22
HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN 1

HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

HADOOP UNDER ATTACK

SECURING DATA IN

A BANKING DOMAIN

1

Page 2: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

2

Federico Leven

WHOAMI

@ ReactoData

Big Data + Open Source from 2012

Big Data Meetup coordinator

(http://www.iaar.site), speaker ...

[email protected]

Web : http://www.reactodata.net

Twitter: @reactodata

Linkedin : https://www.linkedin.com/in/federicoleven/

Page 3: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

3

We are a startup based in Buenos and Poland, providing Big

Data + Cloud solutions based on Open Source and

proprietary software and Hadoop consultancy.

• Big Data and Hadoop applications development

• Machine Learning

• Cloud

• UX/UI and Mobile Apps for Big Data platforms

• Hadoop Consultancy

Page 4: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

4

Agenda

The Challenge : Best Practices + Regulations

How to do it in Hadoop

End-to-End Secured Architecture

What can go wrong ?

References

Conclusion & Questions

Page 5: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

Buzzconf 2018 5

The Challenge

Best Practices

and regulations

Page 6: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

The Challenge : Data Security

6

The set of preventive, detective and corrective measures to protect the integrity,

confidenciality and availability of the data.

CAAIN

• CONFIDENCIALITY

• AVAILABILITY

• AUTHENTICITY

• INTEGRITY

❑ ACCOUNTABILITY / AUDITING

❑ TRACEABILITY

• NON-REPUDIATION

Cain and Abel

Page 7: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

C(A)AIN

7

CONFIDENCIALITY : Data is not made available or disclosed to unauthorized

parties.

AVAILABILITY : Data is available when is needed.

AUTHENTICITY : Data source identity is verifiable.

INTEGRITY : Data is accurate and complete over its entire lifecycle.

NON-REPUDIATION : Parties of a data transaction cannot deny having

received/sent the data .

Page 8: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

The Challenge : Threats in financial and banking domain

8

Emerging Technologies

Challenges- Botnet

- IoT unsecured devices

- DDoS (Distributed Denial of Service

Attack)

Target- Sensitive data

- Access credentials

Regulation Challenges- Periodically new and/or stricter

regulations

- US Data Protection rules

- EUR : GDPR

Insider Challenges- Unintentional actions

- Malicious users

Page 9: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

The Challenge : Best Practices in banking

9

• Security Officer

• End User Guidelines

• Access Policies

• Governance

• …

HumanOrganization Technological

• Networking

• Software Updates

• Data Protection

• Auditing

• …

• Employees Awareness

• Training

• …

Page 10: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

Buzzconf 2018 10

How to do it in

Hadoop

Page 11: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

From concepts to technology

11

AUTHENTICATION : Identify the user.

AUTHORIZATION : Grant user access to the data.

PROTECTION : Protect data from being used except by authorized users.

AVAILABILITY : Make data accessible when needed.

Page 12: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

From concepts to technology

12

C

Authentication

Authorization

A

Authentication

I

Protection

Availability

A

Availability

N

• Kerberos

• LDAP

• Sentry

• HBase ACLs

• +

• Hadoop

• HA (HDFS,

HBASE …)

• Kerberos

• LDAP• Encryption

(Motion &

Rest)

• Redaction

• Hadoop/HA

Auditing

Traceability

Metadata

Lineage

Log Audit

Cloudera

Navigator

Page 13: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

From concepts to technology

13

https://www.cloudera.com/documentation/enterprise/5-14-x/topics/sg_edh_overview.html

• BCRA A6375

• BCRA A6495

• ISO 17799/27001

Page 14: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

What we needed in a banking infrastructure for Hadoop

14

Page 15: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

Buzzconf 2018 15

End-to-End

Secured

Architecture

Page 16: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

Example Production deployment (CDH 5.13)

16

HDFS

YARN

ZOOKEEPER

HBASE

Kerberos AD

Oracle DB

HIVE METASTOREHIVESERVER

SSL/TLS

KTS KMS

C. MANAGER

SENTRY

Page 17: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

HDFS

Secure data pipeline example

17

ORACLE

Sources IngestHDFS Landing

Area

HDFS Business

Area

Rabbit

MQFlume Agent

Sqoop

Web UI

HDFS

Impala

SparkSQL

Hive

Encrypted

Zone

Sqoop SSL ON

Data redaction

custom component

Spark ETL

Page 18: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

Buzzconf 2018 18

What can go

wrong ?

Page 19: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

What can go wrong ? Some good news and some bad news

19

UNSECURE APPLICATIONS WILL NOT WORK ON SECURE ENVIRONMENTS

Sentry HDFS synchronization does not support Hive Metastore HA (CDH 5.9)

Sentry HA not supported (CDH 5.9)

To use CM Kerberos wizard, you need a high level privileges user

SparkSQL does not respect Sentry permissions (Latest)

Enabling Sentry turns off Hive impersonation (CDH 5.9)

Spark Streaming cannot consume from secure Kafka (CDH 5.9)

Page 20: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

Buzzconf 2018 20

References

Page 21: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

21

✓ http://www.bcra.gob.ar/Pdfs/Texord/t-rmsist.pdf

✓ http://www.bcra.gov.ar/pdfs/texord/t-seguef.pdf

✓ https://en.wikipedia.org/wiki/ISO/IEC_27002

✓ http://web.iram.org.ar/index.php?vernorma&id=2439

✓ https://www.cloudera.com/documentation/enterprise/latest/PDF/cloudera-security.pdf

✓ https://www.cloudera.com/documentation/enterprise/5-9-x/topics/security.html

✓ https://www.forbes.com/sites/gregorymcneal/2014/05/26/banks-challenged-by-

cybersecurity-threats-state-regulators-acting/#228d745597f7

References

Page 22: HADOOP UNDER ATTACK SECURING DATA IN A BANKING DOMAIN · 2020. 7. 13. · To use CM Kerberos wizard, you need a high level privileges user SparkSQL does not respect Sentry permissions

22

Thank you !

Questions, suggestions or complaints ?

“No Hadoop was harmed in the making of this presentation”