EFF secure IM scorecard review

EFF “Secure” IM Scorecard Review

Hello.

We’re going to be talking about Privacy vs Security.

We’re also going to talk about challenges in securing IM communication.

Agenda

• Epilog• Review Part 1 – RetroShare• Review Part 2a – libotr• Threat landscape • Hardening approaches• Prolog

Epilog

Privacy and encryption has been a regular mainstream topic

It’s common for security and privacy to be talked about in the same breath

Security is a foundation to building privacy “features”

Introducing…

EFF “Secure” “IM” “Scorecard”

“Which apps and tools actually keep your messages safe”.

“Actually”?!

EFF “Secure” IM Scorecard Summary

• A webpage on the EFF site guiding users on the security of IM clients

• It drastically simplifies the problem domain into several “features”:

The scorecard logic is a bit perplexing to us...

We decided to set aside time to investigate…

• Spend casual hours through 2016 reviewing a selection of clients

• Dedicate up to 24 hours to each client and document the process

• Research the development lifecycle, patching, bug tracking, etc.• Responsibly disclose any vulnerabilities uncovered during the

audits

Review Part 1 - RetroShare

Retroshare

• Started in 2006• Transport via TLS (OpenSSL)• Auth via PGP keys• Features include:

• Chat / chat channels• Voice and video• Mail• File sharing• Forums• Support Tor / I2P

Code Repo

First impressions

• Trivial web-based vulnerabilities (e.g. XSS) via the UI• Large attack surface:

• HTTP protocol handling• Rich UI• Low-level serialised protocols• File handling• Multiple 3rd party libraries

• General code quality below par:• Inconsistent retval checking• Signed-ness inconsistencies• Relaxed edge-case handling• Poor usage of typecasting

Static analysis results

• Vanilla integer overflows to heap corruption in VOIP:

• Vanilla integer overflows to OOBR in base serialisation library:

Developer response

• Communication was easy to establish with PGP keys• Overnight response – very positive and friendly• Issues patched promptly, updated binaries shortly following

11th Jan 2016

Disclosure and

response

13th Jan 2016Patch

developed

9th Feb 2016

Binaries published

Developer response

Review Part 2 – libotr

Libotr

• Is a library, not a client.• Many popular clients for IM protocols leverage it as a plugin to

provide privacy based communication.• Pidgin• Irssi• Miranda• Jitsi• Etc

Code repo

Initial Impressions• Relatively small code repository• Inconsistency with defensive programming patterns

• A lot of repetition / lack of centralised routines• A lot of “potentially” unsafe integer arithmetic• Not testing return values of memory allocators etc

It was clear to see “knee jerk” patching, where validation is done in response to security advisories.Some crushing of hopes..

<- check

Developer Responses• We sent one e-mail overnight and had a response waiting in the

inbox by the next morning.• The bug trucker shows a keen interest in resolving issues.• The developers seem to understand secure coding and pay

attention to bug reports that might have an impact.• So far so good.

Threat landscape

Things may not reach NSA proof…

Things may not reach NSA proof… However attacker cost needs to be as high as possible

Privacy applications also have different security requirements Compromising the confidentiality of “secrets” == RCE

Application functionality

• These clients often have fairly extensive functionality, which each have their own potential attack scenarios, including:• Web and XML-style bugs in UI and rendering = RCE or

deanonymisation• Memory corruption = RCE • Memory leaks = critical information disclosure, RCE reliability• File handle issues = potential information disclosure / RCE• And the list goes on…

Further, these clients also pass heavy-lifting to libraries PGP, SSL, Onion Routing, UI, Compression, Images/Video/Audio…

Scenario breakdown

• Low investment Common application-level vulnerability classes (web to low level)

• Moderate investment More intricate application-level vulnerabilities Logic level and design issues with adverse security impact Common 3rd party dependency vulnerabilities Passive end-point classification and intrusive targeting

• High investment Overarching low-level protocol vulnerabilities (both passive/intrusive)

Hardening improvements

Bug bounties (pentesting v2.0)

• Bug bounties can certainly serve a purpose, but its fit and limitations need to be well understood (or else it resembles the next slide)

• If at a certain proven maturity level, then high value bounties can entice skilled researchers to invest time to identify moderate-high resource attack scenarios and findings

• Can be excellent for quickly identifying low hanging fruit and uncovering a range of vulnerability classes, but:• It should not be viewed as a level of assurance or maturity for the

project• Introduces a risk for bandaid / kneejerk patching for specific

vulnerability constructs, without getting to the systemic or implementation pattern flaw

• Create a false sense of security by eradicating simpler bugs while higher skilled threat actors sit on moderate-high skill strategic attacks

General best practices

• Clear documentation of architecture, protocols, possible configuration options, and edge cases.

• Security focused “unit” test cases• Hardened binaries adopting latest best practice exploit

mitigation approaches• Sandboxing and hardened platform policies would be ideal, e.g.

grsec policies for containing the running binary – access to the underlying file-system and syscalls

• Good issue tracker, easy to find security contact with PGP keys available

Fuzzing

- Very useful in bashing out bugs of all kinds without much human effort

- Needs initial time investment to get fuzzer implemented- Needs some time investment to follow up on its results

- It’s important that if fuzzing is used that it’s done correctly – otherwise lack of results might lead to a false sense of security.- For example, encryption heavy code is likely to fail early on each

mutation. Your fuzzer should hopefully take this into consideration then and do what’s needed to get better code coverage.

- It’s unlikely to find more complicated issues, so shouldn’t be relied upon as the only security testing for a project.

Static analysis

• There is a number of potential compile time static analysis techniques, both open-source (e.g. clang) to commercial (e.g. Coverity)

• Static analysis tooling can introduce both false negatives and false positives (the latter being time heavy to investigate)

• Can be beneficial identifying LHF but also identify bad constructs in code – encourages overall cleanliness and developers to follow best practices

• Can provide useful metrics and tracking of security issues and improvements over time

Code audits

• The quality of a code audit depends on who you have on it, and the auditors understanding and experience of the relevant technology stack – this influences the effectiveness drastically (e.g. the read() example)

• Code audits should not be a simple snapshot of “findings”, but provide ongoing guidance to the developers for the overall architecture and coding practices of the application

• We believe dedicated audits from experienced folk should only really occur after applications meet a certain bar

Prolog

Key takeaways

• Neither products so far meet the bar required for “keeping you safe”, factoring in the complicated threat scenarios for privacy software

• Judging software security on publicised vulnerabilities is misleading

• The scorecard needs improvements and ongoing maintenance otherwise it may introduce risk to users

General rule of thumb:• simple foundations + minimal functionality = promising

Next steps

• We’re in disclosure with libotr now and will do part 2b at Wahckon

• A part 2 blog post will be coming out in May • We will review 1 or 2 clients not in the scorecard, including

Richochet• We will review 1 or 2 other popular clients in the scorecard soon• Potentially, we will look at common libraries used by many

clients

Thanks for listening!

Questions?

EFF secure IM scorecard review

Software

Boiler Eff

EFF: eff-comments

Cairosphinxpyramids Eff

EFF: GellerMTDCal

EFF EFF 1_2012

EFF: EFF amicus costar v loopnet

EFF 2015_Final

Hollywood Eff

Scorecard 20170830 - Golfclub Reit im Winkl

EFF: EFF bootcamp KO BestPracLEA

EFF: commerce

EFF: Illinois

Eff level2

EFF: final

EFF: boards

EFF: EFF amicus

MLQ International Normative Samples - Mind · PDF filea mbea eff a a ii(b) im is ic cr mbea mbep lf ee eff sat a

EFF: supercommons

EFF: uspublicreply

EFF: oppositiondismisscorp