Harden Known against new attack landscape #discussion #203

mapkyca · 2014-03-24T19:30:08Z

See #202... what attacks do we see in the post snowden landscape, what can we do to protect the citizen against them?

Starting off with a bit of a brainstorm, random thoughts in no particular order:

Client -> server comms. Largely solved by TLS
User private messages. Look at TLS for transport, give users public keys for private messages
Protecting social graph. Encrypt friend list to the user(?). Database on encrypted drive. Recommendation to remove logs. Ability to route intersite comms over TLS and TOR (see Webmention code to use internal Webservices library, make method plugable #202)
Cartel attack. Mitigated by distributed nature...? What else...?
Updated 020150703 based on comment below: Work to remove unique identifiers from protocols (certainly non-secured protocols)

More later, for now food, @benwerd when would be a good time to chat? my hands are tired from typing and it'd be good to kick this around for a bit :)

Want to back this issue? Post a bounty on it! We accept bounties via Bountysource.

mapkyca · 2014-06-05T08:58:16Z

Related #288

benwerd · 2014-06-15T17:37:49Z

Bumping this thread.

I'm interested in providing a secured solution. I'm not convinced that TLS carries as much security as people think it does, but I do think it's a good start and a sensible recommendation.

The ideal would be, eg, when Known is used in a group environment, that only the members of that group can feasibly access the content. The trick is to maintain this when members are added to, or removed from, the group. Access-controlled content carries the same principle: you're essentially adding people to the group, but dealing with non-local users.

And, of course, doing this in such a way that it's as easy to use as Facebook and Twitter, and easier to install & maintain than WordPress, which is the bar we need to meet.

mapkyca · 2014-06-16T11:44:02Z

TLS depends on your attack vector: TLS is very good at securing point to point, renders passive on-the-wire attacks impractical, at least at scale. Perhaps more importantly, TLS means that you can, with some degree of certainty, ensure that resources provided by your server haven't been modified en-route... which is an attack vector previously thought nearly impossible, but now it turns out is fairly widespread.

Widespread use also provides cover/herd immunity to others, so is a generally Good Thing to do.

TLS is pretty secure, certainly much better than not using TLS, but it isn't a silver bullet. It relies on well implemented libraries with no exploitable bugs (see Heartbleed), it relies on good algorithm selection on the server (apache's defaults were terrible, they're now better, but still not perfect). Corporate solutions often have to pick compatibility over security, but the death of XP & IE6-8 will do a lot for this.

TLS is also only for transit security; typically TLS is removed at your data centre, and this is where most attacks are focussed (see NSA smiley).

My view, is a strength in depth approach is what's required and we should ABSOLUTELY use TLS as part of that, and make it super easy to set it up - via tutorials, links to certs and validators, nudging, bullying and general pestering, etc (see #210).

More widely, I think Known helps to a large degree by providing target dispersal - lots of one/two user installs rather than one or two installs with millions of users. If we can make the communication between them secure (webmentions over TLS sent via a TOR proxy to disguise the social graph, as I'm keen to do) then that will go a large way towards our goals.

Next thing to look at I think is data at rest: largely this boils down to storing stuff on encrypted drives, which is fine if your attack vector is someone breaking in to your data centre, but not if your attack vector is someone holding a gun to your kids head until you type in your password (cartel attack). Encryption of data at rest then, I think, is really only a factor if you are holding data for a third party (known hosted, rather than self hosted).

So, to mitigate cartel attacks (which in reality generally come in the form of national security letters), it is best to be able to hand-on-heart say that you have no data to turn over. Again, target dispersal plays a part here (you can tell them who your friends are, but they couldn't then find out who your friends of friends are, without going through the process over again), however it brings us back to data retention.

This was a point I made on #292 but was overruled: to really stand a chance of resisting a cartel attack you need to not be holding the data in the first place (since even if it's encrypted you can be compelled either by threat of arrest or a $5 wrench to decrypt it, and at some point someone somewhere needs to have the key). I made the point that we should only collect the minimum amount of data necessary to complete a specific task, and hold it only as long as necessary to do it.

Since the only reason to collect Exif right now is to sort out the orientation of thumbnails, we shouldn't be keeping it for any longer than that. Sure, in the future we might have another use for it, and we can start collecting it again when we do, but right now we don't. So all we're doing is making a specific node a more juicy target.

I got overruled on this unfortunately, but if you're want to get serious on this issue, you're going to have to start addressing the "collect everything just in case" instinct that pervades our industry. Encryption is only part of the solution...

mapkyca · 2014-06-17T12:16:24Z

Not that I'm necessarily advocating this as a "solution", but I've also been experimenting with client side encryption of data e.g. https://www.marcus-povey.co.uk/2014/06/05/openpgp-encrypted-forms-for-idno/

This might be a technique to explore for truly locked down data stores - i.e. keys generated on and never leave the client, meaning even if we get hacked/NSLed we have no way of getting the data back. Of course, the flip side of this means that if the client loses the key then their data is gone forever.

mapkyca · 2014-07-14T16:20:25Z

Bumping this again with thoughts regarding the hosted version.

If you're going to offer a managed solution, you're going to need to put in protection for your customers, especially if you become successful.

If you provide services to UK based users, you'll fall under the data retention laws introduced in DRIP. If you provide services to Russian users, you will have to store your data on a cloud within the RF. The trend is continuing.

Step 1 would be to make your customers fully aware of what might go on via a data jurisdiction statement.
Step 2 would be to start building in countermeasures, first and foremost you should aim to store the absolute minimum amount of data necessary to perform a service (see #292 etc), and encrypt what you do, ideally in a way that only the customer can retrieve.

One thing I was mulling is, could you move the hosted solution to a model whereby a user, on signup, enters details of their own cloud based database server (or better yet, can click on a button, enter in their AWS details and have one generated automatically). That way, you could have known run as a halfway house between hosted and centralised and fully indieweb - i.e. it's not "on their own server", but they own and pay for their own database that they legally own, but benefit from the management and software updates of a hosted service.

Since the customer owns the database, data retention laws shouldn't apply. An attacker then is left to going after the individual customer, because if they go after you your customer can simply change the access credentials, install their own copy of the codebase and go fully indieweb, or they can attack AWS itself, inwhich case they should in theory face some push back.

Plus, if you start encrypting data based on a key provided by your customer, an attack on aws itself would be of minimal utility.

Just thinking aloud....

mapkyca · 2014-07-16T19:31:44Z

@benwerd what are your thoughts about hardening the hosted solution? Centralised solutions are a target, and the UK's new law for example means you'd have to retain data.

harsh5455555 · 2014-07-17T08:08:10Z

You Can Give Franchise of Known to the people of other Companies for Hosting and Managing Known

benwerd · 2014-07-18T01:30:52Z

The hosted solution will support TLS by default. Beyond that I don't want
to open up the hosted version for discussion in the open source community.
On Jul 17, 2014 1:08 AM, "harsh5455555" notifications@github.com wrote:

You Can Give Franchise of Known to the people of other Companies for
Hosting and Managing Known

—
Reply to this email directly or view it on GitHub
#203 (comment).

benwerd · 2014-07-18T01:44:38Z

We will absolutely be creating partnerships with other companies providing Known services.

benwerd · 2014-07-18T01:55:47Z

It's also worth saying that Known will be on the IndieBox, which will let anyone easily run it on a friendly appliance that can actually live inside your home or office.

harsh5455555 · 2014-07-18T05:51:21Z

If u want we can take up the responsibility for INDIA we have a nice infrastructure and server facility also

…ring.

…ed and served from the local server. Refs idno#203

mapkyca · 2015-01-07T13:41:54Z

Bumping this with some notes on corporate environments:

It is quite common for corporate installs to install a SSL proxy with a certificate trusted by all the machines in the network.

Not a great deal coders can do (although folk might want to look at my in browser PGP form submission tool [/plug]). More of a FYI - got very disturbed when, on a visit to a corporate a while ago, I started getting warnings when testing out stuff over curl and started getting massive numbers of cert errors.

mapkyca · 2015-01-07T17:32:59Z

Also, it seems, on airline wifi: https://nakedsecurity.sophos.com/2015/01/07/gogo-forges-youtube-ssl-certificate-to-throttle-high-bandwith-usage-on-flights/

mapkyca · 2015-07-03T09:09:21Z

The IETF have an RFC (https://tools.ietf.org/html/rfc7258) which all internet protocols should factor.

In a nutshell, things should be built with an understanding of selector based surveillance. Largely this means never sending unique identifiers in the clear.

In an ideal world I'd suggest not even using sessions on non TLS connections, but I'm aware that's not practical for the project, at least at this stage.

mapkyca · 2016-01-31T22:20:19Z

Bumping with the info that letsencrypt is now out. Is it possible to integrate with this somehow?

For example, could the installer detect whether it's installed and trigger a generation of cert?

mapkyca · 2016-02-04T17:21:00Z

Also noting that given that TLS is now essentially free, can we revisit the level of nudging and pestering we do?

Modern browsers now start giving in-your-face warnings when a password input field is delivered over a non-TLS connection, with the obvious intention of eventually blocking POSTing of those forms entirely.

Could we start warnings? Perhaps starting with admin logged in warnings on the diagnostics terminal and ramping up from there...?

mapkyca · 2019-06-18T11:58:44Z

Archiving this.

Good discussion, but this is going to be an ongoing thing

benwerd changed the title ~~Harden Idno against new attack landscape #discussion~~ Harden Known against new attack landscape #discussion Jun 15, 2014

benwerd added enhancement labels Jun 15, 2014

mapkyca added a commit to mapkyca/idno that referenced this issue Aug 16, 2014

Refs idno#203: Web service calls can now be passed a proxy connect st…

3d989bb

…ring.

This was referenced Aug 31, 2014

SSL browser warning with commenter images #420

Closed

Cache remote user icons #431

Closed

mapkyca added a commit to mapkyca/idno that referenced this issue Sep 2, 2014

Closes idno#431 & idno#420: Profile icons in webmentions are now cach…

93de5c7

…ed and served from the local server. Refs idno#203

mapkyca mentioned this issue Sep 4, 2014

2FA #434

Closed

mapkyca mentioned this issue Nov 22, 2014

Hosted service doesn't default to https #597

Closed

mapkyca mentioned this issue Feb 15, 2015

Known needs a queue #691

Closed

mapkyca mentioned this issue Mar 13, 2015

https in hub link #789

Merged

mapkyca added a commit to mapkyca/idno that referenced this issue Feb 4, 2016

Refs idno#203: Display diagnostics warning for non TLS connections

2caee1a

mapkyca added the Security label Feb 4, 2016

mapkyca mentioned this issue Oct 29, 2016

Discussion: Block logins and API on non-tls connections #1579

Closed

mapkyca closed this as completed Jun 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harden Known against new attack landscape #discussion #203

Harden Known against new attack landscape #discussion #203

mapkyca commented Mar 24, 2014

mapkyca commented Jun 5, 2014

benwerd commented Jun 15, 2014

mapkyca commented Jun 16, 2014

mapkyca commented Jun 17, 2014

mapkyca commented Jul 14, 2014

mapkyca commented Jul 16, 2014

harsh5455555 commented Jul 17, 2014

benwerd commented Jul 18, 2014

benwerd commented Jul 18, 2014

benwerd commented Jul 18, 2014

harsh5455555 commented Jul 18, 2014

mapkyca commented Jan 7, 2015

mapkyca commented Jan 7, 2015

mapkyca commented Jul 3, 2015

mapkyca commented Jan 31, 2016

mapkyca commented Feb 4, 2016

mapkyca commented Jun 18, 2019

Harden Known against new attack landscape #discussion #203

Harden Known against new attack landscape #discussion #203

Comments

mapkyca commented Mar 24, 2014

mapkyca commented Jun 5, 2014

benwerd commented Jun 15, 2014

mapkyca commented Jun 16, 2014

mapkyca commented Jun 17, 2014

mapkyca commented Jul 14, 2014

mapkyca commented Jul 16, 2014

harsh5455555 commented Jul 17, 2014

benwerd commented Jul 18, 2014

benwerd commented Jul 18, 2014

benwerd commented Jul 18, 2014

harsh5455555 commented Jul 18, 2014

mapkyca commented Jan 7, 2015

mapkyca commented Jan 7, 2015

mapkyca commented Jul 3, 2015

mapkyca commented Jan 31, 2016

mapkyca commented Feb 4, 2016

mapkyca commented Jun 18, 2019