| 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174 | 
Challenges in bringing low-latency stream anonymity to the masses\section{Introduction}We deployed this thing called Tor. it's got all these different types ofusers. it's been backed by navy and eff, and prime and anonymizer looked atit. Because we're this cool, you should believe us when we tell you stuff.In this paper we give the reader an understanding of Tor's contextin the anonymity space and then we go on to describe the variety ofpractical challenges that stand in the way of moving from a practicaluseful network to a practical useful anonymous network.% The goal of the paper is to get the PET-audience reader up to speed% on all the issues we have with Tor, so he can, if he wants,% * understand the technical and policy and legal issues and why they're%   tricky in practice% * help us out with answering some of the technical decisions%   (and in writing it, we'll clarify our own opinions about them)% * help us out with answering some of the anonymity questions\section{What Is Tor}Tor works like this.weasel's graph of # nodes and of bandwidth, ideally from week 0.Tor has the following goals.and we made these assumptions when trying to design the thing.\section{Tor's position in the anonymity field}There are many other classes of systems: single-hop proxies, open proxies,jap, mixminion, flash mixes, freenet, i2p, mute/ants/etc, tarzan,morphmix, freedom. Give brief descriptions and brief characterizationsof how we differ. This is not the breakthrough stuff and we only havea page or two for it.\section{Crossroads}Discuss each item that Tor hasn't solved yet that isn't just codingwork.  Perhaps we'll have so many that we can pick out the best ones todiscuss, so it's a bit less of a laundry list. Maybe they'll even fitinto categories. The trick to making the paper good will be to findthe right balance between going into depth and breadth of coverage.Peer-to-peer / practical issues:Network discovery, sybil, node admission, scaling. It seems that the codewill ship with something and that's our trust root. We could try to getpeople to build a web of trust, but no. Where we go from here dependson what threats we have in mind. Really decentralized if your threat isRIAA; less so if threat is to application data or individuals or...Making use of servers with little bandwidth. How to handle hammering bycertain applications.Handling servers that are far away from the rest of the network, e.g. onthe continents that aren't North America and Europe. High latency,often high packet loss.Running Tor servers behind NATs, behind great-firewalls-of-China, etc.Restricted routes. How to propagate to everybody the topology? BGPstyle doesn't work because we don't want just *one* path. Point toGeoff's stuff.Routing-zones. It seems that our threat model comes down to diversity anddispersal. But hard for Alice to know how to act. Many questions remain.The China problem. We have lots of users in Iran and similar (we stoppedlogging, so it's hard to know now, but many Persian sites on how to useTor), and they seem to be doing ok. But the China problem is bigger. CiteStefan's paper, and talk about how we need to route through clients,and we maybe we should start with a time-release IP publishing system +advogato based reputation system, to bound the number of IPs leaked to theadversary.Policy issues:Bittorrent and dmca. Should we add an IDS to autodetect protocols andsnipe them? Takedowns and efnet abuse and wikipedia complaints and ircnetworks. Should we allow revocation of anonymity if a threshold ofservers want to?Image: substantial non-infringing uses. Image is a security parameter,since it impacts user base and perceived sustainability.Sustainability. Previous attempts have been commercial which we thinkadds a lot of unnecessary complexity and accountability. Freedom didn'tcollect enough money to pay its servers; JAP bandwidth is supported bycontinued money, and they periodically ask what they will do when itdries up.Logging. Making logs not revealing. A happy coincidence that verboselogging is our #2 performance bottleneck. Is there a way to detectmodified servers, or to have them volunteer the information that they'relogging verbosely? Would that actually solve any attacks?Anonymity issues:Transporting the stream vs transporting the packets.The DNS problem in practice.Applications that leak data. We can say they're not our problem, butthey're somebody's problem.How to measure performance without letting people selectively deny serviceby distinguishing pings. Heck, just how to measure performance at all. Inpractice people have funny firewalls that don't match up to their exitpolicies and Tor doesn't deal.Mid-latency. Can we do traffic shape to get any defense against George'sPET2004 paper? Will padding or long-range dummies do anything then? Willit kill the user base or can we get both approaches to play well together?Does running a server help you or harm you? George's Oakland attack.Plausible deniability -- without even running your traffic through Tor! Wehave to pick the path length so adversary can't distinguish client fromserver (how many hops is good?).When does fixing your entry or exit node help you?Helper nodes in the literature don't deal with churn, andespecially active attacks to induce churn.Survivable services are new in practice, yes? Hidden services seemless hidden than we'd like, since they stay in one place and get useda lot. They're the epitome of the need for helper nodes. This meansthat using Tor as a building block for Free Haven is going to be reallyhard. Also, they're brittle in terms of intersection and observationattacks. Would be nice to have hot-swap services, but hard to design.P2P + anonymity issues:Incentives. Copy the page I wrote for the NSF proposal, and maybe extendit if we're feeling smart.Usability: fc03 paper was great, except the lower latency you are theless useful it seems it is.A Tor gui, how jap's gui is nice but does not reflect the securitythey provide.Public perception, and thus advertising, is a security parameter.Network investigation: Is all this bandwidth publishing thing a good idea?How can we collect stats better? Note weasel's smokeping, athttp://seppia.noreply.org/cgi-bin/smokeping.cgi?target=Torwhich probably gives george and steven enough info to break tor?Do general DoS attacks have anonymity implications? See e.g. AdamBack's IH paper, but I think there's more to be pointed out here.% need to do somewhere in the paper:have a serious discussion of morphmix's assumptions, since they wouldseem to be the direct competition. in fact tor is a flexible architecturethat would encompass morphmix, and they're nearly identical except forpath selection and node discovery. and the trust system morphmix hasseems overkill (and/or insecure) based on the threat model we've picked.need to discuss how we take the approach of building the thing, and thenassuming that, how much anonymity can we get. we're not here to model orto simulate or to produce equations and formulae. but those have theirroles too.
 |