20 years ago · cb97bf8c06
--- a/doc/tor-design.tex
+++ b/doc/tor-design.tex
@@ -1351,11 +1351,6 @@ acknowledge his existence.
 
															 \Section{Attacks and Defenses}
														
 
															 \label{sec:attacks}
														
 
															-% XXX In sec9 we should talk about bandwidth classes, which will
														
 
															-%     enable us to accept a lot more ORs than if we continue to
														
 
															-%     require 10mbit connections for all ORs. -RD
														
 
															-
														
 
															-  
														
 
															 Below we summarize a variety of attacks, and discuss how well our
														
 
															 design withstands them.
														
@@ -1647,236 +1642,157 @@ by the session key shared by the client and server.
 
															 \Section{Open Questions in Low-latency Anonymity}
														
 
															 \label{sec:maintaining-anonymity}
														
 
															-% There must be a better intro than this! -NM
														
 
															 In addition to the open problems discussed in
														
 
															-Section~\ref{subsec:non-goals}, many other questions remain to be
														
 
															-solved by future research before we can be confident of our security.
														
 
															-
														
 
															-Many of these open issues are questions of balance.  For example,
														
 
															-how often should users rotate to fresh circuits?  Too-frequent
														
 
															-rotation is inefficient, expensive, and may lead to intersection attacks
														
 
															-and predecessor attacks \cite{wright03},
														
 
															-but too-infrequent rotation
														
 
															-makes the user's traffic linkable.   Along with opening a fresh
														
 
															-circuit, clients can also limit linkability by exiting from a middle point
														
 
															-of the circuit, or by truncating and re-extending the circuit; but
														
 
															-more analysis is needed to determine the proper trade-off.
														
 
															-
														
 
															-A similar question surrounds timing of directory operations:
														
 
															-how often should directories be updated?  With too-infrequent
														
 
															-updates clients receive an inaccurate picture of the network; with
														
 
															-too-frequent updates the directory servers are overloaded.
														
 
															-
														
 
															-%do different exit policies at different exit nodes trash anonymity sets,
														
 
															-%or not mess with them much?
														
 
															-%% Why would they?  By routing traffic to certain nodes preferentially?
														
 
															-
														
 
															-%[XXX Choosing paths and path lengths: I'm not writing this bit till
														
 
															-%  Arma's pathselection stuff is in. -NM]
														
 
															-%Alice always chooses her path to contain at least
														
 
															-%three nodes unrelated to herself and her destination, choosing the
														
 
															-%number of nodes beyond the third from a geometric distribution;
														
 
															-%explain why. -NM
														
 
															-
														
 
															-%%%% Roger said that he'd put a path selection paragraph into section
														
 
															-%%%% 4 that would replace this.
														
 
															-%
														
 
															-%I probably should have noted that this means loops will be on at least
														
 
															-%five hop routes, which should be rare given the distribution.  I'm    
														
 
															-%realizing that this is reproducing some of the thought that led to a  
														
 
															-%default of five hops in the original onion routing design.  There were
														
 
															-%some different assumptions, which I won't spell out now.  Note that   
														
 
															-%enclave level protections really change these assumptions.  If most   
														
 
															-%circuits are just two hops, then just a single link observer will be  
														
 
															-%able to tell that two enclaves are communicating with high probability.
														
 
															-%So, it would seem that enclaves should have a four node minimum circuit
														
 
															-%to prevent trivial circuit insider identification of the whole circuit,
														
 
															-%and three hop minimum for circuits from an enclave to some nonclave    
														
 
															-%responder. But then... we would have to make everyone obey these rules 
														
 
															-%or a node that through timing inferred it was on a four hop circuit    
														
 
															-%would know that it was probably carrying enclave to enclave traffic.   
														
 
															-%Which... if there were even a moderate number of bad nodes in the      
														
 
															-%network would make it advantageous to break the connection to conduct  
														
 
															-%a reformation intersection attack. Ahhh! I gotta stop thinking         
														
 
															-%about this and work on the paper some before the family wakes up.  
														
 
															-%On Sat, Oct 25, 2003 at 06:57:12AM -0400, Paul Syverson wrote:
														
 
															-%> Which... if there were even a moderate number of bad nodes in the
														
 
															-%> network would make it advantageous to break the connection to conduct
														
 
															-%> a reformation intersection attack. Ahhh! I gotta stop thinking
														
 
															-%> about this and work on the paper some before the family wakes up. 
														
 
															-%This is the sort of issue that should go in the 'maintaining anonymity
														
 
															-%with tor' section towards the end. :)
														
 
															-%Email from between roger and me to beginning of section above. Fix and move.
														
 
															+Section~\ref{subsec:non-goals}, many other questions must be solved
														
 
															+before we can be confident of Tor's security.
														
 
															+
														
 
															+Many of these open issues are questions of balance. For example,
														
 
															+how often should users rotate to fresh circuits? Frequent rotation
														
 
															+is inefficient, expensive, and may lead to intersection attacks and
														
 
															+predecessor attacks \cite{wright03}, but infrequent rotation makes the
														
 
															+user's traffic linkable. Along with opening a fresh circuit, clients can
														
 
															+also limit linkability by exiting from a middle point of the circuit,
														
 
															+or by truncating and re-extending the circuit; but more analysis is
														
 
															+needed to determine the proper trade-off.
														
 
															+
														
 
															+A similar question surrounds timing of directory operations: how often
														
 
															+should directories be updated?  Clients that update infrequently receive
														
 
															+an inaccurate picture of the network, but frequent updates can overload
														
 
															+the directory servers. More generally, we must find more
														
 
															+decentralized yet practical ways to distribute up-to-date snapshots of
														
 
															+network status without introducing new attacks.
														
 
															+
														
 
															+How should we choose path lengths? If she uses only two hops, then both
														
 
															+these nodes are certain that by colluding they will learn about Alice
														
 
															+and Bob. Our current approach is that Alice always chooses at least three
														
 
															+nodes unrelated to herself and her destination. Thus normally she chooses
														
 
															+three nodes, but if she is running an OR and her destination is on an OR,
														
 
															+she uses five. Should Alice choose a nondeterministic path length (say,
														
 
															+increasing it from a geometric distribution), to foil an attacker who
														
 
															+uses timing to learn that he is the fifth hop and thus concludes that
														
 
															+both Alice and the responder are on ORs?
														
 
															 Throughout this paper, we have assumed that end-to-end traffic
														
 
															 confirmation will immediately and automatically defeat a low-latency
														
 
															-anonymity system. Even high-latency anonymity
														
 
															-systems can be vulnerable to end-to-end traffic confirmation, if the
														
 
															-traffic volumes are high enough, and if users' habits are sufficiently
														
 
															-distinct \cite{limits-open,statistical-disclosure}.  \emph{Can
														
 
															-  anything be done to make low-latency systems resist these attacks as
														
 
															-  well as high-latency systems?}
														
 
															-Tor already makes some effort to conceal the starts and
														
 
															-ends of streams by wrapping all long-range control commands in
														
 
															-identical-looking relay cells, but more analysis is needed.  Link
														
 
															-padding could frustrate passive observers who count packets; long-range
														
 
															-padding could work against observers who own the first hop in a
														
 
															-circuit.  But more research needs to be done in order to find an
														
 
															-efficient and practical approach.  Volunteers prefer not to run
														
 
															-constant-bandwidth padding; but more sophisticated traffic shaping
														
 
															-approaches remain somewhat unanalyzed. 
														
 
															-%[XXX is this so?] 
														
 
															-Recent work
														
 
															-on long-range padding \cite{defensive-dropping} shows promise.  One
														
 
															-could also try to reduce correlation in packet timing by batching and
														
 
															-re-ordering packets, but it is unclear whether this could improve
														
 
															-anonymity without introducing so much latency as to render the
														
 
															+anonymity system. Even high-latency anonymity systems can be
														
 
															+vulnerable to end-to-end traffic confirmation, if the traffic volumes
														
 
															+are high enough, and if users' habits are sufficiently distinct
														
 
															+\cite{limits-open,statistical-disclosure}. Can anything be done to
														
 
															+make low-latency systems resist these attacks as well as high-latency
														
 
															+systems? Tor already makes some effort to conceal the starts and ends of
														
 
															+streams by wrapping all long-range control commands in identical-looking
														
 
															+relay cells. Link padding could frustrate passive observers who count
														
 
															+packets; long-range padding could work against observers who own the
														
 
															+first hop in a circuit. But more research remains to find an efficient
														
 
															+and practical approach. Volunteers prefer not to run constant-bandwidth
														
 
															+padding; but no convincing traffic shaping approach has ever been
														
 
															+specified. Recent work on long-range padding \cite{defensive-dropping}
														
 
															+shows promise. One could also try to reduce correlation in packet timing
														
 
															+by batching and re-ordering packets, but it is unclear whether this could
														
 
															+improve anonymity without introducing so much latency as to render the
														
 
															 network unusable.
														
 
															-Even if passive timing attacks were wholly solved, active timing
														
 
															-attacks would remain.  \emph{What can
														
 
															-  be done to address attackers who can introduce timing patterns into
														
 
															-  a user's traffic?}  % [XXX mention likely approaches]
														
 
															-
														
 
															-%%% I think we cover this by framing the problem as ``Can we make 
														
 
															-%%% end-to-end characteristics of low-latency systems as good as
														
 
															-%%% those of high-latency systems?''  Eliminating long-term
														
 
															-%%% intersection is a hard problem.
														
 
															-%
														
 
															-%Even regardless of link padding from Alice to the cloud, there will be
														
 
															-%times when Alice is simply not online. Link padding, at the edges or
														
 
															-%inside the cloud, does not help for this.
														
 
															-
														
 
															-In order to scale to many users, and to prevent an
														
 
															-attacker from observing the whole network at once, it may be necessary
														
 
															-for low-latency anonymity systems to support far more servers than Tor
														
 
															-currently anticipates.  This introduces several issues.  First, if
														
 
															-approval by a centralized set of directory servers is no longer
														
 
															-feasible, what mechanism should be used to prevent adversaries from
														
 
															-signing up many spurious servers? 
														
 
															-Second, if clients can no longer have a complete
														
 
															-picture of the network at all times, how can they perform
														
 
															-discovery while preventing attackers from manipulating or exploiting
														
 
															-gaps in client knowledge?  Third, if there are too many servers
														
 
															-for every server to constantly communicate with every other, what kind
														
 
															-of non-clique topology should the network use?   Restricted-route
														
 
															-topologies promise comparable anonymity with better scalability
														
 
															-\cite{danezis-pets03}, but whatever topology we choose, we need some
														
 
															-way to keep attackers from manipulating their position within it.
														
 
															-Fourth, since no centralized authority is tracking server reliability,
														
 
															-How do we prevent unreliable servers from rendering the network
														
 
															-unusable?  Fifth, do clients receive so much anonymity benefit from
														
 
															-running their own servers that we should expect them all to do so, or
														
 
															-do we need to find another incentive structure to motivate them?
														
 
															-(Tarzan and MorphMix present possible solutions.)
														
 
															-
														
 
															-% [[ XXX how to approve new nodes (advogato, sybil, captcha (RTT));]
														
 
															-
														
 
															-Alternatively, it may be the case that one of these problems proves
														
 
															-intractable, or that the drawbacks to many-server systems prove
														
 
															-greater than the benefits.  Nevertheless, we may still do well to
														
 
															-consider non-clique topologies.  A cascade topology may provide more
														
 
															-defense against traffic confirmation.
														
 
															-% XXX Why would it?   Cite.  -NM
														
 
															-%
														
 
															-% Huh? Do you mean for simple attacks just because of larger anonymity
														
 
															-% sets? -PS
														
 
															-Does the hydra topology (many input nodes, few output nodes) work
														
 
															-better? Are we going to get a hydra anyway because most nodes will be
														
 
															-middleman nodes?
														
 
															-
														
 
															-As mentioned in Section~\ref{subsec:dos}, Tor could improve its
														
 
															-robustness against node failure by buffering transmitted stream data
														
 
															-at the network's edges until the data has been acknowledged by the
														
 
															-other end of the stream.  The efficacy of this approach remains to be
														
 
															-tested, however, and there may be more effective means for ensuring
														
 
															-reliable connections in the presence of unreliable nodes.
														
 
															-
														
 
															-%%% Keeping this original paragraph for a little while, since it 
														
 
															-%%% is not the same as what's written there now.
														
 
															-%
														
 
															-%Because Tor depends on TLS and TCP to provide a reliable transport,
														
 
															-%when one of the servers goes down, all the circuits (and thus streams)
														
 
															-%traveling over that server must break.  This reduces anonymity because
														
 
															-%everybody needs to reconnect right then (does it? how much?)  and
														
 
															-%because exit connections all break at the same time, and it also harms
														
 
															-%usability. It seems the problem is even worse in a peer-to-peer
														
 
															-%environment, because so far such systems don't really provide an
														
 
															-%incentive for nodes to stay connected when they're done browsing, so
														
 
															-%we would expect a much higher churn rate than for onion routing.
														
 
															-%there ways of allowing streams to survive the loss of a node in the
														
 
															-%path?
														
 
															-
														
 
															-% Roger or Paul suggested that we say something about incentives,
														
 
															-% too, but I think that's a better candidate for our future work
														
 
															-% section.  After all, we will doubtlessly learn very much about why
														
 
															-% people do or don't run and use Tor in the near future. -NM
														
 
															-
														
 
															-%We should run a squid at each exit node, to provide comparable anonymity
														
 
															-%to private exit nodes for cache hits, to speed everything up, and to
														
 
															-%have a buffer for funny stuff coming out of port 80.
														
 
															-% on the other hand, it hampers PFS, because ORs have pages in the cache.
														
 
															-%I previously elsewhere suggested bulk transfer proxies to carve
														
 
															-%up big things so that they could be downloaded in less noticeable
														
 
															-%pieces over several normal looking connections. We could suggest
														
 
															-%similarly one or a handful of squid nodes that might serve up
														
 
															-%some of the more sensitive but common material, especially if
														
 
															-%the relevant sites didn't want to or couldn't run their own OR.
														
 
															-%This would be better than having everyone run a squid which would
														
 
															-%just help identify after the fact the different history of that
														
 
															-%node's activity. All this kind of speculation needs to move to
														
 
															-%future work section I guess. -PS]
														
 
															+Common wisdom suggests that Alice should run her own onion router for best
														
 
															+anonymity, because traffic coming through her node could plausibly have
														
 
															+come from elsewhere. How much mixing do we need before this is actually
														
 
															+effective, or is it immediately beneficial because many real-world
														
 
															+adversaries won't be able to observe Alice's router?
														
 
															+
														
 
															+To scale to many users, and to prevent an attacker from observing the
														
 
															+whole network at once, it may be necessary for low-latency anonymity
														
 
															+systems to support far more servers than Tor currently anticipates.
														
 
															+This introduces several issues.  First, if approval by a centralized set
														
 
															+of directory servers is no longer feasible, what mechanism should be used
														
 
															+to prevent adversaries from signing up many colluding servers? Second,
														
 
															+if clients can no longer have a complete picture of the network at all
														
 
															+times, how can they perform discovery while preventing attackers from
														
 
															+manipulating or exploiting gaps in client knowledge?  Third, if there
														
 
															+are too many servers for every server to constantly communicate with
														
 
															+every other, what kind of non-clique topology should the network use?
														
 
															+Restricted-route topologies promise comparable anonymity with better
														
 
															+scalability \cite{danezis-pets03}, but whatever topology we choose, we
														
 
															+need some way to keep attackers from manipulating their position within
														
 
															+it \cite{casc-rep}. Fourth, since no centralized authority is tracking
														
 
															+server reliability, How do we prevent unreliable servers from rendering
														
 
															+the network unusable?  Fifth, do clients receive so much anonymity benefit
														
 
															+from running their own servers that we should expect them all to do so
														
 
															+\cite{econymics}, or do we need to find another incentive structure to
														
 
															+motivate them?  Tarzan and MorphMix present possible solutions.
														
 
															+
														
 
															+% advogato, captcha
														
 
															+
														
 
															+A cascade topology with long-range padding and mixing may provide more
														
 
															+defense against traffic confirmation against a large adversary, because
														
 
															+it aggregates many users. Does the hydra topology (many input nodes,
														
 
															+few output nodes) work better against some adversaries? Are we going to
														
 
															+get a hydra anyway because most nodes will be middleman nodes?
														
 
															+
														
 
															+When a Tor node goes down, all its circuits (and thus streams) must break.
														
 
															+Do users abandon the system because of this brittleness? How well
														
 
															+does the method in Section~\ref{subsec:dos} allow streams to survive
														
 
															+node failure? If affected users rebuild circuits immediately, how much
														
 
															+anonymity is lost? It seems the problem is even worse in a peer-to-peer
														
 
															+environment---so far such systems don't provide an incentive for peers to
														
 
															+stay connected when they're done retrieving content, so we would expect
														
 
															+a higher churn rate.
														
 
															 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
														
 
															 \Section{Future Directions}
														
 
															 \label{sec:conclusion}
														
 
															-Tor brings together many innovations into
														
 
															-a unified deployable system. But there are still several attacks that
														
 
															-work quite well, as well as a number of sustainability and run-time
														
 
															-issues remaining to be ironed out. In particular:
														
 
															-
														
 
															-% Many of these (Scalability, cover traffic, morphmix) 
														
 
															-% are duplicates from open problems.
														
 
															-%
														
 
															-
														
 
															-\emph{Scalability:} Tor's emphasis on design simplicity and
														
 
															-deployability has led us to adopt a clique topology, a
														
 
															-semi-centralized model for directories and trusts, and a
														
 
															-full-network-visibility model for client knowledge.  None of these
														
 
															-properties will scale to more than a few hundred servers.
														
 
															-Promising approaches to better scalability exist (see
														
 
															-Section~\ref{sec:maintaining-anonymity}), but more deployment
														
 
															-experience would be helpful in learning the relative importance of
														
 
															-these bottlenecks.
														
 
															-
														
 
															-\emph{Incentives:} Volunteers may want to run nodes for publicity
														
 
															-or better anonymity \cite{econymics}. 
														
 
															-more users -> more anonymity
														
 
															-
														
 
															-\emph{Cover traffic:} Currently we avoid cover traffic because
														
 
															-whereas its costs in performance and bandwidth are clear, and because its
														
 
															-security benefits are not well understood. With more research
														
 
															-\cite{SS03,defensive-dropping}, this price/value ratio may change,
														
 
															-both for link-level cover traffic and also long-range cover traffic.
														
 
															-
														
 
															-\emph{Better directory distribution:} Even with the threshold
														
 
															-directory agreement algorithm described in Section~\ref{subsec:dirservers},
														
 
															-directory distribution is still performance-critical. We must find more
														
 
															-decentralized yet practical ways to distribute up-to-date snapshots of
														
 
															-network status without introducing new attacks.  Also, directory
														
 
															-retrieval presents a scaling problem, since clients currently
														
 
															-download a description of the entire network state every 15
														
 
															-minutes.  As the state grows larger and clients more numerous, we
														
 
															-may need to move to a solution in which clients only receive
														
 
															-incremental updates to directory state.
														
 
															-
														
 
															-\emph{Implementing location-hidden servers:} While
														
 
															-Section~\ref{sec:rendezvous} describes a design for rendezvous
														
 
															-points and location-hidden servers, these features have not yet been
														
 
															-implemented.  While doing so we are likely to encounter additional
														
 
															-issues that must be resolved, both in terms of usability and anonymity.
														
 
															+Tor brings together many innovations into a unified deployable system. The
														
 
															+immediate next steps include:
														
 
															+
														
 
															+\emph{Scalability:} Tor's emphasis on design simplicity and deployability
														
 
															+has led us to adopt a clique topology, a semi-centralized model for
														
 
															+directories and trusts, and a full-network-visibility model for client
														
 
															+knowledge. These properties will not scale past a few hundred servers.
														
 
															+Section~\ref{sec:maintaining-anonymity} describes some promising
														
 
															+approaches, but more deployment experience will be helpful in learning
														
 
															+the relative importance of these bottlenecks.
														
 
															+
														
 
															+\emph{Bandwidth classes:} In this paper we assume all onion routers have
														
 
															+good bandwidth and latency. We should adapt the Morphmix model,
														
 
															+where nodes advertise their bandwidth level (DSL, T1, T3), and
														
 
															+Alice avoids bottlenecks in her path by choosing nodes that match or
														
 
															+exceed her bandwidth. In this way DSL users can join the Tor network.
														
 
															+
														
 
															+\emph{Incentives:} Volunteers who run nodes are rewarded with publicity
														
 
															+and possibly better anonymity \cite{econymics}. More nodes means increased
														
 
															+scalability, and more users can mean more anonymity. We need to continue
														
 
															+examining the incentive structures for participating in Tor.
														
 
															+
														
 
															+\emph{Cover traffic:} Currently Tor avoids cover traffic because its costs
														
 
															+in performance and bandwidth are clear, whereas its security benefits are
														
 
															+not well-understood. We must pursue more research on both link-level cover
														
 
															+traffic and long-range cover traffic to determine some simple padding
														
 
															+schemes that offer provable protection against our chosen adversary.
														
 
															+
														
 
															+%%\emph{Offer two relay cell sizes:} Traffic on the Internet tends to be
														
 
															+%%large for bulk transfers and small for interactive traffic. One cell
														
 
															+%%size cannot be optimal for both types of traffic.
														
 
															+% This should go in the spec and todo, but not the paper yet. -RD
														
 
															+
														
 
															+\emph{Caching at exit nodes:} We should run a caching web proxy at each
														
 
															+exit node, to provide anonymity for cached pages (Alice's request never
														
 
															+leaves the Tor network), to improve speed, and to reduce bandwidth cost.
														
 
															+%XXX and to have a layer to block to block funny stuff out of port 80.
														
 
															+% is that a useful thing to say?
														
 
															+On the other hand, forward security is weakened because routers have the
														
 
															+pages in their cache. We must find the right balance between usability
														
 
															+and security.
														
 
															+
														
 
															+\emph{Better directory distribution:} Directory retrieval presents
														
 
															+a scaling problem, since clients currently download a description of
														
 
															+the entire network state every 15 minutes. As the state grows larger
														
 
															+and clients more numerous, we may need to move to a solution in which
														
 
															+clients only receive incremental updates to directory state.
														
 
															+
														
 
															+\emph{Implement location-hidden services:} The design in
														
 
															+Section~\ref{sec:rendezvous} has not yet been implemented.  While doing
														
 
															+so we are likely to encounter additional issues that must be resolved,
														
 
															+both in terms of usability and anonymity.
														
 
															 \emph{Further specification review:} Although we have a public,
														
 
															 byte-level specification for the Tor protocols, this document has
														
@@ -1889,7 +1805,6 @@ designer of MorphMix to make the common elements of our two systems
 
															 share a common specification and implementation. So far, this seems
														
 
															 to be relatively straightforward.  Interoperability will allow testing
														
 
															 and direct comparison of the two designs for trust and scalability.
														
 
															-% XXXX Bandwidth classes.
														
 
															 \emph{Wider-scale deployment:} The original goal of Tor was to
														
 
															 gain experience in deploying an anonymizing overlay network, and
														
@@ -1900,7 +1815,6 @@ able to evaluate some of our design decisions, including our
 
															 robustness/latency trade-offs, our performance trade-offs (including
														
 
															 cell size), our abuse-prevention mechanisms, and
														
 
															 our overall usability.
														
 
															-% XXX large and small cells on same network.
														
 
															 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%