TODO 12 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294
  1. Legend:
  2. SPEC!! - Not specified
  3. SPEC - Spec not finalized
  4. NICK - nick claims
  5. ARMA - arma claims
  6. - Not done
  7. * Top priority
  8. . Partially done
  9. o Done
  10. D Deferred
  11. X Abandoned
  12. For 0.0.9:
  13. N&R. bring tor-spec up to date
  14. o cache and serve running-routers on other nodes?
  15. o cache running-routers
  16. o download running-routers from servers running rc5-cvs or later
  17. o pump up periods for fetching things; figure out how to do this
  18. backward-compatibily, so that people who did set dirfetchpostperiod
  19. get the right behavior.
  20. o If dirport is set, we should have a maximum dirfetchperiod and
  21. a maximum statusfetchperiod, or else we'll serve very stale stuff.
  22. o Adapt version parsing code to handle new version scheme; document new
  23. version scheme.
  24. N&R. make loglevels info,debug less noisy
  25. R - fix dfc/weasel's intro point bug
  26. R - add goodell's .exit tld
  27. Beyond 0.0.9:
  28. - server descriptor declares min log level, clients avoid servers
  29. that are too loggy.
  30. N - Clean up NT service code
  31. N - OS X package (and bundle?)
  32. - controller should have 'getinfo' command to query about rephist,
  33. about rendezvous status, etc.
  34. - allow transition from ORPort to !ORPort, and back
  35. R . bandwidth buckets for write as well as read.
  36. - Limit to 2 dir, 2 OR, N SOCKS connections per IP.
  37. - Implement If-Modified-Since for directories.
  38. - Make more configuration variables into CSVs.
  39. N - Handle rendezvousing with unverified nodes.
  40. - Specify: Stick rendezvous point's key in INTRODUCE cell.
  41. Bob should _always_ use key from INTRODUCE cell.
  42. - Implement.
  43. R - figure out enclaves, e.g. so we know what to recommend that people
  44. do, and so running a tor server on your website is helpful.
  45. - Do enclaves for same IP only.
  46. - Resolve first, then if IP is an OR, connect to next guy.
  47. N . the user interface interface
  48. - Implement a trivial fun gui.
  49. N - add ipv6 support.
  50. - Spec issue: if a resolve returns an IP4 and an IP6 address,
  51. which to use?
  52. N&R - Update Spec
  53. R X learn from ben about his openssl-reinitialization-trick to
  54. rotate tls keys without making new connections.
  55. - Do something to prevent spurious EXTEND cells from making middleman
  56. nodes connect all over. Rate-limit failed connections, perhaps?
  57. - christian grothoff's attack of infinite-length circuit.
  58. the solution is to have a separate 'extend-data' cell type
  59. which is used for the first N data cells, and only
  60. extend-data cells can be extend requests.
  61. - have a pool of circuits available, cannibalize them
  62. for your purposes (e.g. rendezvous, etc).
  63. - Once we have a trusted directory on port 80, stop falling back to
  64. forbidden ports when fascistfirewall blocks all good dirservers.
  65. o fix sprintf's to snprintf's?
  66. . Make intro points and rendezvous points accept $KEYID in addition
  67. to nicknames.
  68. o Specify
  69. o Implement parsing
  70. - Generate new formats (Not till 007 is dead)
  71. - Facility to automatically choose long-term helper nodes; perhaps
  72. on by default for hidden services.
  73. o Make command-line strict about checking options; make only certain
  74. option prefixes work.
  75. - Rate-limit OR and directory connections overall and per-IP and
  76. maybe per subnet.
  77. D put expiry date on onion-key, so people don't keep trying
  78. old ones that they could know are expired?
  79. * Leave on todo list, see if pre3 onion fixes helped enough.
  80. D should the running-routers list put unverified routers at the
  81. end?
  82. * Cosmetic, don't do it yet.
  83. D make advertised_server_mode() ORs fetch dirs more often.
  84. * not necessary yet.
  85. D Add a notion of nickname->Pubkey binding that's not 'verification'
  86. * eventually, only when needed
  87. D ORs use uniquer default nicknames
  88. * Don't worry about this for now
  89. D Handle full buffers without totally borking
  90. * do this eventually, no rush.
  91. D if destination IP is running a tor node, extend a circuit there
  92. before sending begin.
  93. * don't do this for now. figure out how enclaves work. but do
  94. enclaves soon.
  95. - Support egd or other non-OS-integrated strong entropy sources
  96. more features, complex:
  97. - password protection for on-disk identity key
  98. - Have clients and dirservers preserve reputation info over
  99. reboots.
  100. * continue not doing until we have something we need to preserve
  101. - round detected bandwidth up to nearest 10KB?
  102. - client software not upload descriptor until:
  103. - you've been running for an hour
  104. - it's sufficiently satisfied with its bandwidth
  105. - it decides it is reachable
  106. - start counting again if your IP ever changes.
  107. - never regenerate identity keys, for now.
  108. - you can set a bit for not-being-an-OR.
  109. * no need to do this yet. few people define their ORPort.
  110. - authdirserver lists you as running iff:
  111. - he can connect to you
  112. - he has successfully extended to you
  113. - you have sufficient mean-time-between-failures
  114. * keep doing nothing for now.
  115. - Include HTTP status messages in logging (see parse_http_response).
  116. blue sky:
  117. - Possible to get autoconf to easily install things into ~/.tor?
  118. ongoing:
  119. . rename/rearrange functions for what file they're in
  120. - generalize our transport: add transport.c in preparation for
  121. http, airhook, etc transport.
  122. o investigate sctp for alternate transport.
  123. For September:
  124. N . Windows port
  125. o works as client
  126. - deal with pollhup / reached_eof on all platforms
  127. . robust as a client
  128. . works as server
  129. - can be configured
  130. - robust as a server
  131. . Usable as NT service
  132. - docs for building in win
  133. o installer, including all needed libs.
  134. - and including privoxy
  135. - and including a sockscap equivalent
  136. - Docs
  137. . FAQ
  138. - a howto tutorial with examples
  139. * put a stub on the wiki
  140. o tutorial: how to set up your own tor network
  141. o (need to not hardcode dirservers file in config.c)
  142. o Make tutorial reflect this.
  143. . port forwarding howto for ipchains, etc
  144. . correct, update, polish spec
  145. - document the exposed function api?
  146. - Document where we differ from tor-design
  147. . packages
  148. . find a long-term rpm maintainer
  149. - code
  150. - better warn/info messages
  151. - write howto for setting up tsocks, socat.
  152. - including on osx and win32
  153. - freecap handling
  154. - tsocks
  155. o gather patches, submit to maintainer
  156. * send him a reminder mail and see what's up.
  157. - intercept gethostbyname and others
  158. * add this to tsocks
  159. o do resolve via tor
  160. - redesign and thorough code revamp, with particular eye toward:
  161. - support half-open tcp connections
  162. - conn key rotation
  163. - other transports -- http, airhook
  164. - modular introduction mechanism
  165. - allow non-clique topology
  166. Other details and small and hard things:
  167. - tor should be able to have a pool of outgoing IP addresses
  168. that it is able to rotate through. (maybe)
  169. - tie into squid
  170. - hidserv offerers shouldn't need to define a SocksPort
  171. * figure out what breaks for this, and do it.
  172. - when the client fails to pick an intro point for a hidserv,
  173. it should refetch the hidserv desc.
  174. . should maybe make clients exit(1) when bad things happen?
  175. e.g. clock skew.
  176. - should retry exitpolicy end streams even if the end cell didn't
  177. resolve the address for you
  178. o Make logs handle it better when writing to them fails.
  179. o Dirserver shouldn't put you in running-routers list if you haven't
  180. uploaded a descriptor recently
  181. . Refactor: add own routerinfo to routerlist. Right now, only
  182. router_get_by_nickname knows about 'this router', as a hack to
  183. get circuit_launch_new to do the right thing.
  184. . Scrubbing proxies
  185. - Find an smtp proxy?
  186. . Get socks4a support into Mozilla
  187. - Need a relay teardown cell, separate from one-way ends.
  188. - Make it harder to circumvent bandwidth caps: look at number of bytes
  189. sent across sockets, not number sent inside TLS stream.
  190. - fix router_get_by_* functions so they can get ourselves too,
  191. and audit everything to make sure rend and intro points are
  192. just as likely to be us as not.
  193. ***************************Future tasks:****************************
  194. Rendezvous and hidden services:
  195. make it fast:
  196. - preemptively build and start rendezvous circs.
  197. - preemptively build n-1 hops of intro circs?
  198. - cannibalize general circs?
  199. make it reliable:
  200. - standby/hotswap/redundant services.
  201. - store stuff to disk? dirservers forget service descriptors when
  202. they restart; nodes offering hidden services forget their chosen
  203. intro points when they restart.
  204. make it robust:
  205. - auth mechanisms to let midpoint and bob selectively choose
  206. connection requests.
  207. make it scalable:
  208. - right now the hidserv store/lookup system is run by the dirservers;
  209. this won't scale.
  210. Tor scalability:
  211. Relax clique assumptions.
  212. Redesign how directories are handled.
  213. o Separate running-routers lookup from descriptor list lookup.
  214. - Resolve directory agreement somehow.
  215. o Cache directory on all servers.
  216. Find and remove bottlenecks
  217. - Address linear searches on e.g. circuit and connection lists.
  218. Reputation/memory system, so dirservers can measure people,
  219. and so other people can verify their measurements.
  220. - Need to measure via relay, so it's not distinguishable.
  221. Bandwidth-aware path selection. So people with T3's are picked
  222. more often than people with DSL.
  223. Reliability-aware node selection. So people who are stable are
  224. preferred for long-term circuits such as intro and rend circs,
  225. and general circs for irc, aim, ssh, etc.
  226. Let dissidents get to Tor servers via Tor users. ("Backbone model")
  227. Anonymity improvements:
  228. Is abandoning the circuit the only option when an extend fails, or
  229. can we do something without impacting anonymity too much?
  230. Is exiting from the middle of the circuit always a bad idea?
  231. Helper nodes. Decide how to use them to improve safety.
  232. DNS resolution: need to make tor support resolve requests. Need to write
  233. a script and an interface (including an extension to the socks
  234. protocol) so we can ask it to do resolve requests. Need to patch
  235. tsocks to intercept gethostbyname, else we'll continue leaking it.
  236. Improve path selection algorithms based on routing-zones paper. Be sure
  237. to start and end circuits in different ASs. Ideally, consider AS of
  238. source and destination -- maybe even enter and exit via nearby AS.
  239. Intermediate model, with some delays and mixing.
  240. Add defensive dropping regime?
  241. Make it more correct:
  242. Handle half-open connections: right now we don't support all TCP
  243. streams, at least according to the protocol. But we handle all that
  244. we've seen in the wild.
  245. Support IPv6.
  246. Efficiency/speed/robustness:
  247. Congestion control. Is our current design sufficient once we have heavy
  248. use? Need to measure and tweak, or maybe overhaul.
  249. Allow small cells and large cells on the same network?
  250. Cell buffering and resending. This will allow us to handle broken
  251. circuits as long as the endpoints don't break, plus will allow
  252. connection (tls session key) rotation.
  253. Implement Morphmix, so we can compare its behavior, complexity, etc.
  254. Use cpuworker for more heavy lifting.
  255. - Signing (and verifying) hidserv descriptors
  256. - Signing (and verifying) intro/rend requests
  257. - Signing (and verifying) router descriptors
  258. - Signing (and verifying) directories
  259. - Doing TLS handshake (this is very hard to separate out, though)
  260. Buffer size pool: allocate a maximum size for all buffers, not
  261. a maximum size for each buffer. So we don't have to give up as
  262. quickly (and kill the thickpipe!) when there's congestion.
  263. Exit node caching: tie into squid or other caching web proxy.
  264. Other transport. HTTP, udp, rdp, airhook, etc. May have to do our own
  265. link crypto, unless we can bully openssl into it.
  266. P2P Tor:
  267. Do all the scalability stuff above, first.
  268. Incentives to relay. Not so hard.
  269. Incentives to allow exit. Possibly quite hard.
  270. Sybil defenses without having a human bottleneck.
  271. How to gather random sample of nodes.
  272. How to handle nodelist recommendations.
  273. Consider incremental switches: a p2p tor with only 50 users has
  274. different anonymity properties than one with 10k users, and should
  275. be treated differently.