TODO 14 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333
  1. Legend:
  2. SPEC!! - Not specified
  3. SPEC - Spec not finalized
  4. NICK - nick claims
  5. ARMA - arma claims
  6. - Not done
  7. * Top priority
  8. . Partially done
  9. o Done
  10. D Deferred
  11. X Abandoned
  12. 0.0.8:
  13. - fix sprintf's to snprintf's?
  14. . Make it work on win32 with no $home
  15. o Don't crash.
  16. NICK - Put files someplace reasonable.
  17. o Why is the first entry of kill -USR1 a router with a 0 key?
  18. ARMA . Tors deal appropriately when a newly-verified router has the
  19. same nickname as another router they know about
  20. NICK . put ip:port:keyhash in intro points, rendezvous points,
  21. and hidserv descriptors.
  22. o Specify
  23. - Implement parsing
  24. - Generate new formats (Not till 007 is dead)
  25. NICK . unify similar config entries that need to be split. put them
  26. into a smartlist, and have things take a smartlist.
  27. - figure out what to do when somebody asks to extend to
  28. ip:port:differentkey
  29. * reject it. assuming this is as dumb as it sounds.
  30. - make loglevel info less noisy
  31. o if there's only one entrynode preference, don't pick the
  32. desired entrynode as exit.
  33. o "AllowUnverifiedRouters" config option
  34. o Parse it into 3 bits
  35. o Consider it when picking nodes for your path
  36. o 'fascistfirewall' option to pick dirservers on port 80 and
  37. ORs on port 443.
  38. o extend it to take a range of ports
  39. o parse uptime into router->uptime
  40. o Handle servers with dynamic IP addresses: don't replace
  41. options->Address with the resolved one at startup.
  42. o detect our address right before we make a routerinfo each time.
  43. o external IP vs bind-IP. Already done, just use options->Address.
  44. o OutboundBindAddress config option, to bind to a specific
  45. IP address for outgoing connect()s.
  46. o Add '[...truncated]' or similar to truncated log entries.
  47. o if a begin failed due to exit policy, but we believe the IP
  48. should have been allowed, switch that router to exitpolicy
  49. reject *:* until we get our next directory.
  50. o tor-resolve needs a man page
  51. o clients choose nodes proportional to advertised bandwidth
  52. o and/or while avoiding unreliable nodes, depending on goals
  53. o defining an ORPort isn't necessary anymore, if you use
  54. ORAddress:port. Same with DirPort, SocksPort.
  55. X why did common/util.c build-depend on or/or.h ?
  56. bug fixes, might be handy:
  57. - the directory servers complain a lot about people using the
  58. old key. does 0.0.7 use dirservers before it's pulled down
  59. the directory?
  60. - put expiry date on onion-key, so people don't keep trying
  61. old ones that they could know are expired?
  62. * Leave on todo list, see if pre3 onion fixes helped enough.
  63. - should the running-routers list put unverified routers at the
  64. end?
  65. * Cosmetic, don't do it yet.
  66. - make advertised_server_mode() ORs fetch dirs more often.
  67. * not necessary yet.
  68. - Add a notion of nickname->Pubkey binding that's not 'verification'
  69. * eventually, only when needed
  70. - ORs use uniquer default nicknames
  71. * Don't worry about this for now
  72. - Handle full buffers without totally borking
  73. * do this eventually, no rush.
  74. more features, easy:
  75. o check the date in the http headers, compare for clock skew.
  76. o requiredentrynode vs preferredentrynode
  77. - per-month byte allowances
  78. * nick will spec something.
  79. - have a pool of circuits available, cannibalize them
  80. for your purposes (e.g. rendezvous, etc).
  81. * hold off on that.
  82. - node 'groups' that are known to be in the same zone of control
  83. * nick and roger will talk about it
  84. - do resolve before trying to attach the stream
  85. * don't do this for now.
  86. - if destination IP is running a tor node, extend a circuit there
  87. before sending begin.
  88. * don't do this for now. figure out how enclaves work. but do enclaves soon.
  89. more features, complex:
  90. - compress the directory. client sends http header
  91. "accept-transfer-encoding: gzip", server might send http header
  92. "transfer-encoding: gzip". ta-da.
  93. - grow a zlib dependency. keep a cached compressed directory.
  94. * nick will look into this. not critical priority.
  95. - Switch dirservers entries to config lines:
  96. - read in and parse each TrustedDir config line.
  97. - stop reading dirservers file.
  98. - add some default TrustedDir lines if none defined, or if
  99. no torrc.
  100. - remove notion of ->is_trusted_dir from the routerlist. that's
  101. no longer where you look.
  102. - clean up router parsing flow, since it's simpler now?
  103. - when checking signature on a directory, look it up in
  104. options.TrustedDirs, and make sure there's a descriptor
  105. with that nickname, whose key hashes to the fingerprint,
  106. and who correctly signed the directory.
  107. * nick will do the above
  108. - when fetching a directory, if you want a trusted one,
  109. choose from the trusteddir list.
  110. - which means keeping track of which ones are "up"
  111. - if you don't need a trusted one, choose from the routerinfo
  112. list if you have one, else from the trusteddir list.
  113. * roger will do the above
  114. - add a listener for a ui
  115. * nick chats with weasel
  116. - and a basic gui
  117. - Have clients and dirservers preserve reputation info over
  118. reboots.
  119. * continue not doing until we have something we need to preserve
  120. - users can set their bandwidth, or we auto-detect it:
  121. - advertised bandwidth defaults to 10KB
  122. o advertised bandwidth is the min of max seen in each direction
  123. in the past N seconds.
  124. o calculate this
  125. o not counting "local" connections
  126. - round detected bandwidth up to nearest 10KB
  127. - client software not upload descriptor until:
  128. - you've been running for an hour
  129. - it's sufficiently satisfied with its bandwidth
  130. - it decides it is reachable
  131. - start counting again if your IP ever changes.
  132. - never regenerate identity keys, for now.
  133. - you can set a bit for not-being-an-OR.
  134. * no need to do this yet. few people define their ORPort.
  135. - authdirserver lists you as running iff:
  136. - he can connect to you
  137. - he has successfully extended to you
  138. - you have sufficient mean-time-between-failures
  139. * keep doing nothing for now.
  140. blue sky:
  141. - Possible to get autoconf to easily install things into ~/.tor?
  142. ongoing:
  143. . rename/rearrange functions for what file they're in
  144. - generalize our transport: add transport.c in preparation for
  145. http, airhook, etc transport.
  146. NICK - investigate sctp for alternate transport.
  147. For September:
  148. NICK . Windows port
  149. o works as client
  150. - deal with pollhup / reached_eof on all platforms
  151. . robust as a client
  152. . works as server
  153. - can be configured
  154. - robust as a server
  155. . Usable as NT service
  156. - docs for building in win
  157. - installer
  158. - Docs
  159. . FAQ
  160. o overview of tor. how does it work, what's it do, pros and
  161. cons of using it, why should I use it, etc.
  162. - a howto tutorial with examples
  163. * put a stub on the wiki
  164. o tutorial: how to set up your own tor network
  165. - (need to not hardcode dirservers file in config.c)
  166. * this will be solved when we put dirservers in config lines
  167. - port forwarding howto for ipchains, etc
  168. * roger add to wiki of requests
  169. . correct, update, polish spec
  170. - document the exposed function api?
  171. o document what we mean by socks.
  172. NICK . packages
  173. . rpm
  174. * nick will look at the spec file
  175. - find a long-term rpm maintainer
  176. * roger will start guilting people
  177. - code
  178. - better warn/info messages
  179. o let tor do resolves.
  180. o extend socks4 to do resolves?
  181. o make script to ask tor for resolves
  182. - write howto for setting up tsocks, socat.
  183. - including on osx and win32
  184. - freecap handling
  185. - tsocks
  186. o gather patches, submit to maintainer
  187. * send him a reminder mail and see what's up.
  188. - intercept gethostbyname and others
  189. * add this to tsocks
  190. o do resolve via tor
  191. - redesign and thorough code revamp, with particular eye toward:
  192. - support half-open tcp connections
  193. - conn key rotation
  194. - other transports -- http, airhook
  195. - modular introduction mechanism
  196. - allow non-clique topology
  197. Other details and small and hard things:
  198. - tor should be able to have a pool of outgoing IP addresses
  199. that it is able to rotate through. (maybe)
  200. - tie into squid
  201. - hidserv offerers shouldn't need to define a SocksPort
  202. * figure out what breaks for this, and do it.
  203. - when the client fails to pick an intro point for a hidserv,
  204. it should refetch the hidserv desc.
  205. . should maybe make clients exit(1) when bad things happen?
  206. e.g. clock skew.
  207. - should retry exitpolicy end streams even if the end cell didn't
  208. resolve the address for you
  209. . Make logs handle it better when writing to them fails.
  210. o Dirserver shouldn't put you in running-routers list if you haven't
  211. uploaded a descriptor recently
  212. . Refactor: add own routerinfo to routerlist. Right now, only
  213. router_get_by_nickname knows about 'this router', as a hack to
  214. get circuit_launch_new to do the right thing.
  215. . Scrubbing proxies
  216. - Find an smtp proxy?
  217. . Get socks4a support into Mozilla
  218. - Need a relay teardown cell, separate from one-way ends.
  219. - Make it harder to circumvent bandwidth caps: look at number of bytes
  220. sent across sockets, not number sent inside TLS stream.
  221. - fix router_get_by_* functions so they can get ourselves too,
  222. and audit everything to make sure rend and intro points are
  223. just as likely to be us as not.
  224. ***************************Future tasks:****************************
  225. Rendezvous and hidden services:
  226. make it fast:
  227. - preemptively build and start rendezvous circs.
  228. - preemptively build n-1 hops of intro circs?
  229. - cannibalize general circs?
  230. make it reliable:
  231. - standby/hotswap/redundant services.
  232. - store stuff to disk? dirservers forget service descriptors when
  233. they restart; nodes offering hidden services forget their chosen
  234. intro points when they restart.
  235. make it robust:
  236. - auth mechanisms to let midpoint and bob selectively choose
  237. connection requests.
  238. make it scalable:
  239. - right now the hidserv store/lookup system is run by the dirservers;
  240. this won't scale.
  241. Tor scalability:
  242. Relax clique assumptions.
  243. Redesign how directories are handled.
  244. - Separate running-routers lookup from descriptor list lookup.
  245. - Resolve directory agreement somehow.
  246. - Cache directory on all servers.
  247. Find and remove bottlenecks
  248. - Address linear searches on e.g. circuit and connection lists.
  249. Reputation/memory system, so dirservers can measure people,
  250. and so other people can verify their measurements.
  251. - Need to measure via relay, so it's not distinguishable.
  252. Bandwidth-aware path selection. So people with T3's are picked
  253. more often than people with DSL.
  254. Reliability-aware node selection. So people who are stable are
  255. preferred for long-term circuits such as intro and rend circs,
  256. and general circs for irc, aim, ssh, etc.
  257. Let dissidents get to Tor servers via Tor users. ("Backbone model")
  258. Anonymity improvements:
  259. Is abandoning the circuit the only option when an extend fails, or
  260. can we do something without impacting anonymity too much?
  261. Is exiting from the middle of the circuit always a bad idea?
  262. Helper nodes. Decide how to use them to improve safety.
  263. DNS resolution: need to make tor support resolve requests. Need to write
  264. a script and an interface (including an extension to the socks
  265. protocol) so we can ask it to do resolve requests. Need to patch
  266. tsocks to intercept gethostbyname, else we'll continue leaking it.
  267. Improve path selection algorithms based on routing-zones paper. Be sure
  268. to start and end circuits in different ASs. Ideally, consider AS of
  269. source and destination -- maybe even enter and exit via nearby AS.
  270. Intermediate model, with some delays and mixing.
  271. Add defensive dropping regime?
  272. Make it more correct:
  273. Handle half-open connections: right now we don't support all TCP
  274. streams, at least according to the protocol. But we handle all that
  275. we've seen in the wild.
  276. Support IPv6.
  277. Efficiency/speed/robustness:
  278. Congestion control. Is our current design sufficient once we have heavy
  279. use? Need to measure and tweak, or maybe overhaul.
  280. Allow small cells and large cells on the same network?
  281. Cell buffering and resending. This will allow us to handle broken
  282. circuits as long as the endpoints don't break, plus will allow
  283. connection (tls session key) rotation.
  284. Implement Morphmix, so we can compare its behavior, complexity, etc.
  285. Use cpuworker for more heavy lifting.
  286. - Signing (and verifying) hidserv descriptors
  287. - Signing (and verifying) intro/rend requests
  288. - Signing (and verifying) router descriptors
  289. - Signing (and verifying) directories
  290. - Doing TLS handshake (this is very hard to separate out, though)
  291. Buffer size pool: allocate a maximum size for all buffers, not
  292. a maximum size for each buffer. So we don't have to give up as
  293. quickly (and kill the thickpipe!) when there's congestion.
  294. Exit node caching: tie into squid or other caching web proxy.
  295. Other transport. HTTP, udp, rdp, airhook, etc. May have to do our own
  296. link crypto, unless we can bully openssl into it.
  297. P2P Tor:
  298. Do all the scalability stuff above, first.
  299. Incentives to relay. Not so hard.
  300. Incentives to allow exit. Possibly quite hard.
  301. Sybil defenses without having a human bottleneck.
  302. How to gather random sample of nodes.
  303. How to handle nodelist recommendations.
  304. Consider incremental switches: a p2p tor with only 50 users has
  305. different anonymity properties than one with 10k users, and should
  306. be treated differently.