TODO 17 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405
  1. $Id$
  2. Legend:
  3. SPEC!! - Not specified
  4. SPEC - Spec not finalized
  5. NICK - nick claims
  6. ARMA - arma claims
  7. PHOBOS - phobos claims
  8. - Not done
  9. * Top priority
  10. . Partially done
  11. o Done
  12. D Deferred
  13. X Abandoned
  14. For 0.1.0.x:
  15. - Why do solaris cpuworks go dormant?
  16. (Apparently, disabling threads fixes this.)
  17. - Why does kevent barf with EINVAL on some freebsd boxes?
  18. (Lucky is trying a patch that should tell us why.)
  19. - Fix the remaining flyspray bugs marked for 0.1.0.9
  20. (Only 124 remains.)
  21. X Free remaining unfreed memory (arma will run valgrind)
  22. (Not for a stable release)
  23. o Note libevent/method/platform combos that are unlikely to work.
  24. . Memory use on Linux: what's happening?
  25. - Is it threading? (Maybe, maybe not)
  26. - Is it the buf_shrink bug? (Quite possibly)
  27. - instrument the code to figure out where our memory is going. (all platforms?)
  28. X change torrc to point to abuse-faq (once abuse-faq is posted)
  29. for 0.1.1.x:
  30. - cpu fixes:
  31. M rewrite how libevent does select() on win32 so it's not so very slow.
  32. - create-light
  33. - see if we should make use of truncate to retry
  34. - hardware accelerator support
  35. - use openssl aes when available
  36. - do the kernel buffer style design
  37. - continue decentralizing the directory
  38. M have two router descriptor formats
  39. - dirservers verify reachability claims
  40. - find 10 dirservers. (what are criteria to be a dirserver?)
  41. - some back-out mechanism?
  42. - dirservers have blacklist of IPs they hate
  43. - a way of rolling back approvals to before a timestamp
  44. - have new people be in limbo and need to demonstrate usefulness
  45. before we approve them
  46. - other?
  47. - dirservers publish router-status with all these flags.
  48. - alices fetch many router-statuses and update descriptors as needed.
  49. - add if-newer-than fetch options
  50. - dirservers allow people to lookup by N descriptors, or to fetch all.
  51. - alices avoid duplicate class C nodes.
  52. - everybody with a dirport will give you his descriptor.
  53. - config option, on by default, to cache all descriptors.
  54. - Compress router desc sets before transmitting them
  55. M Analyze how bad the partitioning is or isn't.
  56. - Naming:
  57. - some dirservers announce that they manage bindings (a flag in
  58. router-status).
  59. - other dirservers mention a binding if there is no conflict for
  60. that binding among the dirservers that manage it.
  61. no conflict == any of them bind it and no disagreement.
  62. - alice can specify a nickname and it will record that name in her
  63. datadir along with the key *if* it is bound. otherwise her specifying
  64. will fail (loudly we hope).
  65. - thus when a binding vanishes (e.g. conflict) alice will keep using
  66. the one she meant.
  67. - if the binding changes keys, the entry in her datadir will silently
  68. get corrected.
  69. - helper nodes (at least preliminary)
  70. - enclaves (at least preliminary)
  71. - packaging and ui stuff:
  72. - uninstallers
  73. - something, anything, for sys tray on Windows.
  74. - let ORPort config option change.
  75. - new controller protocol
  76. For sometime soon:
  77. - Server instructions for OSX and Windows operators.
  78. - Audit all changes to bandwidth buckets for integer over/underflow.
  79. - whine if your socks port is an open proxy.
  80. Refactoring and infrastructure:
  81. N . Switch to libevent
  82. - Hold-open-until-flushed now works by accident; it should work by
  83. design.
  84. . The logic for reading from TLS sockets is likely to overrun the
  85. bandwidth buckets under heavy load. (Really, the logic was
  86. never right in the first place.) Also, we should audit all users
  87. of get_pending_bytes().
  88. Security:
  89. . Make sure logged info is "safe"ish.
  90. Functionality
  91. - Tests for new controller features
  92. N . NT Service code
  93. o Clean up NT service code even more.
  94. o Enable it by default.
  95. o Make sure it works.
  96. . Document it.
  97. Documentation
  98. r - Correct and clarify the wiki entry on port forwarding.
  99. o Document where OSX logs and torrc go.
  100. o Document where windows logs and torrc go.
  101. - (Make sure they actually go there.)
  102. Installers
  103. N - Vet all pending installer patches
  104. - Win32 installer plus privoxy, sockscap/freecap, etc.
  105. - Vet win32 systray helper code
  106. N . Make logs go into platform default locations.
  107. o OSX
  108. - Windows. (?)
  109. Correctness
  110. - how do ulimits work on win32, anyway? (We should handle WSAENOBUFS as
  111. needed, look at the MaxConnections registry entry, look at the
  112. MaxUserPort entry, and look at the TcpTimedWaitDelay entry. We may also
  113. want to provide a way to set them as needed. See bug 98.)
  114. Arguable
  115. - Bug: Why do idle cpuworkers sometimes get thought of as busy?
  116. - IP-based blacklisting of which servers get recommended by dirservers.
  117. N - tor-resolve script should use socks5 to get better error messages.
  118. o Script to try pulling bytes through slow-seeming servers so they can
  119. notice that they might be fast.
  120. N . Reverse DNS
  121. o specify
  122. - implement
  123. r - make min uptime a function of the available choices (say, choose 60th
  124. percentile, not 1 day.)
  125. r - kill dns workers more slowly
  126. r - build testing circuits? going through non-verified nodes?
  127. - config option to publish what ports you listen on, beyond ORPort/DirPort
  128. N - It would be nice to have a FirewalledIPs thing that works like
  129. FirewallPorts.
  130. - If we have a trusted directory on port 80, stop falling back to
  131. forbidden ports when fascistfirewall blocks all good dirservers.
  132. N - Code cleanup
  133. - Make configure.in handle cross-compilation
  134. - Have NULL_REP_IS_ZERO_BYTES default to 1.
  135. - Make with-ssl-dir disable search for ssl.
  136. - Efficiency/speed improvements.
  137. - Write limiting; configurable token buckets.
  138. - Make it harder to circumvent bandwidth caps: look at number of bytes
  139. sent across sockets, not number sent inside TLS stream.
  140. o Hidden service improvements
  141. o Investigate hidden service performance/reliability
  142. - Add private:* alias in exit policies to make it easier to ban all the
  143. fiddly little 192.168.foo addresses.
  144. - controller should have an event to learn about new addressmappings?
  145. No
  146. Todo: when you connect and get a guy you didn't expect, tell him hey i wasn't
  147. expecting you i'm going to go now bye, instead of just hanging up. This lets
  148. him know that he's doing something funny.
  149. - choose entry node to be one you're already connected to?
  150. - Convert man pages to pod, or whatever's right.
  151. - support hostnames as well as IPs for authdirservers.
  152. - GPSLocation optional config string.
  153. - Windows
  154. - Make millisecond accuracy work on win32
  155. - IPv6 support
  156. - teach connection_ap_handshake_socks_reply() about ipv6 and friends
  157. so connection_ap_handshake_socks_resolved() doesn't also need
  158. to know about them.
  159. - Let more config options (e.g. ORPort) change dynamically.
  160. - hidserv offerers shouldn't need to define a SocksPort
  161. * figure out what breaks for this, and do it.
  162. - Destroy and truncated cells should have reasons.
  163. - Packaging
  164. - Figure out how to make the rpm not strip the binaries it makes.
  165. - Integrate an http proxy into Tor (maybe as a third class of worker
  166. process), so we can stop shipping with the beast that is Privoxy.
  167. - Implement If-Modified-Since for directories.
  168. - Big, incompatible re-architecting and decentralization of directory
  169. system.
  170. - Only the top of a directory needs to be signed.
  171. - Windows
  172. - Get a controller to launch tor and keep it on the system tray.
  173. For 0.1.1.x:
  174. Decentralizing:
  175. - self-measurement
  176. - remote measurement
  177. - you've been running for an hour
  178. - it's sufficiently satisfied with its bandwidth
  179. - remove approval crap, add blacklisting by IP
  180. - gather more permanent dirservers and put their keys into the code
  181. - ship with a master key, and implement a way to query dirservers for
  182. a blob which is a timestamped signed newest pile of dirservers. put
  183. that on disk and use it on startup rather than the built-in default.
  184. - threshold belief from clients about up-ness
  185. - a way for clients to get fresh enough server descriptors
  186. - a way for clients to partition the set of servers in a safe way:
  187. so they don't have to learn all of them but so they're not easily
  188. partitionable.
  189. Tier two:
  190. N - Handle rendezvousing with unverified nodes.
  191. - Specify: Stick rendezvous point's key in INTRODUCE cell.
  192. Bob should _always_ use key from INTRODUCE cell.
  193. - Implement.
  194. N - IPv6 support (For exit addresses)
  195. - Spec issue: if a resolve returns an IP4 and an IP6 address,
  196. which to use?
  197. - Add to exit policy code
  198. - Make tor_gethostbyname into tor_getaddrinfo
  199. - Make everything that uses uint32_t as an IP address change to use
  200. a generalize address struct.
  201. - Change relay cell types to accept new addresses.
  202. - Add flag to serverdescs to tell whether IPv6 is supported.
  203. - Security fixes
  204. - christian grothoff's attack of infinite-length circuit.
  205. the solution is to have a separate 'extend-data' cell type
  206. which is used for the first N data cells, and only
  207. extend-data cells can be extend requests.
  208. - Code cleanup
  209. o fix router_get_by_* functions so they can get ourselves too ...
  210. - and audit everything to make sure rend and intro points are
  211. just as likely to be us as not.
  212. - tor should be able to have a pool of outgoing IP addresses
  213. that it is able to rotate through. (maybe)
  214. Packaging, docs, etc:
  215. - Exit node caching: tie into squid or other caching web proxy.
  216. Deferred until needed:
  217. - Do something to prevent spurious EXTEND cells from making middleman
  218. nodes connect all over. Rate-limit failed connections, perhaps?
  219. - Limit to 2 dir, 2 OR, N SOCKS connections per IP.
  220. - Handle full buffers without totally borking
  221. * do this eventually, no rush.
  222. - Rate-limit OR and directory connections overall and per-IP and
  223. maybe per subnet.
  224. - DoS protection: TLS puzzles, public key ops, bandwidth exhaustion.
  225. - Have clients and dirservers preserve reputation info over
  226. reboots.
  227. - authdirserver lists you as running iff:
  228. - he can connect to you
  229. - he has successfully extended to you
  230. - you have sufficient mean-time-between-failures
  231. * keep doing nothing for now.
  232. - Include HTTP status messages in logging (see parse_http_response).
  233. Blue sky or deferred indefinitely:
  234. - Support egd or other non-OS-integrated strong entropy sources
  235. - password protection for on-disk identity key
  236. - Possible to get autoconf to easily install things into ~/.tor?
  237. - server descriptor declares min log level, clients avoid servers
  238. that are too loggy.
  239. - put expiry date on onion-key, so people don't keep trying
  240. old ones that they could know are expired?
  241. - Add a notion of nickname->Pubkey binding that's not 'verification'
  242. - Conn key rotation.
  243. - Need a relay teardown cell, separate from one-way ends.
  244. Big tasks that would demonstrate progress:
  245. - Facility to automatically choose long-term helper nodes; perhaps
  246. on by default for hidden services.
  247. - patch privoxy and socks protocol to pass strings to the browser.
  248. - patch tsocks with our current patches + gethostbyname, getpeername, etc.
  249. - make freecap (or whichever) do what we want.
  250. - scrubbing proxies for protocols other than http.
  251. - Find an smtp proxy?
  252. . Get socks4a support into Mozilla
  253. - figure out enclaves, e.g. so we know what to recommend that people
  254. do, and so running a tor server on your website is helpful.
  255. - Do enclaves for same IP only.
  256. - Resolve first, then if IP is an OR, extend to him first.
  257. - implement a trivial fun gui to demonstrate our control interface.
  258. ************************ Roadmap for 2004-2005 **********************
  259. Hard problems that need to be solved:
  260. - Separating node discovery from routing.
  261. - Arranging membership management for independence.
  262. Sybil defenses without having a human bottleneck.
  263. How to gather random sample of nodes.
  264. How to handle nodelist recommendations.
  265. Consider incremental switches: a p2p tor with only 50 users has
  266. different anonymity properties than one with 10k users, and should
  267. be treated differently.
  268. - Measuring performance of other nodes. Measuring whether they're up.
  269. - Choosing exit node by meta-data, e.g. country.
  270. - Incentives to relay; incentives to exit.
  271. - Allowing dissidents to relay through Tor clients.
  272. - How to intercept, or not need to intercept, dns queries locally.
  273. - Improved anonymity:
  274. - Experiment with mid-latency systems. How do they impact usability,
  275. how do they impact safety?
  276. - Understand how powerful fingerprinting attacks are, and experiment
  277. with ways to foil them (long-range padding?).
  278. - Come up with practical approximations to picking entry and exit in
  279. different routing zones.
  280. - Find ideal churn rate for helper nodes; how safe is it?
  281. - What info squeaks by Privoxy? Are other scrubbers better?
  282. - Attacking freenet-gnunet/timing-delay-randomness-arguments.
  283. - Is abandoning the circuit the only option when an extend fails, or
  284. can we do something without impacting anonymity too much?
  285. - Is exiting from the middle of the circuit always a bad idea?
  286. Sample Publicity Landmarks:
  287. - we have N servers / N users
  288. - we have servers at epic and aclu and foo
  289. - hidden services are robust and fast
  290. - a more decentralized design
  291. - tor win32 installer works
  292. - win32 tray icon for end-users
  293. - tor server works on win32
  294. - win32 service for servers
  295. - mac installer works
  296. ***************************Future tasks:****************************
  297. Rendezvous and hidden services:
  298. make it fast:
  299. o preemptively build and start rendezvous circs.
  300. o preemptively build n-1 hops of intro circs?
  301. o cannibalize general circs?
  302. make it reliable:
  303. - standby/hotswap/redundant services.
  304. - store stuff to disk? dirservers forget service descriptors when
  305. they restart; nodes offering hidden services forget their chosen
  306. intro points when they restart.
  307. make it robust:
  308. - auth mechanisms to let midpoint and bob selectively choose
  309. connection requests.
  310. make it scalable:
  311. - robust decentralized storage for hidden service descriptors.
  312. make it accessible:
  313. - web proxy gateways to let normal people browse hidden services.
  314. Tor scalability:
  315. Relax clique assumptions.
  316. Redesign how directories are handled.
  317. - Resolve directory agreement somehow.
  318. Find and remove bottlenecks
  319. - Address linear searches on e.g. circuit and connection lists.
  320. Reputation/memory system, so dirservers can measure people,
  321. and so other people can verify their measurements.
  322. - Need to measure via relay, so it's not distinguishable.
  323. Let dissidents get to Tor servers via Tor users. ("Backbone model")
  324. Make it more correct:
  325. Handle half-open connections: right now we don't support all TCP
  326. streams, at least according to the protocol. But we handle all that
  327. we've seen in the wild.
  328. Support IPv6.
  329. Efficiency/speed/robustness:
  330. Congestion control. Is our current design sufficient once we have heavy
  331. use? Need to measure and tweak, or maybe overhaul.
  332. Allow small cells and large cells on the same network?
  333. Cell buffering and resending. This will allow us to handle broken
  334. circuits as long as the endpoints don't break, plus will allow
  335. connection (tls session key) rotation.
  336. Implement Morphmix, so we can compare its behavior, complexity, etc.
  337. Use cpuworker for more heavy lifting.
  338. - Signing (and verifying) hidserv descriptors
  339. - Signing (and verifying) intro/rend requests
  340. - Signing (and verifying) router descriptors
  341. - Signing (and verifying) directories
  342. - Doing TLS handshake (this is very hard to separate out, though)
  343. Buffer size pool: allocate a maximum size for all buffers, not
  344. a maximum size for each buffer. So we don't have to give up as
  345. quickly (and kill the thickpipe!) when there's congestion.
  346. Other transport. HTTP, udp, rdp, airhook, etc. May have to do our own
  347. link crypto, unless we can bully openssl into it.
  348. *********** uncategorized
  349. - why gnutls is bad/not good for tor
  350. P - flesh out the rest of the section 6 of the faq
  351. - compare 0.1.0.5-rc vs 0.1.0.8-rc memory usage to test out old buffer (1015) vs new buffer (0108) algorithms
  352. P - gather pointers to livecd distros that include tor
  353. - we should remove our libevent tree from cvs. it's obsolete now.
  354. - desired contribute.html patch: patches for dir-servers to verify server reachability.
  355. P - i want to put the logo on the website, in source form, so people can put it on stickers directly, etc.
  356. - i want more pictures from ren. he wants to describe the tor handshake, i want to talk about hidden services.
  357. o switch accountingmax to count total in+out, not either in or out. it's easy to move in this direction (not risky), but hard to back, out if we decide we prefer it the way it already is. hm.
  358. - clean up the places where our docs are redundant (or worse, obsolete in one file and correct elsewhere). agl has a start on a global list-of-tor-docs.
  359. P - update window's docs to clarify which versions of windows, and why a DOS window, how it's used, for the less technical users
  360. - multiple sample torrc files
  361. P . os x uninstall click-click script