TODO 18 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428
  1. Legend:
  2. SPEC!! - Not specified
  3. SPEC - Spec not finalized
  4. NICK - nick claims
  5. ARMA - arma claims
  6. - Not done
  7. * Top priority
  8. . Partially done
  9. o Done
  10. D Deferred
  11. X Abandoned
  12. For 0.0.9.6:
  13. - Server instructions for OSX and Windows operators.
  14. - Audit all changes to bandwidth buckets for integer over/underflow.
  15. For 0.1.0.1-rc:
  16. R - write a changelog
  17. R o pick the whole path when you start the circuit.
  18. o and then the controller can call that for extendcircuit
  19. o finish messing with reachability stuff
  20. o if we jump in time a lot, then mark our circs and note that we
  21. haven't made a circ yet.
  22. o actually give http reason phrases to dir clients, so they know why
  23. they're rejected.
  24. - controller should have an event to learn about new addressmappings?
  25. - how do ulimits work on win32, anyway?
  26. o have a separate config option which caps bandwidth-to-advertise.
  27. For 0.1.0.x:
  28. Refactoring and infrastructure:
  29. N . Switch to libevent
  30. - Hold-open-until-flushed now works by accident; it should work by
  31. design.
  32. - The logic for reading from TLS sockets is likely to overrun the
  33. bandwidth buckets under heavy load. (Really, the logic was
  34. never right in the first place.) Also, we should audit all users
  35. of get_pending_bytes().
  36. . Find a way to make sure we have libevent 1.0 or later.
  37. o Implement patch to libevent
  38. o Submit patch to niels making this possible.
  39. - Implement Tor side once patch is accepted.
  40. . Log which poll method we're using.
  41. o Implement patch to libevent
  42. o Submit patch to niels making this possible.
  43. - Implement Tor side once patch is accepted.
  44. . Intercept libevent's "log" messages.
  45. o Ask Niels whether a patch would be accepted.
  46. o Implement patch, if so.
  47. - Implement Tor side once patch is accepted.
  48. o Check return from event_set, event_add, event_del.
  49. o Keep pushing to get a windows patch accepted.
  50. - After about 26 March, check back with Niels; he should be back
  51. by then.
  52. Security:
  53. - Make sure logged info is "safe"ish.
  54. Stability
  55. R o Reset uptime when IP changes.
  56. Functionality
  57. o Implement pending controller features.
  58. o Stubs for new functions.
  59. o GETINFO
  60. o Version
  61. o Descriptor list
  62. o Individual descriptors
  63. o Need to remember descriptors for all routers.
  64. o Replace everything else that remembers serverdescs with
  65. routerinfo.
  66. o List of address mappings
  67. o POSTDESCRIPTOR
  68. o MAPADDRESS
  69. o Map A->B.
  70. o Map DontCare->B.
  71. o Reuse mappings when asked to map DontCare->B for the same B.
  72. o But only when the DontCare is of the same type. :/
  73. o Way to handle overlong messages
  74. o Specify fragmented format
  75. o Implement fragmented format
  76. o Event for "new descriptors"
  77. o Better stream IDs
  78. o Stream status changed: "new" state.
  79. o EXTENDCIRCUIT
  80. o revised circ selection stuff.
  81. o Implement controller interface.
  82. o ATTACHSTREAM
  83. o Make streams have an 'unattached and not-automatically-attachable'
  84. state. ("Controller managed.")
  85. o Add support to put new streams into this state rather than try to
  86. attach them automatically. ("Hidden" config option.)
  87. o Implement 'attach stream X to circuit Y' logic.
  88. o Time out never-attached streams.
  89. o If we never get a CONNECTED back, we should put the stream back in
  90. CONTROLLER_WAIT, not in CIRCUIT_WAIT.
  91. o Add a way for the controller to say, "Hey, nuke this stream."
  92. o Specify
  93. o Implement
  94. o Add a way for the controller to say, "Hey, nuke this circuit."
  95. o Specify
  96. o Implement
  97. - Tests for new controller features
  98. R o HTTPS proxy for OR CONNECT stuff. (For outgoing SSL connections to
  99. other ORs.)
  100. o Changes for forward compatibility
  101. o If a version is later than the last in its series, but a version
  102. in the next series is recommended, that doesn't mean it's bad.
  103. o Do end reasons better
  104. o Start using RESOURCELIMIT more.
  105. o Try to use MISC a lot less.
  106. o bug: if the exit node fails to create a socket (e.g. because it
  107. has too many open), we will get a generic stream end response.
  108. o Fix on platforms with set_max_file_descriptors.
  109. o niels's "did it fail because conn refused or timeout or what"
  110. relay end feature.
  111. o Realize that unrecognized end reasons are probably features rather than
  112. bugs. (backport to 009x)
  113. o Push the work of sending the end cell deeper into package_raw_inbuf.
  114. (Turns out, if package_raw_inbuf fails, it *can't* send an end cell.)
  115. o Check for any place where we can close an edge connection without
  116. sending an end; see if we should send an end.
  117. o Feed end reason back into SOCK5 as reasonable.
  118. R o cache .foo.exit names better, or differently, or not.
  119. o make !advertised_server_mode() ORs fetch dirs less often.
  120. N . NT Service code
  121. o Clean up NT service code even more.
  122. o Enable it by default.
  123. o Make sure it works.
  124. . Document it.
  125. Documentation
  126. o Document new version system.
  127. r - Correct and clarify the wiki entry on port forwarding.
  128. o Document where OSX logs and torrc go.
  129. o Document where windows logs and torrc go.
  130. - (Make sure they actually go there.)
  131. Installers
  132. N - Vet all pending installer patches
  133. - Win32 installer plus privoxy, sockscap/freecap, etc.
  134. - Vet win32 systray helper code
  135. o Make OSX man pages go into man directory.
  136. N . Make logs go into platform default locations.
  137. o OSX
  138. - Windows. (?)
  139. Correctness
  140. - Mark bugs for 010 or post 010 in bugtracker.
  141. - Bugfixes
  142. R - when we haven't explicitly sent a socks reject, sending one in
  143. connection_about_to_close_connection() fails because we never give it
  144. a chance to flush. right answer is to do the socks reply manually in
  145. each appropriate case, and then about-to-close-connection can simply
  146. warn us if we forgot one. [Tag this 010 in flyspray.]
  147. R - should retry exitpolicy end streams even if the end cell didn't
  148. resolve the address for you
  149. o Figure out when to reset addressmaps (on hup, on reconfig, etc)
  150. Improvements to self-measurement.
  151. R X round detected bandwidth up to nearest 10KB?
  152. R o client software not upload descriptor until:
  153. . it decides it is reachable
  154. o dirport
  155. . orport
  156. - rule for now: "If you process a CREATE cell that did not come from
  157. your own IP, you are reachable."
  158. o start counting again if your IP ever changes.
  159. o never regenerate identity keys, for now.
  160. o you can set a bit for not-being-an-OR.
  161. * no need to do this yet. few people define their ORPort.
  162. Arguable
  163. N - Script to try pulling bytes through slow-seeming servers so they can
  164. notice that they might be fast.
  165. N . Reverse DNS
  166. o specify
  167. - implement
  168. r - make min uptime a function of the available choices (say, choose 60th
  169. percentile, not 1 day.)
  170. r - kill dns workers more slowly
  171. r - build testing circuits? going through non-verified nodes?
  172. - config option to publish what ports you listen on, beyond ORPort/DirPort
  173. N - It would be nice to have a FirewalledIPs thing that works like
  174. FirewallPorts.
  175. - If we have a trusted directory on port 80, stop falling back to
  176. forbidden ports when fascistfirewall blocks all good dirservers.
  177. N - Code cleanup
  178. - Make configure.in handle cross-compilation
  179. - Have NULL_REP_IS_ZERO_BYTES default to 1.
  180. - Make with-ssl-dir disable search for ssl.
  181. - Efficiency/speed improvements.
  182. - Write limiting; configurable token buckets.
  183. - Make it harder to circumvent bandwidth caps: look at number of bytes
  184. sent across sockets, not number sent inside TLS stream.
  185. - Hidden service improvements
  186. - Investigate hidden service performance/reliability
  187. - Add private:* alias in exit policies to make it easier to ban all the
  188. fiddly little 192.168.foo addresses.
  189. No
  190. - choose entry node to be one you're already connected to?
  191. - Convert man pages to pod, or whatever's right.
  192. - support hostnames as well as IPs for authdirservers.
  193. - GPSLocation optional config string.
  194. - Windows
  195. - Make millisecond accuracy work on win32
  196. - IPv6 support
  197. - teach connection_ap_handshake_socks_reply() about ipv6 and friends
  198. so connection_ap_handshake_socks_resolved() doesn't also need
  199. to know about them.
  200. - Let more config options (e.g. ORPort) change dynamically.
  201. - hidserv offerers shouldn't need to define a SocksPort
  202. * figure out what breaks for this, and do it.
  203. - Destroy and truncated cells should have reasons.
  204. - Packaging
  205. - Figure out how to make the rpm not strip the binaries it makes.
  206. - Integrate an http proxy into Tor (maybe as a third class of worker
  207. process), so we can stop shipping with the beast that is Privoxy.
  208. - Implement If-Modified-Since for directories.
  209. - Big, incompatible re-architecting and decentralization of directory
  210. system.
  211. - Only the top of a directory needs to be signed.
  212. - Windows
  213. - Get a controller to launch tor and keep it on the system tray.
  214. For 0.1.1.x:
  215. Decentralizing:
  216. - self-measurement
  217. - remote measurement
  218. - you've been running for an hour
  219. - it's sufficiently satisfied with its bandwidth
  220. - remove approval crap, add blacklisting by IP
  221. - gather more permanent dirservers and put their keys into the code
  222. - ship with a master key, and implement a way to query dirservers for
  223. a blob which is a timestamped signed newest pile of dirservers. put
  224. that on disk and use it on startup rather than the built-in default.
  225. - threshold belief from clients about up-ness
  226. - a way for clients to get fresh enough server descriptors
  227. - a way for clients to partition the set of servers in a safe way:
  228. so they don't have to learn all of them but so they're not easily
  229. partitionable.
  230. Tier two:
  231. N - Handle rendezvousing with unverified nodes.
  232. - Specify: Stick rendezvous point's key in INTRODUCE cell.
  233. Bob should _always_ use key from INTRODUCE cell.
  234. - Implement.
  235. N - IPv6 support (For exit addresses)
  236. - Spec issue: if a resolve returns an IP4 and an IP6 address,
  237. which to use?
  238. - Add to exit policy code
  239. - Make tor_gethostbyname into tor_getaddrinfo
  240. - Make everything that uses uint32_t as an IP address change to use
  241. a generalize address struct.
  242. - Change relay cell types to accept new addresses.
  243. - Add flag to serverdescs to tell whether IPv6 is supported.
  244. - Security fixes
  245. - christian grothoff's attack of infinite-length circuit.
  246. the solution is to have a separate 'extend-data' cell type
  247. which is used for the first N data cells, and only
  248. extend-data cells can be extend requests.
  249. - Code cleanup
  250. o fix router_get_by_* functions so they can get ourselves too ...
  251. - and audit everything to make sure rend and intro points are
  252. just as likely to be us as not.
  253. - tor should be able to have a pool of outgoing IP addresses
  254. that it is able to rotate through. (maybe)
  255. Packaging, docs, etc:
  256. - Exit node caching: tie into squid or other caching web proxy.
  257. Deferred until needed:
  258. - Do something to prevent spurious EXTEND cells from making middleman
  259. nodes connect all over. Rate-limit failed connections, perhaps?
  260. - Limit to 2 dir, 2 OR, N SOCKS connections per IP.
  261. - Handle full buffers without totally borking
  262. * do this eventually, no rush.
  263. - Rate-limit OR and directory connections overall and per-IP and
  264. maybe per subnet.
  265. - DoS protection: TLS puzzles, public key ops, bandwidth exhaustion.
  266. - Have clients and dirservers preserve reputation info over
  267. reboots.
  268. - authdirserver lists you as running iff:
  269. - he can connect to you
  270. - he has successfully extended to you
  271. - you have sufficient mean-time-between-failures
  272. * keep doing nothing for now.
  273. - Include HTTP status messages in logging (see parse_http_response).
  274. Blue sky or deferred indefinitely:
  275. - Support egd or other non-OS-integrated strong entropy sources
  276. - password protection for on-disk identity key
  277. - Possible to get autoconf to easily install things into ~/.tor?
  278. - server descriptor declares min log level, clients avoid servers
  279. that are too loggy.
  280. - put expiry date on onion-key, so people don't keep trying
  281. old ones that they could know are expired?
  282. - Add a notion of nickname->Pubkey binding that's not 'verification'
  283. - Conn key rotation.
  284. - Need a relay teardown cell, separate from one-way ends.
  285. Big tasks that would demonstrate progress:
  286. - Facility to automatically choose long-term helper nodes; perhaps
  287. on by default for hidden services.
  288. - patch privoxy and socks protocol to pass strings to the browser.
  289. - patch tsocks with our current patches + gethostbyname, getpeername, etc.
  290. - make freecap (or whichever) do what we want.
  291. - scrubbing proxies for protocols other than http.
  292. - Find an smtp proxy?
  293. . Get socks4a support into Mozilla
  294. - figure out enclaves, e.g. so we know what to recommend that people
  295. do, and so running a tor server on your website is helpful.
  296. - Do enclaves for same IP only.
  297. - Resolve first, then if IP is an OR, extend to him first.
  298. - implement a trivial fun gui to demonstrate our control interface.
  299. ************************ Roadmap for 2004-2005 **********************
  300. Hard problems that need to be solved:
  301. - Separating node discovery from routing.
  302. - Arranging membership management for independence.
  303. Sybil defenses without having a human bottleneck.
  304. How to gather random sample of nodes.
  305. How to handle nodelist recommendations.
  306. Consider incremental switches: a p2p tor with only 50 users has
  307. different anonymity properties than one with 10k users, and should
  308. be treated differently.
  309. - Measuring performance of other nodes. Measuring whether they're up.
  310. - Choosing exit node by meta-data, e.g. country.
  311. - Incentives to relay; incentives to exit.
  312. - Allowing dissidents to relay through Tor clients.
  313. - How to intercept, or not need to intercept, dns queries locally.
  314. - Improved anonymity:
  315. - Experiment with mid-latency systems. How do they impact usability,
  316. how do they impact safety?
  317. - Understand how powerful fingerprinting attacks are, and experiment
  318. with ways to foil them (long-range padding?).
  319. - Come up with practical approximations to picking entry and exit in
  320. different routing zones.
  321. - Find ideal churn rate for helper nodes; how safe is it?
  322. - What info squeaks by Privoxy? Are other scrubbers better?
  323. - Attacking freenet-gnunet/timing-delay-randomness-arguments.
  324. - Is abandoning the circuit the only option when an extend fails, or
  325. can we do something without impacting anonymity too much?
  326. - Is exiting from the middle of the circuit always a bad idea?
  327. Sample Publicity Landmarks:
  328. - we have N servers / N users
  329. - we have servers at epic and aclu and foo
  330. - hidden services are robust and fast
  331. - a more decentralized design
  332. - tor win32 installer works
  333. - win32 tray icon for end-users
  334. - tor server works on win32
  335. - win32 service for servers
  336. - mac installer works
  337. ***************************Future tasks:****************************
  338. Rendezvous and hidden services:
  339. make it fast:
  340. o preemptively build and start rendezvous circs.
  341. o preemptively build n-1 hops of intro circs?
  342. o cannibalize general circs?
  343. make it reliable:
  344. - standby/hotswap/redundant services.
  345. - store stuff to disk? dirservers forget service descriptors when
  346. they restart; nodes offering hidden services forget their chosen
  347. intro points when they restart.
  348. make it robust:
  349. - auth mechanisms to let midpoint and bob selectively choose
  350. connection requests.
  351. make it scalable:
  352. - robust decentralized storage for hidden service descriptors.
  353. make it accessible:
  354. - web proxy gateways to let normal people browse hidden services.
  355. Tor scalability:
  356. Relax clique assumptions.
  357. Redesign how directories are handled.
  358. - Resolve directory agreement somehow.
  359. Find and remove bottlenecks
  360. - Address linear searches on e.g. circuit and connection lists.
  361. Reputation/memory system, so dirservers can measure people,
  362. and so other people can verify their measurements.
  363. - Need to measure via relay, so it's not distinguishable.
  364. Let dissidents get to Tor servers via Tor users. ("Backbone model")
  365. Make it more correct:
  366. Handle half-open connections: right now we don't support all TCP
  367. streams, at least according to the protocol. But we handle all that
  368. we've seen in the wild.
  369. Support IPv6.
  370. Efficiency/speed/robustness:
  371. Congestion control. Is our current design sufficient once we have heavy
  372. use? Need to measure and tweak, or maybe overhaul.
  373. Allow small cells and large cells on the same network?
  374. Cell buffering and resending. This will allow us to handle broken
  375. circuits as long as the endpoints don't break, plus will allow
  376. connection (tls session key) rotation.
  377. Implement Morphmix, so we can compare its behavior, complexity, etc.
  378. Use cpuworker for more heavy lifting.
  379. - Signing (and verifying) hidserv descriptors
  380. - Signing (and verifying) intro/rend requests
  381. - Signing (and verifying) router descriptors
  382. - Signing (and verifying) directories
  383. - Doing TLS handshake (this is very hard to separate out, though)
  384. Buffer size pool: allocate a maximum size for all buffers, not
  385. a maximum size for each buffer. So we don't have to give up as
  386. quickly (and kill the thickpipe!) when there's congestion.
  387. Other transport. HTTP, udp, rdp, airhook, etc. May have to do our own
  388. link crypto, unless we can bully openssl into it.