TODO 14 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308
  1. Legend:
  2. SPEC!! - Not specified
  3. SPEC - Spec not finalized
  4. NICK - nick claims
  5. ARMA - arma claims
  6. - Not done
  7. * Top priority
  8. . Partially done
  9. o Done
  10. D Deferred
  11. X Abandoned
  12. Bugs:
  13. o When it can't resolve any dirservers, it is useless from then on.
  14. We should make it reload the RouterFile if it has no dirservers.
  15. o Sometimes it picks a middleman node as the exit for a circuit.
  16. - if you specify a non-dirserver as exitnode or entrynode, when it
  17. makes the first few circuits it hasn't yet fetched the directory,
  18. so it warns that it doesn't know the node.
  19. - make 'make test' exit(1) if a test fails.
  20. - fix buffer unit test so it passes
  21. Short-term:
  22. o when you hup, rewrite the router.desc file (and maybe others)
  23. - consider handling broken socks4 implementations
  24. - improve how it behaves when i remove a line from the approved-routers files
  25. - Make tls connections tls_close intentionally
  26. o Rename ACI to circID
  27. . integrate rep_ok functions, see what breaks
  28. - update tor faq
  29. o obey SocksBindAddress, ORBindAddress
  30. o warn if we're running as root
  31. o make connection_flush_buf() more obviously obsolete
  32. o let hup reread the config file, eg so we can get new exit
  33. policies without restarting
  34. o Put recommended_versions in a config entry
  35. X use times(2) rather than gettimeofday to measure how long it
  36. takes to process a cell
  37. o Separate trying to rebuild a circuit because you have none from trying
  38. to rebuild a circuit because the current one is stale
  39. X Continue reading from socks port even while waiting for connect.
  40. o Exit policies
  41. o Spec how to write the exit policies
  42. o Path selection algorithms
  43. o Choose path more incrementally
  44. o Let user request first/last node
  45. o And disallow certain nodes
  46. D Choose path by jurisdiction, etc?
  47. o Make relay end cells have failure status and payload attached
  48. X let non-approved routers handshake.
  49. X Dirserver shouldn't put you in running-routers list if you haven't
  50. uploaded a descriptor recently
  51. X migrate to using nickname rather than addr:port for routers
  52. - migrate to using IPv6 sizes everywhere
  53. o Move from onions to ephemeral DH
  54. o incremental path building
  55. o transition circuit-level sendmes to hop-level sendmes
  56. o implement truncate, truncated
  57. o move from 192byte DH to 128byte DH, so it isn't so damn slow
  58. X exiting from not-last hop
  59. X OP logic to decide to extend/truncate a path
  60. X make sure exiting from the not-last hop works
  61. X logic to find last *open* hop, not last hop, in cpath
  62. o Remember address and port when beginning.
  63. - Extend by nickname/hostname/something, not by IP.
  64. - Need a relay teardown cell, separate from one-way ends.
  65. - remove per-connection rate limiting
  66. - Make it harder to circumvent bandwidth caps: look at number of bytes
  67. sent across sockets, not number sent inside TLS stream.
  68. On-going
  69. . Better comments for functions!
  70. . Go through log messages, reduce confusing error messages.
  71. . make the logs include more info (fd, etc)
  72. . Unit tests
  73. . Update the spec so it matches the code
  74. Mid-term:
  75. - Rotate tls-level connections -- make new ones, expire old ones.
  76. So we get actual key rotation, not just symmetric key rotation
  77. o Are there anonymity issues with sequential streamIDs? Sequential
  78. circIDs? Eg an attacker can learn how many there have been.
  79. The fix is to initialize them randomly rather than at 1.
  80. - Look at having smallcells and largecells
  81. . Redo scheduler
  82. o fix SSL_read bug for buffered records
  83. - make round-robining more fair
  84. - What happens when a circuit's length is 1? What breaks?
  85. . streams / circuits
  86. o Implement streams
  87. o Rotate circuits after N minutes?
  88. X Circuits should expire when circuit->expire triggers
  89. NICK . Handle half-open connections
  90. o openssh is an application that uses half-open connections
  91. o Figure out what causes connections to close, standardize
  92. when we mark a connection vs when we tear it down
  93. o Look at what ssl does to keep from mutating data streams
  94. o Put CPU workers in separate processes
  95. o Handle multiple cpu workers (one for each cpu, plus one)
  96. o Queue for pending tasks if all workers full
  97. o Support the 'process this onion' task
  98. D Merge dnsworkers and cpuworkers to some extent
  99. o Handle cpuworkers dying
  100. . Scrubbing proxies
  101. - Find an smtp proxy?
  102. - Check the old smtp proxy code
  103. o Find an ftp proxy? wget --passive
  104. D Wait until there are packet redirectors for Linux
  105. . Get socks4a support into Mozilla
  106. . Develop rendezvous points
  107. X Handle socks commands other than connect, eg, bind?
  108. o Design
  109. - Spec
  110. - Implement
  111. . Tests
  112. o Testing harness/infrastructure
  113. D System tests (how?)
  114. - Performance tests, so we know when we've improved
  115. . webload infrastructure (Bruce)
  116. . httperf infrastructure (easy to set up)
  117. . oprofile (installed in RH >8.0)
  118. NICK . Daemonize and package
  119. o Teach it to fork and background
  120. - Red Hat spec file
  121. o Debian spec file equivalent
  122. . Portability
  123. . Which .h files are we actually using?
  124. . Port to:
  125. o Linux
  126. o BSD
  127. . Solaris
  128. o Cygwin
  129. . Win32
  130. o OS X
  131. - deal with pollhup / reached_eof on all platforms
  132. o openssl randomness
  133. o inet_ntoa
  134. o stdint.h
  135. - Make a script to set up a local network on your machine
  136. o More flexibility in node addressing
  137. D Support IPv6 rather than just 4
  138. o Handle multihomed servers (config variable to set IP)
  139. In the distant future:
  140. D Load balancing between router twins
  141. D Keep track of load over links/nodes, to
  142. know who's hosed
  143. SPEC!! D Non-clique topologies
  144. D Implement our own memory management, at least for common structs
  145. (Not ever necessary?)
  146. D Advanced directory servers
  147. D Automated reputation management
  148. SPEC!! D Figure out how to do threshold directory servers
  149. D jurisdiction info in dirserver entries? other info?
  150. Older (done) todo stuff:
  151. For 0.0.2pre17:
  152. o Put a H(K | handshake) into the onionskin response
  153. o Make cells 512 bytes
  154. o Reduce streamid footprint from 7 bytes to 2 bytes
  155. X Check for collisions in streamid (now possible with
  156. just 2 bytes), and back up & replace with padding if so
  157. o Use the 4 reserved bytes in each cell header to keep 1/5
  158. of a sha1 of the ongoing relay payload (move into stream header)
  159. o Move length into the stream header too
  160. o Make length 2 bytes
  161. D increase DH key length
  162. D increase RSA key length
  163. D Spec the stream_id stuff. Clarify that nobody on the backward
  164. stream should look at stream_id.
  165. Cell:
  166. ACI (anonymous circuit identifier) [2 bytes]
  167. Command [1 byte]
  168. Payload (padded with 0 bytes) [509 bytes]
  169. Relay payload:
  170. Relay command [1 byte]
  171. Stream ID [7 bytes]
  172. Partial SHA-1 [4 bytes]
  173. Length [2 bytes]
  174. Relay payload [495 bytes]
  175. For 0.0.2pre15:
  176. o don't pick exit nodes which will certainly reject all things.
  177. o don't pick nodes that the directory says are down
  178. o choose randomly from running dirservers, not just first one
  179. o install the man page
  180. o warn when client-side tries an address/port which no router in the dir accepts.
  181. For 0.0.2pre14:
  182. o More flexible exit policies (18.*, 18.0.0.0/8)
  183. o Work to succeed in the precense of exit policy violation
  184. o Replace desired_path_len with opaque path-selection specifier
  185. o Client-side DNS caching
  186. o Add entries to client DNS cache based on END cells
  187. o Remove port from END_REASON_EXITPOLICY cells
  188. o Start building new circuits when we get an exit-policy
  189. failure. (Defer exiting from the middle of existing
  190. circuits or extending existing circuits for later.)
  191. o Implement function to check whether a routerinfo_t
  192. supports a given exit addr.
  193. o Choose the exit node of an in-progress circuit based on
  194. pending AP connections.
  195. o Choose the exit node _first_, then beginning, then
  196. middle nodes.
  197. Previous:
  198. o Get tor to act like a socks server
  199. o socks4, socks4a
  200. o socks5
  201. o routers have identity key, link key, onion key.
  202. o link key certs are
  203. D signed by identity key
  204. D not in descriptor
  205. o not in config
  206. D not on disk
  207. o identity and onion keys are in descriptor (and disk)
  208. o upon boot, if it doesn't find identity key, generate it and write it.
  209. o also write a file with the identity key fingerprint in it
  210. o router generates descriptor: flesh out router_get_my_descriptor()
  211. o Routers sign descriptors with identity key
  212. o routers put version number in descriptor
  213. o routers should maybe have `uname -a` in descriptor?
  214. o Give nicknames to routers
  215. o in config
  216. o in descriptors
  217. o router posts descriptor
  218. o when it boots
  219. o every DirFetchPostPeriod seconds
  220. D when it changes
  221. o change tls stuff so certs don't get written to disk, or read from disk
  222. o make directory.c 'thread'safe
  223. o dirserver parses descriptor
  224. o dirserver checks signature
  225. D client checks signature?
  226. o dirserver writes directory to file
  227. o reads that file upon boot
  228. o directory includes all routers, up and down
  229. o add "up" line to directory, listing nicknames
  230. o instruments ORs to report stats
  231. o average cell fullness
  232. o average bandwidth used
  233. o configure log files. separate log file, separate severities.
  234. o what assumptions break if we fclose(0) when we daemonize?
  235. o make buffer struct elements opaque outside buffers.c
  236. o add log convention to the HACKING file
  237. o make 'make install' do the right thing
  238. o change binary name to tor
  239. o change config files so you look at commandline, else look in
  240. /etc/torrc. no cascading.
  241. o have an absolute datadir with fixed names for files, and fixed-name
  242. keydir under that with fixed names
  243. o Move (most of) the router/directory code out of main.c
  244. o Simple directory servers
  245. o Include key in source; sign directories
  246. o Signed directory backend
  247. o Document
  248. o Integrate
  249. o Add versions to code
  250. o Have directories list recommended-versions
  251. o Include line in directories
  252. o Check for presence of line.
  253. o Quit if running the wrong version
  254. o Command-line option to override quit
  255. o Add more information to directory server entries
  256. o Exit policies
  257. o Clearer bandwidth management
  258. o Do we want to remove bandwidth from OR handshakes?
  259. o What about OP handshakes?
  260. X Move away from openssl
  261. o Abstract out crypto calls
  262. X Look at nss, others? Just include code?
  263. o Use a stronger cipher
  264. o aes now, by including the code ourselves
  265. X On the fly compression of each stream
  266. o Clean up the event loop (optimize and sanitize)
  267. o Remove that awful concept of 'roles'
  268. o Terminology
  269. o Circuits, topics, cells stay named that
  270. o 'Connection' gets divided, or renamed, or something?
  271. o DNS farm
  272. o Distribute queries onto the farm, get answers
  273. o Preemptively grow a new worker before he's needed
  274. o Prune workers when too many are idle
  275. o DNS cache
  276. o Clear DNS cache over time
  277. D Honor DNS TTL info (how??)
  278. o Have strategy when all workers are busy
  279. o Keep track of which connections are in dns_wait
  280. o Need to cache positives/negatives on the tor side
  281. o Keep track of which queries have been asked
  282. o Better error handling when
  283. o An address doesn't resolve
  284. o We have max workers running
  285. o Consider taking the master out of the loop?
  286. X Implement reply onions
  287. o Total rate limiting
  288. o Look at OR handshake in more detail
  289. o Spec it
  290. o Merge OR and OP handshakes
  291. o rearrange connection_or so it doesn't suck so much to read
  292. D Periodic link key rotation. Spec?
  293. o wrap malloc with something that explodes when it fails
  294. o Clean up the number of places that get to look at prkey