TODO 12 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266
  1. rename ACI to CircID
  2. rotate tls-level connections -- make new ones, expire old ones.
  3. dirserver shouldn't put you in running-routers list if you haven't
  4. uploaded a descriptor recently
  5. look at having smallcells and largecells
  6. separate trying to rebuild a circuit because you have none from trying to rebuild a
  7. circuit because the current one is stale
  8. <nickm> If I compromise a node, and streamIDs are sequential, I learn
  9. how many streams have been open and closed on this circuit at this point.
  10. > hm. you learn this for circuits too, do you not?
  11. <nickm> True. But how-many-circuits-from-A-to-B only leaks how long
  12. the connection from A to B has been alive and how much use it's seen.
  13. > ok. needs more investigation.
  14. Legend:
  15. SPEC!! - Not specified
  16. SPEC - Spec not finalized
  17. NICK - nick claims
  18. ARMA - arma claims
  19. - Not done
  20. * Top priority
  21. . Partially done
  22. o Done
  23. D Deferred
  24. X Abandoned
  25. Short-term:
  26. . integrate rep_ok functions, see what breaks
  27. - update tor faq
  28. o obey SocksBindAddress, ORBindAddress
  29. - warn if we're running as root
  30. o make connection_flush_buf() more obviously obsolete
  31. . let hup reread the config file, eg so we can get new exit
  32. policies without restarting
  33. - use times(2) rather than gettimeofday to measure how long it
  34. takes to process a cell
  35. . Exit policies
  36. o Spec how to write the exit policies
  37. - Path selection algorithms
  38. - Let user request certain nodes
  39. - And disallow certain nodes
  40. D Choose path by jurisdiction, etc?
  41. - Make relay end cells have failure status and payload attached
  42. - Streams that fail due to exit policy must reextend to new node
  43. - Add extend_wait state to edge connections, thumb through them
  44. when the AP get an extended cell.
  45. - let non-approved routers handshake.
  46. - just list approved routers in directory.
  47. . migrate to using nickname rather than addr:port for routers
  48. o decide_aci_type
  49. - generate onion skins
  50. - circuit_send_next_onion_skin
  51. - circuit_extend
  52. - onion_generate_cpath
  53. - get_unique_aci_by_addr_port
  54. - circ->n_addr and circ->n_port
  55. - circuit_enumerate_by_naddr_nport
  56. - cpath layers
  57. - connection_or_connect
  58. - connection_exact_get_by_addr_port
  59. - connection_twin_get_by_addr_port
  60. - router_get_by_addr_port
  61. - connection_or_init_conn_from_router
  62. - tag_pack, tag_unpack, connection_cpu_process_inbuf
  63. - directory_initiate_command
  64. . Move from onions to ephemeral DH
  65. o incremental path building
  66. o transition circuit-level sendmes to hop-level sendmes
  67. o implement truncate, truncated
  68. o move from 192byte DH to 128byte DH, so it isn't so damn slow
  69. - exiting from not-last hop
  70. - OP logic to decide to extend/truncate a path
  71. - make sure exiting from the not-last hop works
  72. - logic to find last *open* hop, not last hop, in cpath
  73. - choose exit nodes by exit policies
  74. - Remember address and port when beginning.
  75. - Extend by nickname/hostname/something, not by IP.
  76. On-going
  77. . Better comments for functions!
  78. . Go through log messages, reduce confusing error messages.
  79. . make the logs include more info (fd, etc)
  80. . Unit tests
  81. Mid-term:
  82. . Redo scheduler
  83. o fix SSL_read bug for buffered records
  84. - make round-robining more fair
  85. - What happens when a circuit's length is 1? What breaks?
  86. . streams / circuits
  87. o Implement streams
  88. o Rotate circuits after N minutes?
  89. X Circuits should expire when circuit->expire triggers
  90. NICK . Handle half-open connections
  91. o openssh is an application that uses half-open connections
  92. o Figure out what causes connections to close, standardize
  93. when we mark a connection vs when we tear it down
  94. o Look at what ssl does to keep from mutating data streams
  95. ARMA - Reduce streamid footprint from 7 bytes to 2 bytes
  96. - Check for collisions in streamid (now possible with
  97. just 2 bytes), and back up & replace with padding if so
  98. - Use the 3 saved bytes to put pseudorandomness in each relay cell
  99. - Use the 4 reserved bytes in each cell header to keep 1/5
  100. of a sha1 of the relay payload (move into stream header)
  101. - Move length into the stream header too
  102. - Spec the stream_id stuff. Clarify that nobody on the backward
  103. stream should look at stream_id.
  104. . Put CPU workers in separate processes
  105. o Handle multiple cpu workers (one for each cpu, plus one)
  106. o Queue for pending tasks if all workers full
  107. o Support the 'process this onion' task
  108. D Merge dnsworkers and cpuworkers to some extent
  109. - Handle cpuworkers dying
  110. . Scrubbing proxies
  111. - Find an smtp proxy?
  112. - Check the old smtp proxy code
  113. o Find an ftp proxy? wget --passive
  114. D Wait until there are packet redirectors for Linux
  115. . Get socks4a support into Mozilla
  116. . Develop rendezvous points
  117. SPEC!! - Handle socks commands other than connect, eg, bind?
  118. o Design
  119. - Spec
  120. - Implement
  121. - Tests
  122. o Testing harness/infrastructure
  123. D System tests (how?)
  124. - Performance tests, so we know when we've improved
  125. . webload infrastructure (Bruce)
  126. . httperf infrastructure (easy to set up)
  127. . oprofile (installed in RH >8.0)
  128. NICK . Daemonize and package
  129. o Teach it to fork and background
  130. - Red Hat spec file
  131. - Debian spec file equivalent
  132. . Portability
  133. . Which .h files are we actually using?
  134. . Port to:
  135. o Linux
  136. o BSD
  137. . Solaris
  138. o Cygwin
  139. . Win32
  140. o OS X
  141. - deal with pollhup / reached_eof on all platforms
  142. o openssl randomness
  143. o inet_ntoa
  144. . stdint.h
  145. - Make a script to set up a local network on your machine
  146. - More flexibility in node addressing
  147. D Support IPv6 rather than just 4
  148. - Handle multihomed servers (config variable to set IP)
  149. In the distant future:
  150. D Load balancing between router twins
  151. D Keep track of load over links/nodes, to
  152. know who's hosed
  153. SPEC!! D Non-clique topologies
  154. D Implement our own memory management, at least for common structs
  155. (Not ever necessary?)
  156. D Advanced directory servers
  157. D Automated reputation management
  158. SPEC!! D Figure out how to do threshold directory servers
  159. D jurisdiction info in dirserver entries? other info?
  160. Older (done) todo stuff:
  161. o Get tor to act like a socks server
  162. o socks4, socks4a
  163. o socks5
  164. o routers have identity key, link key, onion key.
  165. o link key certs are
  166. D signed by identity key
  167. D not in descriptor
  168. o not in config
  169. D not on disk
  170. o identity and onion keys are in descriptor (and disk)
  171. o upon boot, if it doesn't find identity key, generate it and write it.
  172. o also write a file with the identity key fingerprint in it
  173. o router generates descriptor: flesh out router_get_my_descriptor()
  174. o Routers sign descriptors with identity key
  175. o routers put version number in descriptor
  176. o routers should maybe have `uname -a` in descriptor?
  177. o Give nicknames to routers
  178. o in config
  179. o in descriptors
  180. o router posts descriptor
  181. o when it boots
  182. D when it changes
  183. o change tls stuff so certs don't get written to disk, or read from disk
  184. o make directory.c 'thread'safe
  185. o dirserver parses descriptor
  186. o dirserver checks signature
  187. D client checks signature?
  188. o dirserver writes directory to file
  189. o reads that file upon boot
  190. o directory includes all routers, up and down
  191. o add "up" line to directory, listing nicknames
  192. o instruments ORs to report stats
  193. o average cell fullness
  194. o average bandwidth used
  195. o configure log files. separate log file, separate severities.
  196. o what assumptions break if we fclose(0) when we daemonize?
  197. o make buffer struct elements opaque outside buffers.c
  198. o add log convention to the HACKING file
  199. o make 'make install' do the right thing
  200. o change binary name to tor
  201. o change config files so you look at commandline, else look in
  202. /etc/torrc. no cascading.
  203. o have an absolute datadir with fixed names for files, and fixed-name
  204. keydir under that with fixed names
  205. o Move (most of) the router/directory code out of main.c
  206. o Simple directory servers
  207. o Include key in source; sign directories
  208. o Signed directory backend
  209. o Document
  210. o Integrate
  211. o Add versions to code
  212. o Have directories list recommended-versions
  213. o Include line in directories
  214. o Check for presence of line.
  215. o Quit if running the wrong version
  216. o Command-line option to override quit
  217. o Add more information to directory server entries
  218. o Exit policies
  219. o Clearer bandwidth management
  220. o Do we want to remove bandwidth from OR handshakes?
  221. o What about OP handshakes?
  222. X Move away from openssl
  223. o Abstract out crypto calls
  224. X Look at nss, others? Just include code?
  225. o Use a stronger cipher
  226. o aes now, by including the code ourselves
  227. X On the fly compression of each stream
  228. o Clean up the event loop (optimize and sanitize)
  229. o Remove that awful concept of 'roles'
  230. o Terminology
  231. o Circuits, topics, cells stay named that
  232. o 'Connection' gets divided, or renamed, or something?
  233. o DNS farm
  234. o Distribute queries onto the farm, get answers
  235. o Preemptively grow a new worker before he's needed
  236. o Prune workers when too many are idle
  237. o DNS cache
  238. o Clear DNS cache over time
  239. D Honor DNS TTL info (how??)
  240. o Have strategy when all workers are busy
  241. o Keep track of which connections are in dns_wait
  242. o Need to cache positives/negatives on the tor side
  243. o Keep track of which queries have been asked
  244. o Better error handling when
  245. o An address doesn't resolve
  246. o We have max workers running
  247. o Consider taking the master out of the loop?
  248. X Implement reply onions
  249. o Total rate limiting
  250. o Look at OR handshake in more detail
  251. o Spec it
  252. o Merge OR and OP handshakes
  253. o rearrange connection_or so it doesn't suck so much to read
  254. D Periodic link key rotation. Spec?
  255. o wrap malloc with something that explodes when it fails
  256. o Clean up the number of places that get to look at prkey