108-mtbf-based-uptime.txt 1.6 KB

123456789101112131415161718192021222324252627282930313233343536373839404142
  1. Filename: 108-mtbf-based-stability.txt
  2. Title: Base "Stable" Flag on Mean Time Between Failures
  3. Version: $Revision: 12105 $
  4. Last-Modified: $Date: 2007-01-30T07:50:01.643717Z $
  5. Author: Nick Mathewson
  6. Created:
  7. Status: Open
  8. Overview:
  9. This document proposes that we change how directory authorities set the
  10. stability flag from inspection of routers declared Uptime to the
  11. authorities' perceived mean time between failure for the router.
  12. Motivation:
  13. Clients prefer nodes that the authorities call Stable. This flags are (as
  14. of 0.2.0.0-alpha-dev) set entirely based on the nodes' declared values for
  15. uptime. This creates an opportunity for malicious nodes to declare
  16. falsely high uptimes in order to get more traffic.
  17. Spec changes:
  18. Instead of setting the current rule for setting the Stable flag:
  19. "An authority should call a server Stable if its observed MTBF for
  20. the past month is at or above the median MTBF for Valid servers.
  21. MTBF shall be defined as the mean length of the runs observed by a
  22. given directory authority. A run begins when an authority decides
  23. that the server is Running, and ends when the authority decides that
  24. the server is not Running. In-progress runs are counted when
  25. measuring MTBF."
  26. Issues:
  27. How do you define a clipped MTBF? If the current month begins with one
  28. day at the end of a one-year uptime, and then has 29 days of uptime, do we
  29. average one day and 29 days? Or do we average one year and 29 days? Or
  30. take 29 days on its own and discard the year?
  31. Surely somebody has done this kinds of thing before.