On Estimations

Alice needs to make a decision that depends on the time that Bob needs to complete a task. Hi Bob! How long will you need to complete this task?.

Bob scratches his head and answers, About 5 days... yeah, a working week.

Alice notes down the estimate. But when she gets back to it, she realises that it is rather ambiguous, leaving room for various interpretations, including:

Estimates are a tool to address uncertainty. Bob doesn't know how long he needs to complete the task, but he has an intuition about what time frame is realistic. With his estimate, Bob is trying to communicate the likelihood of different outcomes, which he can do more precisely borrowing from the abstractions that probability theory uses to model uncertainty.

Bob and Alice agree to describe estimates as confidence intervals (x, y), where:

Bob can now say I estimate that this task will take between 2 and 10 working days, and be confident that Alice understands, precisely, that the probability of completing the task in 1 or 2 days is 15% and the probability of finishing within 10 days is 85%. She can also infer that there is a 70% chance to complete the task within 2 to 10 days.

While confidence intervals don't fully describe the probability distribution of the estimate, they provide sufficient information for practical decision making and rigorous assessment of the estimation process quality. For instance, assuming that Bob's estimates are accurate, Alice can infer that:

Unfortunately, there is no reason to believe that Bob's estimates are magically accurate. After all, humans are famously bad at estimating. However, Alice and Bob can evaluate the quality of the estimates and understand how to calibrate the estimation process to rectify the inaccuracies. They start collecting estimates and the actual duration of the tasks once they are completed. Analysing the samples, they can determine how often the task duration aligns with the predicted interval, surpasses the pessimistic end, or falls short of the optimistic end. Bob can then adjust his estimation process until he provides reliable estimates:

Through this iterative process, Bob can refine his estimation process to consistently provide accurate estimates.

Confidence intervals are a relatively straightforward and effective way to describe estimates using probability theory. Nonetheless, Alice and Bob have numerous other possibilities to explore. What matters is that:

Without these properties, estimates are unreliable and lack the necessary feedback loop to improve them with practice. Unsurprisingly, people find these estimation processes frustrating. Estimators feel that they are expected to provide wild guesses that are so uncertain that everyone feels compelled to artificially extend all timelines significantly to have any chance of meeting them. In contrast, when estimates defined precisely using probability theory, can be collected and compared to actual outcomes. This validation process enables estimators to improve their skills and allows stakeholders to make better decisions, only padding timelines affected by low confidence estimates.