Human Level Performance
202502012209
tags: #machine-learning #benchmarking #performance
Human level performance provides a benchmark for machine learning systems, helping determine achievable error rates and guide optimization efforts.
Why it matters:
- Establishes theoretical lower bound (Bayes error)
- Helps distinguish between bias and variance problems
- Guides where to focus improvement efforts
- Provides business context for model performance
When human performance is useful:
- Tasks humans excel at (image recognition, natural language)
- Domains where human expertise is well-established
- Problems where human-level accuracy is the business requirement
Limitation: For some tasks, machines can surpass human performance (structured data analysis, chess), so human benchmarks become less relevant.
Practical application:
If your model performs much worse than humans, focus on reducing bias through better algorithms or Feature Engineering. If close to human level, collect more data or improve Error Analysis process.
This connects to Bias vs Variance diagnosis - human performance helps identify the achievable baseline.
Reference
Machine Learning Yearning by Andrew Ng