Error Analysis

202502012210
tags: #machine-learning #debugging #model-improvement

Error analysis involves systematically examining misclassified examples to identify patterns and guide improvement efforts.

The process:

Collect a set of examples your algorithm misclassified
Look at these examples manually (eyeball test)
Count how many errors fall into different categories
Focus improvement efforts on the most frequent error types

Example categories for image classification:

Blurry images: 8% of errors
Low lighting: 15% of errors
Mislabeled data: 6% of errors
Animal photos (in non-animal dataset): 43% of errors

Key insights:

Focus on categories that represent largest error percentages
Don't spend time on rare error types
Some errors may fall into multiple categories

Mislabeled data consideration:
If training data has incorrect labels, only fix them if they significantly impact performance and you can fix them systematically.

Error analysis guides Feature Engineering decisions and helps prioritize which aspects of the model to improve next.

Reference

Machine Learning Yearning by Andrew Ng