Precision and Recall

Precision and recall are the two basic metrics for evaluating how well a model detects or classifies things. Precision answers "of everything the model flagged as positive, how much was actually correct?" It's calculated as TP / (TP + FP). Recall answers "of everything that actually exists, how much did the model find?" It's calculated as TP / (TP + FN). Together, they capture the two ways a model can fail: false alarms (hurting precision) and missed detections (hurting recall).

There's a natural tension between them. Lowering the confidence threshold means the model flags more detections, catching more true positives (higher recall) but also introducing more false positives (lower precision). Raising the threshold does the opposite. The precision-recall curve plots this tradeoff across all thresholds, and the area under it gives Average Precision (AP). In object detection, these metrics are computed at specific IoU thresholds: a prediction only counts as a true positive if it overlaps sufficiently with a ground truth box.

Which metric matters more depends on the application. Medical screening prioritizes recall because missing a tumor is worse than a false alarm. A self-driving car's emergency braking system may prioritize precision to avoid unnecessary hard stops. The F1 score (harmonic mean of precision and recall) gives a balanced single number when you care about both equally.

Resources

Relevant Blog Posts ↘

Glossary

Our Blog

Documentation

Introducing Class Metrics and Low Confidence Sampling for Deeper Model Evaluation Insights

MIN READ

March 4, 2026

This article introduces the concepts of evaluation class metrics and low confidence sampling and how they can enable deeper model evaluation insights that can improve your computer vision’s model performance using a helmet detection model as an example

Read

How to Assess Your Labelling Metrics with Performance Tracking

MIN READ

March 4, 2026

Our Performance dashboard offers insights into useful metrics to monitor your team's labelling performance for better quality control and task management.

Read

Get Started Now

Get Started using Datature’s computer vision platform now for free.

Book Demo