Anchor Box

An anchor box is a predefined rectangular template used by certain object detection models to propose candidate object locations. Before the model sees any image, it defines a set of boxes at various sizes and aspect ratios (for example, 1:1, 1:2, 2:1) tiled across every position in the feature map. During training, the model learns to classify each anchor as object or background and to refine its coordinates to match the actual object boundaries.

Anchor-based detectors like Faster R-CNN, SSD, and earlier YOLO versions (v2 through v5) rely heavily on this mechanism. The anchor sizes are usually set by clustering the bounding box dimensions in the training dataset using k-means, so the anchors match the typical object shapes in that domain. Getting the anchor configuration wrong (too few sizes, wrong aspect ratios) directly hurts detection accuracy, especially for objects with unusual proportions.

The trend in recent architectures has moved away from anchors. Anchor-free detectors like FCOS, CenterNet, and the latest YOLO versions predict object locations directly from feature map points, removing the need to predefine box templates. Transformer-based detectors like DETR use learned object queries instead of anchors entirely. These approaches simplify the training pipeline and eliminate anchor-related hyperparameter tuning.

Resources

Relevant Blog Posts ↘

Glossary

Our Blog

Documentation

What Is Pose Estimation? Keypoint Detection Explained [2026]

MIN READ

April 2, 2026

Pose estimation predicts anatomical keypoints (e.g., shoulders, elbows, knees) and connects them into a skeleton, revealing posture and motion rather than just “there’s a person here.” In 2026 it’s mature enough for real-time edge use, with top-down vs bottom-up multi-person pipelines, heatmap/SimCC-style localization, and standard evaluation via OKS-based AP.

Read

Implementing Object Tracking for Computer Vision (+ Code)

MIN READ

March 4, 2026

The complete guide to compare the performance of multiple object tracking algorithms and build your object tracker project on Datature's platform.

Read

Train and Visualize A Face Mask Detection Model Without Code

MIN READ

March 4, 2026

Learn how to label images, create augmentation sequences, and train a Face Mask Detection Model without coding. Validate it with our open-source tool - Portal.

Read

Get Started Now

Get Started using Datature’s computer vision platform now for free.

Book Demo