Tutorial

Train a 3D Medical Segmentation Model with Swin UNETR on Datature

https://www.youtube.com/embed/alUdDeqQlo8

Swin UNETR is one of the top-performing architectures for 3D medical image segmentation, and this is the most in-depth tutorial on training one without writing code. At 17 minutes, it covers the full pipeline on Datature Nexus: from uploading volumetric scans to reviewing 3D predictions.

What This Tutorial Covers

  • Uploading and preparing 3D medical datasets (CT, MRI) on Datature Nexus
  • Annotating volumetric data with slice-by-slice and MPR tools
  • Selecting Swin UNETR as the segmentation architecture
  • Configuring training parameters for 3D inputs (patch size, spacing, augmentations)
  • Launching the training run and monitoring loss curves
  • Reviewing 3D segmentation predictions on test volumes

Why Swin UNETR for Medical Imaging

Swin UNETR combines the hierarchical feature extraction of Swin Transformers with the encoder-decoder structure of UNETR. It captures both local tissue patterns and global anatomical context across volumetric slices. Published benchmarks show strong results on organ segmentation (BTCV), brain tumor segmentation (BraTS), and cardiac imaging tasks.

Training this model from scratch typically requires custom data loaders, MONAI or nnU-Net frameworks, and significant GPU memory management. Datature Nexus handles the preprocessing, patch extraction, and distributed training automatically.

Use Cases

Organ segmentation for surgical planning. Brain tumor boundary detection. Cardiac chamber segmentation. Lung nodule analysis. Any task where labeled 3D medical volumes are the input and voxel-level predictions are the output.

Go Deeper

Video Description Lorem Ipsum

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua.Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua.

Resources

More reading...

Building VLMs for Phrase Grounding with Datature Vi
January 14, 2026
Datature Vi

Build a vision-language model for phrase grounding on Datature Vi. Annotate multimodal data, configure a VLM workflow, train, and run inference.

Read
Improving Your Computer Vision Models with Metadata
July 1, 2025
Explained

Improve model accuracy by adding metadata to your training pipeline. Learn how camera settings, timestamps, and sensor data boost CV predictions.

Read
Class Imbalance in Computer Vision, Explained
June 6, 2025
Explained

Learn why class imbalance hurts model performance and how to fix it. Covers oversampling, weighted loss functions, focal loss, and augmentation strategies.

Read
Get Started Now

Get Started using Datature’s computer vision platform now for free.