DINOv3 and EdgeTAM (SAM2) with ONNX Runtime

This repository provides a set of tools and examples for converting and utilizing powerful vision models, DINOv3 and EdgeTAM (SAM2), within the ONNX ecosystem. The focus is on creating efficient, PyTorch-independent inference pipelines for tasks like one-shot segmentation, foreground extraction, and robust video object tracking. Also tried TFLite/LiteRT.

📂 Repository Structure

├── notebooks/
│   ├── dinov3_onnx_export.ipynb               # Exports DINOv3 to ONNX
│   ├── dinov3_tflite_export.ipynb               # Exports DINOv3 to TFLite
│   ├── edgetam_onnx_export.ipynb              # Exports EdgeTAM encoder/decoder to ONNX
│   ├── foreground_segmentation_onnx_export.ipynb # Trains and exports a foreground classifier
│   ├── dinov3_one_shot_segmentation_onnx.ipynb  # Demo for one-shot segmentation with ONNX
│   └── dinov3_one_shot_segmentation_tflite.ipynb  # Demo for one-shot segmentation with TFLite
│
└── scripts/
    └── hybrid_tracker.py

📓 Notebooks

Each notebook is self-contained and can be run directly in Google Colab.

Notebook	Description	Link
`dinov3_onnx_export.ipynb`	Converts the DINOv3 Vision Transformer (ViT) feature extractor to ONNX format.	link
`dinov3_tflite_export.ipynb`	Converts the DINOv3 Vision Transformer (ViT) feature extractor to TFLite format.	link
`edgetam_onnx_export.ipynb`	Exports the EdgeTAM image encoder and mask decoder models to ONNX for efficient segmentation.	link
`foreground_segmentation_onnx_export.ipynb`	Trains a logistic regression classifier on DINOv3 features for foreground segmentation and exports it to ONNX	link
`dinov3_one_shot_segmentation_onnx.ipynb`	Demonstrates one-shot segmentation using DINOv3 features and a reference mask, all in ONNX.	link
`dinov3_one_shot_segmentation_tflite.ipynb`	Demonstrates one-shot segmentation using DINOv3 features and a reference mask, all in TFLite.	link

🙏 Acknowledgements

This work builds upon the official implementations and research from the following projects:

DINOv3: facebookresearch/dinov3

EdgeTAM: facebookresearch/EdgeTAM

Space-Time Correspondence as a Contrastive Random Walk: ajabri/videowalk

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DINOv3 and EdgeTAM (SAM2) with ONNX Runtime

📂 Repository Structure

📓 Notebooks

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

IoT-gamer/segment-anything-dinov3-onnx

Folders and files

Latest commit

History

Repository files navigation

DINOv3 and EdgeTAM (SAM2) with ONNX Runtime

📂 Repository Structure

📓 Notebooks

🙏 Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages