Skip to content

Conversation

@kprokofi
Copy link
Contributor

Summary

resolves #5020

  • Provide batch augmentations support per model per task
  • Change augmentation pipeline to hybrid mode (batch augmentation + torchvision.v2). Operate only tensors
  • Remove all custom unnecessary augmentations
  • Provide DataAugmentationFactory to support configurable augmentation both for dataset based and batch based. This solution should work with Geti templates.
  • Provide benchmark results

How to test

Checklist

  • The PR title and description are clear and descriptive
  • I have manually tested the changes
  • All changes are covered by automated tests
  • All related issues are linked to this PR (if applicable)
  • Documentation has been updated (if applicable)

@kprokofi kprokofi added the ALGO Any changes in OTX Algo Tasks implementation label Nov 14, 2025
@kprokofi kprokofi added this to the Geti Tune MVP milestone Nov 14, 2025
@kprokofi
Copy link
Contributor Author

kprokofi commented Nov 14, 2025

First validation results:

Batch based augmentations + tensor only operations can improve iteration time almost 2x. The larger batch the larger improvements

Method Task Model iter_time
Current classification efficientnet_b0 0.446
Proposed classification efficientnet_b0 0.225
Current detection YOLOX_X 0.369
Proposed detection YOLOX_X 0.180

@leoll2 leoll2 removed this from the Geti Tune MVP milestone Nov 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ALGO Any changes in OTX Algo Tasks implementation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Optimize OTX Data Augmentations pipeline

3 participants