Skip to content

Add MM Grounding DINO #37744

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
2 tasks done
rziga opened this issue Apr 24, 2025 · 3 comments
Open
2 tasks done

Add MM Grounding DINO #37744

rziga opened this issue Apr 24, 2025 · 3 comments

Comments

@rziga
Copy link

rziga commented Apr 24, 2025

Model description

Hi,

MM Grounding DINO is MMDetection's implementation of Grounding DINO. It improves zero-shot detection performance quite a bit (especially on LVIS) and is very similar to the original Grounding DINO implementation-wise. I'm mostly interested in it because they also have checkpoints for models with higher resolution FPN features, which are not available for the original model.

The model also appears to be the same inference-wise as LLMDet (#37334), so this could be a good stepping stone for that one.

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

Weights and implementation are available in MMDetection here.

I've already started working on porting the model with modular transformers, so I can open a draft PR if you want.

@Rocketknight1
Copy link
Member

cc @qubvel @NielsRogge

@demoncoder-crypto
Copy link

Is this open for Community contribution?

@qubvel
Copy link
Member

qubvel commented May 1, 2025

Hi @rziga, that sounds super nice! Feel free to ping me when PR is open!

@demoncoder-crypto thanks for your interest, you can collaborate with @rziga if they need any help 🤗

@qubvel qubvel added the Vision label May 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants