Skip to content

Here's a well-structured GitHub issue draft for your request: Issue: Generating Episodic Data for Object Navigation Challenge – Visibility Criteria #2149

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
sktalha68 opened this issue Jan 31, 2025 · 0 comments

Comments

@sktalha68
Copy link

sktalha68 commented Jan 31, 2025

I am working on generating episodic data for the Object Navigation Challenge. Based on the paper, the required criteria for an episode include:

Agent’s initial location
Target object
Geodesic and Euclidean distances
The ratio of geodesic/Euclidean > 1.05
Valid viewpoints: Points within a 1m radius of the target object that are navigable and have visibility of the object

I am able to satisfy most of the criteria, but I am stuck on determining object visibility for a given viewpoint.
What I Have Tried

Semantic Sensor Data
    I used the semantic sensor from both HM3D Semantics and MP3D datasets.
    However, for some reason, the semantic sensor values for the HM3D  Sem dataset are all zeroes.
    To work around this, I tried using the semantic.glb file and extracting semantic information from the RGB sensor, which seems odd to me.

Mapping Pixel Values to Object Instances
    Even when I extract semantic information, the pixel values are too large (somewhere in an 1k-3k range).
    I couldn’t find a mapping between pixel values and object instances.

Questions

How do I correctly determine the visibility of an object from a given viewpoint?
Is my approach (using semantic sensor values) correct? If not, what is the recommended method?
Is there any reference code or structure used for episodic data generation?
If I am on the right track, how can I extract visibility information more reliably?

Expected Help

Guidance on correctly obtaining object visibility
Clarification on why semantic sensor values for HM3D appear as zeros
Any available codebase for episode generation to compare my approach

Any help would be greatly appreciated! Thanks in advance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant