Great job with this impressive work!
I am wondering if there is any plan to open-source the ScreenshotVQA benchmark and its evaluation scripts?
Or do you plan to evaluate this on opensource long video benchmarks, such as https://dl.acm.org/doi/10.1609/aaai.v39i7.32775