video-dataset

Star

Here are 41 public repositories matching this topic...

OpenGVLab / InternVideo

Star

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Updated Aug 7, 2025
Python

HuaizhengZhang / Awsome-Deep-Learning-for-Video-Analysis

Star

Papers, code and datasets about deep learning and multi-modal learning for video analysis

machine-learning deep-learning paper video-classification video-analysis multimodal-learning video-dataset

Updated Oct 10, 2021

OpenDriveLab / DriveAGI

Sponsor

Star

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving

autonomous-driving large-dataset general-artificial-intelligence video-generation world-models video-dataset embodied-ai policy-learning foundation-model

Updated Jul 2, 2025
Python

TencentARC / VideoPainter

Star

[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"

video video-editing video-inpainting video-dataset

Updated Apr 8, 2025
Python

RaivoKoot / Video-Dataset-Loading-Pytorch

Star

Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.

machine-learning deep-learning pytorch videos dataloader action-recognition video-dataset

Updated Jan 18, 2023
Python

ttengwang / Awesome_Long_Form_Video_Understanding

Star

Awesome papers & datasets specifically focused on long-term videos.

video-representation-learning video-dataset dense-video-captioning video-grounding temporal-action-detection temporal-action-localization temporal-sentence-grounding audio-visual-event-localization long-term-video video-large-language-models video-llms

Updated Aug 6, 2025

Lixsp11 / sekai-codebase

Star

The official repository of "Sekai: A Video Dataset towards World Exploration"

video-generation world-models video-dataset

Updated Jul 19, 2025
Python

yuanxiaosc / Multimodal-short-video-dataset-and-baseline-classification-model

Star

500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型（TensorFlow2.0）。

tensorflow-models classification-model multimodal-datasets video-dataset

Updated Jul 23, 2019
Jupyter Notebook

PKU-YuanGroup / HoloTime

Star

[ACM MM 2025] HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

video-generation video-dataset diffusion-models panoramic-videos 4d-reconstruction 3d-gaussian-splatting 4d-generation

Updated Jul 5, 2025
Python

jssprz / video_captioning_datasets

Star

Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*

review video-captioning state-of-the-art vision-and-language charades video-to-text msvd video-dataset video-description activitynet-captions trecvid tgif-dataset msr-vtt vatex

Updated Oct 27, 2023
Jupyter Notebook

AlexanderMelde / SPHAR-Dataset

Sponsor

Star

Surveillance Perspective Human Action Recognition Dataset: 7759 Videos from 14 Action Classes, aggregated from multiple sources, all cropped spatio-temporally and filmed from a surveillance-camera like position.