tomg-group-umd/cinepile
Visual Question AnsweringVideo Text To TextEN
Tomg-group-umd/cinepile is a visual question answering-focused dataset in EN distributed in Parquet format.
About tomg-group-umd/cinepile
CinePile: A Long Video Question Answering Dataset and Benchmark
CinePile is a question-answering-based, long-form video understanding dataset. It has been created using advanced large language models (LLMs) with human-in-the-loop pipeline lever...
Details
- Task
- Visual Question Answering, Video Text To Text
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- tomg-group-umd
- Year
- 2024