Edit Datasets filters

Multimodal

Visual Question Answering

Video-Text-to-Text

Computer Vision

Depth Estimation

Image Classification

Object Detection

Image Segmentation

Unconditional Image Generation

Video Classification

Zero-Shot Image Classification

Mask Generation

Zero-Shot Object Detection

Image Feature Extraction

Natural Language Processing

Text Classification

Token Classification

Table Question Answering

Question Answering

Zero-Shot Classification

Feature Extraction

Text Generation

Text2Text Generation

Sentence Similarity

Multiple Choice

Audio

Automatic Speech Recognition

Audio Classification

Voice Activity Detection

Tabular

Tabular Classification

Tabular Regression

Tabular to Text

Time Series Forecasting

Reinforcement Learning

Reinforcement Learning

Other

Graph Machine Learning

Datasets

422

Full-text search

Active filters: visual-question-answering

Xkev/LLaVA-CoT-100k

Viewer • Updated 8 days ago • 98.6k • 934 • 32

allenai/pixmo-docs

Viewer • Updated 7 days ago • 255k • 1.85k • 11

HuggingFaceM4/Docmatix

Viewer • Updated Aug 26 • 2.55M • 23.8k • 228

lmms-lab/LLaVA-Video-178K

Viewer • Updated Oct 11 • 1.63M • 48.7k • 92

OpenGVLab/MMPR

Preview • Updated 17 days ago • 282 • 31

tomg-group-umd/pixelprose

Viewer • Updated Jun 23 • 15.6M • 1.2k • 131

HourVideo/HourVideo

Viewer • Updated 1 day ago • 14.2k • 49 • 3

AI4Math/MathVista

Viewer • Updated Feb 11 • 6.14k • 5.9k • 114

longvideobench/LongVideoBench

Viewer • Updated Oct 14 • 6.68k • 4.72k • 15

OpenGVLab/OmniCorpus-CC

Viewer • Updated 18 days ago • 986M • 18k • 11

HuggingFaceFV/finevideo

Viewer • Updated about 1 month ago • 39.5k • 4.83k • 274

allenai/pixmo-cap-qa

Viewer • Updated 3 days ago • 272k • 386 • 3

allenai/pixmo-point-explanations

Viewer • Updated 4 days ago • 79.6k • 311 • 3

Kendamarron/japanese-photo-instruction

Viewer • Updated 3 days ago • 6.44k • 22 • 2

ranjaykrishna/visual_genome

Updated Jun 29, 2023 • 735 • 67

facebook/textvqa

Updated Jan 18 • 722 • 30

dali-does/clevr-math

Preview • Updated Oct 31, 2022 • 204 • 15

jmhessel/newyorker_caption_contest

Viewer • Updated Dec 22, 2023 • 149k • 10.2k • 63

jamescalam/youtube-transcriptions

Viewer • Updated Oct 22, 2022 • 209k • 180 • 35

achang/plot_qa

Viewer • Updated Feb 12, 2023 • 224k • 489 • 7

derek-thomas/ScienceQA

Viewer • Updated Feb 25, 2023 • 21.2k • 2.72k • 152

liuhaotian/LLaVA-Instruct-150K

Preview • Updated Jan 3 • 3.26k • 463

flaviagiammarino/path-vqa

Viewer • Updated Jun 3, 2023 • 32.6k • 1.89k • 32

flaviagiammarino/vqa-rad

Viewer • Updated Jun 3, 2023 • 2.24k • 1.86k • 36

tabtoyou/KoLLaVA-Instruct-150k

Viewer • Updated Nov 30, 2023 • 313k • 54 • 19

tabtoyou/KoLLaVA-CC3M-Pretrain-595K

Viewer • Updated Jun 25, 2023 • 595k • 57 • 10

BAAI/SVIT

Updated Jan 2 • 103 • 28

AILab-CVC/SEED-Bench

Updated May 17 • 1.7k • 21

PetraAI/PetraAI

Updated Sep 14, 2023 • 296 • 20

AlexBlck/ANAKIN

Viewer • Updated Sep 21, 2023 • 2.85k • 2.05k • 1