Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Reset Tasks
Multimodal
Visual Question Answering
Video-Text-to-Text
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Table to Text
Multiple Choice
Text Retrieval
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Tabular to Text
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Datasets
422
Full-text search
Edit filters
Sort: Trending
Active filters:
visual-question-answering
Clear all
Xkev/LLaVA-CoT-100k
Viewer
•
Updated
8 days ago
•
98.6k
•
934
•
32
allenai/pixmo-docs
Viewer
•
Updated
7 days ago
•
255k
•
1.85k
•
11
HuggingFaceM4/Docmatix
Viewer
•
Updated
Aug 26
•
2.55M
•
23.8k
•
228
lmms-lab/LLaVA-Video-178K
Viewer
•
Updated
Oct 11
•
1.63M
•
48.7k
•
92
OpenGVLab/MMPR
Preview
•
Updated
17 days ago
•
282
•
31
tomg-group-umd/pixelprose
Viewer
•
Updated
Jun 23
•
15.6M
•
1.2k
•
131
HourVideo/HourVideo
Viewer
•
Updated
1 day ago
•
14.2k
•
49
•
3
AI4Math/MathVista
Viewer
•
Updated
Feb 11
•
6.14k
•
5.9k
•
114
longvideobench/LongVideoBench
Viewer
•
Updated
Oct 14
•
6.68k
•
4.72k
•
15
OpenGVLab/OmniCorpus-CC
Viewer
•
Updated
18 days ago
•
986M
•
18k
•
11
HuggingFaceFV/finevideo
Viewer
•
Updated
about 1 month ago
•
39.5k
•
4.83k
•
274
allenai/pixmo-cap-qa
Viewer
•
Updated
3 days ago
•
272k
•
386
•
3
allenai/pixmo-point-explanations
Viewer
•
Updated
4 days ago
•
79.6k
•
311
•
3
Kendamarron/japanese-photo-instruction
Viewer
•
Updated
3 days ago
•
6.44k
•
22
•
2
ranjaykrishna/visual_genome
Updated
Jun 29, 2023
•
735
•
67
facebook/textvqa
Updated
Jan 18
•
722
•
30
dali-does/clevr-math
Preview
•
Updated
Oct 31, 2022
•
204
•
15
jmhessel/newyorker_caption_contest
Viewer
•
Updated
Dec 22, 2023
•
149k
•
10.2k
•
63
jamescalam/youtube-transcriptions
Viewer
•
Updated
Oct 22, 2022
•
209k
•
180
•
35
achang/plot_qa
Viewer
•
Updated
Feb 12, 2023
•
224k
•
489
•
7
derek-thomas/ScienceQA
Viewer
•
Updated
Feb 25, 2023
•
21.2k
•
2.72k
•
152
liuhaotian/LLaVA-Instruct-150K
Preview
•
Updated
Jan 3
•
3.26k
•
463
flaviagiammarino/path-vqa
Viewer
•
Updated
Jun 3, 2023
•
32.6k
•
1.89k
•
32
flaviagiammarino/vqa-rad
Viewer
•
Updated
Jun 3, 2023
•
2.24k
•
1.86k
•
36
tabtoyou/KoLLaVA-Instruct-150k
Viewer
•
Updated
Nov 30, 2023
•
313k
•
54
•
19
tabtoyou/KoLLaVA-CC3M-Pretrain-595K
Viewer
•
Updated
Jun 25, 2023
•
595k
•
57
•
10
BAAI/SVIT
Updated
Jan 2
•
103
•
28
AILab-CVC/SEED-Bench
Updated
May 17
•
1.7k
•
21
PetraAI/PetraAI
Updated
Sep 14, 2023
•
296
•
20
AlexBlck/ANAKIN
Viewer
•
Updated
Sep 21, 2023
•
2.85k
•
2.05k
•
1
Previous
1
2
3
...
15
Next