Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Tasks
1
Sizes
Sub-tasks
Languages
Licenses
Other
Reset Tasks
Multimodal
Visual Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Table to Text
Multiple Choice
Text Retrieval
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Tabular to Text
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Datasets
158
Full-text search
Edit filters
Sort: Trending
Active filters:
visual-question-answering
Clear all
isidentical/moondream2-coyo-5M-captions
Viewer
•
Updated
1 day ago
•
30
Vi-VLM/Vista
Viewer
•
Updated
1 day ago
•
10
liuhaotian/LLaVA-Instruct-150K
Preview
•
Updated
Jan 3
•
153
•
352
terryoo/TableVQA-Bench
Viewer
•
Updated
20 days ago
•
3
•
4
MMMU/MMMU
Viewer
•
Updated
11 days ago
•
161k
•
135
openbmb/UltraSafety
Viewer
•
Updated
Mar 16
•
76
•
22
Lin-Chen/MMStar
Viewer
•
Updated
Apr 7
•
600
•
11
FreedomIntelligence/MileBench
Viewer
•
Updated
16 days ago
•
1
•
4
parsee-ai/finRAG
Updated
6 days ago
•
4
jamescalam/youtube-transcriptions
Viewer
•
Updated
Oct 22, 2022
•
122
•
28
SALT-NLP/LLaVAR
Viewer
•
Updated
Jul 22, 2023
•
18
Vision-Flan/vision-flan_191-task_1k
Preview
•
Updated
Sep 21, 2023
•
18
•
15
AI4Math/MathVista
Viewer
•
Updated
Feb 11
•
6.02k
•
85
Lin-Chen/ShareGPT4V
Viewer
•
Updated
Nov 22, 2023
•
205
•
205
OpenGVLab/VideoChat2-IT
Updated
Apr 6
•
4
•
18
mathvision/mathvision
Viewer
•
Updated
Feb 29
•
51
•
6
SakanaAI/JA-VG-VQA-500
Viewer
•
Updated
1 day ago
•
23
•
7
turing-motors/LLaVA-v1.5-Instruct-620K-JA
Preview
•
Updated
Apr 12
•
2
•
2
turing-motors/LLaVA-Pretrain-JA
Viewer
•
Updated
Apr 12
•
2
•
1
turing-motors/Japanese-Heron-Bench
Viewer
•
Updated
Apr 12
•
27
•
5
mhan/Shot2Story-134K
Updated
21 days ago
•
1
BUAADreamer/llava-en-zh-300k
Viewer
•
Updated
8 days ago
•
80
•
2
AILab-CVC/SEED-Bench-2-plus
Viewer
•
Updated
18 days ago
•
3
•
5
BUAADreamer/llava-med-zh-instruct-60k
Viewer
•
Updated
8 days ago
•
1
tomg-group-umd/cinepile
Updated
about 5 hours ago
•
1
compguesswhat
Viewer
•
Updated
Feb 7
•
218
•
1
visual_genome
Preview
•
Updated
Jun 29, 2023
•
1.94k
•
59
textvqa
Viewer
•
Updated
Jan 18
•
815
•
21
Leyo/TGIF
Updated
Oct 25, 2022
•
1
HuggingFaceM4/TGIF
Updated
Oct 25, 2022
•
6
•
11
Previous
1
2
3
...
6
Next