Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Tasks
1
Sizes
Sub-tasks
Languages
Licenses
Other
Reset Tasks
Multimodal
Visual Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Table to Text
Multiple Choice
Text Retrieval
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Tabular to Text
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Datasets
409
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
MMInstruction/ArxivQA
Viewer
•
Updated
Mar 5
•
16
•
13
edesaras/CircuitSketchTextAnnotations
Viewer
•
Updated
24 days ago
•
1
•
2
NatLibFi/Finna-HKM-images
Preview
•
Updated
13 days ago
•
1
CaptionEmporium/anime-caption-danbooru-2021-sfw-5m-hq
Viewer
•
Updated
11 days ago
•
7
BUAADreamer/llava-med-zh-instruct-60k
Viewer
•
Updated
8 days ago
•
1
red_caps
Updated
Jan 18
•
45.5k
•
54
facebook/winoground
Updated
about 1 month ago
•
7.86k
•
75
sbu_captions
Viewer
•
Updated
Jan 18
•
105
•
15
visual_genome
Preview
•
Updated
Jun 29, 2023
•
1.92k
•
59
google/wit
Viewer
•
Updated
Jul 4, 2022
•
20
AhmedSSabir/Textual-Image-Caption-Dataset
Preview
•
Updated
Feb 20
•
9
•
6
olivierdehaene/xkcd
Viewer
•
Updated
Oct 25, 2022
•
6
facebook/pmd
Updated
Aug 9, 2022
•
122
•
31
israfelsr/img-wikipedia-simple
Viewer
•
Updated
Aug 26, 2022
ThierryZhou/test
Updated
Jan 29
davanstrien/newspaper_navigator
Viewer
•
Updated
Oct 14, 2022
biglam/brill_iconclass
Viewer
•
Updated
Dec 21, 2023
•
7
jordanparker6/publaynet
Viewer
•
Updated
Jul 19, 2022
•
17
•
11
sadrasabouri/ShahNegar
Viewer
•
Updated
Oct 21, 2022
•
4
actdan2016/sample1
Updated
Aug 29, 2022
gigant/oldbookillustrations
Viewer
•
Updated
Dec 18, 2023
•
28
•
27
priyank-m/SROIE_2019_text_recognition
Viewer
•
Updated
Aug 27, 2022
•
5
•
3
jamescalam/unsplash-25k-photos
Updated
Sep 13, 2022
•
31
•
45
priyank-m/chinese_text_recognition
Viewer
•
Updated
Sep 21, 2022
•
1
•
11
yerevann/coco-karpathy
Viewer
•
Updated
Oct 31, 2022
•
507
•
8
jmhessel/newyorker_caption_contest
Viewer
•
Updated
Dec 22, 2023
•
9.74k
•
50
kenobi/SDO
Updated
Oct 3, 2022
•
1
sled-umich/Action-Effect
Updated
Oct 14, 2022
•
1
mcemilg/laion2B-multi-turkish-subset
Viewer
•
Updated
Nov 8, 2022
•
3
alexandrainst/da-wit
Viewer
•
Updated
Nov 18, 2022
•
2
Previous
1
2
3
4
...
14
Next