Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Audio-Text-to-Text
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
721
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
Salesforce/blip-image-captioning-large
Image-to-Text
•
Updated
Dec 7, 2023
•
1.94M
•
•
1.19k
alibaba-damo/mgp-str-base
Image-to-Text
•
Updated
Dec 11, 2023
•
8.86k
•
60
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Aug 1, 2023
•
1.65M
•
•
521
microsoft/trocr-base-handwritten
Image-to-Text
•
Updated
May 27
•
915k
•
343
MohamedRashad/arabic-large-nougat
Image-to-Text
•
Updated
7 days ago
•
296
•
3
kha-white/manga-ocr-base
Image-to-Text
•
Updated
Jun 22, 2022
•
87.4k
•
125
U4R/StructTable-InternVL2-1B
Image-to-Text
•
Updated
Oct 22
•
2.75k
•
25
prithivMLmods/Florence-2-VLM-Doc-VQA
Image-to-Text
•
Updated
Oct 26
•
90
•
6
kazars24/trocr-base-handwritten-ru
Image-to-Text
•
Updated
Oct 27
•
1.37k
•
2
keras-io/ocr-for-captcha
Image-to-Text
•
Updated
May 29, 2022
•
132
•
65
microsoft/trocr-base-printed
Image-to-Text
•
Updated
May 27
•
73.8k
•
149
microsoft/trocr-base-stage1
Image-to-Text
•
Updated
May 27
•
19.2k
•
13
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27
•
36.8k
•
95
microsoft/trocr-large-printed
Image-to-Text
•
Updated
May 27
•
166k
•
138
microsoft/trocr-large-stage1
Image-to-Text
•
Updated
May 27
•
3.14k
•
22
microsoft/trocr-small-handwritten
Image-to-Text
•
Updated
May 27
•
552k
•
40
microsoft/trocr-small-printed
Image-to-Text
•
Updated
May 27
•
153k
•
32
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
2.17M
•
•
844
TeamFnord/manga-ocr
Image-to-Text
•
Updated
Feb 10, 2022
•
59
•
10
naver-clova-ix/donut-base-finetuned-cord-v2
Image-to-Text
•
Updated
Aug 13, 2022
•
18.4k
•
82
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
44.9k
•
178
microsoft/trocr-large-str
Image-to-Text
•
Updated
Jan 24, 2023
•
2.16k
•
17
espnet/iam_handwriting_ocr
Image-to-Text
•
Updated
Nov 8, 2022
•
5
•
5
microsoft/git-base
Image-to-Text
•
Updated
Apr 24, 2023
•
2.07M
•
82
microsoft/git-large-coco
Image-to-Text
•
Updated
Jun 26, 2023
•
8.73k
•
98
tuman/vit-rugpt2-image-captioning
Image-to-Text
•
Updated
Jan 26, 2023
•
369
•
13
ddobokki/ko-trocr
Image-to-Text
•
Updated
Oct 22
•
8.94k
•
21
google/pix2struct-textcaps-large
Image-to-Text
•
Updated
May 19, 2023
•
92
•
14
google/pix2struct-base
Image-to-Text
•
Updated
Dec 24, 2023
•
6.4k
•
64
Xenova/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Oct 8
•
1.5k
•
21
Previous
1
2
3
...
25
Next