Collection of image captioning models
Niels Rogge
nielsr
AI & ML interests
Mainly interested in diving into complex Github repos and making AI easier and more accessible to everyone
Blog posts
Organizations
Collections
5
SigLIP improves upon CLIP with a sigmoid loss. Both English-only and multilingual checkpoints are released.
-
Sigmoid Loss for Language Image Pre-Training
Paper • 2303.15343 • Published • 3 -
google/siglip-base-patch16-224
Zero-Shot Image Classification • Updated • 28.9k • 7 -
google/siglip-base-patch16-256
Zero-Shot Image Classification • Updated • 176 -
google/siglip-base-patch16-384
Zero-Shot Image Classification • Updated • 360 • 5
spaces
20
models
164
nielsr/test-model
Updated
nielsr/segformer-b0-scene-parse-150
Updated
nielsr/vit-large-patch16-v-jepa
Updated
•
28
•
2
nielsr/imagebind-huge
Updated
•
11
•
2
nielsr/gemma-2b-it
Updated
nielsr/DUSt3R_ViTLarge_BaseDecoder_512_dpt
Updated
•
17
•
1
nielsr/udop-test
Text2Text Generation
•
Updated
•
16
nielsr/RMBG-1.4
Updated
nielsr/cogvlm-tiny-random
Text Generation
•
Updated
•
41
nielsr/crossmae-small-patch16
Updated
•
1
datasets
76
nielsr/llava-batched-inference
Updated
nielsr/test-cogvlm
Updated
nielsr/test-image
Viewer
•
Updated
nielsr/ml6-website-rag
Viewer
•
Updated
•
6
nielsr/breast-cancer
Viewer
•
Updated
•
1.01k
•
7
nielsr/test-files
Updated
nielsr/example-pdf
Viewer
•
Updated
nielsr/test-maskrcnn
Updated
nielsr/dinov2-test-batch
Updated
nielsr/test-data-nougat
Updated