Merve Noyan's picture

Merve Noyan PRO

merve

·

AI & ML interests

VLMs, vision & co

Articles

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Vision Language Models Explained

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

Deploy MusicGen in no time with Inference Endpoints

Open-Source Text Generation & LLM Ecosystem at Hugging Face

Jupyter X Hugging Face

Using Machine Learning to Aid Survivors and Race through Time

Introducing Skops

Announcing the Hugging Face Fellowship Program

Showcase Your Projects in Spaces using Gradio

Hosting your Models and Datasets on Hugging Face Spaces using Streamlit

Organizations

Posts 25

Post

744

New open Vision Language Model by @Google : PaliGemma 💙🤍

📝 Comes in 3B, pretrained, mix and fine-tuned models in 224, 448 and 896 resolution
🧩 Combination of Gemma 2B LLM and SigLIP image encoder
🤗 Supported in transformers

PaliGemma can do..
🧩 Image segmentation and detection! 🤯
📑 Detailed document understanding and reasoning
🙋 Visual question answering, captioning and any other VLM task!

Read our blog 🔖 hf.co/blog/paligemma
Try the demo 🪀 hf.co/spaces/google/paligemma
Check out the Spaces and the models all in the collection 📚 google/paligemma-release-6643a9ffbf57de2ae0448dda
Collection of fine-tuned PaliGemma models google/paligemma-ft-models-6643b03efb769dad650d2dda

Post

1740

two new VLM benchmarks! 🤩

BLINK: evaluates tasks that humans can solve within a blink 👀 BLINK-Benchmark/BLINK

SEED-2-Plus: multichoice questions on charts, maps, webs 😍 AILab-CVC/SEED-Bench-2-plus

Collections 21

spaces 95

Running on Zero

BLIP2 with transformers

BLIP2 (cutting edge image captioning) in 🤗transformers

Running on Zero

Compare VLMs

Running on Zero

GroundingDINO ⚔ OWL

Running on Zero

GroundingSAM

Running on Zero

Llava Next

Running on Zero

Seggpt Depth Anything

models 75

merve/output8

Updated 4 days ago

merve/output4

merve/VeCap-DFN-h14

Zero-Shot Image Classification • Updated Mar 26 • 4

merve/VeCap-DFN-l14

merve/VeCap-DFN-b16

Zero-Shot Image Classification • Updated Mar 26 • 4

merve/VeCLIP-b16-100m

Zero-Shot Image Classification • Updated Mar 26 • 4

merve/VeCLIP-b16-200m

Zero-Shot Image Classification • Updated Mar 26 • 3

merve/VeCLIP-b16-12m

Zero-Shot Image Classification • Updated Mar 26 • 3

merve/VeCLIP-b16-3m

Zero-Shot Image Classification • Updated Mar 26 • 2

merve/lego-sdxl-dora-3

Text-to-Image • Updated Feb 26 • 248 • 1

datasets 21

merve/faiss_embeddings

merve/pokemon-ds-embeddings

Viewer • Updated Jan 10 • 3

merve/tr-h4-norobots

Updated Jan 7 • 7 • 10

merve/lego_sets_latest

Viewer • Updated Jan 6 • 11 • 1

merve/ai-tube-dummy

Updated Dec 1, 2023

merve/my-blog-images

Viewer • Updated Aug 25, 2023 • 1

merve/turkish_instructions

Viewer • Updated Apr 27, 2023 • 505 • 31

merve/ner-flags

Updated Feb 13, 2023

merve/xlm-roberta-large-df

Viewer • Updated Feb 7, 2023

merve/parsed-dataset-xlm-roberta

Viewer • Updated Feb 7, 2023