-
facebook/vjepa2-vitl-fpc64-256
Video Classification β’ 0.3B β’ Updated β’ 139k β’ 200 -
microsoft/xclip-base-patch32
Video Classification β’ 0.2B β’ Updated β’ 82.2k β’ 113 -
MCG-NJU/videomae-base
Video Classification β’ 94.2M β’ Updated β’ 248k β’ 53 -
OpenGVLab/VideoMAEv2-Base
Video Classification β’ 86.2M β’ Updated β’ 11.5k β’ 17
Alban NYANTUDRE
AI & ML interests
Recent Activity
Organizations
-
anyantudre/MooreSpeechCorpora
Viewer β’ Updated β’ 5.54k β’ 14 β’ 3 -
anyantudre/moore-speech-contes
Viewer β’ Updated β’ 11.9k β’ 13 β’ 1 - Building3
GO AI Bench Leaderboard
π3Open benchmarking leaderboard for NLP models on MoorΓ© & Dyu.
- Build errorAgents4
Moore Language Space
π4Demo Space for MoorΓ© language TTS, ASR and translation
- Running on CPU UpgradeAgents1.02k
Open VLM Leaderboard
π1.02kVLMEvalKit Evaluation Results Collection
- Running on ZeroAgentsFeatured423
moondream1
π423Generate text using the Phi language model
- Runtime errorAgents21
Ovis2 1B
π¦«21Small model can do big things.
- Runtime errorAgents4
VQA Autonomous Driving SmolVLM2
π4Visual Question Answering - Autonomous Driving - SmolVLM2
- Running625
MinerU Document Extraction Tools
π625Embedded MinerU document extraction demo
- Running on ZeroAgentsFeatured471
DeepSeek OCR 2 Demo
π471Try out DeepSeek-OCR-2 on your PDFs or images
- Running on ZeroAgentsFeatured277
granite-docling-258M demo
π277Convert and query documents from images with AI
- Running on ZeroAgents42
Multimodal RAG with Granite Vision
π42RAG example using Granite [vision, embedding, instruct]
-
OpenGVLab/InternVL3-1B
Image-Text-to-Text β’ 0.9B β’ Updated β’ 212k β’ 85 -
vikhyatk/moondream2
Image-Text-to-Text β’ 2B β’ Updated β’ 1.58M β’ 1.42k -
microsoft/Florence-2-base
Image-Text-to-Text β’ 0.2B β’ Updated β’ 2.66M β’ 380 -
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text β’ 0.3B β’ Updated β’ 158k β’ 106
-
facebook/vjepa2-vitl-fpc64-256
Video Classification β’ 0.3B β’ Updated β’ 139k β’ 200 -
microsoft/xclip-base-patch32
Video Classification β’ 0.2B β’ Updated β’ 82.2k β’ 113 -
MCG-NJU/videomae-base
Video Classification β’ 94.2M β’ Updated β’ 248k β’ 53 -
OpenGVLab/VideoMAEv2-Base
Video Classification β’ 86.2M β’ Updated β’ 11.5k β’ 17
- Running625
MinerU Document Extraction Tools
π625Embedded MinerU document extraction demo
- Running on ZeroAgentsFeatured471
DeepSeek OCR 2 Demo
π471Try out DeepSeek-OCR-2 on your PDFs or images
- Running on ZeroAgentsFeatured277
granite-docling-258M demo
π277Convert and query documents from images with AI
- Running on ZeroAgents42
Multimodal RAG with Granite Vision
π42RAG example using Granite [vision, embedding, instruct]
-
anyantudre/MooreSpeechCorpora
Viewer β’ Updated β’ 5.54k β’ 14 β’ 3 -
anyantudre/moore-speech-contes
Viewer β’ Updated β’ 11.9k β’ 13 β’ 1 - Building3
GO AI Bench Leaderboard
π3Open benchmarking leaderboard for NLP models on MoorΓ© & Dyu.
- Build errorAgents4
Moore Language Space
π4Demo Space for MoorΓ© language TTS, ASR and translation
-
OpenGVLab/InternVL3-1B
Image-Text-to-Text β’ 0.9B β’ Updated β’ 212k β’ 85 -
vikhyatk/moondream2
Image-Text-to-Text β’ 2B β’ Updated β’ 1.58M β’ 1.42k -
microsoft/Florence-2-base
Image-Text-to-Text β’ 0.2B β’ Updated β’ 2.66M β’ 380 -
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text β’ 0.3B β’ Updated β’ 158k β’ 106
- Running on CPU UpgradeAgents1.02k
Open VLM Leaderboard
π1.02kVLMEvalKit Evaluation Results Collection
- Running on ZeroAgentsFeatured423
moondream1
π423Generate text using the Phi language model
- Runtime errorAgents21
Ovis2 1B
π¦«21Small model can do big things.
- Runtime errorAgents4
VQA Autonomous Driving SmolVLM2
π4Visual Question Answering - Autonomous Driving - SmolVLM2