ocr tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 223k • 560 opendatalab/MinerU2.5-2509-1.2B Image-Text-to-Text • 1B • Updated Sep 29, 2025 • 110k • 345 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 19 days ago • 339k • 529 PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated 10 days ago • 7.31k • 1.58k
asr FireRedTeam/FireRedASR-AED-L Automatic Speech Recognition • Updated Mar 5, 2025 • 140 • 68 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 283k • 1.58k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 283k • 1.58k
ocr tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 223k • 560 opendatalab/MinerU2.5-2509-1.2B Image-Text-to-Text • 1B • Updated Sep 29, 2025 • 110k • 345 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 19 days ago • 339k • 529 PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated 10 days ago • 7.31k • 1.58k
asr FireRedTeam/FireRedASR-AED-L Automatic Speech Recognition • Updated Mar 5, 2025 • 140 • 68 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 283k • 1.58k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 283k • 1.58k