SpeechLMM v1 - a meetween Collection

meetween 's Collections

updated Nov 25, 2025

1st generation of SpeechLMM models, capable of ingesting video, audio and text and generate text as output. From the Meetween consortium (meetween.eu)

Upvote

meetween/Llama-speechlmm-1.0-s

Feature Extraction • 2B • Updated Nov 25, 2025 • 5
meetween/Llama-speechlmm-1.0-m

Feature Extraction • 4B • Updated Nov 25, 2025 • 9
meetween/Llama-speechlmm-1.0-l

Feature Extraction • 8B • Updated Nov 25, 2025 • 38 • 1
meetween/Llama-speechlmm-1.0-xl

Feature Extraction • 1B • Updated Nov 25, 2025 • 1
meetween/Llama-speechlmm-1.0-l-ASR

0.6B • Updated Jun 5, 2025 • 4
meetween/Llama-speechlmm-1.0-l-ST

Translation • 9B • Updated Apr 30, 2025 • 1
meetween/Llama-speechlmm-1.0-l-MT

Translation • 9B • Updated Jun 18, 2025
meetween/Llama-speechlmm-1.0-l-SLU

9B • Updated Jun 19, 2025 • 1
meetween/Llama-speechlmm-1.0-l-LIPREAD

Other • 9B • Updated May 23, 2025 • 1
meetween/Llama-speechlmm-1.0-l-SQA

Translation • 9B • Updated May 22, 2025 • 1
meetween/Llama-speechlmm-1.0-l-SSUM

9B • Updated Apr 22, 2025 • 1
meetween/Llama-speechlmm-1.0-l-TSUM

9B • Updated Aug 22, 2025

Upvote

Collection guide
Browse collections