Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation Paper • 2410.08371 • Published Oct 10, 2024 • 3
DEPAC: a Corpus for Depression and Anxiety Detection from Speech Paper • 2306.12443 • Published Jun 20, 2023
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 11 days ago • 16
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 11 days ago • 16
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 11 days ago • 16
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 11 days ago • 16
toksuite/supertoken_models-llama_google-gemma-2-2b Text Generation • 2B • Updated 10 days ago • 94
toksuite/supertoken_models-llama_meta-llama-Llama-3.2-1B Text Generation • 2B • Updated 10 days ago • 97
toksuite/supertoken_models-llama_CohereLabs-aya-expanse-8b Text Generation • 2B • Updated 10 days ago • 52
toksuite/supertoken_models-llama_tiktoken-gpt-4o Text Generation • 2B • Updated 10 days ago • 56
toksuite/supertoken_models-llama_common-pile-comma-v0.1 Text Generation • 2B • Updated 10 days ago • 74
toksuite/supertoken_models-llama_microsoft-Phi-3-mini-4k-instruct Text Generation • 1B • Updated 10 days ago • 75