microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
5.32k
•
1.24k
Generate high-quality text data for LLMs using FineWeb
The ultimate guide to training LLM on large GPU Clusters
Calculate memory usage for model configurations