LaunchAI Models
4 hours ago
Featured
Kog open-sources 2B model with 3000+ tokens/sec inference

Hugging Face
@huggingfaceThe AI community building the future. https://t.co/TpiXQMQ9rZ
NYC and Paris and πhuggingface.co

Hugging Face
@huggingface
RT @ClementDelangue: Kog open-sourced on @huggingface the 2B model that they used to show a model running at 3,000+ tokens per second. Veryβ¦
Β·
4 hours ago