LaunchDevelopers
14 days ago
Perplexity open-sources Unigram tokenizer with 5-6x CPU reduction

Perplexity
@perplexity_aiCuriosity changes everything. Download our free app on iOS, Mac, Windows, and Android.
San Francisco, CAperplexity.ai/personal-computer

Perplexity AI
@perplexity_ai
We're open-sourcing the Unigram tokenizer we rebuilt to reduce CPU utilization by 5-6x. Small rerankers and embedders run in single-digit milliseconds on GPU, making CPU tokenization a meaningful share of total latency. https://t.co/QUnHeiho56 https://t.co/Oh29f1lo51

·
14 days ago