AnalysisAI Models
8 days ago
Paper proposes economic framework for LLM inference budget allocation
Authors model inference-time compute as a resource with a shadow price, enabling optimal budget distribution across queries. The framework treats reasoning tokens as a scarce computational good.
·
8 days ago