AnalysisAI Models
23 days ago
Apple researchers propose EpiCache for long-term LLM conversations on constrained devices
EpiCache addresses the linear growth of KV cache in LLMs during extended dialogues, targeting memory-constrained environments. The episodic management method aims to maintain coherent, personalized responses over long conversation histories.
23 days ago
