AnalysisAI Models
6 days ago
New method localizes prompt ambiguity in LLMs with probe-targeted attribution
The paper introduces probe-targeted attribution to identify which parts of a prompt cause ambiguity in LLM outputs. It provides a way to localize latent ambiguity without requiring observable failures.
·
6 days ago