New method localizes prompt ambiguity in LLMs with probe-targeted attribution

AnalysisAI Models

6 days ago

New method localizes prompt ambiguity in LLMs with probe-targeted attribution

The paper introduces probe-targeted attribution to identify which parts of a prompt cause ambiguity in LLM outputs. It provides a way to localize latent ambiguity without requiring observable failures.

6 days ago