STEM与日常科技·英语精读30篇(5)
22 / 30
正在确认阅读权限…
RAG Failures in Customer Service Chatbots: When Retrieval Misleads Generation
客服聊天机器人中的RAG失效:检索环节如何误导生成结果
-
Retrieval-Augmented Generation systems retrieve policy documents before generating replies, yet outdated PDFs often rank higher than revised web content due to legacy metadata.
-
A query about 'return window extension' may retrieve a discontinued holiday policy instead of current standard terms, causing misleading confidence in the response.
-
Semantic similarity scoring fails when product names change—'Galaxy S23 FE' and 'Galaxy S23 Fan Edition' yield low match scores despite identical coverage.
-
Users reporting billing issues frequently trigger irrelevant warranty clauses because retrieval engines over-index on financial keywords like 'charge' or 'refund'.
-
Post-hoc audit logs reveal that 68% of incorrect RAG outputs stem from retrieval top-k contamination—not generation hallucination.
-
Effective mitigation combines timestamp-aware ranking, domain-specific query rewriting, and explicit uncertainty signaling in low-confidence retrieval contexts.