Guardat en:
| Autors principals: | , |
|---|---|
| Format: | Recurso digital |
| Idioma: | |
| Publicat: |
Zenodo
2026
|
| Matèries: | |
| Accés en línia: | https://doi.org/10.5281/zenodo.18716285 |
| Etiquetes: |
Afegir etiqueta
Sense etiquetes, Sigues el primer a etiquetar aquest registre!
|
Taula de continguts:
- <p>Polite Failure is the new hallucination.<br>When agents can act (email, CRM, APIs), the real risk is not what they say — it’s what they do while staying “helpful” and “confident.”</p> <p>ResponsibilityGym (Demo) is a practical eval protocol that stress-tests agentic systems with proxy traps (Goodhart’s law in action) and measures whether an agent can anticipate harm and self-correct before damage happens.</p> <p>Includes: a demo trap suite, pass/fail signals, logging guidance, and a runbook.<br>Full domain suites and automation are available on request.</p>