Skip to content
VuFind
  • Login
    • English
    • Deutsch
    • Español
    • Français
    • Italiano
Advanced
  • Cite this
  • Text this
  • Email this
  • Print
  • Export Record
    • Export to RefWorks
    • Export to EndNoteWeb
    • Export to EndNote
  • Save to List
  • Permanent link
Cover Image

Saved in:
Bibliographic Details
Main Authors: Templeton, Adly, Conerly, Tom, Marcus, Jonathan, Lindsey, Jack, Bricken, Trenton, Chen, Brian, Pearce, Adam, Citro, Craig, Ameisen, Emmanuel, Jones, Andy, Cunningham, Hoagy, Turner, Nicholas L, McDougall, Callum, MacDiarmid, Monte, Tamkin, Alex, Durmus, Esin, Hume, Tristan, Mosconi, Francesco, Freeman, C. Daniel, Sumers, Theodore R., Rees, Edward, Batson, Joshua, Jermyn, Adam, Carter, Shan, Olah, Chris, Henighan, Tom
Format: Preprint
Published: 2026
Subjects:
Artificial Intelligence
Online Access:https://arxiv.org/abs/2605.29358
Tags: Add Tag
No Tags, Be the first to tag this record!
  • Holdings
  • Description
  • Table of Contents
  • Comments
  • Similar Items
  • Staff View

Internet

https://arxiv.org/abs/2605.29358

Similar Items

  • Auditing language models for hidden objectives
    by: Marks, Samuel, et al.
    Published: (2025)
  • When Models Manipulate Manifolds: The Geometry of a Counting Task
    by: Gurnee, Wes, et al.
    Published: (2026)
  • Cross-Architecture Model Diffing with Crosscoders: Unsupervised Discovery of Differences Between LLMs
    by: Jiralerspong, Thomas, et al.
    Published: (2026)
  • Emotion Concepts and their Function in a Large Language Model
    by: Sofroniew, Nicholas, et al.
    Published: (2026)
  • Scottish meat consumption survey (Feb–Jul 2023): attitudes, COM-B measures, and perceived effectiveness of meat-reduction policies (Best-Worst Scaling)
    by: McBey, David, et al.
    Published: (2026)

Search Options

  • Search History
  • Advanced Search

Find More

  • Browse the Catalog
  • Browse Alphabetically
  • Explore Channels
  • Course Reserves
  • New Items

Need Help?

  • Search Tips
  • Ask a Librarian
  • FAQs