Skip to content
VuFind
  • Login
    • English
    • Deutsch
    • Español
    • Français
    • Italiano
Advanced
  • Cite this
  • Text this
  • Email this
  • Print
  • Export Record
    • Export to RefWorks
    • Export to EndNoteWeb
    • Export to EndNote
  • Save to List
  • Permanent link
Cover Image

Saved in:
Bibliographic Details
Main Authors: Mazeika, Mantas, Gatti, Alice, Menghini, Cristina, Sehwag, Udari Madhushani, Singhal, Shivam, Orlovskiy, Yury, Basart, Steven, Sharma, Manasi, Peskoff, Denis, Lau, Elaine, Lim, Jaehyuk, Carroll, Lachlan, Blair, Alice, Sivakumar, Vinaya, Basu, Sumana, Kenstler, Brad, Ma, Yuntao, Michael, Julian, Li, Xiaoke, Ingebretsen, Oliver, Mehta, Aditya, Mottola, Jean, Teichmann, John, Yu, Kevin, Shaik, Zaina, Khoja, Adam, Ren, Richard, Hausenloy, Jason, Phan, Long, Htet, Ye, Aich, Ankit, Rabbani, Tahseen, Shah, Vivswan, Novykov, Andriy, Binder, Felix, Chugunov, Kirill, Ramirez, Luis, Geralnik, Matias, Mesura, Hernán, Lee, Dean, Cardona, Ed-Yeremai Hernandez, Diamond, Annette, Yue, Summer, Wang, Alexandr, Liu, Bing, Hernandez, Ernesto, Hendrycks, Dan
Format: Preprint
Published: 2025
Subjects:
Machine Learning
Artificial Intelligence
Computation and Language
Online Access:https://arxiv.org/abs/2510.26787
Tags: Add Tag
No Tags, Be the first to tag this record!
  • Holdings
  • Description
  • Table of Contents
  • Comments
  • Similar Items
  • Staff View

Internet

https://arxiv.org/abs/2510.26787

Similar Items

  • The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems
    by: Ren, Richard, et al.
    Published: (2025)
  • In-Context Learning with Topological Information for Knowledge Graph Completion
    by: Sehwag, Udari Madhushani, et al.
    Published: (2024)
  • Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
    by: Ren, Richard, et al.
    Published: (2024)
  • AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment
    by: Pathmanathan, Pankayaraj, et al.
    Published: (2024)
  • Can LLMs be Scammed? A Baseline Measurement Study
    by: Sehwag, Udari Madhushani, et al.
    Published: (2024)

Search Options

  • Search History
  • Advanced Search

Find More

  • Browse the Catalog
  • Browse Alphabetically
  • Explore Channels
  • Course Reserves
  • New Items

Need Help?

  • Search Tips
  • Ask a Librarian
  • FAQs