Saved in:
| Main Authors: | Tan, Daniel, Woodruff, Anders, Warncke, Niels, Jose, Arun, Riché, Maxime, Africa, David Demitri, Taylor, Mia |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.04340 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Identifying a Circuit for Verb Conjugation in GPT-2
by: Africa, David Demitri
Published: (2025)
by: Africa, David Demitri
Published: (2025)
Inoculation Prompting: Instructing LLMs to misbehave at train-time improves test-time alignment
by: Wichers, Nevan, et al.
Published: (2025)
by: Wichers, Nevan, et al.
Published: (2025)
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs
by: Betley, Jan, et al.
Published: (2025)
by: Betley, Jan, et al.
Published: (2025)
LURE: Live-Usage Replay Evaluations for Reducing Evaluation Awareness
by: Ivanov, Igor, et al.
Published: (2026)
by: Ivanov, Igor, et al.
Published: (2026)
Steering Awareness: Detecting Activation Steering from Within
by: Rivera, Joshua Fonseca, et al.
Published: (2025)
by: Rivera, Joshua Fonseca, et al.
Published: (2025)
Does Self-Evaluation Enable Wireheading in Language Models?
by: Africa, David Demitri, et al.
Published: (2025)
by: Africa, David Demitri, et al.
Published: (2025)
Consistency Training while Mitigating Obfuscation via Rate Matching
by: Imran, Sohaib, et al.
Published: (2026)
by: Imran, Sohaib, et al.
Published: (2026)
Zwei neue Sandbienen aus der Ukraine und aus Ungarn (Hym. Apoidea)
by: Warncke, Klaus
Published: (1972)
by: Warncke, Klaus
Published: (1972)
Plan for the Development of Library Service in Montana.
by: Warncke, Ruth
Published: (1965)
by: Warncke, Ruth
Published: (1965)
Analyzing Your Community: Basis for Building Library Service.
by: Warncke, Ruth
Published: (1974)
by: Warncke, Ruth
Published: (1974)
Planning Library Workshops and Institutes.
by: Warncke, Ruth
Published: (1976)
by: Warncke, Ruth
Published: (1976)
Learning Dynamics of Meta-Learning in Small Model Pretraining
by: Africa, David Demitri, et al.
Published: (2025)
by: Africa, David Demitri, et al.
Published: (2025)
Investigating ReLoRA: Effects on the Learning Dynamics of Small Language Models
by: Weiss, Yuval, et al.
Published: (2025)
by: Weiss, Yuval, et al.
Published: (2025)
Learning Modular Exponentiation with Transformers
by: Africa, David Demitri, et al.
Published: (2025)
by: Africa, David Demitri, et al.
Published: (2025)
Meta-Pretraining for Zero-Shot Cross-Lingual Named Entity Recognition in Low-Resource Philippine Languages
by: Africa, David Demitri, et al.
Published: (2025)
by: Africa, David Demitri, et al.
Published: (2025)
Pico: A Modular Framework for Hypothesis-Driven Small Language Model Research
by: Martinez, Richard Diehl, et al.
Published: (2025)
by: Martinez, Richard Diehl, et al.
Published: (2025)
ALA Report to IFLA for 1970-1971
by: Clift, David H., et al.
Published: (1972)
by: Clift, David H., et al.
Published: (1972)
Mixed modular perverse sheaves on affine flag varieties and Koszul duality
by: Riche, Simon
Published: (2024)
by: Riche, Simon
Published: (2024)
Emancipation's Daughters
by: Richardson, Riché
Published: (2021)
by: Richardson, Riché
Published: (2021)
Some applications of the geometric Satake equivalence to modular representation theory
by: Riche, Simon
Published: (2024)
by: Riche, Simon
Published: (2024)
No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
by: Cencerrado, Iván Vicente Moreno, et al.
Published: (2025)
by: Cencerrado, Iván Vicente Moreno, et al.
Published: (2025)
LLMs cannot find reasoning errors, but can correct them given the error location
by: Tyen, Gladys, et al.
Published: (2023)
by: Tyen, Gladys, et al.
Published: (2023)
Implementing surrogate goals for safer bargaining in LLM-based agents
by: Oesterheld, Caspar, et al.
Published: (2026)
by: Oesterheld, Caspar, et al.
Published: (2026)
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
by: Puerto, Haritz, et al.
Published: (2024)
by: Puerto, Haritz, et al.
Published: (2024)
Evaluating Prompt Engineering Techniques for Accuracy and Confidence Elicitation in Medical LLMs
by: Naderi, Nariman, et al.
Published: (2025)
by: Naderi, Nariman, et al.
Published: (2025)
Useful Public Spending, Taylor Principle, and Macroeconomic Instability
by: Antoine Le Riche
Published: (2025)
by: Antoine Le Riche
Published: (2025)
Terpenes and Terpenoids: How can we use them?
by: Jay Hanssens, et al.
Published: (2025)
by: Jay Hanssens, et al.
Published: (2025)
Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in
by: Agarwal, Utkarsh, et al.
Published: (2024)
by: Agarwal, Utkarsh, et al.
Published: (2024)
MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection
by: Modzelewski, Arkadiusz, et al.
Published: (2026)
by: Modzelewski, Arkadiusz, et al.
Published: (2026)
LLMs can see and hear without any training
by: Ashutosh, Kumar, et al.
Published: (2025)
by: Ashutosh, Kumar, et al.
Published: (2025)
On two modular geometric realizations of an affine Hecke algebra
by: Bezrukavnikov, Roman, et al.
Published: (2024)
by: Bezrukavnikov, Roman, et al.
Published: (2024)
Equivariant Koszul Duality, Modular Category $\mathcal{O}$, and Periodic Kazhdan--Lusztig Polynomials
by: Riche, Simon, et al.
Published: (2025)
by: Riche, Simon, et al.
Published: (2025)
On multi-graded Proj schemes
by: Mayeux, Arnaud, et al.
Published: (2023)
by: Mayeux, Arnaud, et al.
Published: (2023)
Modular affine Hecke category and regular centralizer
by: Bezrukavnikov, Roman, et al.
Published: (2022)
by: Bezrukavnikov, Roman, et al.
Published: (2022)
Koszul duality for Coxeter groups
by: Riche, Simon, et al.
Published: (2023)
by: Riche, Simon, et al.
Published: (2023)
Analogies and differences between the logic behind statistical hypothesis testing and proofs by contradiction: What can we learn from them?
by: Maria Cristina Amoretti, et al.
Published: (2026)
by: Maria Cristina Amoretti, et al.
Published: (2026)
What Doctoral Student Motivation Tells Us about the Future of LIS Education
by: Hands, Africa S.
Published: (2018)
by: Hands, Africa S.
Published: (2018)
What's Your Type? An Examination of First-Year Doctoral Student Motivation
by: Hands, Africa S.
Published: (2020)
by: Hands, Africa S.
Published: (2020)
Public Libraries: Your Partner in Increasing College Literacy among Nontraditional Prospective Students
by: Hands, Africa S.
Published: (2023)
by: Hands, Africa S.
Published: (2023)
Successfully Serving the College Bound
by: Hands, Africa S.
Published: (2015)
by: Hands, Africa S.
Published: (2015)
Similar Items
-
Identifying a Circuit for Verb Conjugation in GPT-2
by: Africa, David Demitri
Published: (2025) -
Inoculation Prompting: Instructing LLMs to misbehave at train-time improves test-time alignment
by: Wichers, Nevan, et al.
Published: (2025) -
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs
by: Betley, Jan, et al.
Published: (2025) -
LURE: Live-Usage Replay Evaluations for Reducing Evaluation Awareness
by: Ivanov, Igor, et al.
Published: (2026) -
Steering Awareness: Detecting Activation Steering from Within
by: Rivera, Joshua Fonseca, et al.
Published: (2025)