Saved in:
| Main Authors: | Singh, Adhyayan Veer, Shen, Aaron, Law, Brian, Ismail, Ahmed, Rohweder, Jonas, O'Brien, Sean, Zhu, Kevin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.22626 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CBMAS: Cognitive Behavioral Modeling via Activation Steering
by: Ismail, Ahmed H., et al.
Published: (2026)
by: Ismail, Ahmed H., et al.
Published: (2026)
Hierarchical Latent Structures in Data Generation Process Unify Mechanistic Phenomena across Scale
by: Rohweder, Jonas, et al.
Published: (2026)
by: Rohweder, Jonas, et al.
Published: (2026)
Patches of Nonlinearity: Instruction Vectors in Large Language Models
by: Bigoulaeva, Irina, et al.
Published: (2026)
by: Bigoulaeva, Irina, et al.
Published: (2026)
Question-Analysis Prompting Improves LLM Performance in Reasoning Tasks
by: Yugeswardeenoo, Dharunish, et al.
Published: (2024)
by: Yugeswardeenoo, Dharunish, et al.
Published: (2024)
How Scientists Use Large Language Models to Program
by: O'Brien, Gabrielle
Published: (2025)
by: O'Brien, Gabrielle
Published: (2025)
Semantic Self-Consistency: Enhancing Language Model Reasoning via Semantic Weighting
by: Knappe, Tim, et al.
Published: (2024)
by: Knappe, Tim, et al.
Published: (2024)
Error Reflection Prompting: Can Large Language Models Successfully Understand Errors?
by: Li, Jason, et al.
Published: (2025)
by: Li, Jason, et al.
Published: (2025)
CLEAR: Contrasting Textual Feedback with Experts and Amateurs for Reasoning
by: Rufail, Andrew, et al.
Published: (2025)
by: Rufail, Andrew, et al.
Published: (2025)
MALIBU Benchmark: Multi-Agent LLM Implicit Bias Uncovered
by: Mirza, Imran, et al.
Published: (2025)
by: Mirza, Imran, et al.
Published: (2025)
SMAGDi: Socratic Multi Agent Interaction Graph Distillation for Efficient High Accuracy Reasoning
by: Aluru, Aayush, et al.
Published: (2025)
by: Aluru, Aayush, et al.
Published: (2025)
Direct Confidence Alignment: Aligning Verbalized Confidence with Internal Confidence In Large Language Models
by: Zhang, Glenn, et al.
Published: (2025)
by: Zhang, Glenn, et al.
Published: (2025)
AAVENUE: Detecting LLM Biases on NLU Tasks in AAVE via a Novel Benchmark
by: Gupta, Abhay, et al.
Published: (2024)
by: Gupta, Abhay, et al.
Published: (2024)
Ultrafast Superconducting Qubit Readout with the Quarton Coupler
by: Ye, Yufeng, et al.
Published: (2024)
by: Ye, Yufeng, et al.
Published: (2024)
NovelHopQA: Diagnosing Multi-Hop Reasoning Failures in Long Narrative Contexts
by: Gupta, Abhay, et al.
Published: (2025)
by: Gupta, Abhay, et al.
Published: (2025)
From Bias to Balance: Detecting Facial Expression Recognition Biases in Large Multimodal Foundation Models
by: Chhua, Kaylee, et al.
Published: (2024)
by: Chhua, Kaylee, et al.
Published: (2024)
Pause-Tuning for Long-Context Comprehension: A Lightweight Approach to LLM Attention Recalibration
by: Begin, James, et al.
Published: (2025)
by: Begin, James, et al.
Published: (2025)
Zum gesellschaftlichen Nutzen pharmazeutischer Innovationen
by: Wille, Eberhard, et al.
Published: (2019)
by: Wille, Eberhard, et al.
Published: (2019)
Exploring the Short-Term Impacts of a Community-Based Book Distribution Program
by: Anderson, Kimberly L., et al.
Published: (2019)
by: Anderson, Kimberly L., et al.
Published: (2019)
Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks
by: Zang, Yongyi, et al.
Published: (2025)
by: Zang, Yongyi, et al.
Published: (2025)
ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems
by: Singh, Ishneet Sukhvinder, et al.
Published: (2024)
by: Singh, Ishneet Sukhvinder, et al.
Published: (2024)
Inertia and other properties of the matrix $\left[β(i,j)\right]$
by: Grover, Priyanka, et al.
Published: (2024)
by: Grover, Priyanka, et al.
Published: (2024)
A kinematic study of energy barriers to crack formation in graphene tilt boundaries
by: Daly, Matthew, et al.
Published: (2014)
by: Daly, Matthew, et al.
Published: (2014)
STEM Agent: A Self-Adapting, Tool-Enabled, Extensible Architecture for Multi-Protocol AI Agent Systems
by: Shen, Alfred, et al.
Published: (2026)
by: Shen, Alfred, et al.
Published: (2026)
Reasoning Relay: Evaluating Stability and Interchangeability of Large Language Models in Mathematical Reasoning
by: Lu, Leo, et al.
Published: (2025)
by: Lu, Leo, et al.
Published: (2025)
HAFixAgent: History-Aware Program Repair Agent
by: Shi, Yu, et al.
Published: (2025)
by: Shi, Yu, et al.
Published: (2025)
A Few Bad Neurons: Isolating and Surgically Correcting Sycophancy
by: O'Brien, Claire, et al.
Published: (2026)
by: O'Brien, Claire, et al.
Published: (2026)
NESTING AND FORAGING HABITAT OF GREAT GRAY OWLS
by: Bull, Evelyn L., et al.
Published: (1988)
by: Bull, Evelyn L., et al.
Published: (1988)
Exploring GPT-4 for Robotic Agent Strategy with Real-Time State Feedback and a Reactive Behaviour Framework
by: O'Brien, Thomas, et al.
Published: (2025)
by: O'Brien, Thomas, et al.
Published: (2025)
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
by: Thangarasa, Vithursan, et al.
Published: (2024)
by: Thangarasa, Vithursan, et al.
Published: (2024)
Multi-Agent Large Language Models for Conversational Task-Solving
by: Becker, Jonas
Published: (2024)
by: Becker, Jonas
Published: (2024)
FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations
by: Wen, Athena, et al.
Published: (2025)
by: Wen, Athena, et al.
Published: (2025)
SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
by: Xu, Wendong, et al.
Published: (2025)
by: Xu, Wendong, et al.
Published: (2025)
Self-Updatable Large Language Models by Integrating Context into Model Parameters
by: Wang, Yu, et al.
Published: (2024)
by: Wang, Yu, et al.
Published: (2024)
Dual‐Path Transformer Fusion Model for Traffic Accident Prediction Using Dashboard Cameras
by: Veer Bhadra Pratap Singh, et al.
Published: (2026)
by: Veer Bhadra Pratap Singh, et al.
Published: (2026)
An Iterative Approach to the Complex Monge-Ampère Eigenvalue Problem
by: Zeriahi, Ahmed
Published: (2025)
by: Zeriahi, Ahmed
Published: (2025)
Green-Lazarsfeld property $N_p$ for Segre product of Hibi rings
by: Veer, Dharm
Published: (2023)
by: Veer, Dharm
Published: (2023)
Probing Audio-Generation Capabilities of Text-Based Language Models
by: Anbazhagan, Arjun Prasaath, et al.
Published: (2025)
by: Anbazhagan, Arjun Prasaath, et al.
Published: (2025)
MapCoder: Multi-Agent Code Generation for Competitive Problem Solving
by: Islam, Md. Ashraful, et al.
Published: (2024)
by: Islam, Md. Ashraful, et al.
Published: (2024)
SonamicExamples
by: O'Brien, Harry
Published: (2026)
by: O'Brien, Harry
Published: (2026)
Contrasting nonstructural carbohydrate dynamics of tropical tree seedlings under water deficit and variability
by: O'Brien, Michael
Published: (2026)
by: O'Brien, Michael
Published: (2026)
Similar Items
-
CBMAS: Cognitive Behavioral Modeling via Activation Steering
by: Ismail, Ahmed H., et al.
Published: (2026) -
Hierarchical Latent Structures in Data Generation Process Unify Mechanistic Phenomena across Scale
by: Rohweder, Jonas, et al.
Published: (2026) -
Patches of Nonlinearity: Instruction Vectors in Large Language Models
by: Bigoulaeva, Irina, et al.
Published: (2026) -
Question-Analysis Prompting Improves LLM Performance in Reasoning Tasks
by: Yugeswardeenoo, Dharunish, et al.
Published: (2024) -
How Scientists Use Large Language Models to Program
by: O'Brien, Gabrielle
Published: (2025)