Saved in:
| Main Authors: | Nainani, Jatin, Vaidyanathan, Sankaran, Watts, Connor, Assis, Andre N., Rigg, Alice |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.18098 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Adaptive Circuit Behavior and Generalization in Mechanistic Interpretability
by: Nainani, Jatin, et al.
Published: (2024)
by: Nainani, Jatin, et al.
Published: (2024)
Evaluating Brain-Inspired Modular Training in Automated Circuit Discovery for Mechanistic Interpretability
by: Nainani, Jatin
Published: (2024)
by: Nainani, Jatin
Published: (2024)
Empowering Interdisciplinary Research with BERT-Based Models: An Approach Through SciBERT-CNN with Topic Modeling
by: Likhareva, Darya, et al.
Published: (2024)
by: Likhareva, Darya, et al.
Published: (2024)
Quantitative LLM Judges
by: Sahoo, Aishwarya, et al.
Published: (2025)
by: Sahoo, Aishwarya, et al.
Published: (2025)
A Voter-Based Stochastic Rejection-Method Framework for Asymptotically Safe Language Model Outputs
by: Watts, Jake R., et al.
Published: (2024)
by: Watts, Jake R., et al.
Published: (2024)
Internal Planning in Language Models: Characterizing Horizon and Branch Awareness
by: Ustaomeroglu, Muhammed, et al.
Published: (2025)
by: Ustaomeroglu, Muhammed, et al.
Published: (2025)
Timing Analysis Agent: Autonomous Multi-Corner Multi-Mode (MCMM) Timing Debugging with Timing Debug Relation Graph
by: Nainani, Jatin, et al.
Published: (2025)
by: Nainani, Jatin, et al.
Published: (2025)
Evaluating Spoken Language as a Biomarker for Automated Screening of Cognitive Impairment
by: Lima, Maria R., et al.
Published: (2025)
by: Lima, Maria R., et al.
Published: (2025)
Converting MLPs into Polynomials in Closed Form
by: Belrose, Nora, et al.
Published: (2025)
by: Belrose, Nora, et al.
Published: (2025)
Rotary Offset Features in Large Language Models
by: Jonasson, André
Published: (2025)
by: Jonasson, André
Published: (2025)
Invariant Features in Language Models: Geometric Characterization and Model Attribution
by: Dasgupta, Agnibh, et al.
Published: (2026)
by: Dasgupta, Agnibh, et al.
Published: (2026)
Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and Generation
by: Balestriero, Randall, et al.
Published: (2023)
by: Balestriero, Randall, et al.
Published: (2023)
CS4: Measuring the Creativity of Large Language Models Automatically by Controlling the Number of Story-Writing Constraints
by: Atmakuru, Anirudh, et al.
Published: (2024)
by: Atmakuru, Anirudh, et al.
Published: (2024)
An Empirical Study of Multi-Generation Sampling for Jailbreak Detection in Large Language Models
by: Luo, Hanrui, et al.
Published: (2026)
by: Luo, Hanrui, et al.
Published: (2026)
Reasoning Planning for Language Models
by: Nguyen, Bao, et al.
Published: (2025)
by: Nguyen, Bao, et al.
Published: (2025)
TextLap: Customizing Language Models for Text-to-Layout Planning
by: Chen, Jian, et al.
Published: (2024)
by: Chen, Jian, et al.
Published: (2024)
Need a Small Specialized Language Model? Plan Early!
by: Grangier, David, et al.
Published: (2024)
by: Grangier, David, et al.
Published: (2024)
Interactive and Expressive Code-Augmented Planning with Large Language Models
by: Liu, Anthony Z., et al.
Published: (2024)
by: Liu, Anthony Z., et al.
Published: (2024)
DE-COP: Detecting Copyrighted Content in Language Models Training Data
by: Duarte, André V., et al.
Published: (2024)
by: Duarte, André V., et al.
Published: (2024)
Does fine-tuning GPT-3 with the OpenAI API leak personally-identifiable information?
by: Sun, Albert Yu, et al.
Published: (2023)
by: Sun, Albert Yu, et al.
Published: (2023)
Stop-Think-AutoRegress: Language Modeling with Latent Diffusion Planning
by: Lovelace, Justin, et al.
Published: (2026)
by: Lovelace, Justin, et al.
Published: (2026)
Improving Detection of Watermarked Language Models
by: Bahri, Dara, et al.
Published: (2025)
by: Bahri, Dara, et al.
Published: (2025)
Leveraging Language Models to Detect Greenwashing
by: Vinella, Avalon, et al.
Published: (2023)
by: Vinella, Avalon, et al.
Published: (2023)
Characterizing the Accuracy -- Efficiency Trade-off of Low-rank Decomposition in Language Models
by: Moar, Chakshu, et al.
Published: (2024)
by: Moar, Chakshu, et al.
Published: (2024)
Can Large Language Models Reason and Plan?
by: Kambhampati, Subbarao
Published: (2024)
by: Kambhampati, Subbarao
Published: (2024)
Guiding Language Model Reasoning with Planning Tokens
by: Wang, Xinyi, et al.
Published: (2023)
by: Wang, Xinyi, et al.
Published: (2023)
Mechanistic Anomaly Detection for "Quirky" Language Models
by: Johnston, David O., et al.
Published: (2025)
by: Johnston, David O., et al.
Published: (2025)
TempTest: Local Normalization Distortion and the Detection of Machine-generated Text
by: Kempton, Tom, et al.
Published: (2025)
by: Kempton, Tom, et al.
Published: (2025)
ADaPT: As-Needed Decomposition and Planning with Language Models
by: Prasad, Archiki, et al.
Published: (2023)
by: Prasad, Archiki, et al.
Published: (2023)
Exploring and Benchmarking the Planning Capabilities of Large Language Models
by: Bohnet, Bernd, et al.
Published: (2024)
by: Bohnet, Bernd, et al.
Published: (2024)
Prompting Large Language Models to Detect Dementia Family Caregivers
by: Biswas, Md Badsha, et al.
Published: (2025)
by: Biswas, Md Badsha, et al.
Published: (2025)
Detecting and Pruning Prominent but Detrimental Neurons in Large Language Models
by: Ali, Ameen, et al.
Published: (2025)
by: Ali, Ameen, et al.
Published: (2025)
Leveraging Graph Structures to Detect Hallucinations in Large Language Models
by: Nonkes, Noa, et al.
Published: (2024)
by: Nonkes, Noa, et al.
Published: (2024)
Sharpness-Aware Pretraining Mitigates Catastrophic Forgetting
by: Watts, Ishaan, et al.
Published: (2026)
by: Watts, Ishaan, et al.
Published: (2026)
KScope: A Framework for Characterizing the Knowledge Status of Language Models
by: Xiao, Yuxin, et al.
Published: (2025)
by: Xiao, Yuxin, et al.
Published: (2025)
ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models
by: Wang, Siwei, et al.
Published: (2024)
by: Wang, Siwei, et al.
Published: (2024)
Mastering Board Games by External and Internal Planning with Language Models
by: Schultz, John, et al.
Published: (2024)
by: Schultz, John, et al.
Published: (2024)
Graph-enhanced Large Language Models in Asynchronous Plan Reasoning
by: Lin, Fangru, et al.
Published: (2024)
by: Lin, Fangru, et al.
Published: (2024)
Multi-Species Object Detection in Drone Imagery for Population Monitoring of Endangered Animals
by: Sankaran, Sowmya
Published: (2024)
by: Sankaran, Sowmya
Published: (2024)
LMD3: Language Model Data Density Dependence
by: Kirchenbauer, John, et al.
Published: (2024)
by: Kirchenbauer, John, et al.
Published: (2024)
Similar Items
-
Adaptive Circuit Behavior and Generalization in Mechanistic Interpretability
by: Nainani, Jatin, et al.
Published: (2024) -
Evaluating Brain-Inspired Modular Training in Automated Circuit Discovery for Mechanistic Interpretability
by: Nainani, Jatin
Published: (2024) -
Empowering Interdisciplinary Research with BERT-Based Models: An Approach Through SciBERT-CNN with Topic Modeling
by: Likhareva, Darya, et al.
Published: (2024) -
Quantitative LLM Judges
by: Sahoo, Aishwarya, et al.
Published: (2025) -
A Voter-Based Stochastic Rejection-Method Framework for Asymptotically Safe Language Model Outputs
by: Watts, Jake R., et al.
Published: (2024)