:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nainani, Jatin, Vaidyanathan, Sankaran, Watts, Connor, Assis, Andre N., Rigg, Alice
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2508.18098
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Adaptive Circuit Behavior and Generalization in Mechanistic Interpretability
by: Nainani, Jatin, et al.
Published: (2024)

Evaluating Brain-Inspired Modular Training in Automated Circuit Discovery for Mechanistic Interpretability
by: Nainani, Jatin
Published: (2024)

Empowering Interdisciplinary Research with BERT-Based Models: An Approach Through SciBERT-CNN with Topic Modeling
by: Likhareva, Darya, et al.
Published: (2024)

Quantitative LLM Judges
by: Sahoo, Aishwarya, et al.
Published: (2025)

A Voter-Based Stochastic Rejection-Method Framework for Asymptotically Safe Language Model Outputs
by: Watts, Jake R., et al.
Published: (2024)

Internal Planning in Language Models: Characterizing Horizon and Branch Awareness
by: Ustaomeroglu, Muhammed, et al.
Published: (2025)

Timing Analysis Agent: Autonomous Multi-Corner Multi-Mode (MCMM) Timing Debugging with Timing Debug Relation Graph
by: Nainani, Jatin, et al.
Published: (2025)

Evaluating Spoken Language as a Biomarker for Automated Screening of Cognitive Impairment
by: Lima, Maria R., et al.
Published: (2025)

Converting MLPs into Polynomials in Closed Form
by: Belrose, Nora, et al.
Published: (2025)

Rotary Offset Features in Large Language Models
by: Jonasson, André
Published: (2025)

Invariant Features in Language Models: Geometric Characterization and Model Attribution
by: Dasgupta, Agnibh, et al.
Published: (2026)

Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and Generation
by: Balestriero, Randall, et al.
Published: (2023)

CS4: Measuring the Creativity of Large Language Models Automatically by Controlling the Number of Story-Writing Constraints
by: Atmakuru, Anirudh, et al.
Published: (2024)

An Empirical Study of Multi-Generation Sampling for Jailbreak Detection in Large Language Models
by: Luo, Hanrui, et al.
Published: (2026)

Reasoning Planning for Language Models
by: Nguyen, Bao, et al.
Published: (2025)

TextLap: Customizing Language Models for Text-to-Layout Planning
by: Chen, Jian, et al.
Published: (2024)

Need a Small Specialized Language Model? Plan Early!
by: Grangier, David, et al.
Published: (2024)

Interactive and Expressive Code-Augmented Planning with Large Language Models
by: Liu, Anthony Z., et al.
Published: (2024)

DE-COP: Detecting Copyrighted Content in Language Models Training Data
by: Duarte, André V., et al.
Published: (2024)

Does fine-tuning GPT-3 with the OpenAI API leak personally-identifiable information?
by: Sun, Albert Yu, et al.
Published: (2023)

Stop-Think-AutoRegress: Language Modeling with Latent Diffusion Planning
by: Lovelace, Justin, et al.
Published: (2026)

Improving Detection of Watermarked Language Models
by: Bahri, Dara, et al.
Published: (2025)

Leveraging Language Models to Detect Greenwashing
by: Vinella, Avalon, et al.
Published: (2023)

Characterizing the Accuracy -- Efficiency Trade-off of Low-rank Decomposition in Language Models
by: Moar, Chakshu, et al.
Published: (2024)

Can Large Language Models Reason and Plan?
by: Kambhampati, Subbarao
Published: (2024)

Guiding Language Model Reasoning with Planning Tokens
by: Wang, Xinyi, et al.
Published: (2023)

Mechanistic Anomaly Detection for "Quirky" Language Models
by: Johnston, David O., et al.
Published: (2025)

TempTest: Local Normalization Distortion and the Detection of Machine-generated Text
by: Kempton, Tom, et al.
Published: (2025)

ADaPT: As-Needed Decomposition and Planning with Language Models
by: Prasad, Archiki, et al.
Published: (2023)

Exploring and Benchmarking the Planning Capabilities of Large Language Models
by: Bohnet, Bernd, et al.
Published: (2024)

Prompting Large Language Models to Detect Dementia Family Caregivers
by: Biswas, Md Badsha, et al.
Published: (2025)

Detecting and Pruning Prominent but Detrimental Neurons in Large Language Models
by: Ali, Ameen, et al.
Published: (2025)

Leveraging Graph Structures to Detect Hallucinations in Large Language Models
by: Nonkes, Noa, et al.
Published: (2024)

Sharpness-Aware Pretraining Mitigates Catastrophic Forgetting
by: Watts, Ishaan, et al.
Published: (2026)

KScope: A Framework for Characterizing the Knowledge Status of Language Models
by: Xiao, Yuxin, et al.
Published: (2025)

ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models
by: Wang, Siwei, et al.
Published: (2024)

Mastering Board Games by External and Internal Planning with Language Models
by: Schultz, John, et al.
Published: (2024)

Graph-enhanced Large Language Models in Asynchronous Plan Reasoning
by: Lin, Fangru, et al.
Published: (2024)

Multi-Species Object Detection in Drone Imagery for Population Monitoring of Endangered Animals
by: Sankaran, Sowmya
Published: (2024)

LMD3: Language Model Data Density Dependence
by: Kirchenbauer, John, et al.
Published: (2024)