:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ramnauth, Rebecca, Scassellati, Brian
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.28639
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

More than Chit-Chat: Developing Robots for Small-Talk Interactions
by: Ramnauth, Rebecca, et al.
Published: (2024)

Robotics-Inspired Guardrails for Foundation Models in Socially Sensitive Domains
by: Ramnauth, Rebecca, et al.
Published: (2026)

A Grounded Observer Framework for Establishing Guardrails for Foundation Models in Socially Sensitive Domains
by: Ramnauth, Rebecca, et al.
Published: (2024)

A Robot-Assisted Approach to Small Talk Training for Adults with ASD
by: Ramnauth, Rebecca, et al.
Published: (2025)

Gaze Behavior During a Long-Term, In-Home, Social Robot Intervention for Children with ASD
by: Ramnauth, Rebecca, et al.
Published: (2025)

Don't Think of the White Bear: Ironic Negation in Transformer Models Under Cognitive Load
by: Mann, Logan, et al.
Published: (2025)

SignAttention: On the Interpretability of Transformer Models for Sign Language Translation
by: Bianco, Pedro Alejandro Dal, et al.
Published: (2024)

Attention Sinks in Diffusion Language Models
by: Rulli, Maximo Eduardo, et al.
Published: (2025)

Memorization in Attention-only Transformers
by: Dana, Léo, et al.
Published: (2024)

Affine-Scaled Attention: Towards Flexible and Stable Transformer Attention
by: Bae, Jeongin, et al.
Published: (2026)

Efficient Streaming Language Models with Attention Sinks
by: Xiao, Guangxuan, et al.
Published: (2023)

Cross-Attention Watermarking of Large Language Models
by: Baldassini, Folco Bertini, et al.
Published: (2024)

Attention-Aligned Reasoning for Large Language Models
by: Zhang, Hongxiang, et al.
Published: (2025)

$π$-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling
by: Liu, Dong, et al.
Published: (2025)

Weighted Grouped Query Attention in Transformers
by: Chinnakonduru, Sai Sena, et al.
Published: (2024)

Latent Multi-Head Attention for Small Language Models
by: Mehta, Sushant, et al.
Published: (2025)

ShishuLM : Achieving Optimal and Efficient Parameterization with Low Attention Transformer Models
by: Kumar, Shivanshu, et al.
Published: (2025)

Focusing on Language: Revealing and Exploiting Language Attention Heads in Multilingual Large Language Models
by: Liu, Xin, et al.
Published: (2025)

On Active Privacy Auditing in Supervised Fine-tuning for White-Box Language Models
by: Sun, Qian, et al.
Published: (2024)

Word Meanings in Transformer Language Models
by: Grindrod, Jumbly, et al.
Published: (2025)

Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements
by: Haller, Patrick, et al.
Published: (2025)

Efficient Attention Mechanisms for Large Language Models: A Survey
by: Sun, Yutao, et al.
Published: (2025)

Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models
by: Zhao, Yida, et al.
Published: (2024)

Crisp Attention: Regularizing Transformers via Structured Sparsity
by: Gandhi, Sagar, et al.
Published: (2025)

Intra-Layer Recurrence in Transformers for Language Modeling
by: Nguyen, Anthony, et al.
Published: (2025)

Dynamic Topic Evolution with Temporal Decay and Attention in Large Language Models
by: Wu, Di, et al.
Published: (2025)

Attention Basin: Why Contextual Position Matters in Large Language Models
by: Yi, Zihao, et al.
Published: (2025)

Self-Selected Attention Span for Accelerating Large Language Model Inference
by: Jin, Tian, et al.
Published: (2024)

Selective Attention Improves Transformer
by: Leviathan, Yaniv, et al.
Published: (2024)

Transformer-based Causal Language Models Perform Clustering
by: Wu, Xinbo, et al.
Published: (2024)

ALPS: Attention Localization and Pruning Strategy for Efficient Alignment of Large Language Models
by: Chen, Hao, et al.
Published: (2025)

Exploring the Robustness of Language Models for Tabular Question Answering via Attention Analysis
by: Bhandari, Kushal Raj, et al.
Published: (2024)

Pre-Attention Expert Prediction and Prefetching for Mixture-of-Experts Large Language Models
by: Zhu, Shien, et al.
Published: (2025)

Falcon Mamba: The First Competitive Attention-free 7B Language Model
by: Zuo, Jingwei, et al.
Published: (2024)

From n-gram to Attention: How Model Architectures Learn and Propagate Bias in Language Modeling
by: Kabir, Mohsinul, et al.
Published: (2025)

Cognitive Effects in Large Language Models
by: Shaki, Jonathan, et al.
Published: (2023)

Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers
by: Tang, Zecheng, et al.
Published: (2026)

LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models
by: Rabern, Brian, et al.
Published: (2026)

A Practical Examination of AI-Generated Text Detectors for Large Language Models
by: Tufts, Brian, et al.
Published: (2024)

Transformer-based Language Models for Reasoning in the Description Logic ALCQ
by: Poulis, Angelos, et al.
Published: (2024)