Saved in:
| Main Authors: | Ramnauth, Rebecca, Scassellati, Brian |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.28639 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
More than Chit-Chat: Developing Robots for Small-Talk Interactions
by: Ramnauth, Rebecca, et al.
Published: (2024)
by: Ramnauth, Rebecca, et al.
Published: (2024)
Robotics-Inspired Guardrails for Foundation Models in Socially Sensitive Domains
by: Ramnauth, Rebecca, et al.
Published: (2026)
by: Ramnauth, Rebecca, et al.
Published: (2026)
A Grounded Observer Framework for Establishing Guardrails for Foundation Models in Socially Sensitive Domains
by: Ramnauth, Rebecca, et al.
Published: (2024)
by: Ramnauth, Rebecca, et al.
Published: (2024)
A Robot-Assisted Approach to Small Talk Training for Adults with ASD
by: Ramnauth, Rebecca, et al.
Published: (2025)
by: Ramnauth, Rebecca, et al.
Published: (2025)
Gaze Behavior During a Long-Term, In-Home, Social Robot Intervention for Children with ASD
by: Ramnauth, Rebecca, et al.
Published: (2025)
by: Ramnauth, Rebecca, et al.
Published: (2025)
Don't Think of the White Bear: Ironic Negation in Transformer Models Under Cognitive Load
by: Mann, Logan, et al.
Published: (2025)
by: Mann, Logan, et al.
Published: (2025)
SignAttention: On the Interpretability of Transformer Models for Sign Language Translation
by: Bianco, Pedro Alejandro Dal, et al.
Published: (2024)
by: Bianco, Pedro Alejandro Dal, et al.
Published: (2024)
Attention Sinks in Diffusion Language Models
by: Rulli, Maximo Eduardo, et al.
Published: (2025)
by: Rulli, Maximo Eduardo, et al.
Published: (2025)
Memorization in Attention-only Transformers
by: Dana, Léo, et al.
Published: (2024)
by: Dana, Léo, et al.
Published: (2024)
Affine-Scaled Attention: Towards Flexible and Stable Transformer Attention
by: Bae, Jeongin, et al.
Published: (2026)
by: Bae, Jeongin, et al.
Published: (2026)
Efficient Streaming Language Models with Attention Sinks
by: Xiao, Guangxuan, et al.
Published: (2023)
by: Xiao, Guangxuan, et al.
Published: (2023)
Cross-Attention Watermarking of Large Language Models
by: Baldassini, Folco Bertini, et al.
Published: (2024)
by: Baldassini, Folco Bertini, et al.
Published: (2024)
Attention-Aligned Reasoning for Large Language Models
by: Zhang, Hongxiang, et al.
Published: (2025)
by: Zhang, Hongxiang, et al.
Published: (2025)
$π$-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling
by: Liu, Dong, et al.
Published: (2025)
by: Liu, Dong, et al.
Published: (2025)
Weighted Grouped Query Attention in Transformers
by: Chinnakonduru, Sai Sena, et al.
Published: (2024)
by: Chinnakonduru, Sai Sena, et al.
Published: (2024)
Latent Multi-Head Attention for Small Language Models
by: Mehta, Sushant, et al.
Published: (2025)
by: Mehta, Sushant, et al.
Published: (2025)
ShishuLM : Achieving Optimal and Efficient Parameterization with Low Attention Transformer Models
by: Kumar, Shivanshu, et al.
Published: (2025)
by: Kumar, Shivanshu, et al.
Published: (2025)
Focusing on Language: Revealing and Exploiting Language Attention Heads in Multilingual Large Language Models
by: Liu, Xin, et al.
Published: (2025)
by: Liu, Xin, et al.
Published: (2025)
On Active Privacy Auditing in Supervised Fine-tuning for White-Box Language Models
by: Sun, Qian, et al.
Published: (2024)
by: Sun, Qian, et al.
Published: (2024)
Word Meanings in Transformer Language Models
by: Grindrod, Jumbly, et al.
Published: (2025)
by: Grindrod, Jumbly, et al.
Published: (2025)
Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements
by: Haller, Patrick, et al.
Published: (2025)
by: Haller, Patrick, et al.
Published: (2025)
Efficient Attention Mechanisms for Large Language Models: A Survey
by: Sun, Yutao, et al.
Published: (2025)
by: Sun, Yutao, et al.
Published: (2025)
Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models
by: Zhao, Yida, et al.
Published: (2024)
by: Zhao, Yida, et al.
Published: (2024)
Crisp Attention: Regularizing Transformers via Structured Sparsity
by: Gandhi, Sagar, et al.
Published: (2025)
by: Gandhi, Sagar, et al.
Published: (2025)
Intra-Layer Recurrence in Transformers for Language Modeling
by: Nguyen, Anthony, et al.
Published: (2025)
by: Nguyen, Anthony, et al.
Published: (2025)
Dynamic Topic Evolution with Temporal Decay and Attention in Large Language Models
by: Wu, Di, et al.
Published: (2025)
by: Wu, Di, et al.
Published: (2025)
Attention Basin: Why Contextual Position Matters in Large Language Models
by: Yi, Zihao, et al.
Published: (2025)
by: Yi, Zihao, et al.
Published: (2025)
Self-Selected Attention Span for Accelerating Large Language Model Inference
by: Jin, Tian, et al.
Published: (2024)
by: Jin, Tian, et al.
Published: (2024)
Selective Attention Improves Transformer
by: Leviathan, Yaniv, et al.
Published: (2024)
by: Leviathan, Yaniv, et al.
Published: (2024)
Transformer-based Causal Language Models Perform Clustering
by: Wu, Xinbo, et al.
Published: (2024)
by: Wu, Xinbo, et al.
Published: (2024)
ALPS: Attention Localization and Pruning Strategy for Efficient Alignment of Large Language Models
by: Chen, Hao, et al.
Published: (2025)
by: Chen, Hao, et al.
Published: (2025)
Exploring the Robustness of Language Models for Tabular Question Answering via Attention Analysis
by: Bhandari, Kushal Raj, et al.
Published: (2024)
by: Bhandari, Kushal Raj, et al.
Published: (2024)
Pre-Attention Expert Prediction and Prefetching for Mixture-of-Experts Large Language Models
by: Zhu, Shien, et al.
Published: (2025)
by: Zhu, Shien, et al.
Published: (2025)
Falcon Mamba: The First Competitive Attention-free 7B Language Model
by: Zuo, Jingwei, et al.
Published: (2024)
by: Zuo, Jingwei, et al.
Published: (2024)
From n-gram to Attention: How Model Architectures Learn and Propagate Bias in Language Modeling
by: Kabir, Mohsinul, et al.
Published: (2025)
by: Kabir, Mohsinul, et al.
Published: (2025)
Cognitive Effects in Large Language Models
by: Shaki, Jonathan, et al.
Published: (2023)
by: Shaki, Jonathan, et al.
Published: (2023)
Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers
by: Tang, Zecheng, et al.
Published: (2026)
by: Tang, Zecheng, et al.
Published: (2026)
LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models
by: Rabern, Brian, et al.
Published: (2026)
by: Rabern, Brian, et al.
Published: (2026)
A Practical Examination of AI-Generated Text Detectors for Large Language Models
by: Tufts, Brian, et al.
Published: (2024)
by: Tufts, Brian, et al.
Published: (2024)
Transformer-based Language Models for Reasoning in the Description Logic ALCQ
by: Poulis, Angelos, et al.
Published: (2024)
by: Poulis, Angelos, et al.
Published: (2024)
Similar Items
-
More than Chit-Chat: Developing Robots for Small-Talk Interactions
by: Ramnauth, Rebecca, et al.
Published: (2024) -
Robotics-Inspired Guardrails for Foundation Models in Socially Sensitive Domains
by: Ramnauth, Rebecca, et al.
Published: (2026) -
A Grounded Observer Framework for Establishing Guardrails for Foundation Models in Socially Sensitive Domains
by: Ramnauth, Rebecca, et al.
Published: (2024) -
A Robot-Assisted Approach to Small Talk Training for Adults with ASD
by: Ramnauth, Rebecca, et al.
Published: (2025) -
Gaze Behavior During a Long-Term, In-Home, Social Robot Intervention for Children with ASD
by: Ramnauth, Rebecca, et al.
Published: (2025)