Saved in:
| Main Author: | Seshadri, Amrit Diggavi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.14091 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
One Jump Is All You Need: Short-Cutting Transformers for Early Exit Prediction with One Jump to Fit All Exit Levels
by: Seshadri, Amrit Diggavi
Published: (2025)
by: Seshadri, Amrit Diggavi
Published: (2025)
Reasoning over the Behaviour of Objects in Video-Clips for Adverb-Type Recognition
by: Seshadri, Amrit Diggavi, et al.
Published: (2023)
by: Seshadri, Amrit Diggavi, et al.
Published: (2023)
More Than A Shortcut: A Hyperbolic Approach To Early-Exit Networks
by: Bhosale, Swapnil, et al.
Published: (2025)
by: Bhosale, Swapnil, et al.
Published: (2025)
From Narrow to Wide: Autoencoding Transformers for Ultrasound Bandwidth Recovery
by: KhakzadGharamaleki, Sepideh, et al.
Published: (2025)
by: KhakzadGharamaleki, Sepideh, et al.
Published: (2025)
Budgeted Attention Allocation: Cost-Conditioned Compute Control for Efficient Transformers
by: Nidhi, Amrit
Published: (2026)
by: Nidhi, Amrit
Published: (2026)
DyTTP: Trajectory Prediction with Normalization-Free Transformers
by: Zhu, JianLin, et al.
Published: (2025)
by: Zhu, JianLin, et al.
Published: (2025)
ADEPT: Adaptive Dynamic Early-Exit Process for Transformers
by: Yoo, Sangmin, et al.
Published: (2026)
by: Yoo, Sangmin, et al.
Published: (2026)
Narrow Transformer: StarCoder-Based Java-LM For Desktop
by: Rathinasamy, Kamalkumar, et al.
Published: (2024)
by: Rathinasamy, Kamalkumar, et al.
Published: (2024)
Emergent Misalignment is Easy, Narrow Misalignment is Hard
by: Soligo, Anna, et al.
Published: (2026)
by: Soligo, Anna, et al.
Published: (2026)
Step-GRPO: Internalizing Dynamic Early Exit for Efficient Reasoning
by: Chen, Benteng, et al.
Published: (2026)
by: Chen, Benteng, et al.
Published: (2026)
Early-Exit Neural Networks with Nested Prediction Sets
by: Jazbec, Metod, et al.
Published: (2023)
by: Jazbec, Metod, et al.
Published: (2023)
FlashThink: An Early Exit Method For Efficient Reasoning
by: Jiang, Guochao, et al.
Published: (2025)
by: Jiang, Guochao, et al.
Published: (2025)
Amortized-Precision Quantization for Early-Exit Vision Transformers
by: Fang, Rui, et al.
Published: (2026)
by: Fang, Rui, et al.
Published: (2026)
Narrow Secret Loyalty Dodges Black-Box Audits
by: Lamerton, Alfie, et al.
Published: (2026)
by: Lamerton, Alfie, et al.
Published: (2026)
Pushing the Limits of BFP on Narrow Precision LLM Inference
by: Wang, Hui, et al.
Published: (2025)
by: Wang, Hui, et al.
Published: (2025)
Learning Rate Transfer in Normalized Transformers
by: Shigida, Boris, et al.
Published: (2026)
by: Shigida, Boris, et al.
Published: (2026)
Rethinking Model Evaluation as Narrowing the Socio-Technical Gap
by: Liao, Q. Vera, et al.
Published: (2023)
by: Liao, Q. Vera, et al.
Published: (2023)
Narrow Fine-Tuning Erodes Safety Alignment in Vision-Language Agents
by: Gulati, Idhant, et al.
Published: (2026)
by: Gulati, Idhant, et al.
Published: (2026)
Assessing Domain-Level Susceptibility to Emergent Misalignment from Narrow Finetuning
by: Mishra, Abhishek, et al.
Published: (2026)
by: Mishra, Abhishek, et al.
Published: (2026)
Artificial Adaptive Intelligence: The Missing Stage Between Narrow and General Intelligence
by: Kriuk, Boris
Published: (2026)
by: Kriuk, Boris
Published: (2026)
Transformers without Normalization
by: Zhu, Jiachen, et al.
Published: (2025)
by: Zhu, Jiachen, et al.
Published: (2025)
Narrow Finetuning Leaves Clearly Readable Traces in Activation Differences
by: Minder, Julian, et al.
Published: (2025)
by: Minder, Julian, et al.
Published: (2025)
Improved GUI Grounding via Iterative Narrowing
by: Nguyen, Anthony
Published: (2024)
by: Nguyen, Anthony
Published: (2024)
BEExformer: A Fast Inferencing Binarized Transformer with Early Exits
by: Ansar, Wazib, et al.
Published: (2024)
by: Ansar, Wazib, et al.
Published: (2024)
Bridging Human and LLM Judgments: Understanding and Narrowing the Gap
by: Polo, Felipe Maia, et al.
Published: (2025)
by: Polo, Felipe Maia, et al.
Published: (2025)
Beyond Greedy Exits: Improved Early Exit Decisions for Risk Control and Reliability
by: Bajpai, Divya Jyoti, et al.
Published: (2025)
by: Bajpai, Divya Jyoti, et al.
Published: (2025)
Dynamic Early Exit in Reasoning Models
by: Yang, Chenxu, et al.
Published: (2025)
by: Yang, Chenxu, et al.
Published: (2025)
Stability of Transformers under Layer Normalization
by: Kan, Kelvin, et al.
Published: (2025)
by: Kan, Kelvin, et al.
Published: (2025)
SAC-ViT: Semantic-Aware Clustering Vision Transformer with Early Exit
by: Hu, Youbing, et al.
Published: (2025)
by: Hu, Youbing, et al.
Published: (2025)
Know What You Don't Know: Selective Prediction for Early Exit DNNs
by: Bajpai, Divya Jyoti, et al.
Published: (2025)
by: Bajpai, Divya Jyoti, et al.
Published: (2025)
From Narrow Unlearning to Emergent Misalignment: Causes, Consequences, and Containment in LLMs
by: Mushtaq, Erum, et al.
Published: (2025)
by: Mushtaq, Erum, et al.
Published: (2025)
Position: Universal Aesthetic Alignment Narrows Artistic Expression
by: Guo, Wenqi Marshall, et al.
Published: (2025)
by: Guo, Wenqi Marshall, et al.
Published: (2025)
Narrow Operator Models of Stellarator Equilibria in Fourier Zernike Basis
by: Thun, Timo, et al.
Published: (2025)
by: Thun, Timo, et al.
Published: (2025)
Feature Hedging: Correlated Features Break Narrow Sparse Autoencoders
by: Chanin, David, et al.
Published: (2025)
by: Chanin, David, et al.
Published: (2025)
Stronger Normalization-Free Transformers
by: Chen, Mingzhi, et al.
Published: (2025)
by: Chen, Mingzhi, et al.
Published: (2025)
Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization
by: Khader, Firas, et al.
Published: (2024)
by: Khader, Firas, et al.
Published: (2024)
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
by: Zhuo, Zhijian, et al.
Published: (2025)
by: Zhuo, Zhijian, et al.
Published: (2025)
nGPT: Normalized Transformer with Representation Learning on the Hypersphere
by: Loshchilov, Ilya, et al.
Published: (2024)
by: Loshchilov, Ilya, et al.
Published: (2024)
Parameter-Efficient Distributional RL via Normalizing Flows and a Geometry-Aware Cramér Surrogate
by: C., Simo Alami, et al.
Published: (2025)
by: C., Simo Alami, et al.
Published: (2025)
Uncovering the Deep Filter Bubble: Narrow Exposure in Short-Video Recommendation
by: Sukiennik, Nicholas, et al.
Published: (2024)
by: Sukiennik, Nicholas, et al.
Published: (2024)
Similar Items
-
One Jump Is All You Need: Short-Cutting Transformers for Early Exit Prediction with One Jump to Fit All Exit Levels
by: Seshadri, Amrit Diggavi
Published: (2025) -
Reasoning over the Behaviour of Objects in Video-Clips for Adverb-Type Recognition
by: Seshadri, Amrit Diggavi, et al.
Published: (2023) -
More Than A Shortcut: A Hyperbolic Approach To Early-Exit Networks
by: Bhosale, Swapnil, et al.
Published: (2025) -
From Narrow to Wide: Autoencoding Transformers for Ultrasound Bandwidth Recovery
by: KhakzadGharamaleki, Sepideh, et al.
Published: (2025) -
Budgeted Attention Allocation: Cost-Conditioned Compute Control for Efficient Transformers
by: Nidhi, Amrit
Published: (2026)