Saved in:
| Main Authors: | Du, Jason, Hong, Kelly, Imran, Alishba, Jahanparast, Erfan, Khfifi, Mehdi, Qiao, Kaichun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.07108 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Review of Lumber Disc Herniation and Sciatica
by: Alishba Imran, Alishba Imran
Published: (2025)
by: Alishba Imran, Alishba Imran
Published: (2025)
Single layer tiny Co$^4$ outpaces GPT-2 and GPT-BERT
by: Zain, Noor Ul, et al.
Published: (2025)
by: Zain, Noor Ul, et al.
Published: (2025)
CommonKV: Compressing KV Cache with Cross-layer Parameter Sharing
by: Wang, Yixuan, et al.
Published: (2025)
by: Wang, Yixuan, et al.
Published: (2025)
TravelEval: A Comprehensive Benchmarking Framework for Evaluating LLM-Powered Travel Planning Agents
by: Chen, Weiyi, et al.
Published: (2026)
by: Chen, Weiyi, et al.
Published: (2026)
SHARP: Accelerating Language Model Inference by SHaring Adjacent layers with Recovery Parameters
by: Wang, Yiping, et al.
Published: (2025)
by: Wang, Yiping, et al.
Published: (2025)
Exploring the In-Context Learning Capabilities of LLMs for Money Laundering Detection in Financial Graphs
by: Pirmorad, Erfan
Published: (2025)
by: Pirmorad, Erfan
Published: (2025)
Efficient Noise Mitigation for Enhancing Inference Accuracy in DNNs on Mixed-Signal Accelerators
by: Azizi, Seyedarmin, et al.
Published: (2024)
by: Azizi, Seyedarmin, et al.
Published: (2024)
Enhancing Reinforcement learning in 3-Dimensional Hydrophobic-Polar Protein Folding Model with Attention-based layers
by: Liu, Peizheng, et al.
Published: (2025)
by: Liu, Peizheng, et al.
Published: (2025)
Deep-layer limit and stability analysis of the basic forward-backward-splitting induced network (II): learning problems
by: Lin, Xuan, et al.
Published: (2026)
by: Lin, Xuan, et al.
Published: (2026)
Krul: Efficient State Restoration for Multi-turn Conversations with Dynamic Cross-layer KV Sharing
by: Wen, Junyi, et al.
Published: (2025)
by: Wen, Junyi, et al.
Published: (2025)
Unmasking the giant: A comprehensive evaluation of ChatGPT's proficiency in coding algorithms and data structures
by: Arefin, Sayed Erfan, et al.
Published: (2023)
by: Arefin, Sayed Erfan, et al.
Published: (2023)
Fair Division of Multi-layered Cakes
by: Sanpui, Mohammad Azharuddin
Published: (2022)
by: Sanpui, Mohammad Azharuddin
Published: (2022)
GraphTrafficGPT: Enhancing Traffic Management Through Graph-Based AI Agent Coordination
by: Taleb, Nabil Abdelaziz Ferhat, et al.
Published: (2025)
by: Taleb, Nabil Abdelaziz Ferhat, et al.
Published: (2025)
A layered architecture for log analysis in complex IT systems
by: Wittkopp, Thorsten
Published: (2025)
by: Wittkopp, Thorsten
Published: (2025)
[Social] Allostasis: Or, How I Learned To Stop Worrying and Love The Noise
by: Khan, Imran
Published: (2025)
by: Khan, Imran
Published: (2025)
Unveiling User Perceptions in the Generative AI Era: A Sentiment-Driven Evaluation of AI Educational Apps' Role in Digital Transformation of e-Teaching
by: Mazaherian, Adeleh, et al.
Published: (2025)
by: Mazaherian, Adeleh, et al.
Published: (2025)
Comprehensive Modeling Approaches for Forecasting Bitcoin Transaction Fees: A Comparative Study
by: Ma, Jiangqin, et al.
Published: (2025)
by: Ma, Jiangqin, et al.
Published: (2025)
LLM Misalignment via Adversarial RLHF Platforms
by: Entezami, Erfan, et al.
Published: (2025)
by: Entezami, Erfan, et al.
Published: (2025)
Multi-layer attentive probing improves transfer of audio representations for bioacoustics
by: Miron, Marius, et al.
Published: (2026)
by: Miron, Marius, et al.
Published: (2026)
A Safe Exploration Strategy for Model-free Task Adaptation in Safety-constrained Grid Environments
by: Entezami, Erfan, et al.
Published: (2024)
by: Entezami, Erfan, et al.
Published: (2024)
Multi-layer random features and the approximation power of neural networks
by: Takhanov, Rustem
Published: (2024)
by: Takhanov, Rustem
Published: (2024)
Demystifying ChatGPT: How It Masters Genre Recognition
by: Raj, Subham, et al.
Published: (2025)
by: Raj, Subham, et al.
Published: (2025)
Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions
by: Suh, Joseph, et al.
Published: (2025)
by: Suh, Joseph, et al.
Published: (2025)
A Qualitative Study on Using ChatGPT for Software Security: Perception vs. Practicality
by: Kholoosi, M. Mehdi, et al.
Published: (2024)
by: Kholoosi, M. Mehdi, et al.
Published: (2024)
EventGPT: Capturing Player Impact from Team Action Sequences Using GPT-Based Framework
by: Hong, Miru, et al.
Published: (2025)
by: Hong, Miru, et al.
Published: (2025)
The role of gain neuromodulation in layer-5 pyramidal neurons
by: Rodriguez-Garcia, Alejandro, et al.
Published: (2025)
by: Rodriguez-Garcia, Alejandro, et al.
Published: (2025)
Unsupervised deep learning model for fast energy layer pre-selection of delivery-efficient proton arc therapy plan optimization of nasopharyngeal carcinoma
by: Yang, Bohan, et al.
Published: (2025)
by: Yang, Bohan, et al.
Published: (2025)
Internal Cross-layer Gradients for Extending Homogeneity to Heterogeneity in Federated Learning
by: Chan, Yun-Hin, et al.
Published: (2023)
by: Chan, Yun-Hin, et al.
Published: (2023)
Multi-layer Sequence Labeling-based Joint Biomedical Event Extraction
by: Chen, Gongchi, et al.
Published: (2024)
by: Chen, Gongchi, et al.
Published: (2024)
Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2024)
by: Merullo, Jack, et al.
Published: (2024)
On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGD
by: Zhang, Tongcheng, et al.
Published: (2026)
by: Zhang, Tongcheng, et al.
Published: (2026)
Theoretical limitations of multi-layer Transformer
by: Chen, Lijie, et al.
Published: (2024)
by: Chen, Lijie, et al.
Published: (2024)
How good is GPT at writing political speeches for the White House?
by: Savoy, Jacques
Published: (2024)
by: Savoy, Jacques
Published: (2024)
AI in Oncology: Transforming Cancer Detection through Machine Learning and Deep Learning Applications
by: Aftab, Muhammad, et al.
Published: (2025)
by: Aftab, Muhammad, et al.
Published: (2025)
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT
by: Xu, Yifang, et al.
Published: (2024)
by: Xu, Yifang, et al.
Published: (2024)
Single and bi-layered 2-D acoustic soft tactile skin (AST2)
by: Rajendran, Vishnu, et al.
Published: (2024)
by: Rajendran, Vishnu, et al.
Published: (2024)
Multi-layer Cross-attention is Provably Optimal for Multi-modal In-context Learning
by: Barnfield, Nicholas, et al.
Published: (2026)
by: Barnfield, Nicholas, et al.
Published: (2026)
ElicitationGPT: Text Elicitation Mechanisms via Language Models
by: Wu, Yifan, et al.
Published: (2024)
by: Wu, Yifan, et al.
Published: (2024)
Dynamic sparsity in tree-structured feed-forward layers at scale
by: Sedghi, Reza, et al.
Published: (2026)
by: Sedghi, Reza, et al.
Published: (2026)
AeroTherm-GPT: A Verification-Centered LLM Framework for Thermal Protection System Engineering Workflows
by: Qiao, Chuhan, et al.
Published: (2026)
by: Qiao, Chuhan, et al.
Published: (2026)
Similar Items
-
A Review of Lumber Disc Herniation and Sciatica
by: Alishba Imran, Alishba Imran
Published: (2025) -
Single layer tiny Co$^4$ outpaces GPT-2 and GPT-BERT
by: Zain, Noor Ul, et al.
Published: (2025) -
CommonKV: Compressing KV Cache with Cross-layer Parameter Sharing
by: Wang, Yixuan, et al.
Published: (2025) -
TravelEval: A Comprehensive Benchmarking Framework for Evaluating LLM-Powered Travel Planning Agents
by: Chen, Weiyi, et al.
Published: (2026) -
SHARP: Accelerating Language Model Inference by SHaring Adjacent layers with Recovery Parameters
by: Wang, Yiping, et al.
Published: (2025)