:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Du, Jason, Hong, Kelly, Imran, Alishba, Jahanparast, Erfan, Khfifi, Mehdi, Qiao, Kaichun
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2501.07108
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Review of Lumber Disc Herniation and Sciatica
by: Alishba Imran, Alishba Imran
Published: (2025)

Single layer tiny Co$^4$ outpaces GPT-2 and GPT-BERT
by: Zain, Noor Ul, et al.
Published: (2025)

CommonKV: Compressing KV Cache with Cross-layer Parameter Sharing
by: Wang, Yixuan, et al.
Published: (2025)

TravelEval: A Comprehensive Benchmarking Framework for Evaluating LLM-Powered Travel Planning Agents
by: Chen, Weiyi, et al.
Published: (2026)

SHARP: Accelerating Language Model Inference by SHaring Adjacent layers with Recovery Parameters
by: Wang, Yiping, et al.
Published: (2025)

Exploring the In-Context Learning Capabilities of LLMs for Money Laundering Detection in Financial Graphs
by: Pirmorad, Erfan
Published: (2025)

Efficient Noise Mitigation for Enhancing Inference Accuracy in DNNs on Mixed-Signal Accelerators
by: Azizi, Seyedarmin, et al.
Published: (2024)

Enhancing Reinforcement learning in 3-Dimensional Hydrophobic-Polar Protein Folding Model with Attention-based layers
by: Liu, Peizheng, et al.
Published: (2025)

Deep-layer limit and stability analysis of the basic forward-backward-splitting induced network (II): learning problems
by: Lin, Xuan, et al.
Published: (2026)

Krul: Efficient State Restoration for Multi-turn Conversations with Dynamic Cross-layer KV Sharing
by: Wen, Junyi, et al.
Published: (2025)

Unmasking the giant: A comprehensive evaluation of ChatGPT's proficiency in coding algorithms and data structures
by: Arefin, Sayed Erfan, et al.
Published: (2023)

Fair Division of Multi-layered Cakes
by: Sanpui, Mohammad Azharuddin
Published: (2022)

GraphTrafficGPT: Enhancing Traffic Management Through Graph-Based AI Agent Coordination
by: Taleb, Nabil Abdelaziz Ferhat, et al.
Published: (2025)

A layered architecture for log analysis in complex IT systems
by: Wittkopp, Thorsten
Published: (2025)

[Social] Allostasis: Or, How I Learned To Stop Worrying and Love The Noise
by: Khan, Imran
Published: (2025)

Unveiling User Perceptions in the Generative AI Era: A Sentiment-Driven Evaluation of AI Educational Apps' Role in Digital Transformation of e-Teaching
by: Mazaherian, Adeleh, et al.
Published: (2025)

Comprehensive Modeling Approaches for Forecasting Bitcoin Transaction Fees: A Comparative Study
by: Ma, Jiangqin, et al.
Published: (2025)

LLM Misalignment via Adversarial RLHF Platforms
by: Entezami, Erfan, et al.
Published: (2025)

Multi-layer attentive probing improves transfer of audio representations for bioacoustics
by: Miron, Marius, et al.
Published: (2026)

A Safe Exploration Strategy for Model-free Task Adaptation in Safety-constrained Grid Environments
by: Entezami, Erfan, et al.
Published: (2024)

Multi-layer random features and the approximation power of neural networks
by: Takhanov, Rustem
Published: (2024)

Demystifying ChatGPT: How It Masters Genre Recognition
by: Raj, Subham, et al.
Published: (2025)

Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions
by: Suh, Joseph, et al.
Published: (2025)

A Qualitative Study on Using ChatGPT for Software Security: Perception vs. Practicality
by: Kholoosi, M. Mehdi, et al.
Published: (2024)

EventGPT: Capturing Player Impact from Team Action Sequences Using GPT-Based Framework
by: Hong, Miru, et al.
Published: (2025)

The role of gain neuromodulation in layer-5 pyramidal neurons
by: Rodriguez-Garcia, Alejandro, et al.
Published: (2025)

Unsupervised deep learning model for fast energy layer pre-selection of delivery-efficient proton arc therapy plan optimization of nasopharyngeal carcinoma
by: Yang, Bohan, et al.
Published: (2025)

Internal Cross-layer Gradients for Extending Homogeneity to Heterogeneity in Federated Learning
by: Chan, Yun-Hin, et al.
Published: (2023)

Multi-layer Sequence Labeling-based Joint Biomedical Event Extraction
by: Chen, Gongchi, et al.
Published: (2024)

Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2024)

On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGD
by: Zhang, Tongcheng, et al.
Published: (2026)

Theoretical limitations of multi-layer Transformer
by: Chen, Lijie, et al.
Published: (2024)

How good is GPT at writing political speeches for the White House?
by: Savoy, Jacques
Published: (2024)

AI in Oncology: Transforming Cancer Detection through Machine Learning and Deep Learning Applications
by: Aftab, Muhammad, et al.
Published: (2025)

VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT
by: Xu, Yifang, et al.
Published: (2024)

Single and bi-layered 2-D acoustic soft tactile skin (AST2)
by: Rajendran, Vishnu, et al.
Published: (2024)

Multi-layer Cross-attention is Provably Optimal for Multi-modal In-context Learning
by: Barnfield, Nicholas, et al.
Published: (2026)

ElicitationGPT: Text Elicitation Mechanisms via Language Models
by: Wu, Yifan, et al.
Published: (2024)

Dynamic sparsity in tree-structured feed-forward layers at scale
by: Sedghi, Reza, et al.
Published: (2026)

AeroTherm-GPT: A Verification-Centered LLM Framework for Thermal Protection System Engineering Workflows
by: Qiao, Chuhan, et al.
Published: (2026)