:: Library Catalog

$Cover Image$

Saved in:

Bibliographic Details
Main Authors:	Karim, Aabid, Karim, Abdul, Lohana, Bhoomika, Keon, Matt, Singh, Jaswinder, Sattar, Abdul
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2503.18018
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Galton's Law of Mediocrity: Why Large Language Models Regress to the Mean and Fail at Creativity in Advertising
by: Keon, Matt, et al.
Published: (2025)

When Intelligence Fails: An Empirical Study on Why LLMs Struggle with Password Cracking
by: Rehman, Mohammad Abdul, et al.
Published: (2025)

FTT-GRU: A Hybrid Fast Temporal Transformer with GRU for Remaining Useful Life Prediction
by: Chirukiri, Varun Teja, et al.
Published: (2025)

FOCUS: Forcing In-Context Object Localization through Visual Support Constraints and Policy Optimization
by: Karim, Mohammed Asad, et al.
Published: (2026)

Learning Bug Context for PyTorch-to-JAX Translation with LLMs
by: Phan, Hung, et al.
Published: (2025)

Dynamic Vocabulary Pruning in Early-Exit LLMs
by: Vincenti, Jort, et al.
Published: (2024)

TaoBench: Do Automated Theorem Prover LLMs Generalize Beyond MathLib?
by: Taylor, Alexander K, et al.
Published: (2026)

The Depth Delusion: Why Transformers Should Be Wider, Not Deeper
by: Fahim, Md Muhtasim Munif, et al.
Published: (2026)

Pairwise Difference Learning for Classification
by: Belaid, Mohamed Karim, et al.
Published: (2024)

Frontier LLMs Still Struggle with Simple Reasoning Tasks
by: Malek, Alan, et al.
Published: (2025)

The Struggles of LLMs in Cross-lingual Code Clone Detection
by: Moumoula, Micheline Bénédicte, et al.
Published: (2024)

Lost and Found in Translation: Variational Diagnostics for Neural Codebook Channels
by: Hayashi, Yusuke
Published: (2026)

LEMMA: Learning from Errors for MatheMatical Advancement in LLMs
by: Pan, Zhuoshi, et al.
Published: (2025)

DAG-Math: Graph-of-Thought Guided Mathematical Reasoning in LLMs
by: Zhang, Yuanhe, et al.
Published: (2025)

Multi-Armed Bandits-Based Optimization of Decision Trees
by: Shanto, Hasibul Karim, et al.
Published: (2025)

Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization
by: Prasanna, Sai, et al.
Published: (2024)

The Struggle Between Continuation and Refusal: A Mechanistic Analysis of the Continuation-Triggered Jailbreak in LLMs
by: Deng, Yonghong, et al.
Published: (2026)

SELF-[IN]CORRECT: LLMs Struggle with Discriminating Self-Generated Responses
by: Jiang, Dongwei, et al.
Published: (2024)

Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm
by: Vakili, Sattar, et al.
Published: (2024)

Kernelized Reinforcement Learning with Order Optimal Regret Bounds
by: Vakili, Sattar, et al.
Published: (2023)

AMLNet: A Knowledge-Based Multi-Agent Framework to Generate and Detect Realistic Money Laundering Transactions
by: Huda, Sabin, et al.
Published: (2025)

Empowering Bengali Education with AI: Solving Bengali Math Word Problems through Transformer Models
by: Era, Jalisha Jashim, et al.
Published: (2025)

Safe Learning Under Irreversible Dynamics via Asking for Help
by: Plaut, Benjamin, et al.
Published: (2025)

CulturalBench: A Robust, Diverse, and Challenging Cultural Benchmark by Human-AI CulturalTeaming
by: Chiu, Yu Ying, et al.
Published: (2024)

Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
by: Zhu, Yuqi, et al.
Published: (2025)

Do LLMs Encode Functional Importance of Reasoning Tokens?
by: Singh, Janvijay, et al.
Published: (2026)

Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning
by: Wahab, Abdul, et al.
Published: (2026)

Efficiently Deploying LLMs with Controlled Risk
by: Zellinger, Michael J., et al.
Published: (2024)

BertaQA: How Much Do Language Models Know About Local Culture?
by: Etxaniz, Julen, et al.
Published: (2024)

Enhancing Frame Detection with Retrieval Augmented Generation
by: Diallo, Papa Abdou Karim Karou, et al.
Published: (2025)

CultureLLM: Incorporating Cultural Differences into Large Language Models
by: Li, Cheng, et al.
Published: (2024)

ABS: Enforcing Constraint Satisfaction On Generated Sequences Via Automata-Guided Beam Search
by: Collura, Vincenzo, et al.
Published: (2025)

Constraint-Guided Prediction Refinement via Deterministic Diffusion Trajectories
by: Dogoulis, Pantelis, et al.
Published: (2025)

X-REFINE: XAI-based RElevance input-Filtering and archItecture fiNe-tuning for channel Estimation
by: Gizzini, Abdul Karim, et al.
Published: (2026)

BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
by: Petrov, Ivo, et al.
Published: (2025)

An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning
by: Chen, Zui, et al.
Published: (2024)

Leveraging Intermediate Representations of Time Series Foundation Models for Anomaly Detection
by: Han, Chan Sik, et al.
Published: (2025)

eMargin: Revisiting Contrastive Learning with Margin-Based Separation
by: Shamba, Abdul-Kazeem, et al.
Published: (2025)

Machine Learning Risk Intelligence for Green Hydrogen Investment: Insights for Duqm R3 Auction
by: Nwafor, Obumneme, et al.
Published: (2025)

Generative Modeling with Flow-Guided Density Ratio Learning
by: Heng, Alvin, et al.
Published: (2023)