Saved in:
| Main Authors: | Karim, Aabid, Karim, Abdul, Lohana, Bhoomika, Keon, Matt, Singh, Jaswinder, Sattar, Abdul |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.18018 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Galton's Law of Mediocrity: Why Large Language Models Regress to the Mean and Fail at Creativity in Advertising
by: Keon, Matt, et al.
Published: (2025)
by: Keon, Matt, et al.
Published: (2025)
When Intelligence Fails: An Empirical Study on Why LLMs Struggle with Password Cracking
by: Rehman, Mohammad Abdul, et al.
Published: (2025)
by: Rehman, Mohammad Abdul, et al.
Published: (2025)
FTT-GRU: A Hybrid Fast Temporal Transformer with GRU for Remaining Useful Life Prediction
by: Chirukiri, Varun Teja, et al.
Published: (2025)
by: Chirukiri, Varun Teja, et al.
Published: (2025)
FOCUS: Forcing In-Context Object Localization through Visual Support Constraints and Policy Optimization
by: Karim, Mohammed Asad, et al.
Published: (2026)
by: Karim, Mohammed Asad, et al.
Published: (2026)
Learning Bug Context for PyTorch-to-JAX Translation with LLMs
by: Phan, Hung, et al.
Published: (2025)
by: Phan, Hung, et al.
Published: (2025)
Dynamic Vocabulary Pruning in Early-Exit LLMs
by: Vincenti, Jort, et al.
Published: (2024)
by: Vincenti, Jort, et al.
Published: (2024)
TaoBench: Do Automated Theorem Prover LLMs Generalize Beyond MathLib?
by: Taylor, Alexander K, et al.
Published: (2026)
by: Taylor, Alexander K, et al.
Published: (2026)
The Depth Delusion: Why Transformers Should Be Wider, Not Deeper
by: Fahim, Md Muhtasim Munif, et al.
Published: (2026)
by: Fahim, Md Muhtasim Munif, et al.
Published: (2026)
Pairwise Difference Learning for Classification
by: Belaid, Mohamed Karim, et al.
Published: (2024)
by: Belaid, Mohamed Karim, et al.
Published: (2024)
Frontier LLMs Still Struggle with Simple Reasoning Tasks
by: Malek, Alan, et al.
Published: (2025)
by: Malek, Alan, et al.
Published: (2025)
The Struggles of LLMs in Cross-lingual Code Clone Detection
by: Moumoula, Micheline Bénédicte, et al.
Published: (2024)
by: Moumoula, Micheline Bénédicte, et al.
Published: (2024)
Lost and Found in Translation: Variational Diagnostics for Neural Codebook Channels
by: Hayashi, Yusuke
Published: (2026)
by: Hayashi, Yusuke
Published: (2026)
LEMMA: Learning from Errors for MatheMatical Advancement in LLMs
by: Pan, Zhuoshi, et al.
Published: (2025)
by: Pan, Zhuoshi, et al.
Published: (2025)
DAG-Math: Graph-of-Thought Guided Mathematical Reasoning in LLMs
by: Zhang, Yuanhe, et al.
Published: (2025)
by: Zhang, Yuanhe, et al.
Published: (2025)
Multi-Armed Bandits-Based Optimization of Decision Trees
by: Shanto, Hasibul Karim, et al.
Published: (2025)
by: Shanto, Hasibul Karim, et al.
Published: (2025)
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization
by: Prasanna, Sai, et al.
Published: (2024)
by: Prasanna, Sai, et al.
Published: (2024)
The Struggle Between Continuation and Refusal: A Mechanistic Analysis of the Continuation-Triggered Jailbreak in LLMs
by: Deng, Yonghong, et al.
Published: (2026)
by: Deng, Yonghong, et al.
Published: (2026)
SELF-[IN]CORRECT: LLMs Struggle with Discriminating Self-Generated Responses
by: Jiang, Dongwei, et al.
Published: (2024)
by: Jiang, Dongwei, et al.
Published: (2024)
Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm
by: Vakili, Sattar, et al.
Published: (2024)
by: Vakili, Sattar, et al.
Published: (2024)
Kernelized Reinforcement Learning with Order Optimal Regret Bounds
by: Vakili, Sattar, et al.
Published: (2023)
by: Vakili, Sattar, et al.
Published: (2023)
AMLNet: A Knowledge-Based Multi-Agent Framework to Generate and Detect Realistic Money Laundering Transactions
by: Huda, Sabin, et al.
Published: (2025)
by: Huda, Sabin, et al.
Published: (2025)
Empowering Bengali Education with AI: Solving Bengali Math Word Problems through Transformer Models
by: Era, Jalisha Jashim, et al.
Published: (2025)
by: Era, Jalisha Jashim, et al.
Published: (2025)
Safe Learning Under Irreversible Dynamics via Asking for Help
by: Plaut, Benjamin, et al.
Published: (2025)
by: Plaut, Benjamin, et al.
Published: (2025)
CulturalBench: A Robust, Diverse, and Challenging Cultural Benchmark by Human-AI CulturalTeaming
by: Chiu, Yu Ying, et al.
Published: (2024)
by: Chiu, Yu Ying, et al.
Published: (2024)
Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
by: Zhu, Yuqi, et al.
Published: (2025)
by: Zhu, Yuqi, et al.
Published: (2025)
Do LLMs Encode Functional Importance of Reasoning Tokens?
by: Singh, Janvijay, et al.
Published: (2026)
by: Singh, Janvijay, et al.
Published: (2026)
Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning
by: Wahab, Abdul, et al.
Published: (2026)
by: Wahab, Abdul, et al.
Published: (2026)
Efficiently Deploying LLMs with Controlled Risk
by: Zellinger, Michael J., et al.
Published: (2024)
by: Zellinger, Michael J., et al.
Published: (2024)
BertaQA: How Much Do Language Models Know About Local Culture?
by: Etxaniz, Julen, et al.
Published: (2024)
by: Etxaniz, Julen, et al.
Published: (2024)
Enhancing Frame Detection with Retrieval Augmented Generation
by: Diallo, Papa Abdou Karim Karou, et al.
Published: (2025)
by: Diallo, Papa Abdou Karim Karou, et al.
Published: (2025)
CultureLLM: Incorporating Cultural Differences into Large Language Models
by: Li, Cheng, et al.
Published: (2024)
by: Li, Cheng, et al.
Published: (2024)
ABS: Enforcing Constraint Satisfaction On Generated Sequences Via Automata-Guided Beam Search
by: Collura, Vincenzo, et al.
Published: (2025)
by: Collura, Vincenzo, et al.
Published: (2025)
Constraint-Guided Prediction Refinement via Deterministic Diffusion Trajectories
by: Dogoulis, Pantelis, et al.
Published: (2025)
by: Dogoulis, Pantelis, et al.
Published: (2025)
X-REFINE: XAI-based RElevance input-Filtering and archItecture fiNe-tuning for channel Estimation
by: Gizzini, Abdul Karim, et al.
Published: (2026)
by: Gizzini, Abdul Karim, et al.
Published: (2026)
BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
by: Petrov, Ivo, et al.
Published: (2025)
by: Petrov, Ivo, et al.
Published: (2025)
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning
by: Chen, Zui, et al.
Published: (2024)
by: Chen, Zui, et al.
Published: (2024)
Leveraging Intermediate Representations of Time Series Foundation Models for Anomaly Detection
by: Han, Chan Sik, et al.
Published: (2025)
by: Han, Chan Sik, et al.
Published: (2025)
eMargin: Revisiting Contrastive Learning with Margin-Based Separation
by: Shamba, Abdul-Kazeem, et al.
Published: (2025)
by: Shamba, Abdul-Kazeem, et al.
Published: (2025)
Machine Learning Risk Intelligence for Green Hydrogen Investment: Insights for Duqm R3 Auction
by: Nwafor, Obumneme, et al.
Published: (2025)
by: Nwafor, Obumneme, et al.
Published: (2025)
Generative Modeling with Flow-Guided Density Ratio Learning
by: Heng, Alvin, et al.
Published: (2023)
by: Heng, Alvin, et al.
Published: (2023)
Similar Items
-
Galton's Law of Mediocrity: Why Large Language Models Regress to the Mean and Fail at Creativity in Advertising
by: Keon, Matt, et al.
Published: (2025) -
When Intelligence Fails: An Empirical Study on Why LLMs Struggle with Password Cracking
by: Rehman, Mohammad Abdul, et al.
Published: (2025) -
FTT-GRU: A Hybrid Fast Temporal Transformer with GRU for Remaining Useful Life Prediction
by: Chirukiri, Varun Teja, et al.
Published: (2025) -
FOCUS: Forcing In-Context Object Localization through Visual Support Constraints and Policy Optimization
by: Karim, Mohammed Asad, et al.
Published: (2026) -
Learning Bug Context for PyTorch-to-JAX Translation with LLMs
by: Phan, Hung, et al.
Published: (2025)