Saved in:
| Main Authors: | Mahdavi, Sadegh, Aoki, Raquel, Tang, Keyi, Cao, Yanshuai |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.12979 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Jump Start or False Start? A Theoretical and Empirical Evaluation of LLM-initialized Bandits
by: Bayley, Adam, et al.
Published: (2026)
by: Bayley, Adam, et al.
Published: (2026)
Causal EpiNets: Precision-corrected Bounds on Individual Treatment Effects using Epistemic Neural Networks
by: Patil, Gandharv, et al.
Published: (2026)
by: Patil, Gandharv, et al.
Published: (2026)
Memorization Capacity of Multi-Head Attention in Transformers
by: Mahdavi, Sadegh, et al.
Published: (2023)
by: Mahdavi, Sadegh, et al.
Published: (2023)
End-to-end PDDL Planning with Hardcoded and Dynamic Agents
by: La Malfa, Emanuele, et al.
Published: (2025)
by: La Malfa, Emanuele, et al.
Published: (2025)
Advantage Shaping as Surrogate Reward Maximization: Unifying Pass@K Policy Gradients
by: Thrampoulidis, Christos, et al.
Published: (2025)
by: Thrampoulidis, Christos, et al.
Published: (2025)
Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation
by: Mahdavi, Sadegh, et al.
Published: (2025)
by: Mahdavi, Sadegh, et al.
Published: (2025)
Flora: Low-Rank Adapters Are Secretly Gradient Compressors
by: Hao, Yongchang, et al.
Published: (2024)
by: Hao, Yongchang, et al.
Published: (2024)
NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks
by: Hao, Yongchang, et al.
Published: (2024)
by: Hao, Yongchang, et al.
Published: (2024)
Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks
by: Hao, Yongchang, et al.
Published: (2024)
by: Hao, Yongchang, et al.
Published: (2024)
From Graph Diffusion to Graph Classification
by: Xian, Jia Jun Cheng, et al.
Published: (2024)
by: Xian, Jia Jun Cheng, et al.
Published: (2024)
LoRAQuant: Mixed-Precision Quantization of LoRA to Ultra-Low Bits
by: Mirzaei, Amir Reza, et al.
Published: (2025)
by: Mirzaei, Amir Reza, et al.
Published: (2025)
LassoFlexNet: Flexible Neural Architecture for Tabular Data
by: Lui, Kry Yik Chau, et al.
Published: (2026)
by: Lui, Kry Yik Chau, et al.
Published: (2026)
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
by: Carta, Thomas, et al.
Published: (2023)
by: Carta, Thomas, et al.
Published: (2023)
Jump Starting Bandits with LLM-Generated Prior Knowledge
by: Alamdari, Parand A., et al.
Published: (2024)
by: Alamdari, Parand A., et al.
Published: (2024)
Interactive and Expressive Code-Augmented Planning with Large Language Models
by: Liu, Anthony Z., et al.
Published: (2024)
by: Liu, Anthony Z., et al.
Published: (2024)
FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
by: Lin, Xiaohan, et al.
Published: (2024)
by: Lin, Xiaohan, et al.
Published: (2024)
No $D_{\text{train}}$: Model-Agnostic Counterfactual Explanations Using Reinforcement Learning
by: Sun, Xiangyu, et al.
Published: (2024)
by: Sun, Xiangyu, et al.
Published: (2024)
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
by: Wen, Yuqiao, et al.
Published: (2024)
by: Wen, Yuqiao, et al.
Published: (2024)
Exploring Model Invariance with Discrete Search for Ultra-Low-Bit Quantization
by: Wen, Yuqiao, et al.
Published: (2025)
by: Wen, Yuqiao, et al.
Published: (2025)
Harnessing Optimization Dynamics for Curvature-Informed Model Merging
by: Mahdavinia, Pouria, et al.
Published: (2025)
by: Mahdavinia, Pouria, et al.
Published: (2025)
Leveraging Language Models for Automated Patient Record Linkage
by: Beheshti, Mohammad, et al.
Published: (2025)
by: Beheshti, Mohammad, et al.
Published: (2025)
Parallel Differentiable Reachability for Learning and Planning with Certified Neural Dynamics and Controllers
by: Shen, Keyi, et al.
Published: (2026)
by: Shen, Keyi, et al.
Published: (2026)
Leveraging Foundation Language Models (FLMs) for Automated Cohort Extraction from Large EHR Databases
by: Mugambi, Purity, et al.
Published: (2024)
by: Mugambi, Purity, et al.
Published: (2024)
On Large-scale Evaluation of Embedding Models for Knowledge Graph Completion
by: Shirvani-Mahdavi, Nasim, et al.
Published: (2025)
by: Shirvani-Mahdavi, Nasim, et al.
Published: (2025)
Regression with Large Language Models for Materials and Molecular Property Prediction
by: Jacobs, Ryan, et al.
Published: (2024)
by: Jacobs, Ryan, et al.
Published: (2024)
One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning
by: Pu, Yuan, et al.
Published: (2025)
by: Pu, Yuan, et al.
Published: (2025)
Towards Leveraging Large Language Models for Automated Medical Q&A Evaluation
by: Krolik, Jack, et al.
Published: (2024)
by: Krolik, Jack, et al.
Published: (2024)
Causal Effects with Unobserved Unit Types in Interacting Human-AI Systems
by: Overman, William, et al.
Published: (2026)
by: Overman, William, et al.
Published: (2026)
Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization
by: Tang, Zilu, et al.
Published: (2025)
by: Tang, Zilu, et al.
Published: (2025)
Integrating Large Language Models in Financial Investments and Market Analysis: A Survey
by: Mahdavi, Sedigheh, et al.
Published: (2025)
by: Mahdavi, Sedigheh, et al.
Published: (2025)
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
by: Aissi, Mohamed Salim, et al.
Published: (2024)
by: Aissi, Mohamed Salim, et al.
Published: (2024)
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models
by: Wu, Zhanglin, et al.
Published: (2025)
by: Wu, Zhanglin, et al.
Published: (2025)
Leveraging Large Language Models for Information Verification -- an Engineering Approach
by: Hung, Nguyen Nang, et al.
Published: (2025)
by: Hung, Nguyen Nang, et al.
Published: (2025)
Assessing the Latent Automated Program Repair Capabilities of Large Language Models using Round-Trip Translation
by: Ruiz, Fernando Vallecillos, et al.
Published: (2024)
by: Ruiz, Fernando Vallecillos, et al.
Published: (2024)
Evolutionary Large Language Model for Automated Feature Transformation
by: Gong, Nanxu, et al.
Published: (2024)
by: Gong, Nanxu, et al.
Published: (2024)
Model Merging via Multi-Teacher Knowledge Distillation
by: Dalili, Seyed Arshan, et al.
Published: (2025)
by: Dalili, Seyed Arshan, et al.
Published: (2025)
Leveraging Large Language Models and Topic Modeling for Toxicity Classification
by: Oskouie, Haniyeh Ehsani, et al.
Published: (2024)
by: Oskouie, Haniyeh Ehsani, et al.
Published: (2024)
Leveraging Large Language Models for Automated Causal Loop Diagram Generation: Enhancing System Dynamics Modeling through Curated Prompting Techniques
by: Liu, Ning-Yuan Georgia, et al.
Published: (2025)
by: Liu, Ning-Yuan Georgia, et al.
Published: (2025)
QoS-QoE Translation with Large Language Model
by: Yu, Yingjie, et al.
Published: (2026)
by: Yu, Yingjie, et al.
Published: (2026)
Merge before Forget: A Single LoRA Continual Learning via Continual Merging
by: Qiao, Fuli, et al.
Published: (2025)
by: Qiao, Fuli, et al.
Published: (2025)
Similar Items
-
Jump Start or False Start? A Theoretical and Empirical Evaluation of LLM-initialized Bandits
by: Bayley, Adam, et al.
Published: (2026) -
Causal EpiNets: Precision-corrected Bounds on Individual Treatment Effects using Epistemic Neural Networks
by: Patil, Gandharv, et al.
Published: (2026) -
Memorization Capacity of Multi-Head Attention in Transformers
by: Mahdavi, Sadegh, et al.
Published: (2023) -
End-to-end PDDL Planning with Hardcoded and Dynamic Agents
by: La Malfa, Emanuele, et al.
Published: (2025) -
Advantage Shaping as Surrogate Reward Maximization: Unifying Pass@K Policy Gradients
by: Thrampoulidis, Christos, et al.
Published: (2025)