Saved in:
| Main Authors: | Nekoei, Hadi, Jaiswal, Aman, Bechard, Patrice, Shliazhko, Oleh, Ayala, Orlando Marquez, Reymond, Mathieu, Caccia, Massimo, Drouin, Alexandre, Chandar, Sarath, Lacoste, Alexandre |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.04373 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Reducing hallucination in structured outputs via Retrieval-Augmented Generation
by: Béchard, Patrice, et al.
Published: (2024)
by: Béchard, Patrice, et al.
Published: (2024)
Multi-task retriever fine-tuning for domain-specific and efficient RAG
by: Béchard, Patrice, et al.
Published: (2025)
by: Béchard, Patrice, et al.
Published: (2025)
Generating a Low-code Complete Workflow via Task Decomposition and RAG
by: Ayala, Orlando Marquez, et al.
Published: (2024)
by: Ayala, Orlando Marquez, et al.
Published: (2024)
A Generalist Hanabi Agent
by: Sudhakar, Arjun V, et al.
Published: (2025)
by: Sudhakar, Arjun V, et al.
Published: (2025)
Shielded Controller Units for RL with Operational Constraints Applied to Remote Microgrids
by: Nekoei, Hadi, et al.
Published: (2025)
by: Nekoei, Hadi, et al.
Published: (2025)
How to Train Your LLM Web Agent: A Statistical Diagnosis
by: Vattikonda, Dheeraj, et al.
Published: (2025)
by: Vattikonda, Dheeraj, et al.
Published: (2025)
Squeezing More from the Stream : Learning Representation Online for Streaming Reinforcement Learning
by: Nilaksh, et al.
Published: (2026)
by: Nilaksh, et al.
Published: (2026)
Fine-Tune an SLM or Prompt an LLM? The Case of Generating Low-Code Workflows
by: Ayala, Orlando Marquez, et al.
Published: (2025)
by: Ayala, Orlando Marquez, et al.
Published: (2025)
GRPO-$λ$: Credit Assignment improves LLM Reasoning
by: Parthasarathi, Prasanna, et al.
Published: (2025)
by: Parthasarathi, Prasanna, et al.
Published: (2025)
CoPeP: Benchmarking Continual Pretraining for Protein Language Models
by: Patil, Darshan, et al.
Published: (2026)
by: Patil, Darshan, et al.
Published: (2026)
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
by: Drouin, Alexandre, et al.
Published: (2024)
by: Drouin, Alexandre, et al.
Published: (2024)
WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
by: Boisvert, Léo, et al.
Published: (2024)
by: Boisvert, Léo, et al.
Published: (2024)
LineRetriever: Planning-Aware Observation Reduction for Web Agents
by: Kerboua, Imene, et al.
Published: (2025)
by: Kerboua, Imene, et al.
Published: (2025)
CrystalGym: A New Benchmark for Materials Discovery Using Reinforcement Learning
by: Govindarajan, Prashant, et al.
Published: (2025)
by: Govindarajan, Prashant, et al.
Published: (2025)
Privileged Information Distillation for Language Models
by: Penaloza, Emiliano, et al.
Published: (2026)
by: Penaloza, Emiliano, et al.
Published: (2026)
FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents
by: Kerboua, Imene, et al.
Published: (2025)
by: Kerboua, Imene, et al.
Published: (2025)
Mem-$π$: Adaptive Memory through Learning When and What to Generate
by: Wang, Xiaoqiang, et al.
Published: (2026)
by: Wang, Xiaoqiang, et al.
Published: (2026)
CUBE: A Standard for Unifying Agent Benchmarks
by: Lacoste, Alexandre, et al.
Published: (2026)
by: Lacoste, Alexandre, et al.
Published: (2026)
Terminal Agents Suffice for Enterprise Automation
by: Bechard, Patrice, et al.
Published: (2026)
by: Bechard, Patrice, et al.
Published: (2026)
Sub-goal Distillation: A Method to Improve Small Language Agents
by: Hashemzadeh, Maryam, et al.
Published: (2024)
by: Hashemzadeh, Maryam, et al.
Published: (2024)
Dialectics of Alignment: Harnessing Unsafe Knowledge for Dynamic Safety Routing
by: Hashemzadeh, Maryam, et al.
Published: (2026)
by: Hashemzadeh, Maryam, et al.
Published: (2026)
The BrowserGym Ecosystem for Web Agent Research
by: De Chezelles, Thibault Le Sellier, et al.
Published: (2024)
by: De Chezelles, Thibault Le Sellier, et al.
Published: (2024)
Hinter der glitzernden Fassade
by: Safiyev, Rail
Published: (2026)
by: Safiyev, Rail
Published: (2026)
Hinter_Fragen der Erziehungswissenschaft
Published: (2022)
Published: (2022)
Faithfulness Measurable Masked Language Models
by: Madsen, Andreas, et al.
Published: (2023)
by: Madsen, Andreas, et al.
Published: (2023)
Are self-explanations from Large Language Models faithful?
by: Madsen, Andreas, et al.
Published: (2024)
by: Madsen, Andreas, et al.
Published: (2024)
On the Costs and Benefits of Adopting Lifelong Learning for Software Analytics -- Empirical Study on Brown Build and Risk Prediction
by: Olewicki, Doriane, et al.
Published: (2023)
by: Olewicki, Doriane, et al.
Published: (2023)
Generalization Bounds via Meta-Learned Model Representations: PAC-Bayes and Sample Compression Hypernetworks
by: Leblanc, Benjamin, et al.
Published: (2024)
by: Leblanc, Benjamin, et al.
Published: (2024)
Reconstruction or Semantics? What Makes a Latent Space Useful for Robotic World Models
by: Nilaksh, et al.
Published: (2026)
by: Nilaksh, et al.
Published: (2026)
Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain
by: Boisvert, Léo, et al.
Published: (2025)
by: Boisvert, Léo, et al.
Published: (2025)
Optimizing What Matters: AUC-Driven Learning for Robust Neural Retrieval
by: Sheikholeslami, Nima, et al.
Published: (2025)
by: Sheikholeslami, Nima, et al.
Published: (2025)
Knowing When Not to Answer: Evaluating Abstention in Multimodal Reasoning Systems
by: Madhusudhan, Nishanth, et al.
Published: (2026)
by: Madhusudhan, Nishanth, et al.
Published: (2026)
Promoting Exploration in Memory-Augmented Adam using Critical Momenta
by: Malviya, Pranshu, et al.
Published: (2023)
by: Malviya, Pranshu, et al.
Published: (2023)
LLMs Can't Play Hangman: On the Necessity of a Private Working Memory for Language Agents
by: Baldelli, Davide, et al.
Published: (2026)
by: Baldelli, Davide, et al.
Published: (2026)
Neural Coherence : Find higher performance to out-of-distribution tasks from few samples
by: Guiroy, Simon, et al.
Published: (2025)
by: Guiroy, Simon, et al.
Published: (2025)
Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
by: Prato, Gabriele, et al.
Published: (2025)
by: Prato, Gabriele, et al.
Published: (2025)
The Expressive Limits of Diagonal SSMs for State-Tracking
by: Shakerinava, Mehran, et al.
Published: (2026)
by: Shakerinava, Mehran, et al.
Published: (2026)
Intelligent Switching for Reset-Free RL
by: Patil, Darshan, et al.
Published: (2024)
by: Patil, Darshan, et al.
Published: (2024)
Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models
by: Huang, Jerry, et al.
Published: (2024)
by: Huang, Jerry, et al.
Published: (2024)
Interpretability Needs a New Paradigm
by: Madsen, Andreas, et al.
Published: (2024)
by: Madsen, Andreas, et al.
Published: (2024)
Similar Items
-
Reducing hallucination in structured outputs via Retrieval-Augmented Generation
by: Béchard, Patrice, et al.
Published: (2024) -
Multi-task retriever fine-tuning for domain-specific and efficient RAG
by: Béchard, Patrice, et al.
Published: (2025) -
Generating a Low-code Complete Workflow via Task Decomposition and RAG
by: Ayala, Orlando Marquez, et al.
Published: (2024) -
A Generalist Hanabi Agent
by: Sudhakar, Arjun V, et al.
Published: (2025) -
Shielded Controller Units for RL with Operational Constraints Applied to Remote Microgrids
by: Nekoei, Hadi, et al.
Published: (2025)