:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nekoei, Hadi, Jaiswal, Aman, Bechard, Patrice, Shliazhko, Oleh, Ayala, Orlando Marquez, Reymond, Mathieu, Caccia, Massimo, Drouin, Alexandre, Chandar, Sarath, Lacoste, Alexandre
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.04373
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Reducing hallucination in structured outputs via Retrieval-Augmented Generation
by: Béchard, Patrice, et al.
Published: (2024)

Multi-task retriever fine-tuning for domain-specific and efficient RAG
by: Béchard, Patrice, et al.
Published: (2025)

Generating a Low-code Complete Workflow via Task Decomposition and RAG
by: Ayala, Orlando Marquez, et al.
Published: (2024)

A Generalist Hanabi Agent
by: Sudhakar, Arjun V, et al.
Published: (2025)

Shielded Controller Units for RL with Operational Constraints Applied to Remote Microgrids
by: Nekoei, Hadi, et al.
Published: (2025)

How to Train Your LLM Web Agent: A Statistical Diagnosis
by: Vattikonda, Dheeraj, et al.
Published: (2025)

Squeezing More from the Stream : Learning Representation Online for Streaming Reinforcement Learning
by: Nilaksh, et al.
Published: (2026)

Fine-Tune an SLM or Prompt an LLM? The Case of Generating Low-Code Workflows
by: Ayala, Orlando Marquez, et al.
Published: (2025)

GRPO-$λ$: Credit Assignment improves LLM Reasoning
by: Parthasarathi, Prasanna, et al.
Published: (2025)

CoPeP: Benchmarking Continual Pretraining for Protein Language Models
by: Patil, Darshan, et al.
Published: (2026)

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
by: Drouin, Alexandre, et al.
Published: (2024)

WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
by: Boisvert, Léo, et al.
Published: (2024)

LineRetriever: Planning-Aware Observation Reduction for Web Agents
by: Kerboua, Imene, et al.
Published: (2025)

CrystalGym: A New Benchmark for Materials Discovery Using Reinforcement Learning
by: Govindarajan, Prashant, et al.
Published: (2025)

Privileged Information Distillation for Language Models
by: Penaloza, Emiliano, et al.
Published: (2026)

FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents
by: Kerboua, Imene, et al.
Published: (2025)

Mem-$π$: Adaptive Memory through Learning When and What to Generate
by: Wang, Xiaoqiang, et al.
Published: (2026)

CUBE: A Standard for Unifying Agent Benchmarks
by: Lacoste, Alexandre, et al.
Published: (2026)

Terminal Agents Suffice for Enterprise Automation
by: Bechard, Patrice, et al.
Published: (2026)

Sub-goal Distillation: A Method to Improve Small Language Agents
by: Hashemzadeh, Maryam, et al.
Published: (2024)

Dialectics of Alignment: Harnessing Unsafe Knowledge for Dynamic Safety Routing
by: Hashemzadeh, Maryam, et al.
Published: (2026)

The BrowserGym Ecosystem for Web Agent Research
by: De Chezelles, Thibault Le Sellier, et al.
Published: (2024)

Hinter der glitzernden Fassade
by: Safiyev, Rail
Published: (2026)

Hinter_Fragen der Erziehungswissenschaft
Published: (2022)

Faithfulness Measurable Masked Language Models
by: Madsen, Andreas, et al.
Published: (2023)

Are self-explanations from Large Language Models faithful?
by: Madsen, Andreas, et al.
Published: (2024)

On the Costs and Benefits of Adopting Lifelong Learning for Software Analytics -- Empirical Study on Brown Build and Risk Prediction
by: Olewicki, Doriane, et al.
Published: (2023)

Generalization Bounds via Meta-Learned Model Representations: PAC-Bayes and Sample Compression Hypernetworks
by: Leblanc, Benjamin, et al.
Published: (2024)

Reconstruction or Semantics? What Makes a Latent Space Useful for Robotic World Models
by: Nilaksh, et al.
Published: (2026)

Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain
by: Boisvert, Léo, et al.
Published: (2025)

Optimizing What Matters: AUC-Driven Learning for Robust Neural Retrieval
by: Sheikholeslami, Nima, et al.
Published: (2025)

Knowing When Not to Answer: Evaluating Abstention in Multimodal Reasoning Systems
by: Madhusudhan, Nishanth, et al.
Published: (2026)

Promoting Exploration in Memory-Augmented Adam using Critical Momenta
by: Malviya, Pranshu, et al.
Published: (2023)

LLMs Can't Play Hangman: On the Necessity of a Private Working Memory for Language Agents
by: Baldelli, Davide, et al.
Published: (2026)

Neural Coherence : Find higher performance to out-of-distribution tasks from few samples
by: Guiroy, Simon, et al.
Published: (2025)

Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
by: Prato, Gabriele, et al.
Published: (2025)

The Expressive Limits of Diagonal SSMs for State-Tracking
by: Shakerinava, Mehran, et al.
Published: (2026)

Intelligent Switching for Reset-Free RL
by: Patil, Darshan, et al.
Published: (2024)

Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models
by: Huang, Jerry, et al.
Published: (2024)

Interpretability Needs a New Paradigm
by: Madsen, Andreas, et al.
Published: (2024)