Saved in:
| Main Authors: | Roth, K, Gupta, Rushil, Halle, Simon, Liu, Bang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.01344 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Benchmark for Procedural Memory Retrieval in Language Agents
by: Kohar, Ishant, et al.
Published: (2025)
by: Kohar, Ishant, et al.
Published: (2025)
Memp: Exploring Agent Procedural Memory
by: Fang, Runnan, et al.
Published: (2025)
by: Fang, Runnan, et al.
Published: (2025)
Towards an Action-Centric Ontology for Cooking Procedures Using Temporal Graphs
by: Kumbhakern, Aarush, et al.
Published: (2025)
by: Kumbhakern, Aarush, et al.
Published: (2025)
Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution
by: Cao, Zouying, et al.
Published: (2025)
by: Cao, Zouying, et al.
Published: (2025)
Transparent and Coherent Procedural Mistake Detection
by: Storks, Shane, et al.
Published: (2024)
by: Storks, Shane, et al.
Published: (2024)
InfiniteScienceGym: An Unbounded, Procedurally-Generated Benchmark for Scientific Analysis
by: Bentham, Oliver, et al.
Published: (2026)
by: Bentham, Oliver, et al.
Published: (2026)
PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games
by: Buongiorno, Steph, et al.
Published: (2024)
by: Buongiorno, Steph, et al.
Published: (2024)
IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation
by: Baek, In-Chang, et al.
Published: (2025)
by: Baek, In-Chang, et al.
Published: (2025)
RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models
by: An, Bang, et al.
Published: (2025)
by: An, Bang, et al.
Published: (2025)
Graph Guided Question Answer Generation for Procedural Question-Answering
by: Pham, Hai X., et al.
Published: (2024)
by: Pham, Hai X., et al.
Published: (2024)
MAC: Multi-Agent Constitution Learning
by: Thareja, Rushil, et al.
Published: (2026)
by: Thareja, Rushil, et al.
Published: (2026)
SOPBench: Evaluating Language Agents at Following Standard Operating Procedures and Constraints
by: Li, Zekun, et al.
Published: (2025)
by: Li, Zekun, et al.
Published: (2025)
Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models
by: Dahl, Matthew
Published: (2025)
by: Dahl, Matthew
Published: (2025)
Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models
by: Li, Zhuoqun, et al.
Published: (2024)
by: Li, Zhuoqun, et al.
Published: (2024)
Behavior-Aware Item Modeling via Dynamic Procedural Solution Representations for Knowledge Tracing
by: Seo, Jun, et al.
Published: (2026)
by: Seo, Jun, et al.
Published: (2026)
Mem-$π$: Adaptive Memory through Learning When and What to Generate
by: Wang, Xiaoqiang, et al.
Published: (2026)
by: Wang, Xiaoqiang, et al.
Published: (2026)
ProcBench: Benchmark for Multi-Step Reasoning and Following Procedure
by: Fujisawa, Ippei, et al.
Published: (2024)
by: Fujisawa, Ippei, et al.
Published: (2024)
R-Debater: Retrieval-Augmented Debate Generation through Argumentative Memory
by: Li, Maoyuan, et al.
Published: (2025)
by: Li, Maoyuan, et al.
Published: (2025)
Teaching Through Analogies: A Modular Pipeline for Educational Analogy Generation
by: Barakat, Mariam, et al.
Published: (2026)
by: Barakat, Mariam, et al.
Published: (2026)
R$^3$Mem: Bridging Memory Retention and Retrieval via Reversible Compression
by: Wang, Xiaoqiang, et al.
Published: (2025)
by: Wang, Xiaoqiang, et al.
Published: (2025)
Conversational Education at Scale: A Multi-LLM Agent Workflow for Procedural Learning and Pedagogic Quality Assessment
by: Pei, Jiahuan, et al.
Published: (2025)
by: Pei, Jiahuan, et al.
Published: (2025)
DS@GT at CheckThat! 2025: Detecting Subjectivity via Transfer-Learning and Corrective Data Augmentation
by: Heil, Maximilian, et al.
Published: (2025)
by: Heil, Maximilian, et al.
Published: (2025)
LLMs for Bayesian Optimization in Scientific Domains: Are We There Yet?
by: Gupta, Rushil, et al.
Published: (2025)
by: Gupta, Rushil, et al.
Published: (2025)
Imagine How To Change: Explicit Procedure Modeling for Change Captioning
by: Sun, Jiayang, et al.
Published: (2026)
by: Sun, Jiayang, et al.
Published: (2026)
Multilingual LLMs Are Not Multilingual Thinkers: Evidence from Hindi Analogy Evaluation
by: Gupta, Ashray, et al.
Published: (2025)
by: Gupta, Ashray, et al.
Published: (2025)
Clustered Retrieved Augmented Generation (CRAG)
by: Akesson, Simon, et al.
Published: (2024)
by: Akesson, Simon, et al.
Published: (2024)
RECIPER: A Dual-View Retrieval Pipeline for Procedure-Oriented Materials Question Answering
by: Wu, Zhuoyu, et al.
Published: (2026)
by: Wu, Zhuoyu, et al.
Published: (2026)
GEM-RAG: Graphical Eigen Memories For Retrieval Augmented Generation
by: Rappazzo, Brendan Hogan, et al.
Published: (2024)
by: Rappazzo, Brendan Hogan, et al.
Published: (2024)
GameTileNet: A Semantic Dataset for Low-Resolution Game Art in Procedural Content Generation
by: Chen, Yi-Chun, et al.
Published: (2025)
by: Chen, Yi-Chun, et al.
Published: (2025)
Routing by Analogy: kNN-Augmented Expert Assignment for Mixture-of-Experts
by: Lyu, Boxuan, et al.
Published: (2026)
by: Lyu, Boxuan, et al.
Published: (2026)
Human Evaluation of Procedural Knowledge Graph Extraction from Text with Large Language Models
by: Carriero, Valentina Anita, et al.
Published: (2024)
by: Carriero, Valentina Anita, et al.
Published: (2024)
DP-Fusion: Token-Level Differentially Private Inference for Large Language Models
by: Thareja, Rushil, et al.
Published: (2025)
by: Thareja, Rushil, et al.
Published: (2025)
Out-of-Context Abduction: LLMs Make Inferences About Procedural Data Leveraging Declarative Facts in Earlier Training Data
by: Imran, Sohaib, et al.
Published: (2025)
by: Imran, Sohaib, et al.
Published: (2025)
BioRAGent: A Retrieval-Augmented Generation System for Showcasing Generative Query Expansion and Domain-Specific Search for Scientific Q&A
by: Ateia, Samy, et al.
Published: (2024)
by: Ateia, Samy, et al.
Published: (2024)
Q-Mirror: Unlocking the Multi-Modal Potential of Scientific Text-Only QA Pairs
by: Wang, Junying, et al.
Published: (2025)
by: Wang, Junying, et al.
Published: (2025)
Memory-Augmented Agent Training for Business Document Understanding
by: Liu, Jiale, et al.
Published: (2024)
by: Liu, Jiale, et al.
Published: (2024)
MADial-Bench: Towards Real-world Evaluation of Memory-Augmented Dialogue Generation
by: He, Junqing, et al.
Published: (2024)
by: He, Junqing, et al.
Published: (2024)
MemLong: Memory-Augmented Retrieval for Long Text Modeling
by: Liu, Weijie, et al.
Published: (2024)
by: Liu, Weijie, et al.
Published: (2024)
High-Quality Data Augmentation for Low-Resource NMT: Combining a Translation Memory, a GAN Generator, and Filtering
by: Liu, Hengjie, et al.
Published: (2024)
by: Liu, Hengjie, et al.
Published: (2024)
PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning
by: Brahman, Faeze, et al.
Published: (2023)
by: Brahman, Faeze, et al.
Published: (2023)
Similar Items
-
A Benchmark for Procedural Memory Retrieval in Language Agents
by: Kohar, Ishant, et al.
Published: (2025) -
Memp: Exploring Agent Procedural Memory
by: Fang, Runnan, et al.
Published: (2025) -
Towards an Action-Centric Ontology for Cooking Procedures Using Temporal Graphs
by: Kumbhakern, Aarush, et al.
Published: (2025) -
Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution
by: Cao, Zouying, et al.
Published: (2025) -
Transparent and Coherent Procedural Mistake Detection
by: Storks, Shane, et al.
Published: (2024)