:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Enomoto, Masafumi, Takeoka, Kunihiro, Akimoto, Kosuke, Gashteovski, Kiril, Oyamada, Masafumi
Formato:	Preprint
Publicado:	2024
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2406.12494
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering
por: Akimoto, Kosuke, et al.
Publicado: (2024)

$M^3$ Scaling Law: Optimizing Multi-Epoch, Multi-Lingual, and Multi-Stage Training for Low-Resource Language Models
por: Akimoto, Kosuke, et al.
Publicado: (2024)

Revisiting Observation Reduction for Web Agents: Comprehensive Evaluation with a Lightweight Framework
por: Enomoto, Masafumi, et al.
Publicado: (2026)

cotomi Act: Learning to Automate Work by Watching You
por: Oyamada, Masafumi, et al.
Publicado: (2026)

Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems
por: Kusano, Genki, et al.
Publicado: (2024)

On Synthesizing Data for Context Attribution in Question Answering
por: Radevski, Gorjan, et al.
Publicado: (2025)

Read More, Think More: Revisiting Observation Reduction for Web Agents
por: Enomoto, Masafumi, et al.
Publicado: (2026)

Can a Crow Hatch a Falcon? Lineage Matters in Predicting Large Language Model Performance
por: Tamura, Takuya, et al.
Publicado: (2025)

LLM-based Query Expansion Fails for Unfamiliar and Ambiguous Queries
por: Abe, Kenya, et al.
Publicado: (2025)

Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection
por: Dukić, David, et al.
Publicado: (2023)

Can Large Language Models Invent Algorithms to Improve Themselves?: Algorithm Discovery for Recursive Self-Improvement through Reinforcement Learning
por: Ishibashi, Yoichi, et al.
Publicado: (2024)

An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability
por: Yamauchi, Yusuke, et al.
Publicado: (2025)

LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents
por: Yano, Taro, et al.
Publicado: (2025)

Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning
por: Ishibashi, Yoichi, et al.
Publicado: (2025)

Revisiting Prompt Engineering: A Comprehensive Evaluation for LLM-based Personalized Recommendation
por: Kusano, Genki, et al.
Publicado: (2025)

Effective Harness Engineering for Algorithm Discovery with Coding Agents
por: Ishibashi, Yoichi, et al.
Publicado: (2026)

Jellyfish: A Large Language Model for Data Preprocessing
por: Zhang, Haochen, et al.
Publicado: (2023)

Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain
por: Ozaki, Shintaro, et al.
Publicado: (2024)

Robust Text Classification: Analyzing Prototype-Based Networks
por: Sourati, Zhivar, et al.
Publicado: (2023)

TextMineX: Data, Evaluation Framework and Ontology-guided LLM Pipeline for Humanitarian Mine Action
por: Zhou, Chenyue, et al.
Publicado: (2025)

Compositional Steering of Large Language Models with Steering Tokens
por: Radevski, Gorjan, et al.
Publicado: (2026)

Beyond Independent Passages: Adaptive Passage Combination Retrieval for Retrieval Augmented Open-Domain Question Answering
por: Ko, Ting-Wen, et al.
Publicado: (2025)

MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis
por: Rose, Daniel, et al.
Publicado: (2025)

AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents
por: Gioacchini, Luca, et al.
Publicado: (2024)

DAPR: A Benchmark on Document-Aware Passage Retrieval
por: Wang, Kexin, et al.
Publicado: (2023)

Dense Passage Retrieval: Is it Retrieving?
por: Reichman, Benjamin, et al.
Publicado: (2024)

Best-of-$\infty$ -- Asymptotic Performance of Test-Time LLM Ensembling
por: Komiyama, Junpei, et al.
Publicado: (2025)

SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
por: Lin, Chyi-Jiunn, et al.
Publicado: (2024)

Evaluating Language Models as Synthetic Data Generators
por: Kim, Seungone, et al.
Publicado: (2024)

SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation
por: Ueda, Nobuhiro, et al.
Publicado: (2025)

QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs
por: Kim, Minsang, et al.
Publicado: (2024)

On The Persona-based Summarization of Domain-Specific Documents
por: Mullick, Ankan, et al.
Publicado: (2024)

Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval
por: Pavlova, Vera
Publicado: (2025)

Cohort Retrieval using Dense Passage Retrieval
por: Jadhav, Pranav
Publicado: (2025)

Scaling Evaluation-time Compute with Reasoning Models as Evaluators
por: Kim, Seungone, et al.
Publicado: (2025)

Stent Retrieval Technique Using a Basket Catheter With a Rotation Function for Retrieval of Thread‐Attached Stent
por: Masafumi Watanabe, et al.
Publicado: (2025)

Domain Adaptation of LLMs for Process Data
por: Oyamada, Rafael Seidi, et al.
Publicado: (2025)

Do Multi-Document Summarization Models Synthesize?
por: DeYoung, Jay, et al.
Publicado: (2023)

Control Token with Dense Passage Retrieval
por: Lee, Juhwan, et al.
Publicado: (2024)

Summarization-Based Document IDs for Generative Retrieval with Language Models
por: Li, Haoxin, et al.
Publicado: (2023)