Saved in:
| Main Authors: | Eisenstein, Jacob, Andor, Daniel, Bohnet, Bernd, Collins, Michael, Mimno, David |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2210.02498 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation
by: Bohnet, Bernd, et al.
Published: (2024)
by: Bohnet, Bernd, et al.
Published: (2024)
Stronger Random Baselines for In-Context Learning
by: Yauney, Gregory, et al.
Published: (2024)
by: Yauney, Gregory, et al.
Published: (2024)
HonestLLM: Toward an Honest and Helpful Large Language Model
by: Gao, Chujie, et al.
Published: (2024)
by: Gao, Chujie, et al.
Published: (2024)
Do Language Models Encode Semantic Relations? Probing and Sparse Feature Analysis
by: Diera, Andor, et al.
Published: (2026)
by: Diera, Andor, et al.
Published: (2026)
Lost in Space: Finding the Right Tokens for Structured Output
by: Hamilton, Sil, et al.
Published: (2025)
by: Hamilton, Sil, et al.
Published: (2025)
End-to-end Contrastive Language-Speech Pretraining Model For Long-form Spoken Question Answering
by: Hu, Jiliang, et al.
Published: (2025)
by: Hu, Jiliang, et al.
Published: (2025)
Parametric Knowledge is Not All You Need: Toward Honest Large Language Models via Retrieval of Pretraining Data
by: Kusuma, Christopher Adrian, et al.
Published: (2026)
by: Kusuma, Christopher Adrian, et al.
Published: (2026)
Analysis of Optimality of Large Language Models on Planning Problems
by: Bohnet, Bernd, et al.
Published: (2026)
by: Bohnet, Bernd, et al.
Published: (2026)
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains
by: Jacovi, Alon, et al.
Published: (2024)
by: Jacovi, Alon, et al.
Published: (2024)
Toward Honest Language Models for Deductive Reasoning
by: Liu, Jiarui, et al.
Published: (2025)
by: Liu, Jiarui, et al.
Published: (2025)
Elias in the Lighthouse, Again? Diagnosing Low Diversity in LLM Stories
by: Hamilton, Sil, et al.
Published: (2026)
by: Hamilton, Sil, et al.
Published: (2026)
Pretraining Vision-Language Model for Difference Visual Question Answering in Longitudinal Chest X-rays
by: Cho, Yeongjae, et al.
Published: (2024)
by: Cho, Yeongjae, et al.
Published: (2024)
Efficient Continual Learning for Small Language Models with a Discrete Key-Value Bottleneck
by: Diera, Andor, et al.
Published: (2024)
by: Diera, Andor, et al.
Published: (2024)
A Knowledge-Injected Curriculum Pretraining Framework for Question Answering
by: Lin, Xin, et al.
Published: (2024)
by: Lin, Xin, et al.
Published: (2024)
Interpretable Question Answering with Knowledge Graphs
by: Aneja, Kartikeya, et al.
Published: (2025)
by: Aneja, Kartikeya, et al.
Published: (2025)
TOP-Training: Target-Oriented Pretraining for Medical Extractive Question Answering
by: Sengupta, Saptarshi, et al.
Published: (2023)
by: Sengupta, Saptarshi, et al.
Published: (2023)
Interpretable LLM-based Table Question Answering
by: Nguyen, Giang, et al.
Published: (2024)
by: Nguyen, Giang, et al.
Published: (2024)
The Zero Body Problem: Probing LLM Use of Sensory Language
by: Hicke, Rebecca M. M., et al.
Published: (2025)
by: Hicke, Rebecca M. M., et al.
Published: (2025)
[Lions: 1] and [Tigers: 2] and [Bears: 3], Oh My! Literary Coreference Annotation with LLMs
by: Hicke, Rebecca M. M., et al.
Published: (2024)
by: Hicke, Rebecca M. M., et al.
Published: (2024)
Identifying and Answering Questions with False Assumptions: An Interpretable Approach
by: Wang, Zijie, et al.
Published: (2025)
by: Wang, Zijie, et al.
Published: (2025)
Show or Tell? Modeling the evolution of request-making in Human-LLM conversations
by: Zhu, Shengqi, et al.
Published: (2025)
by: Zhu, Shengqi, et al.
Published: (2025)
BeHonest: Benchmarking Honesty in Large Language Models
by: Chern, Steffi, et al.
Published: (2024)
by: Chern, Steffi, et al.
Published: (2024)
How Chinese are Chinese Language Models? The Puzzling Lack of Language Policy in China's LLMs
by: Wen-Yi, Andrea W, et al.
Published: (2024)
by: Wen-Yi, Andrea W, et al.
Published: (2024)
Are Large Language Models More Honest in Their Probabilistic or Verbalized Confidence?
by: Ni, Shiyu, et al.
Published: (2024)
by: Ni, Shiyu, et al.
Published: (2024)
Looking for the Inner Music: Probing LLMs' Understanding of Literary Style
by: Hicke, Rebecca M. M., et al.
Published: (2025)
by: Hicke, Rebecca M. M., et al.
Published: (2025)
Priming, Path-dependence, and Plasticity: Understanding the molding of user-LLM interaction and its implications from (many) chat logs in the wild
by: Zhu, Shengqi, et al.
Published: (2026)
by: Zhu, Shengqi, et al.
Published: (2026)
Learning from Natural Language Feedback for Personalized Question Answering
by: Salemi, Alireza, et al.
Published: (2025)
by: Salemi, Alireza, et al.
Published: (2025)
A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction
by: Ying, Huaiyuan, et al.
Published: (2024)
by: Ying, Huaiyuan, et al.
Published: (2024)
On the Temporal Question-Answering Capabilities of Large Language Models Over Anonymized Data
by: Ruiz, Alfredo Garrachón, et al.
Published: (2025)
by: Ruiz, Alfredo Garrachón, et al.
Published: (2025)
Towards Reliable and Interpretable Document Question Answering via VLMs
by: Chen, Alessio, et al.
Published: (2025)
by: Chen, Alessio, et al.
Published: (2025)
Predicting the Target Word of Game-playing Conversations using a Low-Rank Dialect Adapter for Decoder Models
by: Srirag, Dipankar, et al.
Published: (2024)
by: Srirag, Dipankar, et al.
Published: (2024)
A Study on Large Language Models' Limitations in Multiple-Choice Question Answering
by: Khatun, Aisha, et al.
Published: (2024)
by: Khatun, Aisha, et al.
Published: (2024)
Calibrated Large Language Models for Binary Question Answering
by: Giovannotti, Patrizio, et al.
Published: (2024)
by: Giovannotti, Patrizio, et al.
Published: (2024)
Intrinsic Subgraph Generation for Interpretable Graph based Visual Question Answering
by: Tilli, Pascal, et al.
Published: (2024)
by: Tilli, Pascal, et al.
Published: (2024)
Discrete Subgraph Sampling for Interpretable Graph based Visual Question Answering
by: Tilli, Pascal, et al.
Published: (2024)
by: Tilli, Pascal, et al.
Published: (2024)
Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering
by: Yu, Zeping, et al.
Published: (2024)
by: Yu, Zeping, et al.
Published: (2024)
Question: How do Large Language Models perform on the Question Answering tasks? Answer:
by: Fischer, Kevin, et al.
Published: (2024)
by: Fischer, Kevin, et al.
Published: (2024)
Question-Answering (QA) Model for a Personalized Learning Assistant for Arabic Language
by: Sammoudi, Mohammad, et al.
Published: (2024)
by: Sammoudi, Mohammad, et al.
Published: (2024)
Benchmarking Uncertainty Calibration in Large Language Model Long-Form Question Answering
by: Müller, Philip, et al.
Published: (2026)
by: Müller, Philip, et al.
Published: (2026)
Tasks and Roles in Legal AI: Data Curation, Annotation, and Verification
by: Koenecke, Allison, et al.
Published: (2025)
by: Koenecke, Allison, et al.
Published: (2025)
Similar Items
-
Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation
by: Bohnet, Bernd, et al.
Published: (2024) -
Stronger Random Baselines for In-Context Learning
by: Yauney, Gregory, et al.
Published: (2024) -
HonestLLM: Toward an Honest and Helpful Large Language Model
by: Gao, Chujie, et al.
Published: (2024) -
Do Language Models Encode Semantic Relations? Probing and Sparse Feature Analysis
by: Diera, Andor, et al.
Published: (2026) -
Lost in Space: Finding the Right Tokens for Structured Output
by: Hamilton, Sil, et al.
Published: (2025)