:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Voronov, Anton, Wolf, Lena, Ryabinin, Max
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2401.06766
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
by: Svirschevski, Ruslan, et al.
Published: (2024)

AutoJudge: Judge Decoding Without Manual Annotation
by: Garipov, Roman, et al.
Published: (2025)

Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
by: Wang, Jiayi, et al.
Published: (2024)

Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
by: Chen, Zhuoming, et al.
Published: (2024)

DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
by: Cui, Wendi, et al.
Published: (2024)

Can (A)I Change Your Mind?
by: Havin, Miriam, et al.
Published: (2025)

Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning
by: Wagner, Eitan, et al.
Published: (2024)

Multilingual Language Model Pretraining using Machine-translated Data
by: Wang, Jiayi, et al.
Published: (2025)

Assessing LLMs in Art Contexts: Critique Generation and Theory of Mind Evaluation
by: Arita, Takaya, et al.
Published: (2025)

Asynchronous Reasoning: Training-Free Interactive Thinking LLMs
by: Yakushev, George, et al.
Published: (2025)

Towards Emotion Consistency Analysis of Large Language Models in Emotional Conversational Contexts
by: Oram, Sneha, et al.
Published: (2026)

Towards Safety Evaluations of Theory of Mind in Large Language Models
by: Aoshima, Tatsuhiro, et al.
Published: (2025)

MIST: Towards Multi-dimensional Implicit BiaS Evaluation of LLMs for Theory of Mind
by: Li, Yanlin, et al.
Published: (2025)

Towards a Principled Evaluation of Knowledge Editors
by: Pohl, Sebastian, et al.
Published: (2025)

MindShift: Analyzing Language Models' Reactions to Psychological Prompts
by: Vasiliuk, Anton, et al.
Published: (2025)

How Persuasive is Your Context?
by: Nguyen, Tu, et al.
Published: (2025)

Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking
by: Ghadia, Ravi, et al.
Published: (2026)

Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models
by: Song, Yuda, et al.
Published: (2024)

YourBench: Easy Custom Evaluation Sets for Everyone
by: Shashidhar, Sumuk, et al.
Published: (2025)

Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States
by: Xiao, Yang, et al.
Published: (2025)

Mitigating Copy Bias in In-Context Learning through Neuron Pruning
by: Ali, Ameen, et al.
Published: (2024)

Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground
by: Soubki, Adil, et al.
Published: (2024)

Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents
by: Santosh, T. Y. S. S, et al.
Published: (2024)

Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs
by: Podolak, Jakub, et al.
Published: (2025)

Mind Your Moras: Orthography-Aware Error Analysis of Neural Japanese Morphological Generation
by: Zhang, Wen
Published: (2026)

Leveraging In-Context Learning for Political Bias Testing of LLMs
by: Haller, Patrick, et al.
Published: (2025)

Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images
by: Rykov, Elisei, et al.
Published: (2025)

LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement
by: Ying, Jiahao, et al.
Published: (2024)

RULER: What's the Real Context Size of Your Long-Context Language Models?
by: Hsieh, Cheng-Ping, et al.
Published: (2024)

Minding the Politeness Gap in Cross-cultural Communication
by: Machino, Yuka, et al.
Published: (2025)

Bootstrap Your Own Context Length
by: Wang, Liang, et al.
Published: (2024)

LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
by: Long, Do Xuan, et al.
Published: (2024)

On Many-Shot In-Context Learning for Long-Context Evaluation
by: Zou, Kaijian, et al.
Published: (2024)

Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer
by: Hu, Xinshuo, et al.
Published: (2024)

This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs
by: Wolf, Lorenz, et al.
Published: (2025)

Evaluating the Consistency of LLM Evaluators
by: Lee, Noah, et al.
Published: (2024)

Exploring Format Consistency for Instruction Tuning
by: Liang, Shihao, et al.
Published: (2023)

Learning-to-Context Slope: Evaluating In-Context Learning Effectiveness Beyond Performance Illusions
by: Wang, Dingzriui, et al.
Published: (2025)

ParaICL: Towards Parallel In-Context Learning
by: Li, Xingxuan, et al.
Published: (2024)

Where to show Demos in Your Prompt: A Positional Bias of In-Context Learning
by: Cobbina, Kwesi, et al.
Published: (2025)