Saved in:
| Main Authors: | Voronov, Anton, Wolf, Lena, Ryabinin, Max |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.06766 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
by: Svirschevski, Ruslan, et al.
Published: (2024)
by: Svirschevski, Ruslan, et al.
Published: (2024)
AutoJudge: Judge Decoding Without Manual Annotation
by: Garipov, Roman, et al.
Published: (2025)
by: Garipov, Roman, et al.
Published: (2025)
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
by: Wang, Jiayi, et al.
Published: (2024)
by: Wang, Jiayi, et al.
Published: (2024)
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
by: Chen, Zhuoming, et al.
Published: (2024)
by: Chen, Zhuoming, et al.
Published: (2024)
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
by: Cui, Wendi, et al.
Published: (2024)
by: Cui, Wendi, et al.
Published: (2024)
Can (A)I Change Your Mind?
by: Havin, Miriam, et al.
Published: (2025)
by: Havin, Miriam, et al.
Published: (2025)
Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning
by: Wagner, Eitan, et al.
Published: (2024)
by: Wagner, Eitan, et al.
Published: (2024)
Multilingual Language Model Pretraining using Machine-translated Data
by: Wang, Jiayi, et al.
Published: (2025)
by: Wang, Jiayi, et al.
Published: (2025)
Assessing LLMs in Art Contexts: Critique Generation and Theory of Mind Evaluation
by: Arita, Takaya, et al.
Published: (2025)
by: Arita, Takaya, et al.
Published: (2025)
Asynchronous Reasoning: Training-Free Interactive Thinking LLMs
by: Yakushev, George, et al.
Published: (2025)
by: Yakushev, George, et al.
Published: (2025)
Towards Emotion Consistency Analysis of Large Language Models in Emotional Conversational Contexts
by: Oram, Sneha, et al.
Published: (2026)
by: Oram, Sneha, et al.
Published: (2026)
Towards Safety Evaluations of Theory of Mind in Large Language Models
by: Aoshima, Tatsuhiro, et al.
Published: (2025)
by: Aoshima, Tatsuhiro, et al.
Published: (2025)
MIST: Towards Multi-dimensional Implicit BiaS Evaluation of LLMs for Theory of Mind
by: Li, Yanlin, et al.
Published: (2025)
by: Li, Yanlin, et al.
Published: (2025)
Towards a Principled Evaluation of Knowledge Editors
by: Pohl, Sebastian, et al.
Published: (2025)
by: Pohl, Sebastian, et al.
Published: (2025)
MindShift: Analyzing Language Models' Reactions to Psychological Prompts
by: Vasiliuk, Anton, et al.
Published: (2025)
by: Vasiliuk, Anton, et al.
Published: (2025)
How Persuasive is Your Context?
by: Nguyen, Tu, et al.
Published: (2025)
by: Nguyen, Tu, et al.
Published: (2025)
Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking
by: Ghadia, Ravi, et al.
Published: (2026)
by: Ghadia, Ravi, et al.
Published: (2026)
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models
by: Song, Yuda, et al.
Published: (2024)
by: Song, Yuda, et al.
Published: (2024)
YourBench: Easy Custom Evaluation Sets for Everyone
by: Shashidhar, Sumuk, et al.
Published: (2025)
by: Shashidhar, Sumuk, et al.
Published: (2025)
Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States
by: Xiao, Yang, et al.
Published: (2025)
by: Xiao, Yang, et al.
Published: (2025)
Mitigating Copy Bias in In-Context Learning through Neuron Pruning
by: Ali, Ameen, et al.
Published: (2024)
by: Ali, Ameen, et al.
Published: (2024)
Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground
by: Soubki, Adil, et al.
Published: (2024)
by: Soubki, Adil, et al.
Published: (2024)
Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents
by: Santosh, T. Y. S. S, et al.
Published: (2024)
by: Santosh, T. Y. S. S, et al.
Published: (2024)
Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs
by: Podolak, Jakub, et al.
Published: (2025)
by: Podolak, Jakub, et al.
Published: (2025)
Mind Your Moras: Orthography-Aware Error Analysis of Neural Japanese Morphological Generation
by: Zhang, Wen
Published: (2026)
by: Zhang, Wen
Published: (2026)
Leveraging In-Context Learning for Political Bias Testing of LLMs
by: Haller, Patrick, et al.
Published: (2025)
by: Haller, Patrick, et al.
Published: (2025)
Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images
by: Rykov, Elisei, et al.
Published: (2025)
by: Rykov, Elisei, et al.
Published: (2025)
LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement
by: Ying, Jiahao, et al.
Published: (2024)
by: Ying, Jiahao, et al.
Published: (2024)
RULER: What's the Real Context Size of Your Long-Context Language Models?
by: Hsieh, Cheng-Ping, et al.
Published: (2024)
by: Hsieh, Cheng-Ping, et al.
Published: (2024)
Minding the Politeness Gap in Cross-cultural Communication
by: Machino, Yuka, et al.
Published: (2025)
by: Machino, Yuka, et al.
Published: (2025)
Bootstrap Your Own Context Length
by: Wang, Liang, et al.
Published: (2024)
by: Wang, Liang, et al.
Published: (2024)
LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
by: Long, Do Xuan, et al.
Published: (2024)
by: Long, Do Xuan, et al.
Published: (2024)
On Many-Shot In-Context Learning for Long-Context Evaluation
by: Zou, Kaijian, et al.
Published: (2024)
by: Zou, Kaijian, et al.
Published: (2024)
Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer
by: Hu, Xinshuo, et al.
Published: (2024)
by: Hu, Xinshuo, et al.
Published: (2024)
This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs
by: Wolf, Lorenz, et al.
Published: (2025)
by: Wolf, Lorenz, et al.
Published: (2025)
Evaluating the Consistency of LLM Evaluators
by: Lee, Noah, et al.
Published: (2024)
by: Lee, Noah, et al.
Published: (2024)
Exploring Format Consistency for Instruction Tuning
by: Liang, Shihao, et al.
Published: (2023)
by: Liang, Shihao, et al.
Published: (2023)
Learning-to-Context Slope: Evaluating In-Context Learning Effectiveness Beyond Performance Illusions
by: Wang, Dingzriui, et al.
Published: (2025)
by: Wang, Dingzriui, et al.
Published: (2025)
ParaICL: Towards Parallel In-Context Learning
by: Li, Xingxuan, et al.
Published: (2024)
by: Li, Xingxuan, et al.
Published: (2024)
Where to show Demos in Your Prompt: A Positional Bias of In-Context Learning
by: Cobbina, Kwesi, et al.
Published: (2025)
by: Cobbina, Kwesi, et al.
Published: (2025)
Similar Items
-
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
by: Svirschevski, Ruslan, et al.
Published: (2024) -
AutoJudge: Judge Decoding Without Manual Annotation
by: Garipov, Roman, et al.
Published: (2025) -
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
by: Wang, Jiayi, et al.
Published: (2024) -
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
by: Chen, Zhuoming, et al.
Published: (2024) -
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
by: Cui, Wendi, et al.
Published: (2024)