Saved in:
| Main Authors: | Akata, Elif, Schulz, Lion, Coda-Forno, Julian, Oh, Seong Joon, Bethge, Matthias, Schulz, Eric |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2305.16867 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Inducing anxiety in large language models can induce bias
by: Coda-Forno, Julian, et al.
Published: (2023)
by: Coda-Forno, Julian, et al.
Published: (2023)
CogBench: a large language model walks into a psychology lab
by: Coda-Forno, Julian, et al.
Published: (2024)
by: Coda-Forno, Julian, et al.
Published: (2024)
The Illusion of Latent Generalization: Bi-directionality and the Reversal Curse
by: Coda-Forno, Julian, et al.
Published: (2026)
by: Coda-Forno, Julian, et al.
Published: (2026)
Visual cognition in multimodal large language models
by: Buschoff, Luca M. Schulze, et al.
Published: (2023)
by: Buschoff, Luca M. Schulze, et al.
Published: (2023)
LLM generation novelty through the lens of semantic similarity
by: Davydov, Philipp, et al.
Published: (2025)
by: Davydov, Philipp, et al.
Published: (2025)
Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks
by: Jagadish, Akshay K., et al.
Published: (2024)
by: Jagadish, Akshay K., et al.
Published: (2024)
In-Context Function Learning in Large Language Models
by: Akata, Elif, et al.
Published: (2026)
by: Akata, Elif, et al.
Published: (2026)
WikiBigEdit: Understanding the Limits of Lifelong Knowledge Editing in LLMs
by: Thede, Lukas, et al.
Published: (2025)
by: Thede, Lukas, et al.
Published: (2025)
Testing the Limits of Fine-Tuning for Improving Visual Cognition in Vision Language Models
by: Buschoff, Luca M. Schulze, et al.
Published: (2025)
by: Buschoff, Luca M. Schulze, et al.
Published: (2025)
How to Merge Your Multimodal Models Over Time?
by: Dziadzio, Sebastian, et al.
Published: (2024)
by: Dziadzio, Sebastian, et al.
Published: (2024)
Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models
by: Puerto, Haritz, et al.
Published: (2024)
by: Puerto, Haritz, et al.
Published: (2024)
Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences
by: Wu, Shuchen, et al.
Published: (2024)
by: Wu, Shuchen, et al.
Published: (2024)
Calibrating Large Language Models Using Their Generations Only
by: Ulmer, Dennis, et al.
Published: (2024)
by: Ulmer, Dennis, et al.
Published: (2024)
Investigating Continual Pretraining in Large Language Models: Insights and Implications
by: Yıldız, Çağatay, et al.
Published: (2024)
by: Yıldız, Çağatay, et al.
Published: (2024)
Are We Done with Object-Centric Learning?
by: Rubinstein, Alexander, et al.
Published: (2025)
by: Rubinstein, Alexander, et al.
Published: (2025)
Meta-learning ecological priors from large language models explains human learning and decision making
by: Jagadish, Akshay K., et al.
Published: (2025)
by: Jagadish, Akshay K., et al.
Published: (2025)
Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models
by: Goel, Anmol, et al.
Published: (2026)
by: Goel, Anmol, et al.
Published: (2026)
A Systematic Study of In-the-Wild Model Merging for Large Language Models
by: Hitit, Oğuz Kağan, et al.
Published: (2025)
by: Hitit, Oğuz Kağan, et al.
Published: (2025)
Reference-Free Rating of LLM Responses via Latent Information
by: Girrbach, Leander, et al.
Published: (2025)
by: Girrbach, Leander, et al.
Published: (2025)
Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers
by: Green, Tommaso, et al.
Published: (2025)
by: Green, Tommaso, et al.
Published: (2025)
Mapping Post-Training Forgetting in Language Models at Scale
by: Harmon, Jackson, et al.
Published: (2025)
by: Harmon, Jackson, et al.
Published: (2025)
metabench -- A Sparse Benchmark of Reasoning and Knowledge in Large Language Models
by: Kipnis, Alex, et al.
Published: (2024)
by: Kipnis, Alex, et al.
Published: (2024)
Role-Playing Evaluation for Large Language Models
by: Boudouri, Yassine El, et al.
Published: (2025)
by: Boudouri, Yassine El, et al.
Published: (2025)
Ethical Considerations of Large Language Models in Game Playing
by: Zhang, Qingquan, et al.
Published: (2025)
by: Zhang, Qingquan, et al.
Published: (2025)
Structural Pruning of Large Vision Language Models: A Comprehensive Study on Pruning Dynamics, Recovery, and Data Efficiency
by: Huang, Yiran, et al.
Published: (2026)
by: Huang, Yiran, et al.
Published: (2026)
Round-Trip Translation Reveals What Frontier Multilingual Benchmarks Miss
by: Skorobogat, Ronald, et al.
Published: (2026)
by: Skorobogat, Ronald, et al.
Published: (2026)
Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
by: Huang, Yiran, et al.
Published: (2025)
by: Huang, Yiran, et al.
Published: (2025)
CiteME: Can Language Models Accurately Cite Scientific Claims?
by: Press, Ori, et al.
Published: (2024)
by: Press, Ori, et al.
Published: (2024)
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
by: Hochlehnert, Andreas, et al.
Published: (2025)
by: Hochlehnert, Andreas, et al.
Published: (2025)
A Practitioner's Guide to Continual Multimodal Pretraining
by: Roth, Karsten, et al.
Published: (2024)
by: Roth, Karsten, et al.
Published: (2024)
CPR: Mitigating Large Language Model Hallucinations with Curative Prompt Refinement
by: Shim, Jung-Woo, et al.
Published: (2025)
by: Shim, Jung-Woo, et al.
Published: (2025)
Multi-stage Prompt Refinement for Mitigating Hallucinations in Large Language Models
by: Shim, Jung-Woo, et al.
Published: (2025)
by: Shim, Jung-Woo, et al.
Published: (2025)
Machine Psychology
by: Hagendorff, Thilo, et al.
Published: (2023)
by: Hagendorff, Thilo, et al.
Published: (2023)
SemioLLM: Evaluating Large Language Models for Diagnostic Reasoning from Unstructured Clinical Narratives in Epilepsy
by: Dani, Meghal, et al.
Published: (2024)
by: Dani, Meghal, et al.
Published: (2024)
ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections
by: Bini, Massimo, et al.
Published: (2024)
by: Bini, Massimo, et al.
Published: (2024)
What Defines Good Reasoning in LLMs? Dissecting Reasoning Steps with Multi-Aspect Evaluation
by: Do, Heejin, et al.
Published: (2025)
by: Do, Heejin, et al.
Published: (2025)
MEME: Multi-entity & Evolving Memory Evaluation
by: Jung, Seokwon, et al.
Published: (2026)
by: Jung, Seokwon, et al.
Published: (2026)
Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models
by: Thede, Lukas, et al.
Published: (2024)
by: Thede, Lukas, et al.
Published: (2024)
Unraveling Syntax: How Language Models Learn Context-Free Grammars
by: Schulz, Laura Ying, et al.
Published: (2025)
by: Schulz, Laura Ying, et al.
Published: (2025)
C-SEO Bench: Does Conversational SEO Work?
by: Puerto, Haritz, et al.
Published: (2025)
by: Puerto, Haritz, et al.
Published: (2025)
Similar Items
-
Inducing anxiety in large language models can induce bias
by: Coda-Forno, Julian, et al.
Published: (2023) -
CogBench: a large language model walks into a psychology lab
by: Coda-Forno, Julian, et al.
Published: (2024) -
The Illusion of Latent Generalization: Bi-directionality and the Reversal Curse
by: Coda-Forno, Julian, et al.
Published: (2026) -
Visual cognition in multimodal large language models
by: Buschoff, Luca M. Schulze, et al.
Published: (2023) -
LLM generation novelty through the lens of semantic similarity
by: Davydov, Philipp, et al.
Published: (2025)