:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kudo, Keito, Aoki, Yoichi, Kuribayashi, Tatsuki, Sone, Shusaku, Taniguchi, Masaya, Brassard, Ana, Sakaguchi, Keisuke, Inui, Kentaro
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2412.01113
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning
by: Aoki, Yoichi, et al.
Published: (2024)

ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation
by: Brassard, Ana, et al.
Published: (2024)

Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models
by: Yoshida, Haruto, et al.
Published: (2026)

J-UniMorph: Japanese Morphological Annotation through the Universal Feature Schema
by: Matsuzaki, Kosuke, et al.
Published: (2024)

Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference
by: Kamoda, Go, et al.
Published: (2025)

FinchGPT: a Transformer based language model for birdsong analysis
by: Kobayashi, Kosei, et al.
Published: (2025)

Syntactic Learnability of Echo State Neural Language Models at Scale
by: Ueda, Ryo, et al.
Published: (2025)

Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Maps
by: Kobayashi, Goro, et al.
Published: (2023)

To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese
by: Ishizuki, Yukiko, et al.
Published: (2024)

Large Language Models Are Human-Like Internally
by: Kuribayashi, Tatsuki, et al.
Published: (2025)

RealTime QA: What's the Answer Right Now?
by: Kasai, Jungo, et al.
Published: (2022)

On Representational Dissociation of Language and Arithmetic in Large Language Models
by: Kisako, Riku, et al.
Published: (2025)

The Curse of Popularity: Popular Entities have Catastrophic Side Effects when Deleting Knowledge from Language Models
by: Takahashi, Ryosuke, et al.
Published: (2024)

Does Vision Accelerate Hierarchical Generalization in Neural Language Learners?
by: Kuribayashi, Tatsuki, et al.
Published: (2023)

Repetitive Infection Spreading and Directed Evolution in the Susceptible-Infected-Recovered-Susceptible Model
by: Sakaguchi, Hidetsugu, et al.
Published: (2024)

Can Language Models Handle a Non-Gregorian Calendar? The Case of the Japanese wareki
by: Sasaki, Mutsumi, et al.
Published: (2025)

Spelling-out is not Straightforward: LLMs' Capability of Tokenization from Token to Characters
by: Hiraoka, Tatsuya, et al.
Published: (2025)

A Multi-Agent Probabilistic Inference Framework Inspired by Kairanban-Style CoT System with IdoBata Conversation for Debiasing
by: Ueno, Takato, et al.
Published: (2025)

Annotating Errors in English Learners' Written Language Production: Advancing Automated Written Feedback Systems
by: Coyne, Steven, et al.
Published: (2025)

Psychometric Predictive Power of Large Language Models
by: Kuribayashi, Tatsuki, et al.
Published: (2023)

Reducing the Cost: Cross-Prompt Pre-Finetuning for Short Answer Scoring
by: Funayama, Hiroaki, et al.
Published: (2024)

Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning
by: Niwa, Ayana, et al.
Published: (2025)

Which Word Orders Facilitate Length Generalization in LMs? An Investigation with GCG-Based Artificial Languages
by: El-Naggar, Nadine, et al.
Published: (2025)

What Kind of Language is Easy to Language-Model Under Curriculum Learning?
by: El-Naggar, Nadine, et al.
Published: (2026)

From Geometry to Culture: An Iterative VLM Layout Framework for Placing Objects in Complex 3D Scene Contexts
by: Asano, Yuto, et al.
Published: (2025)

Automatic Feedback Generation for Short Answer Questions using Answer Diagnostic Graphs
by: Furuhashi, Momoka, et al.
Published: (2025)

Repetition Neurons: How Do Language Models Produce Repetitions?
by: Hiraoka, Tatsuya, et al.
Published: (2024)

Monotonic Representation of Numeric Properties in Language Models
by: Heinzerling, Benjamin, et al.
Published: (2024)

TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification
by: Zheng, Tong, et al.
Published: (2024)

CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
by: Zhang, Bohan, et al.
Published: (2025)

Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces
by: He, Chen, et al.
Published: (2026)

Transformer Key-Value Memories Are Nearly as Interpretable as Sparse Autoencoders
by: Ye, Mengyu, et al.
Published: (2025)

Can Input Attributions Explain Inductive Reasoning in In-Context Learning?
by: Ye, Mengyu, et al.
Published: (2024)

Self-Training Meets Consistency: Improving LLMs' Reasoning with Consistency-Driven Rationale Evaluation
by: Lee, Jaehyeok, et al.
Published: (2024)

Dual Alignment Between Language Model Layers and Human Sentence Processing
by: Kuribayashi, Tatsuki, et al.
Published: (2026)

From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step
by: Deng, Yuntian, et al.
Published: (2024)

Decentralized Collective World Model for Emergent Communication and Coordination
by: Nomura, Kentaro, et al.
Published: (2025)

Prior-Free Sample Size Design for Test-and-Roll Experiments
by: Kawato, Kentaro, et al.
Published: (2026)

LLMs Can Compensate for Deficiencies in Visual Representations
by: Takishita, Sho, et al.
Published: (2025)

Reconsidering Positional Supervision in Masked Diffusion Language Model Training
by: Ye, Mengyu, et al.
Published: (2026)