Saved in:
| Main Authors: | Kudo, Keito, Aoki, Yoichi, Kuribayashi, Tatsuki, Sone, Shusaku, Taniguchi, Masaya, Brassard, Ana, Sakaguchi, Keisuke, Inui, Kentaro |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.01113 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning
by: Aoki, Yoichi, et al.
Published: (2024)
by: Aoki, Yoichi, et al.
Published: (2024)
ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation
by: Brassard, Ana, et al.
Published: (2024)
by: Brassard, Ana, et al.
Published: (2024)
Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models
by: Yoshida, Haruto, et al.
Published: (2026)
by: Yoshida, Haruto, et al.
Published: (2026)
J-UniMorph: Japanese Morphological Annotation through the Universal Feature Schema
by: Matsuzaki, Kosuke, et al.
Published: (2024)
by: Matsuzaki, Kosuke, et al.
Published: (2024)
Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference
by: Kamoda, Go, et al.
Published: (2025)
by: Kamoda, Go, et al.
Published: (2025)
FinchGPT: a Transformer based language model for birdsong analysis
by: Kobayashi, Kosei, et al.
Published: (2025)
by: Kobayashi, Kosei, et al.
Published: (2025)
Syntactic Learnability of Echo State Neural Language Models at Scale
by: Ueda, Ryo, et al.
Published: (2025)
by: Ueda, Ryo, et al.
Published: (2025)
Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Maps
by: Kobayashi, Goro, et al.
Published: (2023)
by: Kobayashi, Goro, et al.
Published: (2023)
To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese
by: Ishizuki, Yukiko, et al.
Published: (2024)
by: Ishizuki, Yukiko, et al.
Published: (2024)
Large Language Models Are Human-Like Internally
by: Kuribayashi, Tatsuki, et al.
Published: (2025)
by: Kuribayashi, Tatsuki, et al.
Published: (2025)
RealTime QA: What's the Answer Right Now?
by: Kasai, Jungo, et al.
Published: (2022)
by: Kasai, Jungo, et al.
Published: (2022)
On Representational Dissociation of Language and Arithmetic in Large Language Models
by: Kisako, Riku, et al.
Published: (2025)
by: Kisako, Riku, et al.
Published: (2025)
The Curse of Popularity: Popular Entities have Catastrophic Side Effects when Deleting Knowledge from Language Models
by: Takahashi, Ryosuke, et al.
Published: (2024)
by: Takahashi, Ryosuke, et al.
Published: (2024)
Does Vision Accelerate Hierarchical Generalization in Neural Language Learners?
by: Kuribayashi, Tatsuki, et al.
Published: (2023)
by: Kuribayashi, Tatsuki, et al.
Published: (2023)
Repetitive Infection Spreading and Directed Evolution in the Susceptible-Infected-Recovered-Susceptible Model
by: Sakaguchi, Hidetsugu, et al.
Published: (2024)
by: Sakaguchi, Hidetsugu, et al.
Published: (2024)
Can Language Models Handle a Non-Gregorian Calendar? The Case of the Japanese wareki
by: Sasaki, Mutsumi, et al.
Published: (2025)
by: Sasaki, Mutsumi, et al.
Published: (2025)
Spelling-out is not Straightforward: LLMs' Capability of Tokenization from Token to Characters
by: Hiraoka, Tatsuya, et al.
Published: (2025)
by: Hiraoka, Tatsuya, et al.
Published: (2025)
A Multi-Agent Probabilistic Inference Framework Inspired by Kairanban-Style CoT System with IdoBata Conversation for Debiasing
by: Ueno, Takato, et al.
Published: (2025)
by: Ueno, Takato, et al.
Published: (2025)
Annotating Errors in English Learners' Written Language Production: Advancing Automated Written Feedback Systems
by: Coyne, Steven, et al.
Published: (2025)
by: Coyne, Steven, et al.
Published: (2025)
Psychometric Predictive Power of Large Language Models
by: Kuribayashi, Tatsuki, et al.
Published: (2023)
by: Kuribayashi, Tatsuki, et al.
Published: (2023)
Reducing the Cost: Cross-Prompt Pre-Finetuning for Short Answer Scoring
by: Funayama, Hiroaki, et al.
Published: (2024)
by: Funayama, Hiroaki, et al.
Published: (2024)
Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning
by: Niwa, Ayana, et al.
Published: (2025)
by: Niwa, Ayana, et al.
Published: (2025)
Which Word Orders Facilitate Length Generalization in LMs? An Investigation with GCG-Based Artificial Languages
by: El-Naggar, Nadine, et al.
Published: (2025)
by: El-Naggar, Nadine, et al.
Published: (2025)
What Kind of Language is Easy to Language-Model Under Curriculum Learning?
by: El-Naggar, Nadine, et al.
Published: (2026)
by: El-Naggar, Nadine, et al.
Published: (2026)
From Geometry to Culture: An Iterative VLM Layout Framework for Placing Objects in Complex 3D Scene Contexts
by: Asano, Yuto, et al.
Published: (2025)
by: Asano, Yuto, et al.
Published: (2025)
Automatic Feedback Generation for Short Answer Questions using Answer Diagnostic Graphs
by: Furuhashi, Momoka, et al.
Published: (2025)
by: Furuhashi, Momoka, et al.
Published: (2025)
Repetition Neurons: How Do Language Models Produce Repetitions?
by: Hiraoka, Tatsuya, et al.
Published: (2024)
by: Hiraoka, Tatsuya, et al.
Published: (2024)
Monotonic Representation of Numeric Properties in Language Models
by: Heinzerling, Benjamin, et al.
Published: (2024)
by: Heinzerling, Benjamin, et al.
Published: (2024)
TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification
by: Zheng, Tong, et al.
Published: (2024)
by: Zheng, Tong, et al.
Published: (2024)
CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
by: Zhang, Bohan, et al.
Published: (2025)
by: Zhang, Bohan, et al.
Published: (2025)
Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces
by: He, Chen, et al.
Published: (2026)
by: He, Chen, et al.
Published: (2026)
Transformer Key-Value Memories Are Nearly as Interpretable as Sparse Autoencoders
by: Ye, Mengyu, et al.
Published: (2025)
by: Ye, Mengyu, et al.
Published: (2025)
Can Input Attributions Explain Inductive Reasoning in In-Context Learning?
by: Ye, Mengyu, et al.
Published: (2024)
by: Ye, Mengyu, et al.
Published: (2024)
Self-Training Meets Consistency: Improving LLMs' Reasoning with Consistency-Driven Rationale Evaluation
by: Lee, Jaehyeok, et al.
Published: (2024)
by: Lee, Jaehyeok, et al.
Published: (2024)
Dual Alignment Between Language Model Layers and Human Sentence Processing
by: Kuribayashi, Tatsuki, et al.
Published: (2026)
by: Kuribayashi, Tatsuki, et al.
Published: (2026)
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step
by: Deng, Yuntian, et al.
Published: (2024)
by: Deng, Yuntian, et al.
Published: (2024)
Decentralized Collective World Model for Emergent Communication and Coordination
by: Nomura, Kentaro, et al.
Published: (2025)
by: Nomura, Kentaro, et al.
Published: (2025)
Prior-Free Sample Size Design for Test-and-Roll Experiments
by: Kawato, Kentaro, et al.
Published: (2026)
by: Kawato, Kentaro, et al.
Published: (2026)
LLMs Can Compensate for Deficiencies in Visual Representations
by: Takishita, Sho, et al.
Published: (2025)
by: Takishita, Sho, et al.
Published: (2025)
Reconsidering Positional Supervision in Masked Diffusion Language Model Training
by: Ye, Mengyu, et al.
Published: (2026)
by: Ye, Mengyu, et al.
Published: (2026)
Similar Items
-
First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning
by: Aoki, Yoichi, et al.
Published: (2024) -
ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation
by: Brassard, Ana, et al.
Published: (2024) -
Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models
by: Yoshida, Haruto, et al.
Published: (2026) -
J-UniMorph: Japanese Morphological Annotation through the Universal Feature Schema
by: Matsuzaki, Kosuke, et al.
Published: (2024) -
Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference
by: Kamoda, Go, et al.
Published: (2025)