:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nishida, Yuto, Isonuma, Masaru, Oda, Yusuke
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2510.04848
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Unlearning Traces the Influential Training Data of Language Models
by: Isonuma, Masaru, et al.
Published: (2024)

What's New in My Data? Novelty Exploration via Contrastive Generation
by: Isonuma, Masaru, et al.
Published: (2024)

Investigating Training and Generalization in Faithful Self-Explanations of Large Language Models
by: Doi, Tomoki, et al.
Published: (2025)

Comprehensive Evaluation of Large Language Models for Topic Modeling
by: Doi, Tomoki, et al.
Published: (2024)

Exclusive Unlearning
by: Sasaki, Mutsumi, et al.
Published: (2026)

Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance
by: Ozaki, Shintaro, et al.
Published: (2025)

Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality
by: Harada, Yuto, et al.
Published: (2025)

UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation
by: Lu, Huimin, et al.
Published: (2025)

Towards Transfer Unlearning: Empirical Evidence of Cross-Domain Bias Mitigation
by: Lu, Huimin, et al.
Published: (2024)

How a Bilingual LM Becomes Bilingual: Tracing Internal Representations with Sparse Autoencoders
by: Inaba, Tatsuro, et al.
Published: (2025)

How to Make the Most of LLMs' Grammatical Knowledge for Acceptability Judgments
by: Ide, Yusuke, et al.
Published: (2024)

Scaling Laws for Downstream Task Performance of Large Language Models
by: Isik, Berivan, et al.
Published: (2024)

llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length
by: Sugiura, Issa, et al.
Published: (2025)

Generating Diverse Translation with Perturbed kNN-MT
by: Nishida, Yuto, et al.
Published: (2024)

Exploring the Impact of a Transformer's Latent Space Geometry on Downstream Task Performance
by: Marbut, Anna C., et al.
Published: (2024)

Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text
by: Jarca, Andrei, et al.
Published: (2025)

Contrastive Learning for Task-Independent SpeechLLM-Pretraining
by: Züfle, Maike, et al.
Published: (2024)

Understanding and Mitigating Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
by: Li, Miaomiao, et al.
Published: (2025)

Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?
by: Uchiyama, Fumiya, et al.
Published: (2024)

Measuring the Effect of Transcription Noise on Downstream Language Understanding Tasks
by: Shapira, Ori, et al.
Published: (2025)

An Evaluation of Sindhi Word Embedding in Semantic Analogies and Downstream Tasks
by: Ali, Wazir, et al.
Published: (2024)

Edit Distances and Their Applications to Downstream Tasks in Research and Commercial Contexts
by: Carmo, Félix do, et al.
Published: (2024)

Long-Tail Crisis in Nearest Neighbor Language Models
by: Nishida, Yuto, et al.
Published: (2025)

Llama-Mimi: Exploring the Limits of Flattened Speech Language Modeling
by: Sugiura, Issa, et al.
Published: (2025)

Vaporetto: Efficient Japanese Tokenization Based on Improved Pointwise Linear Classification
by: Akabe, Koichi, et al.
Published: (2024)

Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation
by: Iluz, Bar, et al.
Published: (2024)

FedEval-LLM: Federated Evaluation of Large Language Models on Downstream Tasks with Collective Wisdom
by: He, Yuanqin, et al.
Published: (2024)

Forecasting Downstream Performance of LLMs With Proxy Metrics
by: Patel, Arkil, et al.
Published: (2026)

Quantifying the Importance of Data Alignment in Downstream Model Performance
by: Chawla, Krrish, et al.
Published: (2025)

Scaling Laws Are Unreliable for Downstream Tasks: A Reality Check
by: Lourie, Nicholas, et al.
Published: (2025)

Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
by: Riabi, Arij, et al.
Published: (2021)

How Does Code Pretraining Affect Language Model Task Performance?
by: Petty, Jackson, et al.
Published: (2024)

Adapting Decoder-Based Language Models for Diverse Encoder Downstream Tasks
by: Suganthan, Paul, et al.
Published: (2025)

Optimising Language Models for Downstream Tasks: A Post-Training Perspective
by: Shi, Zhengyan
Published: (2025)

Scaling Laws for Predicting Downstream Performance in LLMs
by: Chen, Yangyi, et al.
Published: (2024)

CoLA: Cross-Modal Low-rank Adaptation for Multimodal Downstream Tasks
by: Suharitdamrong, Wish, et al.
Published: (2026)

Personality as a Probe for LLM Evaluation: Method Trade-offs and Downstream Effects
by: Handa, Gunmay, et al.
Published: (2025)

Refactoring Programs Using Large Language Models with Few-Shot Examples
by: Shirafuji, Atsushi, et al.
Published: (2023)

Protoknowledge Shapes Behaviour of LLMs in Downstream Tasks: Memorization and Generalization with Knowledge Graphs
by: Ranaldi, Federico, et al.
Published: (2025)

Routing-Aligned Fine-Tuning for Multilingual Downstream Tasks in Mixture-of-Experts Models
by: Deng, Guanzhi, et al.
Published: (2026)