Saved in:
| Main Authors: | Gu, Jiayao, Chen, Liting, Li, Yihong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.03826 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LaMsS: When Large Language Models Meet Self-Skepticism
by: Wu, Yetao, et al.
Published: (2024)
by: Wu, Yetao, et al.
Published: (2024)
Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL Translation
by: Ranaldi, Federico, et al.
Published: (2024)
by: Ranaldi, Federico, et al.
Published: (2024)
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models
by: Dong, Yihong, et al.
Published: (2024)
by: Dong, Yihong, et al.
Published: (2024)
A Survey on Data Selection for Language Models
by: Albalak, Alon, et al.
Published: (2024)
by: Albalak, Alon, et al.
Published: (2024)
QuRating: Selecting High-Quality Data for Training Language Models
by: Wettig, Alexander, et al.
Published: (2024)
by: Wettig, Alexander, et al.
Published: (2024)
Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models
by: Tao, Yongding, et al.
Published: (2025)
by: Tao, Yongding, et al.
Published: (2025)
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
by: Li, Ming, et al.
Published: (2024)
by: Li, Ming, et al.
Published: (2024)
Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?
by: Sajith, Aryan, et al.
Published: (2024)
by: Sajith, Aryan, et al.
Published: (2024)
Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models
by: Lee, Joseph, et al.
Published: (2024)
by: Lee, Joseph, et al.
Published: (2024)
Decoding Uncertainty: The Impact of Decoding Strategies for Uncertainty Estimation in Large Language Models
by: Hashimoto, Wataru, et al.
Published: (2025)
by: Hashimoto, Wataru, et al.
Published: (2025)
Analyzing Similarity Metrics for Data Selection for Language Model Pretraining
by: Sam, Dylan, et al.
Published: (2025)
by: Sam, Dylan, et al.
Published: (2025)
Investigating the Impact of Model Instability on Explanations and Uncertainty
by: Marjanović, Sara Vera, et al.
Published: (2024)
by: Marjanović, Sara Vera, et al.
Published: (2024)
Reasoning-preserved Efficient Distillation of Large Language Models via Activation-aware Initialization
by: He, Junlin, et al.
Published: (2026)
by: He, Junlin, et al.
Published: (2026)
Investigating Data Contamination for Pre-training Language Models
by: Jiang, Minhao, et al.
Published: (2024)
by: Jiang, Minhao, et al.
Published: (2024)
Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval
by: Ji, Luo, et al.
Published: (2024)
by: Ji, Luo, et al.
Published: (2024)
On the Limitations of Language Targeted Pruning: Investigating the Calibration Language Impact in Multilingual LLM Pruning
by: Kurz, Simon, et al.
Published: (2024)
by: Kurz, Simon, et al.
Published: (2024)
Can Perplexity Predict Fine-tuning Performance? An Investigation of Tokenization Effects on Sequential Language Models for Nepali
by: Luitel, Nishant, et al.
Published: (2024)
by: Luitel, Nishant, et al.
Published: (2024)
PoliTune: Analyzing the Impact of Data Selection and Fine-Tuning on Economic and Political Biases in Large Language Models
by: Agiza, Ahmed, et al.
Published: (2024)
by: Agiza, Ahmed, et al.
Published: (2024)
Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training
by: Li, Qingyang, et al.
Published: (2024)
by: Li, Qingyang, et al.
Published: (2024)
Investigating the Multilingual Calibration Effects of Language Model Instruction-Tuning
by: Huang, Jerry, et al.
Published: (2026)
by: Huang, Jerry, et al.
Published: (2026)
Investigating Symbolic Capabilities of Large Language Models
by: Dave, Neisarg, et al.
Published: (2024)
by: Dave, Neisarg, et al.
Published: (2024)
Investigating Layer Importance in Large Language Models
by: Zhang, Yang, et al.
Published: (2024)
by: Zhang, Yang, et al.
Published: (2024)
Active Model Selection for Large Language Models
by: Durmazkeser, Yavuz, et al.
Published: (2025)
by: Durmazkeser, Yavuz, et al.
Published: (2025)
Selective Generation for Controllable Language Models
by: Lee, Minjae, et al.
Published: (2023)
by: Lee, Minjae, et al.
Published: (2023)
Influential Language Data Selection via Gradient Trajectory Pursuit
by: Deng, Zhiwei, et al.
Published: (2024)
by: Deng, Zhiwei, et al.
Published: (2024)
Temperature-Dependent Performance of Prompting Strategies in Extended Reasoning Large Language Models
by: Salah, Mousa, et al.
Published: (2026)
by: Salah, Mousa, et al.
Published: (2026)
SeMe: Training-Free Language Model Merging via Semantic Alignment
by: Gu, Jian, et al.
Published: (2025)
by: Gu, Jian, et al.
Published: (2025)
MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models
by: Yu, Zichun, et al.
Published: (2024)
by: Yu, Zichun, et al.
Published: (2024)
Enhancing Inference Efficiency of Large Language Models: Investigating Optimization Strategies and Architectural Innovations
by: Tyukin, Georgy
Published: (2024)
by: Tyukin, Georgy
Published: (2024)
Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO
by: Zheng, Jinquan, et al.
Published: (2026)
by: Zheng, Jinquan, et al.
Published: (2026)
Can Large Language Models Still Explain Themselves? Investigating the Impact of Quantization on Self-Explanations
by: Wang, Qianli, et al.
Published: (2026)
by: Wang, Qianli, et al.
Published: (2026)
DIDS: Domain Impact-aware Data Sampling for Large Language Model Training
by: Shi, Weijie, et al.
Published: (2025)
by: Shi, Weijie, et al.
Published: (2025)
Query Performance Explanation through Large Language Model for HTAP Systems
by: Xiu, Haibo, et al.
Published: (2024)
by: Xiu, Haibo, et al.
Published: (2024)
Large Language Model Selection with Limited Annotations
by: Durmazkeser, Yavuz, et al.
Published: (2026)
by: Durmazkeser, Yavuz, et al.
Published: (2026)
Selective Neuron Amplification in Transformer Language Models
by: Akhtar, Ryyan, et al.
Published: (2026)
by: Akhtar, Ryyan, et al.
Published: (2026)
Automatic Prompt Selection for Large Language Models
by: Do, Viet-Tung, et al.
Published: (2024)
by: Do, Viet-Tung, et al.
Published: (2024)
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
by: Özeren, Enes, et al.
Published: (2025)
by: Özeren, Enes, et al.
Published: (2025)
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
by: Ye, Jiasheng, et al.
Published: (2023)
by: Ye, Jiasheng, et al.
Published: (2023)
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
by: Ye, Jiasheng, et al.
Published: (2024)
by: Ye, Jiasheng, et al.
Published: (2024)
A Semantic-Aware Layer-Freezing Approach to Computation-Efficient Fine-Tuning of Language Models
by: Gu, Jian, et al.
Published: (2024)
by: Gu, Jian, et al.
Published: (2024)
Similar Items
-
LaMsS: When Large Language Models Meet Self-Skepticism
by: Wu, Yetao, et al.
Published: (2024) -
Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL Translation
by: Ranaldi, Federico, et al.
Published: (2024) -
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models
by: Dong, Yihong, et al.
Published: (2024) -
A Survey on Data Selection for Language Models
by: Albalak, Alon, et al.
Published: (2024) -
QuRating: Selecting High-Quality Data for Training Language Models
by: Wettig, Alexander, et al.
Published: (2024)