Saved in:
| Main Authors: | Williams, Miles, Aletras, Nikolaos |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2309.08708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Self-calibration for Language Model Quantization and Pruning
by: Williams, Miles, et al.
Published: (2024)
by: Williams, Miles, et al.
Published: (2024)
On the Impact of Calibration Data in Post-training Quantization and Pruning
by: Williams, Miles, et al.
Published: (2023)
by: Williams, Miles, et al.
Published: (2023)
Comparing Explanation Faithfulness between Multilingual and Monolingual Fine-tuned Language Models
by: Zhao, Zhixue, et al.
Published: (2024)
by: Zhao, Zhixue, et al.
Published: (2024)
Compressing Language Models for Specialized Domains
by: Williams, Miles, et al.
Published: (2025)
by: Williams, Miles, et al.
Published: (2025)
Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization
by: Chrysostomou, George, et al.
Published: (2023)
by: Chrysostomou, George, et al.
Published: (2023)
An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Language Model Inference
by: Yamaguchi, Atsuki, et al.
Published: (2024)
by: Yamaguchi, Atsuki, et al.
Published: (2024)
How Can We Effectively Expand the Vocabulary of LLMs with 0.01GB of Target Language Text?
by: Yamaguchi, Atsuki, et al.
Published: (2024)
by: Yamaguchi, Atsuki, et al.
Published: (2024)
Fine-Tuning on Noisy Instructions: Effects on Generalization and Performance
by: Alajrami, Ahmed, et al.
Published: (2025)
by: Alajrami, Ahmed, et al.
Published: (2025)
How Private are Language Models in Abstractive Summarization?
by: Hughes, Anthony, et al.
Published: (2024)
by: Hughes, Anthony, et al.
Published: (2024)
Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks
by: Yamaguchi, Atsuki, et al.
Published: (2026)
by: Yamaguchi, Atsuki, et al.
Published: (2026)
Deconstructing Attention: Investigating Design Principles for Effective Language Modeling
by: Xue, Huiyin, et al.
Published: (2025)
by: Xue, Huiyin, et al.
Published: (2025)
Adapting Chat Language Models Using Only Target Unlabeled Language Data
by: Yamaguchi, Atsuki, et al.
Published: (2024)
by: Yamaguchi, Atsuki, et al.
Published: (2024)
Incorporating Attribution Importance for Improving Faithfulness Metrics
by: Zhao, Zhixue, et al.
Published: (2023)
by: Zhao, Zhixue, et al.
Published: (2023)
Progressive Depth Up-scaling via Optimal Transport
by: Cao, Mingzi, et al.
Published: (2025)
by: Cao, Mingzi, et al.
Published: (2025)
Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision
by: Tan, Xingwei, et al.
Published: (2025)
by: Tan, Xingwei, et al.
Published: (2025)
Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets
by: Mu, Yida, et al.
Published: (2023)
by: Mu, Yida, et al.
Published: (2023)
Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates
by: Yamaguchi, Atsuki, et al.
Published: (2025)
by: Yamaguchi, Atsuki, et al.
Published: (2025)
PATCH: Mitigating PII Leakage in Language Models with Privacy-Aware Targeted Circuit PatcHing
by: Hughes, Anthony, et al.
Published: (2025)
by: Hughes, Anthony, et al.
Published: (2025)
MEDVOC: Vocabulary Adaptation for Fine-tuning Pre-trained Language Models on Medical Text Summarization
by: Balde, Gunjan, et al.
Published: (2024)
by: Balde, Gunjan, et al.
Published: (2024)
Where does output diversity collapse in post-training?
by: Karouzos, Constantinos, et al.
Published: (2026)
by: Karouzos, Constantinos, et al.
Published: (2026)
An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift
by: Karouzos, Constantinos, et al.
Published: (2026)
by: Karouzos, Constantinos, et al.
Published: (2026)
Reasoning Dynamics and the Limits of Monitoring Modality Reliance in Vision-Language Models
by: Villegas, Danae Sánchez, et al.
Published: (2026)
by: Villegas, Danae Sánchez, et al.
Published: (2026)
GreekBarBench: A Challenging Benchmark for Free-Text Legal Reasoning and Citations
by: Chlapanis, Odysseas S., et al.
Published: (2025)
by: Chlapanis, Odysseas S., et al.
Published: (2025)
Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research
by: Mu, Yida, et al.
Published: (2024)
by: Mu, Yida, et al.
Published: (2024)
Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models
by: Tan, Xingwei, et al.
Published: (2026)
by: Tan, Xingwei, et al.
Published: (2026)
We Need to Talk About Classification Evaluation Metrics in NLP
by: Vickers, Peter, et al.
Published: (2024)
by: Vickers, Peter, et al.
Published: (2024)
Can Confidence Estimates Decide When Chain-of-Thought Is Necessary for LLMs?
by: Lewis-Lim, Samuel, et al.
Published: (2025)
by: Lewis-Lim, Samuel, et al.
Published: (2025)
Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models
by: Cao, Mingzi, et al.
Published: (2026)
by: Cao, Mingzi, et al.
Published: (2026)
Speculative Decoding with a Speculative Vocabulary
by: Williams, Miles, et al.
Published: (2026)
by: Williams, Miles, et al.
Published: (2026)
Preference-grounded Token-level Guidance for Language Model Fine-tuning
by: Yang, Shentao, et al.
Published: (2023)
by: Yang, Shentao, et al.
Published: (2023)
Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?
by: Lewis-Lim, Samuel, et al.
Published: (2025)
by: Lewis-Lim, Samuel, et al.
Published: (2025)
Improving Multimodal Classification of Social Media Posts by Leveraging Image-Text Auxiliary Tasks
by: Villegas, Danae Sánchez, et al.
Published: (2023)
by: Villegas, Danae Sánchez, et al.
Published: (2023)
Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science
by: Mu, Yida, et al.
Published: (2023)
by: Mu, Yida, et al.
Published: (2023)
Fine-tuning Large Language Models with Sequential Instructions
by: Hu, Hanxu, et al.
Published: (2024)
by: Hu, Hanxu, et al.
Published: (2024)
Sparse Matrix in Large Language Model Fine-tuning
by: He, Haoze, et al.
Published: (2024)
by: He, Haoze, et al.
Published: (2024)
Who is bragging more online? A large scale analysis of bragging in social media
by: Jin, Mali, et al.
Published: (2024)
by: Jin, Mali, et al.
Published: (2024)
EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning
by: Huang, Dong, et al.
Published: (2024)
by: Huang, Dong, et al.
Published: (2024)
Investigating the Representation of Backchannels and Fillers in Fine-tuned Language Models
by: Wang, Yu, et al.
Published: (2025)
by: Wang, Yu, et al.
Published: (2025)
NeurIPS 2023 LLM Efficiency Fine-tuning Competition
by: Saroufim, Mark, et al.
Published: (2025)
by: Saroufim, Mark, et al.
Published: (2025)
Advancing Parameter Efficiency in Fine-tuning via Representation Editing
by: Wu, Muling, et al.
Published: (2024)
by: Wu, Muling, et al.
Published: (2024)
Similar Items
-
Self-calibration for Language Model Quantization and Pruning
by: Williams, Miles, et al.
Published: (2024) -
On the Impact of Calibration Data in Post-training Quantization and Pruning
by: Williams, Miles, et al.
Published: (2023) -
Comparing Explanation Faithfulness between Multilingual and Monolingual Fine-tuned Language Models
by: Zhao, Zhixue, et al.
Published: (2024) -
Compressing Language Models for Specialized Domains
by: Williams, Miles, et al.
Published: (2025) -
Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization
by: Chrysostomou, George, et al.
Published: (2023)