Saved in:
| Main Authors: | Strangmann, Tobias, Purucker, Lennart, Franke, Jörg K. H., Rapant, Ivo, Ferreira, Fabio, Hutter, Frank |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.01195 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Ensembling Finetuned Language Models for Text Classification
by: Arango, Sebastian Pineda, et al.
Published: (2024)
by: Arango, Sebastian Pineda, et al.
Published: (2024)
Beyond Random Augmentations: Pretraining with Hard Views
by: Ferreira, Fabio, et al.
Published: (2023)
by: Ferreira, Fabio, et al.
Published: (2023)
Balancing Synthetic Data and Replay for Enhancing Task-Specific Capabilities
by: Spiegelhalter, Urs, et al.
Published: (2025)
by: Spiegelhalter, Urs, et al.
Published: (2025)
Large Language Models Engineer Too Many Simple Features For Tabular Data
by: Küken, Jaris, et al.
Published: (2024)
by: Küken, Jaris, et al.
Published: (2024)
Early Stopping Tabular In-Context Learning
by: Küken, Jaris, et al.
Published: (2025)
by: Küken, Jaris, et al.
Published: (2025)
Causal Data Augmentation for Robust Fine-Tuning of Tabular Foundation Models
by: Bühler, Magnus, et al.
Published: (2026)
by: Bühler, Magnus, et al.
Published: (2026)
Quickly Tuning Foundation Models for Image Segmentation
by: Das, Breenda, et al.
Published: (2025)
by: Das, Breenda, et al.
Published: (2025)
Transfer Learning of Tabular Data by Finetuning Large Language Models
by: Rabbani, Shourav B., et al.
Published: (2025)
by: Rabbani, Shourav B., et al.
Published: (2025)
Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues
by: Grazzi, Riccardo, et al.
Published: (2024)
by: Grazzi, Riccardo, et al.
Published: (2024)
Don't Waste Your Time: Early Stopping Cross-Validation
by: Bergman, Edward, et al.
Published: (2024)
by: Bergman, Edward, et al.
Published: (2024)
Agentic NL2SQL to Reduce Computational Costs
by: Jehle, Dominik, et al.
Published: (2025)
by: Jehle, Dominik, et al.
Published: (2025)
HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models
by: Sukthanker, Rhea Sanjay, et al.
Published: (2024)
by: Sukthanker, Rhea Sanjay, et al.
Published: (2024)
MulTaBench: Benchmarking Multimodal Tabular Learning with Text and Image
by: Arazi, Alan, et al.
Published: (2026)
by: Arazi, Alan, et al.
Published: (2026)
How Usable is Automated Feature Engineering for Tabular Data?
by: Schäfer, Bastian, et al.
Published: (2025)
by: Schäfer, Bastian, et al.
Published: (2025)
nanoTabPFN: A Lightweight and Educational Reimplementation of TabPFN
by: Pfefferle, Alexander, et al.
Published: (2025)
by: Pfefferle, Alexander, et al.
Published: (2025)
Large Language Models to Diffusion Finetuning
by: Cetin, Edoardo, et al.
Published: (2025)
by: Cetin, Edoardo, et al.
Published: (2025)
Revisiting Dynamic Evaluation: Online Adaptation for Large Language Models
by: Rannen-Triki, Amal, et al.
Published: (2024)
by: Rannen-Triki, Amal, et al.
Published: (2024)
Towards Benchmarking Foundation Models for Tabular Data With Text
by: Mráz, Martin, et al.
Published: (2025)
by: Mráz, Martin, et al.
Published: (2025)
ApiQ: Finetuning of 2-Bit Quantized Large Language Model
by: Liao, Baohao, et al.
Published: (2024)
by: Liao, Baohao, et al.
Published: (2024)
CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models
by: Gong, Zi, et al.
Published: (2024)
by: Gong, Zi, et al.
Published: (2024)
MuonAll: Muon Variant for Efficient Finetuning of Large Language Models
by: Page, Saurabh, et al.
Published: (2025)
by: Page, Saurabh, et al.
Published: (2025)
Towards Scaling Laws for Symbolic Regression
by: Otte, David, et al.
Published: (2025)
by: Otte, David, et al.
Published: (2025)
Differentially Private Zeroth-Order Methods for Scalable Large Language Model Finetuning
by: Liu, Z, et al.
Published: (2024)
by: Liu, Z, et al.
Published: (2024)
Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How
by: Arango, Sebastian Pineda, et al.
Published: (2023)
by: Arango, Sebastian Pineda, et al.
Published: (2023)
Power-Law Decay Loss for Large Language Model Finetuning: A Theory Perspective
by: Shao, Jintian
Published: (2025)
by: Shao, Jintian
Published: (2025)
Finetuning Language Models to Emit Linguistic Expressions of Uncertainty
by: Chaudhry, Arslan, et al.
Published: (2024)
by: Chaudhry, Arslan, et al.
Published: (2024)
Task Matrices: Linear Maps for Cross-Model Finetuning Transfer
by: Brien, Darrin O', et al.
Published: (2025)
by: Brien, Darrin O', et al.
Published: (2025)
Towards Active Synthetic Data Generation for Finetuning Language Models
by: Kessler, Samuel, et al.
Published: (2025)
by: Kessler, Samuel, et al.
Published: (2025)
An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models
by: Bhatt, Gantavya, et al.
Published: (2024)
by: Bhatt, Gantavya, et al.
Published: (2024)
Delta Activations: A Representation for Finetuned Large Language Models
by: Xu, Zhiqiu, et al.
Published: (2025)
by: Xu, Zhiqiu, et al.
Published: (2025)
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
by: Zhang, Biao, et al.
Published: (2024)
by: Zhang, Biao, et al.
Published: (2024)
ReFT: Representation Finetuning for Language Models
by: Wu, Zhengxuan, et al.
Published: (2024)
by: Wu, Zhengxuan, et al.
Published: (2024)
Vanishing Gradients in Reinforcement Finetuning of Language Models
by: Razin, Noam, et al.
Published: (2023)
by: Razin, Noam, et al.
Published: (2023)
Learning Rate Scaling across LoRA Ranks and Transfer to Full Finetuning
by: Chen, Nan, et al.
Published: (2026)
by: Chen, Nan, et al.
Published: (2026)
Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector
by: Zhang, Andi, et al.
Published: (2024)
by: Zhang, Andi, et al.
Published: (2024)
Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model
by: Liang, Jing, et al.
Published: (2025)
by: Liang, Jing, et al.
Published: (2025)
CARE-RFT: Confidence-Anchored Reinforcement Finetuning for Reliable Reasoning in Large Language Models
by: Li, Shuozhe, et al.
Published: (2026)
by: Li, Shuozhe, et al.
Published: (2026)
PLoP: Precise LoRA Placement for Efficient Finetuning of Large Models
by: Hayou, Soufiane, et al.
Published: (2025)
by: Hayou, Soufiane, et al.
Published: (2025)
Improving Deep Learning Optimization through Constrained Parameter Regularization
by: Franke, Jörg K. H., et al.
Published: (2023)
by: Franke, Jörg K. H., et al.
Published: (2023)
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
by: Xie, Wanyun, et al.
Published: (2025)
by: Xie, Wanyun, et al.
Published: (2025)
Similar Items
-
Ensembling Finetuned Language Models for Text Classification
by: Arango, Sebastian Pineda, et al.
Published: (2024) -
Beyond Random Augmentations: Pretraining with Hard Views
by: Ferreira, Fabio, et al.
Published: (2023) -
Balancing Synthetic Data and Replay for Enhancing Task-Specific Capabilities
by: Spiegelhalter, Urs, et al.
Published: (2025) -
Large Language Models Engineer Too Many Simple Features For Tabular Data
by: Küken, Jaris, et al.
Published: (2024) -
Early Stopping Tabular In-Context Learning
by: Küken, Jaris, et al.
Published: (2025)