Saved in:
| Main Authors: | Mazzawi, Hanna, Awasthi, Pranjal, Gonzalvo, Xavi, Ramalingam, Srikumar |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.05033 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Deep Fusion: Efficient Network Training via Pre-trained Initializations
by: Mazzawi, Hanna, et al.
Published: (2023)
by: Mazzawi, Hanna, et al.
Published: (2023)
Grow, Don't Overwrite: Fine-tuning Without Forgetting
by: Adila, Dyah, et al.
Published: (2026)
by: Adila, Dyah, et al.
Published: (2026)
On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions
by: Böther, Maximilian, et al.
Published: (2024)
by: Böther, Maximilian, et al.
Published: (2024)
Transmuting prompts into weights
by: Mazzawi, Hanna, et al.
Published: (2025)
by: Mazzawi, Hanna, et al.
Published: (2025)
Learning without training: The implicit dynamics of in-context learning
by: Dherin, Benoit, et al.
Published: (2025)
by: Dherin, Benoit, et al.
Published: (2025)
How iteration order influences convergence and stability in deep learning
by: Dherin, Benoit, et al.
Published: (2025)
by: Dherin, Benoit, et al.
Published: (2025)
Learning by solving differential equations
by: Dherin, Benoit, et al.
Published: (2025)
by: Dherin, Benoit, et al.
Published: (2025)
The Limits of Preference Data for Post-Training
by: Zhao, Eric, et al.
Published: (2025)
by: Zhao, Eric, et al.
Published: (2025)
Sample-Efficient Optimization over Generative Priors via Coarse Learnability
by: Awasthi, Pranjal, et al.
Published: (2025)
by: Awasthi, Pranjal, et al.
Published: (2025)
Analyzing Similarity Metrics for Data Selection for Language Model Pretraining
by: Sam, Dylan, et al.
Published: (2025)
by: Sam, Dylan, et al.
Published: (2025)
Leveraging GANs For Active Appearance Models Optimized Model Fitting
by: Awasthi, Anurag
Published: (2025)
by: Awasthi, Anurag
Published: (2025)
Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification
by: Zhao, Eric, et al.
Published: (2025)
by: Zhao, Eric, et al.
Published: (2025)
From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning
by: Zhao, Eric, et al.
Published: (2025)
by: Zhao, Eric, et al.
Published: (2025)
Agnostic Learning of General ReLU Activation Using Gradient Descent
by: Awasthi, Pranjal, et al.
Published: (2022)
by: Awasthi, Pranjal, et al.
Published: (2022)
Stacking as Accelerated Gradient Descent
by: Agarwal, Naman, et al.
Published: (2024)
by: Agarwal, Naman, et al.
Published: (2024)
Learning Neural Networks with Sparse Activations
by: Awasthi, Pranjal, et al.
Published: (2024)
by: Awasthi, Pranjal, et al.
Published: (2024)
GIST: Greedy Independent Set Thresholding for Max-Min Diversification with Submodular Utility
by: Fahrbach, Matthew, et al.
Published: (2024)
by: Fahrbach, Matthew, et al.
Published: (2024)
Uncertainty-Aware Tabular Prediction: Evaluating VBLL-Enhanced TabPFN in Safety-Critical Medical Data
by: Ramalingam, Madhushan
Published: (2025)
by: Ramalingam, Madhushan
Published: (2025)
Language verY Rare for All
by: Merad, Ibrahim, et al.
Published: (2024)
by: Merad, Ibrahim, et al.
Published: (2024)
Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure
by: Cho, Hanseul, et al.
Published: (2024)
by: Cho, Hanseul, et al.
Published: (2024)
HQFS: Hybrid Quantum Classical Financial Security with VQC Forecasting, QUBO Annealing, and Audit-Ready Post-Quantum Signing
by: Nayak, Srikumar
Published: (2026)
by: Nayak, Srikumar
Published: (2026)
Named Entity Recognition for Payment Data Using NLP
by: Nayak, Srikumar
Published: (2026)
by: Nayak, Srikumar
Published: (2026)
Calibrated Credit Intelligence: Shift-Robust and Fair Risk Scoring with Bayesian Uncertainty and Gradient Boosting
by: Nayak, Srikumar
Published: (2026)
by: Nayak, Srikumar
Published: (2026)
Learnware of Language Models: Specialized Small Language Models Can Do Big
by: Tan, Zhi-Hao, et al.
Published: (2025)
by: Tan, Zhi-Hao, et al.
Published: (2025)
A Margin-based Multiclass Generalization Bound via Geometric Complexity
by: Munn, Michael, et al.
Published: (2024)
by: Munn, Michael, et al.
Published: (2024)
The Impact of Geometric Complexity on Neural Collapse in Transfer Learning
by: Munn, Michael, et al.
Published: (2024)
by: Munn, Michael, et al.
Published: (2024)
RLShield: Practical Multi-Agent RL for Financial Cyber Defense with Attack-Surface MDPs and Real-Time Response Orchestration
by: Nayak, Srikumar
Published: (2026)
by: Nayak, Srikumar
Published: (2026)
Big2Small: A Unifying Neural Network Framework for Model Compression
by: Liao, Jing-Xiao, et al.
Published: (2026)
by: Liao, Jing-Xiao, et al.
Published: (2026)
BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference
by: Jin, Zewen, et al.
Published: (2025)
by: Jin, Zewen, et al.
Published: (2025)
Energy Efficient Protein Language Models: Leveraging Small Language Models with LoRA for Controllable Protein Generation
by: Shah, Aayush, et al.
Published: (2024)
by: Shah, Aayush, et al.
Published: (2024)
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
by: Aggarwal, Pranjal, et al.
Published: (2025)
by: Aggarwal, Pranjal, et al.
Published: (2025)
Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in Product QA Agents
by: Lewis, Ashley, et al.
Published: (2025)
by: Lewis, Ashley, et al.
Published: (2025)
Training-Free Generative Modeling via Kernelized Stochastic Interpolants
by: Coeurdoux, Florentin, et al.
Published: (2026)
by: Coeurdoux, Florentin, et al.
Published: (2026)
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
by: Rawat, Ankit Singh, et al.
Published: (2024)
by: Rawat, Ankit Singh, et al.
Published: (2024)
Fractional Heat Kernel for Semi-Supervised Graph Learning with Small Training Sample Size
by: Bozorgnia, Farid, et al.
Published: (2025)
by: Bozorgnia, Farid, et al.
Published: (2025)
A Machine learning and Empirical Bayesian Approach for Predictive Buying in B2B E-commerce
by: De, Tuhin Subhra, et al.
Published: (2024)
by: De, Tuhin Subhra, et al.
Published: (2024)
Efficient Parameter Optimisation for Quantum Kernel Alignment: A Sub-sampling Approach in Variational Training
by: Sahin, M. Emre, et al.
Published: (2024)
by: Sahin, M. Emre, et al.
Published: (2024)
A Survey of Reinforcement Learning For Economics
by: Rawat, Pranjal
Published: (2026)
by: Rawat, Pranjal
Published: (2026)
Leveraging Kernel Symmetry for Joint Compression and Error Mitigation in Edge Model Transfer
by: Hamadouche, Anis, et al.
Published: (2026)
by: Hamadouche, Anis, et al.
Published: (2026)
A Post-Training Enhanced Optimization Approach for Small Language Models
by: Zhai, Keke
Published: (2024)
by: Zhai, Keke
Published: (2024)
Similar Items
-
Deep Fusion: Efficient Network Training via Pre-trained Initializations
by: Mazzawi, Hanna, et al.
Published: (2023) -
Grow, Don't Overwrite: Fine-tuning Without Forgetting
by: Adila, Dyah, et al.
Published: (2026) -
On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions
by: Böther, Maximilian, et al.
Published: (2024) -
Transmuting prompts into weights
by: Mazzawi, Hanna, et al.
Published: (2025) -
Learning without training: The implicit dynamics of in-context learning
by: Dherin, Benoit, et al.
Published: (2025)