:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mazzawi, Hanna, Awasthi, Pranjal, Gonzalvo, Xavi, Ramalingam, Srikumar
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2402.05033
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Deep Fusion: Efficient Network Training via Pre-trained Initializations
by: Mazzawi, Hanna, et al.
Published: (2023)

Grow, Don't Overwrite: Fine-tuning Without Forgetting
by: Adila, Dyah, et al.
Published: (2026)

On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions
by: Böther, Maximilian, et al.
Published: (2024)

Transmuting prompts into weights
by: Mazzawi, Hanna, et al.
Published: (2025)

Learning without training: The implicit dynamics of in-context learning
by: Dherin, Benoit, et al.
Published: (2025)

How iteration order influences convergence and stability in deep learning
by: Dherin, Benoit, et al.
Published: (2025)

Learning by solving differential equations
by: Dherin, Benoit, et al.
Published: (2025)

The Limits of Preference Data for Post-Training
by: Zhao, Eric, et al.
Published: (2025)

Sample-Efficient Optimization over Generative Priors via Coarse Learnability
by: Awasthi, Pranjal, et al.
Published: (2025)

Analyzing Similarity Metrics for Data Selection for Language Model Pretraining
by: Sam, Dylan, et al.
Published: (2025)

Leveraging GANs For Active Appearance Models Optimized Model Fitting
by: Awasthi, Anurag
Published: (2025)

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification
by: Zhao, Eric, et al.
Published: (2025)

From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning
by: Zhao, Eric, et al.
Published: (2025)

Agnostic Learning of General ReLU Activation Using Gradient Descent
by: Awasthi, Pranjal, et al.
Published: (2022)

Stacking as Accelerated Gradient Descent
by: Agarwal, Naman, et al.
Published: (2024)

Learning Neural Networks with Sparse Activations
by: Awasthi, Pranjal, et al.
Published: (2024)

GIST: Greedy Independent Set Thresholding for Max-Min Diversification with Submodular Utility
by: Fahrbach, Matthew, et al.
Published: (2024)

Uncertainty-Aware Tabular Prediction: Evaluating VBLL-Enhanced TabPFN in Safety-Critical Medical Data
by: Ramalingam, Madhushan
Published: (2025)

Language verY Rare for All
by: Merad, Ibrahim, et al.
Published: (2024)

Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure
by: Cho, Hanseul, et al.
Published: (2024)

HQFS: Hybrid Quantum Classical Financial Security with VQC Forecasting, QUBO Annealing, and Audit-Ready Post-Quantum Signing
by: Nayak, Srikumar
Published: (2026)

Named Entity Recognition for Payment Data Using NLP
by: Nayak, Srikumar
Published: (2026)

Calibrated Credit Intelligence: Shift-Robust and Fair Risk Scoring with Bayesian Uncertainty and Gradient Boosting
by: Nayak, Srikumar
Published: (2026)

Learnware of Language Models: Specialized Small Language Models Can Do Big
by: Tan, Zhi-Hao, et al.
Published: (2025)

A Margin-based Multiclass Generalization Bound via Geometric Complexity
by: Munn, Michael, et al.
Published: (2024)

The Impact of Geometric Complexity on Neural Collapse in Transfer Learning
by: Munn, Michael, et al.
Published: (2024)

RLShield: Practical Multi-Agent RL for Financial Cyber Defense with Attack-Surface MDPs and Real-Time Response Orchestration
by: Nayak, Srikumar
Published: (2026)

Big2Small: A Unifying Neural Network Framework for Model Compression
by: Liao, Jing-Xiao, et al.
Published: (2026)

BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference
by: Jin, Zewen, et al.
Published: (2025)

Energy Efficient Protein Language Models: Leveraging Small Language Models with LoRA for Controllable Protein Generation
by: Shah, Aayush, et al.
Published: (2024)

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
by: Aggarwal, Pranjal, et al.
Published: (2025)

Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in Product QA Agents
by: Lewis, Ashley, et al.
Published: (2025)

Training-Free Generative Modeling via Kernelized Stochastic Interpolants
by: Coeurdoux, Florentin, et al.
Published: (2026)

A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
by: Rawat, Ankit Singh, et al.
Published: (2024)

Fractional Heat Kernel for Semi-Supervised Graph Learning with Small Training Sample Size
by: Bozorgnia, Farid, et al.
Published: (2025)

A Machine learning and Empirical Bayesian Approach for Predictive Buying in B2B E-commerce
by: De, Tuhin Subhra, et al.
Published: (2024)

Efficient Parameter Optimisation for Quantum Kernel Alignment: A Sub-sampling Approach in Variational Training
by: Sahin, M. Emre, et al.
Published: (2024)

A Survey of Reinforcement Learning For Economics
by: Rawat, Pranjal
Published: (2026)

Leveraging Kernel Symmetry for Joint Compression and Error Mitigation in Edge Model Transfer
by: Hamadouche, Anis, et al.
Published: (2026)

A Post-Training Enhanced Optimization Approach for Small Language Models
by: Zhai, Keke
Published: (2024)