Saved in:
| Main Authors: | Joarder, Shourov, Sikdar, Diganta, Akash, Ahsan Habib, Bhattarai, Binod, Gyawali, Prashnna |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.22620 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
NERO: Explainable Out-of-Distribution Detection with Neuron-level Relevance
by: Chhetri, Anju, et al.
Published: (2025)
by: Chhetri, Anju, et al.
Published: (2025)
Multimodal Federated Learning With Missing Modalities through Feature Imputation Network
by: Poudel, Pranav, et al.
Published: (2025)
by: Poudel, Pranav, et al.
Published: (2025)
FedVG: Gradient-Guided Aggregation for Enhanced Federated Learning
by: Devkota, Alina, et al.
Published: (2026)
by: Devkota, Alina, et al.
Published: (2026)
Addressing Bias in VLMs for Glaucoma Detection Without Protected Attribute Supervision
by: Akash, Ahsan Habib, et al.
Published: (2025)
by: Akash, Ahsan Habib, et al.
Published: (2025)
Investigating Trustworthiness of Nonparametric Deep Survival Models for Alzheimer's Disease Progression Analysis
by: Thrasher, Jacob, et al.
Published: (2026)
by: Thrasher, Jacob, et al.
Published: (2026)
Multimodal Federated Learning in Healthcare: a Review
by: Thrasher, Jacob, et al.
Published: (2023)
by: Thrasher, Jacob, et al.
Published: (2023)
Local K-Similarity Constraint for Federated Learning with Label Noise
by: Amgain, Sanskar, et al.
Published: (2025)
by: Amgain, Sanskar, et al.
Published: (2025)
Two are better than one: Context window extension with multi-grained self-injection
by: Han, Wei, et al.
Published: (2024)
by: Han, Wei, et al.
Published: (2024)
Federated Foundation Model for GI Endoscopy Images
by: Devkota, Alina, et al.
Published: (2025)
by: Devkota, Alina, et al.
Published: (2025)
TE-SSL: Time and Event-aware Self Supervised Learning for Alzheimer's Disease Progression Analysis
by: Thrasher, Jacob, et al.
Published: (2024)
by: Thrasher, Jacob, et al.
Published: (2024)
Dimension Mixer: Group Mixing of Input Dimensions for Efficient Function Approximation
by: Sapkota, Suman, et al.
Published: (2023)
by: Sapkota, Suman, et al.
Published: (2023)
Input Invex Neural Network
by: Sapkota, Suman, et al.
Published: (2021)
by: Sapkota, Suman, et al.
Published: (2021)
Semantic similarity prediction is better than other semantic similarity measures
by: Herbold, Steffen
Published: (2023)
by: Herbold, Steffen
Published: (2023)
Language models are better than humans at next-token prediction
by: Shlegeris, Buck, et al.
Published: (2022)
by: Shlegeris, Buck, et al.
Published: (2022)
TTA-OOD: Test-time Augmentation for Improving Out-of-Distribution Detection in Gastrointestinal Vision
by: Pokhrel, Sandesh, et al.
Published: (2024)
by: Pokhrel, Sandesh, et al.
Published: (2024)
Effect of Data Augmentation on Conformal Prediction for Diabetic Retinopathy
by: Ahamed, Rizwan, et al.
Published: (2025)
by: Ahamed, Rizwan, et al.
Published: (2025)
Supervised Pretraining for Material Property Prediction
by: Rahman, Chowdhury Mohammad Abid, et al.
Published: (2025)
by: Rahman, Chowdhury Mohammad Abid, et al.
Published: (2025)
Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise
by: Khanal, Bidur, et al.
Published: (2024)
by: Khanal, Bidur, et al.
Published: (2024)
Multi-Stage Residual-Aware Unsupervised Deep Learning Framework for Consistent Ultrasound Strain Elastography
by: Joarder, Shourov, et al.
Published: (2025)
by: Joarder, Shourov, et al.
Published: (2025)
Reward Collapse in Aligning Large Language Models
by: Song, Ziang, et al.
Published: (2023)
by: Song, Ziang, et al.
Published: (2023)
CAR-MFL: Cross-Modal Augmentation by Retrieval for Multimodal Federated Learning with Missing Modalities
by: Poudel, Pranav, et al.
Published: (2024)
by: Poudel, Pranav, et al.
Published: (2024)
Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?
by: Nunez, Jeanmely Rojas, et al.
Published: (2026)
by: Nunez, Jeanmely Rojas, et al.
Published: (2026)
When Sharpening Becomes Collapse: Sampling Bias and Semantic Coupling in RL with Verifiable Rewards
by: Fan, Mingyuan, et al.
Published: (2026)
by: Fan, Mingyuan, et al.
Published: (2026)
Training LLMs for Multi-Step Tool Orchestration with Constrained Data Synthesis and Graduated Rewards
by: Jiayang, Cheng, et al.
Published: (2026)
by: Jiayang, Cheng, et al.
Published: (2026)
Group Robust Preference Optimization in Reward-free RLHF
by: Ramesh, Shyam Sundhar, et al.
Published: (2024)
by: Ramesh, Shyam Sundhar, et al.
Published: (2024)
Reward-free Alignment for Conflicting Objectives
by: Chen, Peter, et al.
Published: (2026)
by: Chen, Peter, et al.
Published: (2026)
GRAM-R$^2$: Self-Training Generative Foundation Reward Models for Reward Reasoning
by: Wang, Chenglong, et al.
Published: (2025)
by: Wang, Chenglong, et al.
Published: (2025)
Consolidating Rewarded Perturbations for LLM Post-Training
by: Zhang, Zheyu, et al.
Published: (2026)
by: Zhang, Zheyu, et al.
Published: (2026)
Pre-Trained Policy Discriminators are General Reward Models
by: Dou, Shihan, et al.
Published: (2025)
by: Dou, Shihan, et al.
Published: (2025)
Training-free LLM Merging for Multi-task Learning
by: Fu, Zichuan, et al.
Published: (2025)
by: Fu, Zichuan, et al.
Published: (2025)
Scaling with Collapse: Efficient and Predictable Training of LLM Families
by: Bergsma, Shane, et al.
Published: (2025)
by: Bergsma, Shane, et al.
Published: (2025)
Scalable Multi-phase Word Embedding Using Conjunctive Propositional Clauses
by: Kadhim, Ahmed K., et al.
Published: (2025)
by: Kadhim, Ahmed K., et al.
Published: (2025)
LayerRoute: Input-Conditioned Adaptive Layer Skipping via LoRA Fine-Tuning for Agentic Language Models
by: Sikdar, Prateek Kumar
Published: (2026)
by: Sikdar, Prateek Kumar
Published: (2026)
Urdu Dependency Parsing and Treebank Development: A Syntactic and Morphological Perspective
by: Habib, Nudrat
Published: (2024)
by: Habib, Nudrat
Published: (2024)
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation
by: Min, Do June, et al.
Published: (2024)
by: Min, Do June, et al.
Published: (2024)
Training Acceleration of Low-Rank Decomposed Networks using Sequential Freezing and Rank Quantization
by: Hajimolahoseini, Habib, et al.
Published: (2023)
by: Hajimolahoseini, Habib, et al.
Published: (2023)
Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization
by: Bai, Yang, et al.
Published: (2026)
by: Bai, Yang, et al.
Published: (2026)
Training Data Efficiency in Multimodal Process Reward Models
by: Li, Jinyuan, et al.
Published: (2026)
by: Li, Jinyuan, et al.
Published: (2026)
Calibration Collapse Under Sycophancy Fine-Tuning: How Reward Hacking Breaks Uncertainty Quantification in LLMs
by: Sahoo, Subramanyam
Published: (2026)
by: Sahoo, Subramanyam
Published: (2026)
A Multi-Level Framework for Accelerating Training Transformer Models
by: Zou, Longwei, et al.
Published: (2024)
by: Zou, Longwei, et al.
Published: (2024)
Similar Items
-
NERO: Explainable Out-of-Distribution Detection with Neuron-level Relevance
by: Chhetri, Anju, et al.
Published: (2025) -
Multimodal Federated Learning With Missing Modalities through Feature Imputation Network
by: Poudel, Pranav, et al.
Published: (2025) -
FedVG: Gradient-Guided Aggregation for Enhanced Federated Learning
by: Devkota, Alina, et al.
Published: (2026) -
Addressing Bias in VLMs for Glaucoma Detection Without Protected Attribute Supervision
by: Akash, Ahsan Habib, et al.
Published: (2025) -
Investigating Trustworthiness of Nonparametric Deep Survival Models for Alzheimer's Disease Progression Analysis
by: Thrasher, Jacob, et al.
Published: (2026)