:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Joarder, Shourov, Sikdar, Diganta, Akash, Ahsan Habib, Bhattarai, Binod, Gyawali, Prashnna
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Computation and Language
Online Access:	https://arxiv.org/abs/2605.22620
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

NERO: Explainable Out-of-Distribution Detection with Neuron-level Relevance
by: Chhetri, Anju, et al.
Published: (2025)

Multimodal Federated Learning With Missing Modalities through Feature Imputation Network
by: Poudel, Pranav, et al.
Published: (2025)

FedVG: Gradient-Guided Aggregation for Enhanced Federated Learning
by: Devkota, Alina, et al.
Published: (2026)

Addressing Bias in VLMs for Glaucoma Detection Without Protected Attribute Supervision
by: Akash, Ahsan Habib, et al.
Published: (2025)

Investigating Trustworthiness of Nonparametric Deep Survival Models for Alzheimer's Disease Progression Analysis
by: Thrasher, Jacob, et al.
Published: (2026)

Multimodal Federated Learning in Healthcare: a Review
by: Thrasher, Jacob, et al.
Published: (2023)

Local K-Similarity Constraint for Federated Learning with Label Noise
by: Amgain, Sanskar, et al.
Published: (2025)

Two are better than one: Context window extension with multi-grained self-injection
by: Han, Wei, et al.
Published: (2024)

Federated Foundation Model for GI Endoscopy Images
by: Devkota, Alina, et al.
Published: (2025)

TE-SSL: Time and Event-aware Self Supervised Learning for Alzheimer's Disease Progression Analysis
by: Thrasher, Jacob, et al.
Published: (2024)

Dimension Mixer: Group Mixing of Input Dimensions for Efficient Function Approximation
by: Sapkota, Suman, et al.
Published: (2023)

Input Invex Neural Network
by: Sapkota, Suman, et al.
Published: (2021)

Semantic similarity prediction is better than other semantic similarity measures
by: Herbold, Steffen
Published: (2023)

Language models are better than humans at next-token prediction
by: Shlegeris, Buck, et al.
Published: (2022)

TTA-OOD: Test-time Augmentation for Improving Out-of-Distribution Detection in Gastrointestinal Vision
by: Pokhrel, Sandesh, et al.
Published: (2024)

Effect of Data Augmentation on Conformal Prediction for Diabetic Retinopathy
by: Ahamed, Rizwan, et al.
Published: (2025)

Supervised Pretraining for Material Property Prediction
by: Rahman, Chowdhury Mohammad Abid, et al.
Published: (2025)

Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise
by: Khanal, Bidur, et al.
Published: (2024)

Multi-Stage Residual-Aware Unsupervised Deep Learning Framework for Consistent Ultrasound Strain Elastography
by: Joarder, Shourov, et al.
Published: (2025)

Reward Collapse in Aligning Large Language Models
by: Song, Ziang, et al.
Published: (2023)

CAR-MFL: Cross-Modal Augmentation by Retrieval for Multimodal Federated Learning with Missing Modalities
by: Poudel, Pranav, et al.
Published: (2024)

Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?
by: Nunez, Jeanmely Rojas, et al.
Published: (2026)

When Sharpening Becomes Collapse: Sampling Bias and Semantic Coupling in RL with Verifiable Rewards
by: Fan, Mingyuan, et al.
Published: (2026)

Training LLMs for Multi-Step Tool Orchestration with Constrained Data Synthesis and Graduated Rewards
by: Jiayang, Cheng, et al.
Published: (2026)

Group Robust Preference Optimization in Reward-free RLHF
by: Ramesh, Shyam Sundhar, et al.
Published: (2024)

Reward-free Alignment for Conflicting Objectives
by: Chen, Peter, et al.
Published: (2026)

GRAM-R$^2$: Self-Training Generative Foundation Reward Models for Reward Reasoning
by: Wang, Chenglong, et al.
Published: (2025)

Consolidating Rewarded Perturbations for LLM Post-Training
by: Zhang, Zheyu, et al.
Published: (2026)

Pre-Trained Policy Discriminators are General Reward Models
by: Dou, Shihan, et al.
Published: (2025)

Training-free LLM Merging for Multi-task Learning
by: Fu, Zichuan, et al.
Published: (2025)

Scaling with Collapse: Efficient and Predictable Training of LLM Families
by: Bergsma, Shane, et al.
Published: (2025)

Scalable Multi-phase Word Embedding Using Conjunctive Propositional Clauses
by: Kadhim, Ahmed K., et al.
Published: (2025)

LayerRoute: Input-Conditioned Adaptive Layer Skipping via LoRA Fine-Tuning for Agentic Language Models
by: Sikdar, Prateek Kumar
Published: (2026)

Urdu Dependency Parsing and Treebank Development: A Syntactic and Morphological Perspective
by: Habib, Nudrat
Published: (2024)

Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation
by: Min, Do June, et al.
Published: (2024)

Training Acceleration of Low-Rank Decomposed Networks using Sequential Freezing and Rank Quantization
by: Hajimolahoseini, Habib, et al.
Published: (2023)

Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization
by: Bai, Yang, et al.
Published: (2026)

Training Data Efficiency in Multimodal Process Reward Models
by: Li, Jinyuan, et al.
Published: (2026)

Calibration Collapse Under Sycophancy Fine-Tuning: How Reward Hacking Breaks Uncertainty Quantification in LLMs
by: Sahoo, Subramanyam
Published: (2026)

A Multi-Level Framework for Accelerating Training Transformer Models
by: Zou, Longwei, et al.
Published: (2024)