Saved in:
| Main Authors: | Wandre, Rohan, Gajewar, Yash, Patel, Namrata, Dhalkari, Vivek |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.02371 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data
by: Bezirganyan, Grigor, et al.
Published: (2024)
by: Bezirganyan, Grigor, et al.
Published: (2024)
ROSFD: Robust Online Streaming Fraud Detection with Resilience to Concept Drift in Data Streams
by: Yelleti, Vivek
Published: (2025)
by: Yelleti, Vivek
Published: (2025)
Towards Provable Emergence of In-Context Reinforcement Learning
by: Wang, Jiuqi, et al.
Published: (2025)
by: Wang, Jiuqi, et al.
Published: (2025)
Distribution-Free Robust Predict-Then-Optimize in Function Spaces
by: Patel, Yash, et al.
Published: (2026)
by: Patel, Yash, et al.
Published: (2026)
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
by: Eysenbach, Benjamin, et al.
Published: (2024)
by: Eysenbach, Benjamin, et al.
Published: (2024)
Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations
by: Chen, Letian, et al.
Published: (2022)
by: Chen, Letian, et al.
Published: (2022)
Lifelong Safety Alignment for Language Models
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
Direct Distributional Optimization for Provable Alignment of Diffusion Models
by: Kawata, Ryotaro, et al.
Published: (2025)
by: Kawata, Ryotaro, et al.
Published: (2025)
Controllable Video Generation with Provable Disentanglement
by: Shen, Yifan, et al.
Published: (2025)
by: Shen, Yifan, et al.
Published: (2025)
Operator Learning for Schrödinger Equation: Unitarity, Error Bounds, and Time Generalization
by: Patel, Yash, et al.
Published: (2025)
by: Patel, Yash, et al.
Published: (2025)
Continuum Transformers Perform In-Context Learning by Operator Gradient Descent
by: Mishra, Abhiti, et al.
Published: (2025)
by: Mishra, Abhiti, et al.
Published: (2025)
Ward: Provable RAG Dataset Inference via LLM Watermarks
by: Jovanović, Nikola, et al.
Published: (2024)
by: Jovanović, Nikola, et al.
Published: (2024)
Provably Convergent Primal-Dual DPO for Constrained LLM Alignment
by: Du, Yihan, et al.
Published: (2025)
by: Du, Yihan, et al.
Published: (2025)
Incremental Outlier Detection Modelling Using Streaming Analytics in Finance & Health Care
by: Yelleti, Vivek, et al.
Published: (2023)
by: Yelleti, Vivek, et al.
Published: (2023)
HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction
by: Sarmah, Bhaskarjit, et al.
Published: (2024)
by: Sarmah, Bhaskarjit, et al.
Published: (2024)
A Greedy PDE Router for Blending Neural Operators and Classical Methods
by: Rayan, Sahana, et al.
Published: (2025)
by: Rayan, Sahana, et al.
Published: (2025)
AltGDmin: Alternating GD and Minimization for Partly-Decoupled (Federated) Optimization
by: Vaswani, Namrata
Published: (2025)
by: Vaswani, Namrata
Published: (2025)
OntoRAG: Enhancing Question-Answering through Automated Ontology Derivation from Unstructured Knowledge Bases
by: Tiwari, Yash, et al.
Published: (2025)
by: Tiwari, Yash, et al.
Published: (2025)
Lifelong Learning of Video Diffusion Models From a Single Video Stream
by: Yoo, Jason, et al.
Published: (2024)
by: Yoo, Jason, et al.
Published: (2024)
Conformal Prediction for Ensembles: Improving Efficiency via Score-Based Aggregation
by: Rivera, Eduardo Ochoa, et al.
Published: (2024)
by: Rivera, Eduardo Ochoa, et al.
Published: (2024)
VERSE: Virtual-Gradient Aware Streaming Lifelong Learning with Anytime Inference
by: Banerjee, Soumya, et al.
Published: (2023)
by: Banerjee, Soumya, et al.
Published: (2023)
One-Step Bellman Alignment Enables Provably Efficient Transfer in Online RL
by: Chen, Elynn, et al.
Published: (2026)
by: Chen, Elynn, et al.
Published: (2026)
DSCA: Dynamic Subspace Concept Alignment for Lifelong VLM Editing
by: Das, Gyanendra, et al.
Published: (2026)
by: Das, Gyanendra, et al.
Published: (2026)
Skill-CMIB: Multimodal Agent Skill for Consistent Action via Conditional Multimodal Information Bottleneck
by: Huang, Zihan, et al.
Published: (2026)
by: Huang, Zihan, et al.
Published: (2026)
Scalable Valuation of Human Feedback through Provably Robust Model Alignment
by: Fujisawa, Masahiro, et al.
Published: (2025)
by: Fujisawa, Masahiro, et al.
Published: (2025)
Decentralized and Lifelong-Adaptive Multi-Agent Collaborative Learning
by: Tang, Shuo, et al.
Published: (2024)
by: Tang, Shuo, et al.
Published: (2024)
DynaSTy: A Framework for SpatioTemporal Node Attribute Prediction in Dynamic Graphs
by: Banerji, Namrata, et al.
Published: (2026)
by: Banerji, Namrata, et al.
Published: (2026)
More Edits, More Stable: Understanding the Lifelong Normalization in Sequential Model Editing
by: Ma, Xin, et al.
Published: (2026)
by: Ma, Xin, et al.
Published: (2026)
LeanAgent: Lifelong Learning for Formal Theorem Proving
by: Kumarappan, Adarsh, et al.
Published: (2024)
by: Kumarappan, Adarsh, et al.
Published: (2024)
Conditional Equivalence of DPO and RLHF: Implicit Assumption, Failure Modes, and Provable Alignment
by: Yang, Zhiqin, et al.
Published: (2026)
by: Yang, Zhiqin, et al.
Published: (2026)
Privacy-Aware Lifelong Learning
by: Özdenizci, Ozan, et al.
Published: (2025)
by: Özdenizci, Ozan, et al.
Published: (2025)
Leveraging RAG for Training-Free Alignment of LLMs
by: Halloran, John T.
Published: (2026)
by: Halloran, John T.
Published: (2026)
MimirRAG: A Multi-Agent RAG Framework for Financial Data Retrieval with Metadata Integration
by: Samuelsen, Magnus, et al.
Published: (2026)
by: Samuelsen, Magnus, et al.
Published: (2026)
From Equations to Insights: Unraveling Symbolic Structures in PDEs with LLMs
by: Bhatnagar, Rohan, et al.
Published: (2025)
by: Bhatnagar, Rohan, et al.
Published: (2025)
Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits
by: Lin, Jiabin, et al.
Published: (2024)
by: Lin, Jiabin, et al.
Published: (2024)
Hybrid Preference Optimization for Alignment: Provably Faster Convergence Rates by Combining Offline Preferences with Online Exploration
by: Bose, Avinandan, et al.
Published: (2024)
by: Bose, Avinandan, et al.
Published: (2024)
TIMEGATE: Sustainable Time-Boxed Promotion Gates for Continual ML Adaptation Under Resource Constraints
by: Chakraborty, Abhijit, et al.
Published: (2026)
by: Chakraborty, Abhijit, et al.
Published: (2026)
Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning
by: Freihaut, Till, et al.
Published: (2025)
by: Freihaut, Till, et al.
Published: (2025)
Provably Mitigating Corruption, Overoptimization, and Verbosity Simultaneously in Offline and Online RLHF/DPO Alignment
by: Chen, Ziyi, et al.
Published: (2025)
by: Chen, Ziyi, et al.
Published: (2025)
Byzantine-Resilient Federated PCA and Low Rank Column-wise Sensing
by: Singh, Ankit Pratap, et al.
Published: (2023)
by: Singh, Ankit Pratap, et al.
Published: (2023)
Similar Items
-
LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data
by: Bezirganyan, Grigor, et al.
Published: (2024) -
ROSFD: Robust Online Streaming Fraud Detection with Resilience to Concept Drift in Data Streams
by: Yelleti, Vivek
Published: (2025) -
Towards Provable Emergence of In-Context Reinforcement Learning
by: Wang, Jiuqi, et al.
Published: (2025) -
Distribution-Free Robust Predict-Then-Optimize in Function Spaces
by: Patel, Yash, et al.
Published: (2026) -
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
by: Eysenbach, Benjamin, et al.
Published: (2024)