Saved in:
| Main Author: | Madani, Omid |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.10142 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Post-Training Probability Manifold Correction via Structured SVD Pruning and Self-Referential Distillation
by: Flouro, Aaron R., et al.
Published: (2026)
by: Flouro, Aaron R., et al.
Published: (2026)
Dynamical Priors as a Training Objective in Reinforcement Learning
by: Subaharan, Sukesh
Published: (2026)
by: Subaharan, Sukesh
Published: (2026)
A social path to human-like artificial intelligence
by: Duéñez-Guzmán, Edgar A., et al.
Published: (2024)
by: Duéñez-Guzmán, Edgar A., et al.
Published: (2024)
Memory-efficient Continual Learning with Prototypical Exemplar Condensation
by: Nguyen, Minh-Duong, et al.
Published: (2026)
by: Nguyen, Minh-Duong, et al.
Published: (2026)
Quantifying First-Order Markov Violations in Noisy Reinforcement Learning: A Causal Discovery Approach
by: Mysore, Naveen
Published: (2025)
by: Mysore, Naveen
Published: (2025)
Neurosymbolic Association Rule Mining from Tabular Data
by: Karabulut, Erkan, et al.
Published: (2025)
by: Karabulut, Erkan, et al.
Published: (2025)
Multi-Scale Graph Learning for Anti-Sparse Downscaling
by: Fan, Yingda, et al.
Published: (2025)
by: Fan, Yingda, et al.
Published: (2025)
Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping
by: Lai, Guannan, et al.
Published: (2025)
by: Lai, Guannan, et al.
Published: (2025)
A Practical Guide to Streaming Continual Learning
by: Cossu, Andrea, et al.
Published: (2026)
by: Cossu, Andrea, et al.
Published: (2026)
The Geometry of Persona: Disentangling Personality from Reasoning in Large Language Models
by: Wang, Zhixiang
Published: (2025)
by: Wang, Zhixiang
Published: (2025)
Don't Look Back in Anger: MAGIC Net for Streaming Continual Learning with Temporal Dependence
by: Giannini, Federico, et al.
Published: (2026)
by: Giannini, Federico, et al.
Published: (2026)
cPNN: Continuous Progressive Neural Networks for Evolving Streaming Time Series
by: Giannini, Federico, et al.
Published: (2026)
by: Giannini, Federico, et al.
Published: (2026)
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
by: Lin, Junjie, et al.
Published: (2024)
by: Lin, Junjie, et al.
Published: (2024)
SafeRL-Lite: A Lightweight, Explainable, and Constrained Reinforcement Learning Library
by: Mishra, Satyam, et al.
Published: (2025)
by: Mishra, Satyam, et al.
Published: (2025)
Foundation Models as World Models: A Foundational Study in Text-Based GridWorlds
by: Sasso, Remo, et al.
Published: (2025)
by: Sasso, Remo, et al.
Published: (2025)
Exploration with Foundation Models: Capabilities, Limitations, and Hybrid Approaches
by: Sasso, Remo, et al.
Published: (2025)
by: Sasso, Remo, et al.
Published: (2025)
A Comparative Survey of PyTorch vs TensorFlow for Deep Learning: Usability, Performance, and Deployment Trade-offs
by: Alawi, Zakariya Ba
Published: (2025)
by: Alawi, Zakariya Ba
Published: (2025)
GoldenStart: Q-Guided Priors and Entropy Control for Distilling Flow Policies
by: Zhang, He, et al.
Published: (2026)
by: Zhang, He, et al.
Published: (2026)
MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures
by: Zamaraeva, Elena, et al.
Published: (2025)
by: Zamaraeva, Elena, et al.
Published: (2025)
Reconstructing 12-Lead ECG from 3-Lead ECG using Variational Autoencoder to Improve Cardiac Disease Detection of Wearable ECG Devices
by: Guan, Xinyan, et al.
Published: (2025)
by: Guan, Xinyan, et al.
Published: (2025)
CVCM Track Circuits Pre-emptive Failure Diagnostics for Predictive Maintenance Using Deep Neural Networks
by: Mukherjee, Debdeep, et al.
Published: (2025)
by: Mukherjee, Debdeep, et al.
Published: (2025)
Multi-Teacher Ensemble Distillation: A Mathematical Framework for Probability-Domain Knowledge Aggregation
by: Flouro, Aaron R., et al.
Published: (2026)
by: Flouro, Aaron R., et al.
Published: (2026)
Streaming Continual Learning for Unified Adaptive Intelligence in Dynamic Environments
by: Giannini, Federico, et al.
Published: (2026)
by: Giannini, Federico, et al.
Published: (2026)
Label Smoothing is a Pragmatic Information Bottleneck
by: Kudo, Sota
Published: (2025)
by: Kudo, Sota
Published: (2025)
Hybrid Imbalanced Regression Through Unified Data-Level and Algorithm-Level Balancing
by: Shahbazi, Shermin, et al.
Published: (2026)
by: Shahbazi, Shermin, et al.
Published: (2026)
Hybrid Gated Flow (HGF): Stabilizing 1.58-bit LLMs via Selective Low-Rank Correction
by: Pizzo, David Alejandro Trejo
Published: (2026)
by: Pizzo, David Alejandro Trejo
Published: (2026)
Track Component Failure Detection Using Data Analytics over existing STDS Track Circuit data
by: López, Francisco, et al.
Published: (2025)
by: López, Francisco, et al.
Published: (2025)
Active Causal Experimentalist (ACE): Learning Intervention Strategies via Direct Preference Optimization
by: Cooper, Patrick, et al.
Published: (2026)
by: Cooper, Patrick, et al.
Published: (2026)
AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models
by: Zarei, Mohammad, et al.
Published: (2025)
by: Zarei, Mohammad, et al.
Published: (2025)
Graph Transformers: A Survey
by: Shehzad, Ahsan, et al.
Published: (2024)
by: Shehzad, Ahsan, et al.
Published: (2024)
QGraphLIME - Explaining Quantum Graph Neural Networks
by: Jena, Haribandhu, et al.
Published: (2025)
by: Jena, Haribandhu, et al.
Published: (2025)
SafeAnchor: Preventing Cumulative Safety Erosion in Continual Domain Adaptation of Large Language Models
by: Guo, Dongxin, et al.
Published: (2026)
by: Guo, Dongxin, et al.
Published: (2026)
Active Inference with a Self-Prior in the Mirror-Mark Task
by: Kim, Dongmin, et al.
Published: (2026)
by: Kim, Dongmin, et al.
Published: (2026)
Adaptive Bernstein Change Detector for High-Dimensional Data Streams
by: Heyden, Marco, et al.
Published: (2023)
by: Heyden, Marco, et al.
Published: (2023)
WorkflowGen:an adaptive workflow generation mechanism driven by trajectory experience
by: Wei, Ruocan, et al.
Published: (2026)
by: Wei, Ruocan, et al.
Published: (2026)
Measuring and curing reasoning rigidity: from decorative chain-of-thought to genuine faithfulness
by: Basu, Abhinaba, et al.
Published: (2026)
by: Basu, Abhinaba, et al.
Published: (2026)
The Mirror Loop: Recursive Non-Convergence in Generative Reasoning Systems
by: DeVilling, Bentley
Published: (2025)
by: DeVilling, Bentley
Published: (2025)
Counterfactual Basis Extension and Representational Geometry: An MDL-Constrained Model of Conceptual Growth
by: Amornbunchornvej, Chainarong
Published: (2025)
by: Amornbunchornvej, Chainarong
Published: (2025)
From Imitation to Interaction: Mastering Game of Schnapsen with Shallow Reinforcement Learning
by: Klačan, Ján, et al.
Published: (2026)
by: Klačan, Ján, et al.
Published: (2026)
mHC-SSM: Manifold-Constrained Hyper-Connections for State Space Language Models with Stream-Specialized Adapters
by: Mutlu, Abdulvahap, et al.
Published: (2026)
by: Mutlu, Abdulvahap, et al.
Published: (2026)
Similar Items
-
Post-Training Probability Manifold Correction via Structured SVD Pruning and Self-Referential Distillation
by: Flouro, Aaron R., et al.
Published: (2026) -
Dynamical Priors as a Training Objective in Reinforcement Learning
by: Subaharan, Sukesh
Published: (2026) -
A social path to human-like artificial intelligence
by: Duéñez-Guzmán, Edgar A., et al.
Published: (2024) -
Memory-efficient Continual Learning with Prototypical Exemplar Condensation
by: Nguyen, Minh-Duong, et al.
Published: (2026) -
Quantifying First-Order Markov Violations in Noisy Reinforcement Learning: A Causal Discovery Approach
by: Mysore, Naveen
Published: (2025)