:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Madani, Omid
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence 68T05 I.2.6
Online Access:	https://arxiv.org/abs/2402.10142
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Post-Training Probability Manifold Correction via Structured SVD Pruning and Self-Referential Distillation
by: Flouro, Aaron R., et al.
Published: (2026)

Dynamical Priors as a Training Objective in Reinforcement Learning
by: Subaharan, Sukesh
Published: (2026)

A social path to human-like artificial intelligence
by: Duéñez-Guzmán, Edgar A., et al.
Published: (2024)

Memory-efficient Continual Learning with Prototypical Exemplar Condensation
by: Nguyen, Minh-Duong, et al.
Published: (2026)

Quantifying First-Order Markov Violations in Noisy Reinforcement Learning: A Causal Discovery Approach
by: Mysore, Naveen
Published: (2025)

Neurosymbolic Association Rule Mining from Tabular Data
by: Karabulut, Erkan, et al.
Published: (2025)

Multi-Scale Graph Learning for Anti-Sparse Downscaling
by: Fan, Yingda, et al.
Published: (2025)

Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping
by: Lai, Guannan, et al.
Published: (2025)

A Practical Guide to Streaming Continual Learning
by: Cossu, Andrea, et al.
Published: (2026)

The Geometry of Persona: Disentangling Personality from Reasoning in Large Language Models
by: Wang, Zhixiang
Published: (2025)

Don't Look Back in Anger: MAGIC Net for Streaming Continual Learning with Temporal Dependence
by: Giannini, Federico, et al.
Published: (2026)

cPNN: Continuous Progressive Neural Networks for Evolving Streaming Time Series
by: Giannini, Federico, et al.
Published: (2026)

RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
by: Lin, Junjie, et al.
Published: (2024)

SafeRL-Lite: A Lightweight, Explainable, and Constrained Reinforcement Learning Library
by: Mishra, Satyam, et al.
Published: (2025)

Foundation Models as World Models: A Foundational Study in Text-Based GridWorlds
by: Sasso, Remo, et al.
Published: (2025)

Exploration with Foundation Models: Capabilities, Limitations, and Hybrid Approaches
by: Sasso, Remo, et al.
Published: (2025)

A Comparative Survey of PyTorch vs TensorFlow for Deep Learning: Usability, Performance, and Deployment Trade-offs
by: Alawi, Zakariya Ba
Published: (2025)

GoldenStart: Q-Guided Priors and Entropy Control for Distilling Flow Policies
by: Zhang, He, et al.
Published: (2026)

MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures
by: Zamaraeva, Elena, et al.
Published: (2025)

Reconstructing 12-Lead ECG from 3-Lead ECG using Variational Autoencoder to Improve Cardiac Disease Detection of Wearable ECG Devices
by: Guan, Xinyan, et al.
Published: (2025)

CVCM Track Circuits Pre-emptive Failure Diagnostics for Predictive Maintenance Using Deep Neural Networks
by: Mukherjee, Debdeep, et al.
Published: (2025)

Multi-Teacher Ensemble Distillation: A Mathematical Framework for Probability-Domain Knowledge Aggregation
by: Flouro, Aaron R., et al.
Published: (2026)

Streaming Continual Learning for Unified Adaptive Intelligence in Dynamic Environments
by: Giannini, Federico, et al.
Published: (2026)

Label Smoothing is a Pragmatic Information Bottleneck
by: Kudo, Sota
Published: (2025)

Hybrid Imbalanced Regression Through Unified Data-Level and Algorithm-Level Balancing
by: Shahbazi, Shermin, et al.
Published: (2026)

Hybrid Gated Flow (HGF): Stabilizing 1.58-bit LLMs via Selective Low-Rank Correction
by: Pizzo, David Alejandro Trejo
Published: (2026)

Track Component Failure Detection Using Data Analytics over existing STDS Track Circuit data
by: López, Francisco, et al.
Published: (2025)

Active Causal Experimentalist (ACE): Learning Intervention Strategies via Direct Preference Optimization
by: Cooper, Patrick, et al.
Published: (2026)

AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models
by: Zarei, Mohammad, et al.
Published: (2025)

Graph Transformers: A Survey
by: Shehzad, Ahsan, et al.
Published: (2024)

QGraphLIME - Explaining Quantum Graph Neural Networks
by: Jena, Haribandhu, et al.
Published: (2025)

SafeAnchor: Preventing Cumulative Safety Erosion in Continual Domain Adaptation of Large Language Models
by: Guo, Dongxin, et al.
Published: (2026)

Active Inference with a Self-Prior in the Mirror-Mark Task
by: Kim, Dongmin, et al.
Published: (2026)

Adaptive Bernstein Change Detector for High-Dimensional Data Streams
by: Heyden, Marco, et al.
Published: (2023)

WorkflowGen:an adaptive workflow generation mechanism driven by trajectory experience
by: Wei, Ruocan, et al.
Published: (2026)

Measuring and curing reasoning rigidity: from decorative chain-of-thought to genuine faithfulness
by: Basu, Abhinaba, et al.
Published: (2026)

The Mirror Loop: Recursive Non-Convergence in Generative Reasoning Systems
by: DeVilling, Bentley
Published: (2025)

Counterfactual Basis Extension and Representational Geometry: An MDL-Constrained Model of Conceptual Growth
by: Amornbunchornvej, Chainarong
Published: (2025)

From Imitation to Interaction: Mastering Game of Schnapsen with Shallow Reinforcement Learning
by: Klačan, Ján, et al.
Published: (2026)

mHC-SSM: Manifold-Constrained Hyper-Connections for State Space Language Models with Stream-Specialized Adapters
by: Mutlu, Abdulvahap, et al.
Published: (2026)