Saved in:
| Main Authors: | Hosu, Ionel-Alexandru, Rebedea, Traian, Pascanu, Razvan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.13690 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Complexity-based code embeddings
by: Folea, Rares, et al.
Published: (2026)
by: Folea, Rares, et al.
Published: (2026)
Semi-Supervised Learning for Large Language Models Safety and Content Moderation
by: Dinuta, Eduard Stefan, et al.
Published: (2025)
by: Dinuta, Eduard Stefan, et al.
Published: (2025)
Lattice: Learning to Efficiently Compress the Memory
by: Karami, Mahdi, et al.
Published: (2025)
by: Karami, Mahdi, et al.
Published: (2025)
Revisiting Adam for Streaming Reinforcement Learning
by: Gogianu, Florin, et al.
Published: (2026)
by: Gogianu, Florin, et al.
Published: (2026)
Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024)
by: Lyle, Clare, et al.
Published: (2024)
Pluralistic Behavior Suite: Stress-Testing Multi-Turn Adherence to Custom Behavioral Policies
by: Varshney, Prasoon, et al.
Published: (2025)
by: Varshney, Prasoon, et al.
Published: (2025)
Softmax is not Enough (for Sharp Size Generalisation)
by: Veličković, Petar, et al.
Published: (2024)
by: Veličković, Petar, et al.
Published: (2024)
Hierarchical Meta-Reinforcement Learning via Automated Macro-Action Discovery
by: Cho, Minjae, et al.
Published: (2024)
by: Cho, Minjae, et al.
Published: (2024)
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
by: Schmied, Thomas, et al.
Published: (2024)
by: Schmied, Thomas, et al.
Published: (2024)
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities
by: Schmied, Thomas, et al.
Published: (2025)
by: Schmied, Thomas, et al.
Published: (2025)
State Soup: In-Context Skill Learning, Retrieval and Mixing
by: Pióro, Maciej, et al.
Published: (2024)
by: Pióro, Maciej, et al.
Published: (2024)
MultiMatch: Multihead Consistency Regularization Matching for Semi-Supervised Text Classification
by: Sirbu, Iustin, et al.
Published: (2025)
by: Sirbu, Iustin, et al.
Published: (2025)
A self-supervised neural-analytic method to predict the evolution of COVID-19 in Romania
by: Stochiţoiu, Radu D., et al.
Published: (2020)
by: Stochiţoiu, Radu D., et al.
Published: (2020)
Mining Generalizable Activation Functions
by: Vitvitskyi, Alex, et al.
Published: (2026)
by: Vitvitskyi, Alex, et al.
Published: (2026)
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
by: Schmied, Thomas, et al.
Published: (2024)
by: Schmied, Thomas, et al.
Published: (2024)
Perplexity Cannot Always Tell Right from Wrong
by: Veličković, Petar, et al.
Published: (2026)
by: Veličković, Petar, et al.
Published: (2026)
Layerwise LQR for Geometry-Aware Optimization of Deep Networks
by: Dufort-Labbé, Simon, et al.
Published: (2026)
by: Dufort-Labbé, Simon, et al.
Published: (2026)
Navigating Potholes with Geometry-Aware Sharpness Minimization
by: Dufort-Labbé, Simon, et al.
Published: (2026)
by: Dufort-Labbé, Simon, et al.
Published: (2026)
Asynchronous Algorithmic Alignment with Cocycles
by: Dudzik, Andrew, et al.
Published: (2023)
by: Dudzik, Andrew, et al.
Published: (2023)
Fine-Tuned In-Context Learners for Efficient Adaptation
by: Bornschein, Jorg, et al.
Published: (2025)
by: Bornschein, Jorg, et al.
Published: (2025)
Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
by: Dufort-Labbé, Simon, et al.
Published: (2024)
by: Dufort-Labbé, Simon, et al.
Published: (2024)
Kalman Filter for Online Classification of Non-Stationary Data
by: Titsias, Michalis K., et al.
Published: (2023)
by: Titsias, Michalis K., et al.
Published: (2023)
Torque-Aware Momentum
by: Malviya, Pranshu, et al.
Published: (2024)
by: Malviya, Pranshu, et al.
Published: (2024)
Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
by: Galashov, Alexandre, et al.
Published: (2024)
by: Galashov, Alexandre, et al.
Published: (2024)
Improving Legal Judgement Prediction in Romanian with Long Text Encoders
by: Masala, Mihai, et al.
Published: (2024)
by: Masala, Mihai, et al.
Published: (2024)
Promoting Exploration in Memory-Augmented Adam using Critical Momenta
by: Malviya, Pranshu, et al.
Published: (2023)
by: Malviya, Pranshu, et al.
Published: (2023)
FedMSE: Semi-supervised federated learning approach for IoT network intrusion detection
by: Nguyen, Van Tuan, et al.
Published: (2024)
by: Nguyen, Van Tuan, et al.
Published: (2024)
On the generalization of language models from in-context learning and finetuning: a controlled study
by: Lampinen, Andrew K., et al.
Published: (2025)
by: Lampinen, Andrew K., et al.
Published: (2025)
Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models
by: Sreedhar, Makesh Narsimhan, et al.
Published: (2025)
by: Sreedhar, Makesh Narsimhan, et al.
Published: (2025)
From Markov to Laplace: How Mamba In-Context Learns Markov Chains
by: Bondaschi, Marco, et al.
Published: (2025)
by: Bondaschi, Marco, et al.
Published: (2025)
Air Pollution Forecasting in Bucharest
by: Şerban, Dragoş-Andrei, et al.
Published: (2025)
by: Şerban, Dragoş-Andrei, et al.
Published: (2025)
Reinforcement Learning Policy as Macro Regulator Rather than Macro Placer
by: Xue, Ke, et al.
Published: (2024)
by: Xue, Ke, et al.
Published: (2024)
Towards Inference-time Category-wise Safety Steering for Large Language Models
by: Bhattacharjee, Amrita, et al.
Published: (2024)
by: Bhattacharjee, Amrita, et al.
Published: (2024)
How Overconfidence in Initial Choices and Underconfidence Under Criticism Modulate Change of Mind in Large Language Models
by: Kumaran, Dharshan, et al.
Published: (2025)
by: Kumaran, Dharshan, et al.
Published: (2025)
Look Every Frame All at Once: Video-Ma$^2$mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing
by: Lee, Hosu, et al.
Published: (2024)
by: Lee, Hosu, et al.
Published: (2024)
Decoding complexity: how machine learning is redefining scientific discovery
by: Vinuesa, Ricardo, et al.
Published: (2024)
by: Vinuesa, Ricardo, et al.
Published: (2024)
Gradients as an Action: Towards Communication-Efficient Federated Recommender Systems via Adaptive Action Sharing
by: Lu, Zhufeng, et al.
Published: (2025)
by: Lu, Zhufeng, et al.
Published: (2025)
Uncovering mesa-optimization algorithms in Transformers
by: von Oswald, Johannes, et al.
Published: (2023)
by: von Oswald, Johannes, et al.
Published: (2023)
Improving Traffic Flow Predictions with SGCN-LSTM: A Hybrid Model for Spatial and Temporal Dependencies
by: Cismaru, Alexandru T.
Published: (2024)
by: Cismaru, Alexandru T.
Published: (2024)
Credit Card Fraud Detection
by: Popova, Iva, et al.
Published: (2025)
by: Popova, Iva, et al.
Published: (2025)
Similar Items
-
Complexity-based code embeddings
by: Folea, Rares, et al.
Published: (2026) -
Semi-Supervised Learning for Large Language Models Safety and Content Moderation
by: Dinuta, Eduard Stefan, et al.
Published: (2025) -
Lattice: Learning to Efficiently Compress the Memory
by: Karami, Mahdi, et al.
Published: (2025) -
Revisiting Adam for Streaming Reinforcement Learning
by: Gogianu, Florin, et al.
Published: (2026) -
Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024)