Saved in:
| Main Authors: | Yang, Ze, Jin, Yihong, Liu, Juntian, Xu, Xinhe |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.07411 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Anomaly Detection and Early Warning Mechanism for Intelligent Monitoring Systems in Multi-Cloud Environments Based on LLM
by: Jin, Yihong, et al.
Published: (2025)
by: Jin, Yihong, et al.
Published: (2025)
Research on Large Language Model Cross-Cloud Privacy Protection and Collaborative Training based on Federated Learning
by: Yang, Ze, et al.
Published: (2025)
by: Yang, Ze, et al.
Published: (2025)
Adaptive Fault Tolerance Mechanisms of Large Language Models in Cloud Computing Environments
by: Jin, Yihong, et al.
Published: (2025)
by: Jin, Yihong, et al.
Published: (2025)
Research on Cloud Platform Network Traffic Monitoring and Anomaly Detection System based on Large Language Models
by: Yang, Ze, et al.
Published: (2025)
by: Yang, Ze, et al.
Published: (2025)
HADES: Hardware Accelerated Decoding for Efficient Speculation in Large Language Models
by: Yang, Ze, et al.
Published: (2024)
by: Yang, Ze, et al.
Published: (2024)
Scam Detection for Ethereum Smart Contracts: Leveraging Graph Representation Learning for Secure Blockchain
by: Jin, Yihong, et al.
Published: (2024)
by: Jin, Yihong, et al.
Published: (2024)
Cloud-Based AI Systems: Leveraging Large Language Models for Intelligent Fault Detection and Autonomous Self-Healing
by: Ji, Cheng, et al.
Published: (2025)
by: Ji, Cheng, et al.
Published: (2025)
Scalability Optimization in Cloud-Based AI Inference Services: Strategies for Real-Time Load Balancing and Automated Scaling
by: Jin, Yihong, et al.
Published: (2025)
by: Jin, Yihong, et al.
Published: (2025)
MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning
by: Guo, Yihong, et al.
Published: (2025)
by: Guo, Yihong, et al.
Published: (2025)
Can Large Models Teach Student Models to Solve Mathematical Problems Like Human Beings? A Reasoning Distillation Method via Multi-LoRA Interaction
by: Li, Xinhe, et al.
Published: (2025)
by: Li, Xinhe, et al.
Published: (2025)
When a Reinforcement Learning Agent Encounters Unknown Unknowns
by: Zhu, Juntian, et al.
Published: (2025)
by: Zhu, Juntian, et al.
Published: (2025)
Deep Reinforcement Learning for Traffic Light Control in Intelligent Transportation Systems
by: Zhu, Ming, et al.
Published: (2023)
by: Zhu, Ming, et al.
Published: (2023)
Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models
by: Tao, Yongding, et al.
Published: (2025)
by: Tao, Yongding, et al.
Published: (2025)
CorrectionPlanner: Self-Correction Planner with Reinforcement Learning in Autonomous Driving
by: Guo, Yihong, et al.
Published: (2026)
by: Guo, Yihong, et al.
Published: (2026)
Self-Healing Agentic Orchestrators for Reliable Tool-Augmented Large Language Model Systems
by: Babu, Rahul Suresh, et al.
Published: (2026)
by: Babu, Rahul Suresh, et al.
Published: (2026)
Cross-Domain Energy-Guided Diffusion Generation for Off-Dynamics Reinforcement Learning
by: Yang, Yu, et al.
Published: (2026)
by: Yang, Yu, et al.
Published: (2026)
Large Language Model Interface for Home Energy Management Systems
by: Michelon, François, et al.
Published: (2025)
by: Michelon, François, et al.
Published: (2025)
Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
by: Guo, Yihong, et al.
Published: (2024)
by: Guo, Yihong, et al.
Published: (2024)
GaRField++: Reinforced Gaussian Radiance Fields for Large-Scale 3D Scene Reconstruction
by: Zhang, Hanyue, et al.
Published: (2024)
by: Zhang, Hanyue, et al.
Published: (2024)
StackTrans: From Large Language Model to Large Pushdown Automata Model
by: Zhang, Kechi, et al.
Published: (2025)
by: Zhang, Kechi, et al.
Published: (2025)
Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight
by: Xie, Zhiqiang, et al.
Published: (2024)
by: Xie, Zhiqiang, et al.
Published: (2024)
SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
by: Fang, Wenkai, et al.
Published: (2025)
by: Fang, Wenkai, et al.
Published: (2025)
Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms
by: Ji, Cheng, et al.
Published: (2025)
by: Ji, Cheng, et al.
Published: (2025)
Post-Training Large Language Models via Reinforcement Learning from Self-Feedback
by: van Niekerk, Carel, et al.
Published: (2025)
by: van Niekerk, Carel, et al.
Published: (2025)
Automated Processing of eXplainable Artificial Intelligence Outputs in Deep Learning Models for Fault Diagnostics of Large Infrastructures
by: Floreale, Giovanni, et al.
Published: (2025)
by: Floreale, Giovanni, et al.
Published: (2025)
Cross-Cloud Data Privacy Protection: Optimizing Collaborative Mechanisms of AI Systems by Integrating Federated Learning and LLMs
by: Luo, Huaiying, et al.
Published: (2025)
by: Luo, Huaiying, et al.
Published: (2025)
RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks
by: Yang, Xu, et al.
Published: (2024)
by: Yang, Xu, et al.
Published: (2024)
AI Learning Algorithms: Deep Learning, Hybrid Models, and Large-Scale Model Integration
by: Golilarz, Noorbakhsh Amiri, et al.
Published: (2024)
by: Golilarz, Noorbakhsh Amiri, et al.
Published: (2024)
Recent Advances of Deep Robotic Affordance Learning: A Reinforcement Learning Perspective
by: Yang, Xintong, et al.
Published: (2023)
by: Yang, Xintong, et al.
Published: (2023)
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models
by: Dong, Yihong, et al.
Published: (2024)
by: Dong, Yihong, et al.
Published: (2024)
Fault Diagnosis in Power Grids with Large Language Model
by: Jing, Liu, et al.
Published: (2024)
by: Jing, Liu, et al.
Published: (2024)
Benchmarking Benchmark Leakage in Large Language Models
by: Xu, Ruijie, et al.
Published: (2024)
by: Xu, Ruijie, et al.
Published: (2024)
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
by: Liao, Yi, et al.
Published: (2025)
by: Liao, Yi, et al.
Published: (2025)
A Lightweight, Transferable, and Self-Adaptive Framework for Intelligent DC Arc-Fault Detection in Photovoltaic Systems
by: Yang, Xiaoke, et al.
Published: (2026)
by: Yang, Xiaoke, et al.
Published: (2026)
Hybrid Framework for Robotic Manipulation: Integrating Reinforcement Learning and Large Language Models
by: Saad, Md, et al.
Published: (2026)
by: Saad, Md, et al.
Published: (2026)
On Predictability of Reinforcement Learning Dynamics for Large Language Models
by: Cai, Yuchen, et al.
Published: (2025)
by: Cai, Yuchen, et al.
Published: (2025)
Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback
by: Yu, Song, et al.
Published: (2024)
by: Yu, Song, et al.
Published: (2024)
Self-Hinting Language Models Enhance Reinforcement Learning
by: Liao, Baohao, et al.
Published: (2026)
by: Liao, Baohao, et al.
Published: (2026)
Semantic Encryption: Secure and Effective Interaction with Cloud-based Large Language Models via Semantic Transformation
by: Chen, Dong, et al.
Published: (2025)
by: Chen, Dong, et al.
Published: (2025)
Self-Healing Software Systems: Lessons from Nature, Powered by AI
by: Baqar, Mohammad, et al.
Published: (2025)
by: Baqar, Mohammad, et al.
Published: (2025)
Similar Items
-
Anomaly Detection and Early Warning Mechanism for Intelligent Monitoring Systems in Multi-Cloud Environments Based on LLM
by: Jin, Yihong, et al.
Published: (2025) -
Research on Large Language Model Cross-Cloud Privacy Protection and Collaborative Training based on Federated Learning
by: Yang, Ze, et al.
Published: (2025) -
Adaptive Fault Tolerance Mechanisms of Large Language Models in Cloud Computing Environments
by: Jin, Yihong, et al.
Published: (2025) -
Research on Cloud Platform Network Traffic Monitoring and Anomaly Detection System based on Large Language Models
by: Yang, Ze, et al.
Published: (2025) -
HADES: Hardware Accelerated Decoding for Efficient Speculation in Large Language Models
by: Yang, Ze, et al.
Published: (2024)