Saved in:
| Main Authors: | Wang, Zepeng, Ma, Chao, Zhou, Linjiang, Wu, Libing, Yang, Lei, Shi, Xiaochuan, Peng, Guojun |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.05580 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AdaptGrad: Adaptive Sampling to Reduce Noise
by: Zhou, Linjiang, et al.
Published: (2024)
by: Zhou, Linjiang, et al.
Published: (2024)
Axiomatization of Gradient Smoothing in Neural Networks
by: Zhou, Linjiang, et al.
Published: (2024)
by: Zhou, Linjiang, et al.
Published: (2024)
MDSAM:Memory-Driven Sparse Attention Matrix for LVLMs Hallucination Mitigation
by: Lu, Shuaiye, et al.
Published: (2025)
by: Lu, Shuaiye, et al.
Published: (2025)
KGLens: Towards Efficient and Effective Knowledge Probing of Large Language Models with Knowledge Graphs
by: Zheng, Shangshang, et al.
Published: (2023)
by: Zheng, Shangshang, et al.
Published: (2023)
On Simplifying Large-Scale Spatial Vectors: Fast, Memory-Efficient, and Cost-Predictable k-means
by: Ji, Yushuai, et al.
Published: (2024)
by: Ji, Yushuai, et al.
Published: (2024)
SEG-Parking: Towards Safe, Efficient, and Generalizable Autonomous Parking via End-to-End Offline Reinforcement Learning
by: Yang, Zewei, et al.
Published: (2025)
by: Yang, Zewei, et al.
Published: (2025)
Evaluating the Effectiveness of Cost-Efficient Large Language Models in Benchmark Biomedical Tasks
by: Jahan, Israt, et al.
Published: (2025)
by: Jahan, Israt, et al.
Published: (2025)
General Humanoid Whole-Body Control via Pretraining and Fast Adaptation
by: Wang, Zepeng, et al.
Published: (2026)
by: Wang, Zepeng, et al.
Published: (2026)
A Large Language Model-Enhanced Q-learning for Capacitated Vehicle Routing Problem with Time Windows
by: Cao, Linjiang, et al.
Published: (2025)
by: Cao, Linjiang, et al.
Published: (2025)
AIBrix: Towards Scalable, Cost-Effective Large Language Model Inference Infrastructure
by: The AIBrix Team, et al.
Published: (2025)
by: The AIBrix Team, et al.
Published: (2025)
Counterfactually Safe Reinforcement Learning
by: Li, Jingyi, et al.
Published: (2026)
by: Li, Jingyi, et al.
Published: (2026)
Multi-round jailbreak attack on large language models
by: Zhou, Yihua, et al.
Published: (2024)
by: Zhou, Yihua, et al.
Published: (2024)
FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal
by: Xu, Hang, et al.
Published: (2025)
by: Xu, Hang, et al.
Published: (2025)
P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models
by: Jiang, Guochao, et al.
Published: (2024)
by: Jiang, Guochao, et al.
Published: (2024)
Improving Visual Storytelling with Multimodal Large Language Models
by: Lin, Xiaochuan, et al.
Published: (2024)
by: Lin, Xiaochuan, et al.
Published: (2024)
SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving
by: Chen, Xuesong, et al.
Published: (2025)
by: Chen, Xuesong, et al.
Published: (2025)
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction
by: Qian, Junlang, et al.
Published: (2025)
by: Qian, Junlang, et al.
Published: (2025)
Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models
by: Chen, Yushuo, et al.
Published: (2024)
by: Chen, Yushuo, et al.
Published: (2024)
Towards Efficient and Effective Alignment of Large Language Models
by: Jiang, Yuxin
Published: (2025)
by: Jiang, Yuxin
Published: (2025)
FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering
by: Zhou, Jingqiu, et al.
Published: (2025)
by: Zhou, Jingqiu, et al.
Published: (2025)
Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation
by: Xu, Hang, et al.
Published: (2025)
by: Xu, Hang, et al.
Published: (2025)
Enhancing User Intent for Recommendation Systems via Large Language Models
by: Xu, Xiaochuan, et al.
Published: (2025)
by: Xu, Xiaochuan, et al.
Published: (2025)
Designing Control Barrier Function via Probabilistic Enumeration for Safe Reinforcement Learning Navigation
by: Marzari, Luca, et al.
Published: (2025)
by: Marzari, Luca, et al.
Published: (2025)
Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk
by: Ying, Chengyang, et al.
Published: (2022)
by: Ying, Chengyang, et al.
Published: (2022)
Dual-Quadruped Collaborative Transportation in Narrow Environments via Safe Reinforcement Learning
by: Lei, Zhezhi, et al.
Published: (2026)
by: Lei, Zhezhi, et al.
Published: (2026)
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
by: Xiong, Guojun, et al.
Published: (2024)
by: Xiong, Guojun, et al.
Published: (2024)
Clay‐Enabled (quasi) Solid‐State Electrolytes for Metal Batteries: Toward Safe, Sustainable, and High‐Energy Storage
by: Zhangkuo Han, et al.
Published: (2025)
by: Zhangkuo Han, et al.
Published: (2025)
Exploration of the Effectiveness and Experience of AI‐Assisted Academic Reading
by: Xiaochuan Zheng, et al.
Published: (2024)
by: Xiaochuan Zheng, et al.
Published: (2024)
Towards Effective and Efficient Continual Pre-training of Large Language Models
by: Chen, Jie, et al.
Published: (2024)
by: Chen, Jie, et al.
Published: (2024)
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
by: Zhang, Zongmeng, et al.
Published: (2024)
by: Zhang, Zongmeng, et al.
Published: (2024)
A Status Quo Investigation of Large Language Models towards Cost-Effective CFD Automation with OpenFOAMGPT: ChatGPT vs. Qwen vs. Deepseek
by: Wang, Wenkang, et al.
Published: (2025)
by: Wang, Wenkang, et al.
Published: (2025)
SafeLawBench: Towards Safe Alignment of Large Language Models
by: Cao, Chuxue, et al.
Published: (2025)
by: Cao, Chuxue, et al.
Published: (2025)
Route Sparse Autoencoder to Interpret Large Language Models
by: Shi, Wei, et al.
Published: (2025)
by: Shi, Wei, et al.
Published: (2025)
EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models
by: Tan, Zheyue, et al.
Published: (2025)
by: Tan, Zheyue, et al.
Published: (2025)
A Simple Cost‐Effective Method to Fabricate Single Nanochannels by Embedding Electrospun Polyethylene Oxide Nanofibers
by: Lei Zhou, et al.
Published: (2024)
by: Lei Zhou, et al.
Published: (2024)
Towards Efficient and Effective Unlearning of Large Language Models for Recommendation
by: Wang, Hangyu, et al.
Published: (2024)
by: Wang, Hangyu, et al.
Published: (2024)
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
by: Zhou, Zhehua, et al.
Published: (2024)
by: Zhou, Zhehua, et al.
Published: (2024)
Single-Sample Black-Box Membership Inference Attack against Vision-Language Models via Cross-modal Semantic Alignment
by: Li, Jiaqing, et al.
Published: (2026)
by: Li, Jiaqing, et al.
Published: (2026)
Evaluation of the effect of ultrasound‐assisted hot air drying on the drying characteristics and physicochemical properties of cherries based on the entropy‐weighted TOPSIS method
by: Hongyang Lu, et al.
Published: (2024)
by: Hongyang Lu, et al.
Published: (2024)
Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction
by: Ding, Zepeng, et al.
Published: (2024)
by: Ding, Zepeng, et al.
Published: (2024)
Similar Items
-
AdaptGrad: Adaptive Sampling to Reduce Noise
by: Zhou, Linjiang, et al.
Published: (2024) -
Axiomatization of Gradient Smoothing in Neural Networks
by: Zhou, Linjiang, et al.
Published: (2024) -
MDSAM:Memory-Driven Sparse Attention Matrix for LVLMs Hallucination Mitigation
by: Lu, Shuaiye, et al.
Published: (2025) -
KGLens: Towards Efficient and Effective Knowledge Probing of Large Language Models with Knowledge Graphs
by: Zheng, Shangshang, et al.
Published: (2023) -
On Simplifying Large-Scale Spatial Vectors: Fast, Memory-Efficient, and Cost-Predictable k-means
by: Ji, Yushuai, et al.
Published: (2024)