Guardado en:
| Autores principales: | Wan, Jiayong, Chen, Jiawei, Yin, Zhaoxia, Shuyuan, Liu, Su, Hang |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2605.27375 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Skills-Coach: A Self-Evolving Skill Optimizer via Training-Free GRPO
por: Tian, Yu, et al.
Publicado: (2026)
por: Tian, Yu, et al.
Publicado: (2026)
Exploring the Robustness of Decision-Level Through Adversarial Attacks on LLM-Based Embodied Models
por: Liu, Shuyuan, et al.
Publicado: (2024)
por: Liu, Shuyuan, et al.
Publicado: (2024)
KG-DF: A Black-box Defense Framework against Jailbreak Attacks Based on Knowledge Graphs
por: Liu, Shuyuan, et al.
Publicado: (2025)
por: Liu, Shuyuan, et al.
Publicado: (2025)
Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis
por: Huang, Donghao, et al.
Publicado: (2026)
por: Huang, Donghao, et al.
Publicado: (2026)
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
por: He, Wei, et al.
Publicado: (2025)
por: He, Wei, et al.
Publicado: (2025)
Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts
por: Chen, Hongyu, et al.
Publicado: (2025)
por: Chen, Hongyu, et al.
Publicado: (2025)
TRUEBench: Can LLM Response Meet Real-world Constraints as Productivity Assistant?
por: Park, Jiho, et al.
Publicado: (2025)
por: Park, Jiho, et al.
Publicado: (2025)
Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
por: Qiu, Pengcheng, et al.
Publicado: (2025)
por: Qiu, Pengcheng, et al.
Publicado: (2025)
Enhancing Agentic RL with Progressive Reward Shaping and Value-based Sampling Policy Optimization
por: Su, Jianghao, et al.
Publicado: (2025)
por: Su, Jianghao, et al.
Publicado: (2025)
FaceCat: Enhancing Face Recognition Security with a Unified Diffusion Model
por: Chen, Jiawei, et al.
Publicado: (2024)
por: Chen, Jiawei, et al.
Publicado: (2024)
Safer Reasoning Traces: Measuring and Mitigating Chain-of-Thought Leakage in LLMs
por: Ahrend, Patrick, et al.
Publicado: (2026)
por: Ahrend, Patrick, et al.
Publicado: (2026)
STAR-1: Safer Alignment of Reasoning LLMs with 1K Data
por: Wang, Zijun, et al.
Publicado: (2025)
por: Wang, Zijun, et al.
Publicado: (2025)
Position: The Real Barrier to LLM Agent Usability is Agentic ROI
por: Liu, Weiwen, et al.
Publicado: (2025)
por: Liu, Weiwen, et al.
Publicado: (2025)
TaskCraft: Automated Generation of Agentic Tasks
por: Shi, Dingfeng, et al.
Publicado: (2025)
por: Shi, Dingfeng, et al.
Publicado: (2025)
AutoBreach: Universal and Adaptive Jailbreaking with Efficient Wordplay-Guided Optimization
por: Chen, Jiawei, et al.
Publicado: (2024)
por: Chen, Jiawei, et al.
Publicado: (2024)
A Novel Hierarchical Multi-Agent System for Payments Using LLMs
por: Chua, Joon Kiat, et al.
Publicado: (2026)
por: Chua, Joon Kiat, et al.
Publicado: (2026)
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering
por: Bonaldi, Helena, et al.
Publicado: (2024)
por: Bonaldi, Helena, et al.
Publicado: (2024)
Survey of Query-based Text Summarization
por: Yu, Hang, et al.
Publicado: (2022)
por: Yu, Hang, et al.
Publicado: (2022)
A-MEM: Agentic Memory for LLM Agents
por: Xu, Wujiang, et al.
Publicado: (2025)
por: Xu, Wujiang, et al.
Publicado: (2025)
SAMoRA: Semantic-Aware Mixture of LoRA Experts for Task-Adaptive Learning
por: Shi, Boyan, et al.
Publicado: (2026)
por: Shi, Boyan, et al.
Publicado: (2026)
Lightweight Multimodal LLM-Enabled Cost-Effective Defect Grading of Power Transmission Equipment
por: Wang, Tao, et al.
Publicado: (2026)
por: Wang, Tao, et al.
Publicado: (2026)
Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals
por: Chen, Cheng, et al.
Publicado: (2025)
por: Chen, Cheng, et al.
Publicado: (2025)
Agentic Society: Merging skeleton from real world and texture from Large Language Model
por: Bai, Yuqi, et al.
Publicado: (2024)
por: Bai, Yuqi, et al.
Publicado: (2024)
Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks
por: Lee, Yoonsang, et al.
Publicado: (2026)
por: Lee, Yoonsang, et al.
Publicado: (2026)
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
por: Zheng, Tong, et al.
Publicado: (2026)
por: Zheng, Tong, et al.
Publicado: (2026)
Evaluating the Challenges of LLMs in Real-world Medical Follow-up: A Comparative Study and An Optimized Framework
por: Liu, Jinyan, et al.
Publicado: (2025)
por: Liu, Jinyan, et al.
Publicado: (2025)
Evil Geniuses: Delving into the Safety of LLM-based Agents
por: Tian, Yu, et al.
Publicado: (2023)
por: Tian, Yu, et al.
Publicado: (2023)
AgenticMath: Enhancing LLM Reasoning via Agentic-based Math Data Generation
por: Liu, Xianyang, et al.
Publicado: (2025)
por: Liu, Xianyang, et al.
Publicado: (2025)
LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
por: Li, Haitao, et al.
Publicado: (2024)
por: Li, Haitao, et al.
Publicado: (2024)
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
por: Chen, Yanxu, et al.
Publicado: (2025)
por: Chen, Yanxu, et al.
Publicado: (2025)
PurpCode: Reasoning for Safer Code Generation
por: Liu, Jiawei, et al.
Publicado: (2025)
por: Liu, Jiawei, et al.
Publicado: (2025)
Harnessing RLHF for Robust Unanswerability Recognition and Trustworthy Response Generation in LLMs
por: Lin, Shuyuan, et al.
Publicado: (2025)
por: Lin, Shuyuan, et al.
Publicado: (2025)
LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants?
por: Sun, Lu, et al.
Publicado: (2025)
por: Sun, Lu, et al.
Publicado: (2025)
Pruning Unsafe Tickets: A Resource-Efficient Framework for Safer and More Robust LLMs
por: Si, Wai Man, et al.
Publicado: (2026)
por: Si, Wai Man, et al.
Publicado: (2026)
Towards Safer Pretraining: Analyzing and Filtering Harmful Content in Webscale datasets for Responsible LLMs
por: Mendu, Sai Krishna, et al.
Publicado: (2025)
por: Mendu, Sai Krishna, et al.
Publicado: (2025)
Are Smarter LLMs Safer? Exploring Safety-Reasoning Trade-offs in Prompting and Fine-Tuning
por: Li, Ang, et al.
Publicado: (2025)
por: Li, Ang, et al.
Publicado: (2025)
CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing
por: Xu, Zhipeng, et al.
Publicado: (2026)
por: Xu, Zhipeng, et al.
Publicado: (2026)
CiteLLM: An Agentic Platform for Trustworthy Scientific Reference Discovery
por: Hong, Mengze, et al.
Publicado: (2026)
por: Hong, Mengze, et al.
Publicado: (2026)
Towards Safer Large Language Models through Machine Unlearning
por: Liu, Zheyuan, et al.
Publicado: (2024)
por: Liu, Zheyuan, et al.
Publicado: (2024)
LLM Optimization Unlocks Real-Time Pairwise Reranking
por: Wu, Jingyu, et al.
Publicado: (2025)
por: Wu, Jingyu, et al.
Publicado: (2025)
Ejemplares similares
-
Skills-Coach: A Self-Evolving Skill Optimizer via Training-Free GRPO
por: Tian, Yu, et al.
Publicado: (2026) -
Exploring the Robustness of Decision-Level Through Adversarial Attacks on LLM-Based Embodied Models
por: Liu, Shuyuan, et al.
Publicado: (2024) -
KG-DF: A Black-box Defense Framework against Jailbreak Attacks Based on Knowledge Graphs
por: Liu, Shuyuan, et al.
Publicado: (2025) -
Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis
por: Huang, Donghao, et al.
Publicado: (2026) -
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
por: He, Wei, et al.
Publicado: (2025)