:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Wan, Jiayong, Chen, Jiawei, Yin, Zhaoxia, Shuyuan, Liu, Su, Hang
Formato:	Preprint
Publicado:	2026
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2605.27375
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Skills-Coach: A Self-Evolving Skill Optimizer via Training-Free GRPO
por: Tian, Yu, et al.
Publicado: (2026)

Exploring the Robustness of Decision-Level Through Adversarial Attacks on LLM-Based Embodied Models
por: Liu, Shuyuan, et al.
Publicado: (2024)

KG-DF: A Black-box Defense Framework against Jailbreak Attacks Based on Knowledge Graphs
por: Liu, Shuyuan, et al.
Publicado: (2025)

Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis
por: Huang, Donghao, et al.
Publicado: (2026)

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
por: He, Wei, et al.
Publicado: (2025)

Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts
por: Chen, Hongyu, et al.
Publicado: (2025)

TRUEBench: Can LLM Response Meet Real-world Constraints as Productivity Assistant?
por: Park, Jiho, et al.
Publicado: (2025)

Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
por: Qiu, Pengcheng, et al.
Publicado: (2025)

Enhancing Agentic RL with Progressive Reward Shaping and Value-based Sampling Policy Optimization
por: Su, Jianghao, et al.
Publicado: (2025)

FaceCat: Enhancing Face Recognition Security with a Unified Diffusion Model
por: Chen, Jiawei, et al.
Publicado: (2024)

Safer Reasoning Traces: Measuring and Mitigating Chain-of-Thought Leakage in LLMs
por: Ahrend, Patrick, et al.
Publicado: (2026)

STAR-1: Safer Alignment of Reasoning LLMs with 1K Data
por: Wang, Zijun, et al.
Publicado: (2025)

Position: The Real Barrier to LLM Agent Usability is Agentic ROI
por: Liu, Weiwen, et al.
Publicado: (2025)

TaskCraft: Automated Generation of Agentic Tasks
por: Shi, Dingfeng, et al.
Publicado: (2025)

AutoBreach: Universal and Adaptive Jailbreaking with Efficient Wordplay-Guided Optimization
por: Chen, Jiawei, et al.
Publicado: (2024)

A Novel Hierarchical Multi-Agent System for Payments Using LLMs
por: Chua, Joon Kiat, et al.
Publicado: (2026)

Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering
por: Bonaldi, Helena, et al.
Publicado: (2024)

Survey of Query-based Text Summarization
por: Yu, Hang, et al.
Publicado: (2022)

A-MEM: Agentic Memory for LLM Agents
por: Xu, Wujiang, et al.
Publicado: (2025)

SAMoRA: Semantic-Aware Mixture of LoRA Experts for Task-Adaptive Learning
por: Shi, Boyan, et al.
Publicado: (2026)

Lightweight Multimodal LLM-Enabled Cost-Effective Defect Grading of Power Transmission Equipment
por: Wang, Tao, et al.
Publicado: (2026)

Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals
por: Chen, Cheng, et al.
Publicado: (2025)

Agentic Society: Merging skeleton from real world and texture from Large Language Model
por: Bai, Yuqi, et al.
Publicado: (2024)

Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks
por: Lee, Yoonsang, et al.
Publicado: (2026)

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
por: Zheng, Tong, et al.
Publicado: (2026)

Evaluating the Challenges of LLMs in Real-world Medical Follow-up: A Comparative Study and An Optimized Framework
por: Liu, Jinyan, et al.
Publicado: (2025)

Evil Geniuses: Delving into the Safety of LLM-based Agents
por: Tian, Yu, et al.
Publicado: (2023)

AgenticMath: Enhancing LLM Reasoning via Agentic-based Math Data Generation
por: Liu, Xianyang, et al.
Publicado: (2025)

LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
por: Li, Haitao, et al.
Publicado: (2024)

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
por: Chen, Yanxu, et al.
Publicado: (2025)

PurpCode: Reasoning for Safer Code Generation
por: Liu, Jiawei, et al.
Publicado: (2025)

Harnessing RLHF for Robust Unanswerability Recognition and Trustworthy Response Generation in LLMs
por: Lin, Shuyuan, et al.
Publicado: (2025)

LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants?
por: Sun, Lu, et al.
Publicado: (2025)

Pruning Unsafe Tickets: A Resource-Efficient Framework for Safer and More Robust LLMs
por: Si, Wai Man, et al.
Publicado: (2026)

Towards Safer Pretraining: Analyzing and Filtering Harmful Content in Webscale datasets for Responsible LLMs
por: Mendu, Sai Krishna, et al.
Publicado: (2025)

Are Smarter LLMs Safer? Exploring Safety-Reasoning Trade-offs in Prompting and Fine-Tuning
por: Li, Ang, et al.
Publicado: (2025)

CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing
por: Xu, Zhipeng, et al.
Publicado: (2026)

CiteLLM: An Agentic Platform for Trustworthy Scientific Reference Discovery
por: Hong, Mengze, et al.
Publicado: (2026)

Towards Safer Large Language Models through Machine Unlearning
por: Liu, Zheyuan, et al.
Publicado: (2024)

LLM Optimization Unlocks Real-Time Pairwise Reranking
por: Wu, Jingyu, et al.
Publicado: (2025)