:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Zhou, Jinfeng, Chen, Yuxuan, Shi, Yihan, Zhang, Xuanming, Lei, Leqi, Feng, Yi, Xiong, Zexuan, Yan, Miao, Wang, Xunzhi, Cao, Yaru, Yin, Jianing, Wang, Shuai, Dai, Quanyu, Dong, Zhenhua, Wang, Hongning, Huang, Minlie
Formato:	Preprint
Publicado:	2025
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2506.00900
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Think Socially via Cognitive Reasoning
por: Zhou, Jinfeng, et al.
Publicado: (2025)

Grounding LLMs in Scientific Discovery via Embodied Actions
por: Zhang, Bo, et al.
Publicado: (2026)

SocialSim: Towards Socialized Simulation of Emotional Support Conversation
por: Chen, Zhuang, et al.
Publicado: (2025)

Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues
por: Zhou, Jinfeng, et al.
Publicado: (2025)

Towards Exception Safety Code Generation with Intermediate Representation Agents Framework
por: Zhang, Xuanming, et al.
Publicado: (2024)

ToMBench: Benchmarking Theory of Mind in Large Language Models
por: Chen, Zhuang, et al.
Publicado: (2024)

Data Selection via Optimal Control for Language Models
por: Gu, Yuxian, et al.
Publicado: (2024)

Language Model Decoding as Direct Metrics Optimization
por: Ji, Haozhe, et al.
Publicado: (2023)

Beyond Nash Equilibrium: Bounded Rationality of LLMs and humans in Strategic Decision-making
por: Zheng, Kehan, et al.
Publicado: (2025)

Cognition-of-Thought Elicits Social-Aligned Reasoning in Large Language Models
por: Zhang, Xuanming, et al.
Publicado: (2025)

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems
por: Zhang, Xuanming, et al.
Publicado: (2025)

HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
por: Wen, Bosi, et al.
Publicado: (2025)

Trust-Region Adaptive Policy Optimization
por: Su, Mingyu, et al.
Publicado: (2025)

Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
por: Yang, Junxiao, et al.
Publicado: (2025)

Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning
por: Wen, Jiaxin, et al.
Publicado: (2024)

Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework
por: Zhang, Xuanming, et al.
Publicado: (2024)

Research on the application of graph data structure and graph neural network in node classification/clustering tasks
por: Wang, Yihan, et al.
Publicado: (2025)

Social Trust and the Disclosure of Letters to Shareholders
por: Anting Li, et al.
Publicado: (2025)

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement
por: Zhang, Zhexin, et al.
Publicado: (2025)

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
por: Zhang, Zhexin, et al.
Publicado: (2023)

Learning Task Decomposition to Assist Humans in Competitive Programming
por: Wen, Jiaxin, et al.
Publicado: (2024)

AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback
por: Guan, Jian, et al.
Publicado: (2024)

RLAR: An Agentic Reward System for Multi-task Reinforcement Learning on Large Language Models
por: Feng, Andrew Zhuoer, et al.
Publicado: (2026)

Human Decision-making is Susceptible to AI-driven Manipulation
por: Sabour, Sahand, et al.
Publicado: (2025)

SkillEvolver: Skill Learning as a Meta-Skill
por: Zhang, Genrui, et al.
Publicado: (2026)

DoubleAgents: Human-Agent Alignment in a Socially Embedded Workflow
por: Long, Tao, et al.
Publicado: (2025)

Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!
por: Zhang, Zhexin, et al.
Publicado: (2025)

Agent-SafetyBench: Evaluating the Safety of LLM Agents
por: Zhang, Zhexin, et al.
Publicado: (2024)

Towards Objectively Benchmarking Social Intelligence for Language Agents at Action Level
por: Wang, Chenxu, et al.
Publicado: (2024)

IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation
por: Wen, Bosi, et al.
Publicado: (2026)

Improving Retrospective Language Agents via Joint Policy Gradient Optimization
por: Feng, Xueyang, et al.
Publicado: (2025)

The Influence of Rural Environment on Social Adaptation and the Health Status of Rural Older Adults in China: Evidence From the China Longitudinal Aging Social Survey
por: Zhenhua Zheng, et al.
Publicado: (2025)

Model of predicting fear of cancer recurrence in patients with digestive tract cancer: A cross‐sectional study
por: Yaru Li, et al.
Publicado: (2024)

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
por: Cheng, Jiale, et al.
Publicado: (2023)

Towards Efficient Exact Optimization of Language Model Alignment
por: Ji, Haozhe, et al.
Publicado: (2024)

From Theft to Bomb-Making: The Ripple Effect of Unlearning in Defending Against Jailbreak Attacks
por: Zhang, Zhexin, et al.
Publicado: (2024)

MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science
por: Zhu, Erle, et al.
Publicado: (2025)

IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation
por: Wen, Bosi, et al.
Publicado: (2025)

When Smiley Turns Hostile: Interpreting How Emojis Trigger LLMs' Toxicity
por: Cui, Shiyao, et al.
Publicado: (2025)

Make It Efficient: Dynamic Sparse Attention for Autoregressive Image Generation
por: Xiang, Xunzhi, et al.
Publicado: (2025)