Guardado en:
| Autores principales: | Zhou, Jinfeng, Chen, Yuxuan, Shi, Yihan, Zhang, Xuanming, Lei, Leqi, Feng, Yi, Xiong, Zexuan, Yan, Miao, Wang, Xunzhi, Cao, Yaru, Yin, Jianing, Wang, Shuai, Dai, Quanyu, Dong, Zhenhua, Wang, Hongning, Huang, Minlie |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2506.00900 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Think Socially via Cognitive Reasoning
por: Zhou, Jinfeng, et al.
Publicado: (2025)
por: Zhou, Jinfeng, et al.
Publicado: (2025)
Grounding LLMs in Scientific Discovery via Embodied Actions
por: Zhang, Bo, et al.
Publicado: (2026)
por: Zhang, Bo, et al.
Publicado: (2026)
SocialSim: Towards Socialized Simulation of Emotional Support Conversation
por: Chen, Zhuang, et al.
Publicado: (2025)
por: Chen, Zhuang, et al.
Publicado: (2025)
Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues
por: Zhou, Jinfeng, et al.
Publicado: (2025)
por: Zhou, Jinfeng, et al.
Publicado: (2025)
Towards Exception Safety Code Generation with Intermediate Representation Agents Framework
por: Zhang, Xuanming, et al.
Publicado: (2024)
por: Zhang, Xuanming, et al.
Publicado: (2024)
ToMBench: Benchmarking Theory of Mind in Large Language Models
por: Chen, Zhuang, et al.
Publicado: (2024)
por: Chen, Zhuang, et al.
Publicado: (2024)
Data Selection via Optimal Control for Language Models
por: Gu, Yuxian, et al.
Publicado: (2024)
por: Gu, Yuxian, et al.
Publicado: (2024)
Language Model Decoding as Direct Metrics Optimization
por: Ji, Haozhe, et al.
Publicado: (2023)
por: Ji, Haozhe, et al.
Publicado: (2023)
Beyond Nash Equilibrium: Bounded Rationality of LLMs and humans in Strategic Decision-making
por: Zheng, Kehan, et al.
Publicado: (2025)
por: Zheng, Kehan, et al.
Publicado: (2025)
Cognition-of-Thought Elicits Social-Aligned Reasoning in Large Language Models
por: Zhang, Xuanming, et al.
Publicado: (2025)
por: Zhang, Xuanming, et al.
Publicado: (2025)
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems
por: Zhang, Xuanming, et al.
Publicado: (2025)
por: Zhang, Xuanming, et al.
Publicado: (2025)
HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
por: Wen, Bosi, et al.
Publicado: (2025)
por: Wen, Bosi, et al.
Publicado: (2025)
Trust-Region Adaptive Policy Optimization
por: Su, Mingyu, et al.
Publicado: (2025)
por: Su, Mingyu, et al.
Publicado: (2025)
Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
por: Yang, Junxiao, et al.
Publicado: (2025)
por: Yang, Junxiao, et al.
Publicado: (2025)
Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning
por: Wen, Jiaxin, et al.
Publicado: (2024)
por: Wen, Jiaxin, et al.
Publicado: (2024)
Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework
por: Zhang, Xuanming, et al.
Publicado: (2024)
por: Zhang, Xuanming, et al.
Publicado: (2024)
Research on the application of graph data structure and graph neural network in node classification/clustering tasks
por: Wang, Yihan, et al.
Publicado: (2025)
por: Wang, Yihan, et al.
Publicado: (2025)
Social Trust and the Disclosure of Letters to Shareholders
por: Anting Li, et al.
Publicado: (2025)
por: Anting Li, et al.
Publicado: (2025)
AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement
por: Zhang, Zhexin, et al.
Publicado: (2025)
por: Zhang, Zhexin, et al.
Publicado: (2025)
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
por: Zhang, Zhexin, et al.
Publicado: (2023)
por: Zhang, Zhexin, et al.
Publicado: (2023)
Learning Task Decomposition to Assist Humans in Competitive Programming
por: Wen, Jiaxin, et al.
Publicado: (2024)
por: Wen, Jiaxin, et al.
Publicado: (2024)
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback
por: Guan, Jian, et al.
Publicado: (2024)
por: Guan, Jian, et al.
Publicado: (2024)
RLAR: An Agentic Reward System for Multi-task Reinforcement Learning on Large Language Models
por: Feng, Andrew Zhuoer, et al.
Publicado: (2026)
por: Feng, Andrew Zhuoer, et al.
Publicado: (2026)
Human Decision-making is Susceptible to AI-driven Manipulation
por: Sabour, Sahand, et al.
Publicado: (2025)
por: Sabour, Sahand, et al.
Publicado: (2025)
SkillEvolver: Skill Learning as a Meta-Skill
por: Zhang, Genrui, et al.
Publicado: (2026)
por: Zhang, Genrui, et al.
Publicado: (2026)
DoubleAgents: Human-Agent Alignment in a Socially Embedded Workflow
por: Long, Tao, et al.
Publicado: (2025)
por: Long, Tao, et al.
Publicado: (2025)
Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!
por: Zhang, Zhexin, et al.
Publicado: (2025)
por: Zhang, Zhexin, et al.
Publicado: (2025)
Agent-SafetyBench: Evaluating the Safety of LLM Agents
por: Zhang, Zhexin, et al.
Publicado: (2024)
por: Zhang, Zhexin, et al.
Publicado: (2024)
Towards Objectively Benchmarking Social Intelligence for Language Agents at Action Level
por: Wang, Chenxu, et al.
Publicado: (2024)
por: Wang, Chenxu, et al.
Publicado: (2024)
IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation
por: Wen, Bosi, et al.
Publicado: (2026)
por: Wen, Bosi, et al.
Publicado: (2026)
Improving Retrospective Language Agents via Joint Policy Gradient Optimization
por: Feng, Xueyang, et al.
Publicado: (2025)
por: Feng, Xueyang, et al.
Publicado: (2025)
The Influence of Rural Environment on Social Adaptation and the Health Status of Rural Older Adults in China: Evidence From the China Longitudinal Aging Social Survey
por: Zhenhua Zheng, et al.
Publicado: (2025)
por: Zhenhua Zheng, et al.
Publicado: (2025)
Model of predicting fear of cancer recurrence in patients with digestive tract cancer: A cross‐sectional study
por: Yaru Li, et al.
Publicado: (2024)
por: Yaru Li, et al.
Publicado: (2024)
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
por: Cheng, Jiale, et al.
Publicado: (2023)
por: Cheng, Jiale, et al.
Publicado: (2023)
Towards Efficient Exact Optimization of Language Model Alignment
por: Ji, Haozhe, et al.
Publicado: (2024)
por: Ji, Haozhe, et al.
Publicado: (2024)
From Theft to Bomb-Making: The Ripple Effect of Unlearning in Defending Against Jailbreak Attacks
por: Zhang, Zhexin, et al.
Publicado: (2024)
por: Zhang, Zhexin, et al.
Publicado: (2024)
MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science
por: Zhu, Erle, et al.
Publicado: (2025)
por: Zhu, Erle, et al.
Publicado: (2025)
IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation
por: Wen, Bosi, et al.
Publicado: (2025)
por: Wen, Bosi, et al.
Publicado: (2025)
When Smiley Turns Hostile: Interpreting How Emojis Trigger LLMs' Toxicity
por: Cui, Shiyao, et al.
Publicado: (2025)
por: Cui, Shiyao, et al.
Publicado: (2025)
Make It Efficient: Dynamic Sparse Attention for Autoregressive Image Generation
por: Xiang, Xunzhi, et al.
Publicado: (2025)
por: Xiang, Xunzhi, et al.
Publicado: (2025)
Ejemplares similares
-
Think Socially via Cognitive Reasoning
por: Zhou, Jinfeng, et al.
Publicado: (2025) -
Grounding LLMs in Scientific Discovery via Embodied Actions
por: Zhang, Bo, et al.
Publicado: (2026) -
SocialSim: Towards Socialized Simulation of Emotional Support Conversation
por: Chen, Zhuang, et al.
Publicado: (2025) -
Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues
por: Zhou, Jinfeng, et al.
Publicado: (2025) -
Towards Exception Safety Code Generation with Intermediate Representation Agents Framework
por: Zhang, Xuanming, et al.
Publicado: (2024)