Saved in:
| Main Authors: | Fu, Jiayi, Zhao, Xuandong, Yang, Ruihan, Zhang, Yuansen, Chen, Jiangjie, Xiao, Yanghua |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.12948 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Gumbel Machine: Counterfactual Student Writing Generation via Gumbel Noise Steering
by: McNichols, Hunter, et al.
Published: (2026)
by: McNichols, Hunter, et al.
Published: (2026)
Gumbel Counterfactual Generation From Language Models
by: Ravfogel, Shauli, et al.
Published: (2024)
by: Ravfogel, Shauli, et al.
Published: (2024)
Gumbel Distillation for Parallel Text Generation
by: Zhang, Chi, et al.
Published: (2026)
by: Zhang, Chi, et al.
Published: (2026)
ARIA: Training Language Agents with Intention-Driven Reward Aggregation
by: Yang, Ruihan, et al.
Published: (2025)
by: Yang, Ruihan, et al.
Published: (2025)
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
by: Yang, Ruihan, et al.
Published: (2024)
by: Yang, Ruihan, et al.
Published: (2024)
Gumbel Reranking: Differentiable End-to-End Reranker Optimization
by: Huang, Siyuan, et al.
Published: (2025)
by: Huang, Siyuan, et al.
Published: (2025)
Waste Not, Want Not; Recycled Gumbel Noise Improves Consistency in Natural Language Generation
by: de Mijolla, Damien, et al.
Published: (2025)
by: de Mijolla, Damien, et al.
Published: (2025)
Universal Adversarial Suffixes Using Calibrated Gumbel-Softmax Relaxation
by: Soor, Sampriti, et al.
Published: (2025)
by: Soor, Sampriti, et al.
Published: (2025)
Reward Shaping to Mitigate Reward Hacking in RLHF
by: Fu, Jiayi, et al.
Published: (2025)
by: Fu, Jiayi, et al.
Published: (2025)
Refined Detection for Gumbel Watermarking
by: Lattimore, Tor
Published: (2026)
by: Lattimore, Tor
Published: (2026)
Can LLM Infer Risk Information From MCP Server System Logs?
by: Fu, Jiayi, et al.
Published: (2025)
by: Fu, Jiayi, et al.
Published: (2025)
How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?
by: Wu, Siye, et al.
Published: (2024)
by: Wu, Siye, et al.
Published: (2024)
In-Context Watermarks for Large Language Models
by: Liu, Yepeng, et al.
Published: (2025)
by: Liu, Yepeng, et al.
Published: (2025)
TravelAgent: An AI Assistant for Personalized Travel Planning
by: Chen, Aili, et al.
Published: (2024)
by: Chen, Aili, et al.
Published: (2024)
ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A Million-scale Knowledge Base
by: Yuan, Siyu, et al.
Published: (2023)
by: Yuan, Siyu, et al.
Published: (2023)
GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling
by: Dadgarnia, Alireza, et al.
Published: (2026)
by: Dadgarnia, Alireza, et al.
Published: (2026)
Enhancing Language Agent Strategic Reasoning through Self-Play in Adversarial Games
by: Zhang, Yikai, et al.
Published: (2025)
by: Zhang, Yikai, et al.
Published: (2025)
TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation
by: Zhang, Yikai, et al.
Published: (2024)
by: Zhang, Yikai, et al.
Published: (2024)
Gumbel-MPNN: Graph Rewiring with Gumbel-Softmax
by: Hoffmann, Marcel, et al.
Published: (2025)
by: Hoffmann, Marcel, et al.
Published: (2025)
Past Meets Present: Creating Historical Analogy with Large Language Models
by: Li, Nianqi, et al.
Published: (2024)
by: Li, Nianqi, et al.
Published: (2024)
Dataset Protection via Watermarked Canaries in Retrieval-Augmented LLMs
by: Liu, Yepeng, et al.
Published: (2025)
by: Liu, Yepeng, et al.
Published: (2025)
Revealing the Barriers of Language Agents in Planning
by: Xie, Jian, et al.
Published: (2024)
by: Xie, Jian, et al.
Published: (2024)
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
by: Xie, Jian, et al.
Published: (2024)
by: Xie, Jian, et al.
Published: (2024)
Can LLMs Learn to Map the World from Local Descriptions?
by: Xia, Sirui, et al.
Published: (2025)
by: Xia, Sirui, et al.
Published: (2025)
Efficiently Identifying Watermarked Segments in Mixed-Source Texts
by: Zhao, Xuandong, et al.
Published: (2024)
by: Zhao, Xuandong, et al.
Published: (2024)
Recent Advancement of Emotion Cognition in Large Language Models
by: Chen, Yuyan, et al.
Published: (2024)
by: Chen, Yuyan, et al.
Published: (2024)
DEEPER Insight into Your User: Directed Persona Refinement for Dynamic Persona Modeling
by: Chen, Aili, et al.
Published: (2025)
by: Chen, Aili, et al.
Published: (2025)
DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
by: Gu, Zhouhong, et al.
Published: (2024)
by: Gu, Zhouhong, et al.
Published: (2024)
Piecing Together Clues: A Benchmark for Evaluating the Detective Skills of Large Language Models
by: Gu, Zhouhong, et al.
Published: (2023)
by: Gu, Zhouhong, et al.
Published: (2023)
Curse of Knowledge: When Complex Evaluation Context Benefits yet Biases LLM Judges
by: Li, Weiyuan, et al.
Published: (2025)
by: Li, Weiyuan, et al.
Published: (2025)
From Persona to Personalization: A Survey on Role-Playing Language Agents
by: Chen, Jiangjie, et al.
Published: (2024)
by: Chen, Jiangjie, et al.
Published: (2024)
SEGMENT+: Long Text Processing with Short-Context Language Models
by: Shi, Wei, et al.
Published: (2024)
by: Shi, Wei, et al.
Published: (2024)
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
by: Kang, Zhewei, et al.
Published: (2025)
by: Kang, Zhewei, et al.
Published: (2025)
Position: LLM Watermarking Should Align Stakeholders' Incentives for Practical Adoption
by: Liu, Yepeng, et al.
Published: (2025)
by: Liu, Yepeng, et al.
Published: (2025)
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
by: Zhang, Yuansen, et al.
Published: (2024)
by: Zhang, Yuansen, et al.
Published: (2024)
A Practical Examination of AI-Generated Text Detectors for Large Language Models
by: Tufts, Brian, et al.
Published: (2024)
by: Tufts, Brian, et al.
Published: (2024)
SurveyAgent: A Conversational System for Personalized and Efficient Research Survey
by: Wang, Xintao, et al.
Published: (2024)
by: Wang, Xintao, et al.
Published: (2024)
The Exponentiated Generalized Gumbel Distribution
by: Thiago Andrade
Published: (2015)
by: Thiago Andrade
Published: (2015)
Questions of life / Nicky Gumbel
by: Gumbel, Nicky
by: Gumbel, Nicky
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model
by: Zhang, Yizhe, et al.
Published: (2023)
by: Zhang, Yizhe, et al.
Published: (2023)
Similar Items
-
Gumbel Machine: Counterfactual Student Writing Generation via Gumbel Noise Steering
by: McNichols, Hunter, et al.
Published: (2026) -
Gumbel Counterfactual Generation From Language Models
by: Ravfogel, Shauli, et al.
Published: (2024) -
Gumbel Distillation for Parallel Text Generation
by: Zhang, Chi, et al.
Published: (2026) -
ARIA: Training Language Agents with Intention-Driven Reward Aggregation
by: Yang, Ruihan, et al.
Published: (2025) -
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
by: Yang, Ruihan, et al.
Published: (2024)