:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yuan, Jiayi, Li, Hao, Ding, Xinheng, Xie, Wenya, Li, Yu-Jhe, Zhao, Wentian, Wan, Kun, Shi, Jing, Hu, Xia, Liu, Zirui
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2506.09501
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training
by: Wang, Zhenting, et al.
Published: (2025)

Deterministic Inference across Tensor Parallel Sizes That Eliminates Training-Inference Mismatch
by: Zhang, Ziyang, et al.
Published: (2025)

Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
by: Deng, Shijian, et al.
Published: (2024)

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
by: Xu, Wujiang, et al.
Published: (2025)

Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to See
by: Zhang, Zeliang, et al.
Published: (2024)

DiFR: Inference Verification Despite Nondeterminism
by: Karvonen, Adam, et al.
Published: (2025)

Living Arrangements and Women's Household Decision‐Making Power in China
by: Xinheng Li
Published: (2026)

noDice: Inference for Discrete Probabilistic Programs with Nondeterminism and Conditioning
by: Gürtler, Tobias, et al.
Published: (2026)

Resolving Nondeterminism by Chance
by: Paul, Soumyajit, et al.
Published: (2025)

A Hierarchy of Nondeterminism
by: Radi, Bader Abu, et al.
Published: (2022)

Resolving Nondeterminism with Randomness
by: Henzinger, Thomas A., et al.
Published: (2025)

Link between cascade transitions and correlated Chern insulators in magic-angle twisted bilayer graphene
by: Hu, Qianying, et al.
Published: (2024)

A Novel LLM-based Two-stage Summarization Approach for Long Dialogues
by: Yin, Yuan-Jhe, et al.
Published: (2024)

Prompt-Guided Mask Proposal for Two-Stage Open-Vocabulary Segmentation
by: Li, Yu-Jhe, et al.
Published: (2024)

Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator
by: Yuan, Peiwen, et al.
Published: (2025)

When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs
by: Zeng, Yifan, et al.
Published: (2026)

Probability and Angelic Nondeterminism with Multiset Semantics
by: Ong, Shawn, et al.
Published: (2024)

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
by: Wang, Qinsi, et al.
Published: (2025)

“A Contribution Shared Is a Contribution Halved”: The Influence of Tie Strength‐Based Disclosure Strategy on Potential Donors' Donation Amounts
by: Jing Zhao, et al.
Published: (2025)

STARE at the Structure: Steering ICL Exemplar Selection with Structural Alignment
by: Li, Jiaqian, et al.
Published: (2025)

Characterization-Guided GPU Fault Resilience in NVIDIA MPS
by: Liu, Rixin, et al.
Published: (2026)

ZID-Net: Zero-Inference Diffusion Prior Decoupling Network for Single Image Dehazing
by: Li, Xinheng, et al.
Published: (2026)

Language Equivalence is Undecidable in VASS with Restricted Nondeterminism
by: Czerwiński, Wojciech, et al.
Published: (2025)

A Demonic Outcome Logic for Randomized Nondeterminism
by: Zilberstein, Noam, et al.
Published: (2024)

Optimistic Verifiable Training by Controlling Hardware Nondeterminism
by: Srivastava, Megha, et al.
Published: (2024)

How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior
by: Xiong, Zidi, et al.
Published: (2025)

AttnRegDeepLab: A Two-Stage Decoupled Framework for Interpretable Embryo Fragmentation Grading
by: Lee, Ming-Jhe
Published: (2025)

Quantitative lower bound for solutions to the Boltzmann equation in non-convex domains
by: Su, Jhe-Kuan
Published: (2025)

NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model
by: Xie, Xinheng, et al.
Published: (2024)

Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts
by: He, Shwai, et al.
Published: (2025)

Beyond Numeric Rewards: In-Context Dueling Bandits with LLM Agents
by: Xia, Fanzeng, et al.
Published: (2024)

Multi-Functional RIS-Enabled in SAGIN for IoT: A Hybrid Deep Reinforcement Learning Approach with Compressed Twin-Models
by: Shen, Li-Hsiang, et al.
Published: (2025)

Generating Mitigations for Downstream Projects to Neutralize Upstream Library Vulnerability
by: Chen, Zirui, et al.
Published: (2025)

From High to Low: Simulating Nondeterminism and State with State
by: Tang, Wenhao, et al.
Published: (2023)

Nondeterminism makes unary 1-limited automata concise
by: Guillon, Bruno, et al.
Published: (2025)

Vocabulary Hijacking in LVLMs: Unveiling Critical Attention Heads by Excluding Inert Tokens to Mitigate Hallucination
by: Chen, Yangneng, et al.
Published: (2026)

LPIPS-AttnWav2Lip: Generic Audio-Driven lip synchronization for Talking Head Generation in the Wild
by: Chen, Zhipeng, et al.
Published: (2026)

When Continual Learning Moves to Memory: A Study of Experience Reuse in LLM Agents
by: Hu, Qisheng, et al.
Published: (2026)

Multidimensional‐Encrypted Meta‐Optics Storage Empowered by Diffraction‐Order Decoupling
by: Zirui Zhao, et al.
Published: (2025)

Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly
by: Xie, Wenya, et al.
Published: (2025)