Saved in:
| Main Authors: | Yuan, Jiayi, Li, Hao, Ding, Xinheng, Xie, Wenya, Li, Yu-Jhe, Zhao, Wentian, Wan, Kun, Shi, Jing, Hu, Xia, Liu, Zirui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.09501 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training
by: Wang, Zhenting, et al.
Published: (2025)
by: Wang, Zhenting, et al.
Published: (2025)
Deterministic Inference across Tensor Parallel Sizes That Eliminates Training-Inference Mismatch
by: Zhang, Ziyang, et al.
Published: (2025)
by: Zhang, Ziyang, et al.
Published: (2025)
Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
by: Deng, Shijian, et al.
Published: (2024)
by: Deng, Shijian, et al.
Published: (2024)
EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
by: Xu, Wujiang, et al.
Published: (2025)
by: Xu, Wujiang, et al.
Published: (2025)
Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to See
by: Zhang, Zeliang, et al.
Published: (2024)
by: Zhang, Zeliang, et al.
Published: (2024)
DiFR: Inference Verification Despite Nondeterminism
by: Karvonen, Adam, et al.
Published: (2025)
by: Karvonen, Adam, et al.
Published: (2025)
Living Arrangements and Women's Household Decision‐Making Power in China
by: Xinheng Li
Published: (2026)
by: Xinheng Li
Published: (2026)
noDice: Inference for Discrete Probabilistic Programs with Nondeterminism and Conditioning
by: Gürtler, Tobias, et al.
Published: (2026)
by: Gürtler, Tobias, et al.
Published: (2026)
Resolving Nondeterminism by Chance
by: Paul, Soumyajit, et al.
Published: (2025)
by: Paul, Soumyajit, et al.
Published: (2025)
A Hierarchy of Nondeterminism
by: Radi, Bader Abu, et al.
Published: (2022)
by: Radi, Bader Abu, et al.
Published: (2022)
Resolving Nondeterminism with Randomness
by: Henzinger, Thomas A., et al.
Published: (2025)
by: Henzinger, Thomas A., et al.
Published: (2025)
Link between cascade transitions and correlated Chern insulators in magic-angle twisted bilayer graphene
by: Hu, Qianying, et al.
Published: (2024)
by: Hu, Qianying, et al.
Published: (2024)
A Novel LLM-based Two-stage Summarization Approach for Long Dialogues
by: Yin, Yuan-Jhe, et al.
Published: (2024)
by: Yin, Yuan-Jhe, et al.
Published: (2024)
Prompt-Guided Mask Proposal for Two-Stage Open-Vocabulary Segmentation
by: Li, Yu-Jhe, et al.
Published: (2024)
by: Li, Yu-Jhe, et al.
Published: (2024)
Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs
by: Zeng, Yifan, et al.
Published: (2026)
by: Zeng, Yifan, et al.
Published: (2026)
Probability and Angelic Nondeterminism with Multiset Semantics
by: Ong, Shawn, et al.
Published: (2024)
by: Ong, Shawn, et al.
Published: (2024)
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
by: Wang, Qinsi, et al.
Published: (2025)
by: Wang, Qinsi, et al.
Published: (2025)
“A Contribution Shared Is a Contribution Halved”: The Influence of Tie Strength‐Based Disclosure Strategy on Potential Donors' Donation Amounts
by: Jing Zhao, et al.
Published: (2025)
by: Jing Zhao, et al.
Published: (2025)
STARE at the Structure: Steering ICL Exemplar Selection with Structural Alignment
by: Li, Jiaqian, et al.
Published: (2025)
by: Li, Jiaqian, et al.
Published: (2025)
Characterization-Guided GPU Fault Resilience in NVIDIA MPS
by: Liu, Rixin, et al.
Published: (2026)
by: Liu, Rixin, et al.
Published: (2026)
ZID-Net: Zero-Inference Diffusion Prior Decoupling Network for Single Image Dehazing
by: Li, Xinheng, et al.
Published: (2026)
by: Li, Xinheng, et al.
Published: (2026)
Language Equivalence is Undecidable in VASS with Restricted Nondeterminism
by: Czerwiński, Wojciech, et al.
Published: (2025)
by: Czerwiński, Wojciech, et al.
Published: (2025)
A Demonic Outcome Logic for Randomized Nondeterminism
by: Zilberstein, Noam, et al.
Published: (2024)
by: Zilberstein, Noam, et al.
Published: (2024)
Optimistic Verifiable Training by Controlling Hardware Nondeterminism
by: Srivastava, Megha, et al.
Published: (2024)
by: Srivastava, Megha, et al.
Published: (2024)
How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior
by: Xiong, Zidi, et al.
Published: (2025)
by: Xiong, Zidi, et al.
Published: (2025)
AttnRegDeepLab: A Two-Stage Decoupled Framework for Interpretable Embryo Fragmentation Grading
by: Lee, Ming-Jhe
Published: (2025)
by: Lee, Ming-Jhe
Published: (2025)
Quantitative lower bound for solutions to the Boltzmann equation in non-convex domains
by: Su, Jhe-Kuan
Published: (2025)
by: Su, Jhe-Kuan
Published: (2025)
NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model
by: Xie, Xinheng, et al.
Published: (2024)
by: Xie, Xinheng, et al.
Published: (2024)
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts
by: He, Shwai, et al.
Published: (2025)
by: He, Shwai, et al.
Published: (2025)
Beyond Numeric Rewards: In-Context Dueling Bandits with LLM Agents
by: Xia, Fanzeng, et al.
Published: (2024)
by: Xia, Fanzeng, et al.
Published: (2024)
Multi-Functional RIS-Enabled in SAGIN for IoT: A Hybrid Deep Reinforcement Learning Approach with Compressed Twin-Models
by: Shen, Li-Hsiang, et al.
Published: (2025)
by: Shen, Li-Hsiang, et al.
Published: (2025)
Generating Mitigations for Downstream Projects to Neutralize Upstream Library Vulnerability
by: Chen, Zirui, et al.
Published: (2025)
by: Chen, Zirui, et al.
Published: (2025)
From High to Low: Simulating Nondeterminism and State with State
by: Tang, Wenhao, et al.
Published: (2023)
by: Tang, Wenhao, et al.
Published: (2023)
Nondeterminism makes unary 1-limited automata concise
by: Guillon, Bruno, et al.
Published: (2025)
by: Guillon, Bruno, et al.
Published: (2025)
Vocabulary Hijacking in LVLMs: Unveiling Critical Attention Heads by Excluding Inert Tokens to Mitigate Hallucination
by: Chen, Yangneng, et al.
Published: (2026)
by: Chen, Yangneng, et al.
Published: (2026)
LPIPS-AttnWav2Lip: Generic Audio-Driven lip synchronization for Talking Head Generation in the Wild
by: Chen, Zhipeng, et al.
Published: (2026)
by: Chen, Zhipeng, et al.
Published: (2026)
When Continual Learning Moves to Memory: A Study of Experience Reuse in LLM Agents
by: Hu, Qisheng, et al.
Published: (2026)
by: Hu, Qisheng, et al.
Published: (2026)
Multidimensional‐Encrypted Meta‐Optics Storage Empowered by Diffraction‐Order Decoupling
by: Zirui Zhao, et al.
Published: (2025)
by: Zirui Zhao, et al.
Published: (2025)
Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly
by: Xie, Wenya, et al.
Published: (2025)
by: Xie, Wenya, et al.
Published: (2025)
Similar Items
-
DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training
by: Wang, Zhenting, et al.
Published: (2025) -
Deterministic Inference across Tensor Parallel Sizes That Eliminates Training-Inference Mismatch
by: Zhang, Ziyang, et al.
Published: (2025) -
Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
by: Deng, Shijian, et al.
Published: (2024) -
EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
by: Xu, Wujiang, et al.
Published: (2025) -
Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to See
by: Zhang, Zeliang, et al.
Published: (2024)