Saved in:
| Main Authors: | Hsueh, Wei Lin., Hsu, Tsan Sheng. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.15547 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Conditional Semi-Supervised Data Augmentation for Spam Message Detection with Low Resource Data
by: Nuha, Ulin, et al.
Published: (2024)
by: Nuha, Ulin, et al.
Published: (2024)
Tree-of-Text: A Tree-based Prompting Framework for Table-to-Text Generation in the Sports Domain
by: Chiang, Shang-Hsuan, et al.
Published: (2026)
by: Chiang, Shang-Hsuan, et al.
Published: (2026)
Do Before You Judge: Self-Reference as a Pathway to Better LLM Evaluation
by: Lin, Wei-Hsiang, et al.
Published: (2025)
by: Lin, Wei-Hsiang, et al.
Published: (2025)
Team NYCU at Defactify4: Robust Detection and Source Identification of AI-Generated Images Using CNN and CLIP-Based Models
by: Yang, Tsan-Tsung, et al.
Published: (2025)
by: Yang, Tsan-Tsung, et al.
Published: (2025)
JADE: Expert-Grounded Dynamic Evaluation for Open-Ended Professional Tasks
by: Lin, Lanbo, et al.
Published: (2026)
by: Lin, Lanbo, et al.
Published: (2026)
Behavioral Embeddings of Programs: A Quasi-Dynamic Approach for Optimization Prediction
by: Pan, Haolin, et al.
Published: (2025)
by: Pan, Haolin, et al.
Published: (2025)
Evaluating Agent-based Program Repair at Google
by: Rondon, Pat, et al.
Published: (2025)
by: Rondon, Pat, et al.
Published: (2025)
From Programs to Poses: Factored Real-World Scene Generation via Learned Program Libraries
by: Hsu, Joy, et al.
Published: (2025)
by: Hsu, Joy, et al.
Published: (2025)
Multi-Task Program Error Repair and Explanatory Diagnosis
by: Xu, Zhenyu, et al.
Published: (2024)
by: Xu, Zhenyu, et al.
Published: (2024)
Understanding eGFR Trajectories and Kidney Function Decline via Large Multimodal Models
by: Li, Chih-Yuan, et al.
Published: (2024)
by: Li, Chih-Yuan, et al.
Published: (2024)
Emergence Transformer: Dynamical Temporal Attention Matters
by: Zhou, Zihan, et al.
Published: (2026)
by: Zhou, Zihan, et al.
Published: (2026)
On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes
by: Hau, Jia Lin, et al.
Published: (2023)
by: Hau, Jia Lin, et al.
Published: (2023)
Online Reinforcement Learning-Based Dynamic Adaptive Evaluation Function for Real-Time Strategy Tasks
by: Yang, Weilong, et al.
Published: (2025)
by: Yang, Weilong, et al.
Published: (2025)
Perish or Flourish? A Holistic Evaluation of Large Language Models for Code Generation in Functional Programming
by: Lang, Nguyet-Anh H., et al.
Published: (2026)
by: Lang, Nguyet-Anh H., et al.
Published: (2026)
UINav: A Practical Approach to Train On-Device Automation Agents
by: Li, Wei, et al.
Published: (2023)
by: Li, Wei, et al.
Published: (2023)
Unrolling Dynamic Programming via Graph Filters
by: Rozada, Sergio, et al.
Published: (2025)
by: Rozada, Sergio, et al.
Published: (2025)
Domain-Independent Dynamic Programming with Constraint Propagation
by: Marijnissen, Imko, et al.
Published: (2026)
by: Marijnissen, Imko, et al.
Published: (2026)
TelTrans: Applying Multi-Type Telecom Data to Transportation Evaluation and Prediction via Multifaceted Graph Modeling
by: Lin, ChungYi, et al.
Published: (2024)
by: Lin, ChungYi, et al.
Published: (2024)
Integrating Various Software Artifacts for Better LLM-based Bug Localization and Program Repair
by: Feng, Qiong, et al.
Published: (2024)
by: Feng, Qiong, et al.
Published: (2024)
Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair
by: Cheng, Runxiang, et al.
Published: (2026)
by: Cheng, Runxiang, et al.
Published: (2026)
Program of Equations Thoughts to Solve Algebra Word Problems
by: Lin, Yunze
Published: (2025)
by: Lin, Yunze
Published: (2025)
On the Fallacy of Global Token Perplexity in Spoken Language Model Evaluation
by: Hsu, Chan-Jan, et al.
Published: (2026)
by: Hsu, Chan-Jan, et al.
Published: (2026)
CodeApex: A Bilingual Programming Evaluation Benchmark for Large Language Models
by: Fu, Lingyue, et al.
Published: (2023)
by: Fu, Lingyue, et al.
Published: (2023)
Beyond Partner Diversity: An Influence-Based Team Steering Framework for Zero-Shot Human-Machine Teaming
by: Sheng, Wei, et al.
Published: (2026)
by: Sheng, Wei, et al.
Published: (2026)
Causal Discovery as Dialectical Aggregation: A Quantitative Argumentation Framework
by: Wei, Sheng, et al.
Published: (2026)
by: Wei, Sheng, et al.
Published: (2026)
Local Search for Integer Quadratic Programming
by: He, Xiang, et al.
Published: (2024)
by: He, Xiang, et al.
Published: (2024)
Functional Stable Model Semantics and Answer Set Programming Modulo Theories
by: Bartholomew, Michael, et al.
Published: (2026)
by: Bartholomew, Michael, et al.
Published: (2026)
It's About Time: The Temporal and Modal Dynamics of Copilot Usage
by: Costa-Gomes, Beatriz, et al.
Published: (2025)
by: Costa-Gomes, Beatriz, et al.
Published: (2025)
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
by: Agarwal, Anisha, et al.
Published: (2024)
by: Agarwal, Anisha, et al.
Published: (2024)
Evaluating LLMs for Answering Student Questions in Introductory Programming Courses
by: Van Mullem, Thomas, et al.
Published: (2026)
by: Van Mullem, Thomas, et al.
Published: (2026)
CODE-DITING: A Reasoning-Based Metric for Functional Alignment in Code Evaluation
by: Yang, Guang, et al.
Published: (2025)
by: Yang, Guang, et al.
Published: (2025)
Dynamic Programming Techniques for Enhancing Cognitive Representation in Knowledge Tracing
by: Xu, Lixiang, et al.
Published: (2025)
by: Xu, Lixiang, et al.
Published: (2025)
Auto-Formulating Dynamic Programming Problems with Large Language Models
by: Zhou, Chenyu, et al.
Published: (2025)
by: Zhou, Chenyu, et al.
Published: (2025)
AsyncTool: Evaluating the Asynchronous Function Calling Capability under Multi-Task Scenarios
by: Shi, Kou, et al.
Published: (2026)
by: Shi, Kou, et al.
Published: (2026)
Performance Evaluation of Large Language Models in Statistical Programming
by: Song, Xinyi, et al.
Published: (2025)
by: Song, Xinyi, et al.
Published: (2025)
"Set It Up!": Functional Object Arrangement with Compositional Generative Models
by: Xu, Yiqing, et al.
Published: (2024)
by: Xu, Yiqing, et al.
Published: (2024)
BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional Environments
by: Li, Yuxuan, et al.
Published: (2026)
by: Li, Yuxuan, et al.
Published: (2026)
LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs
by: Guo, Pei-Fu, et al.
Published: (2025)
by: Guo, Pei-Fu, et al.
Published: (2025)
ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning
by: Yu, Xiaodong, et al.
Published: (2024)
by: Yu, Xiaodong, et al.
Published: (2024)
Evaluating the Generalizability of LLMs in Automated Program Repair
by: Li, Fengjie, et al.
Published: (2025)
by: Li, Fengjie, et al.
Published: (2025)
Similar Items
-
Conditional Semi-Supervised Data Augmentation for Spam Message Detection with Low Resource Data
by: Nuha, Ulin, et al.
Published: (2024) -
Tree-of-Text: A Tree-based Prompting Framework for Table-to-Text Generation in the Sports Domain
by: Chiang, Shang-Hsuan, et al.
Published: (2026) -
Do Before You Judge: Self-Reference as a Pathway to Better LLM Evaluation
by: Lin, Wei-Hsiang, et al.
Published: (2025) -
Team NYCU at Defactify4: Robust Detection and Source Identification of AI-Generated Images Using CNN and CLIP-Based Models
by: Yang, Tsan-Tsung, et al.
Published: (2025) -
JADE: Expert-Grounded Dynamic Evaluation for Open-Ended Professional Tasks
by: Lin, Lanbo, et al.
Published: (2026)