Saved in:
| Main Authors: | Liu, Weitang, Li, Ying Wai, Li, Yuelei, Wang, Zihan, You, Yi-Zhuang, Shang, Jingbo |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2312.03291 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Model-diff: A Tool for Comparative Study of Language Models in the Input Space
by: Liu, Weitang, et al.
Published: (2024)
by: Liu, Weitang, et al.
Published: (2024)
Learning a Decision Tree Algorithm with Transformers
by: Zhuang, Yufan, et al.
Published: (2024)
by: Zhuang, Yufan, et al.
Published: (2024)
Toward Student-Oriented Teacher Network Training For Knowledge Distillation
by: Dong, Chengyu, et al.
Published: (2022)
by: Dong, Chengyu, et al.
Published: (2022)
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision
by: Wang, Zihan, et al.
Published: (2024)
by: Wang, Zihan, et al.
Published: (2024)
Skill-R1: Agent Skill Evolution via Reinforcement Learning
by: Vishe, Yash, et al.
Published: (2026)
by: Vishe, Yash, et al.
Published: (2026)
Language models are better than humans at next-token prediction
by: Shlegeris, Buck, et al.
Published: (2022)
by: Shlegeris, Buck, et al.
Published: (2022)
Pretrained battery transformer (PBT): A foundation model for universal battery life prediction
by: Tan, Ruifeng, et al.
Published: (2025)
by: Tan, Ruifeng, et al.
Published: (2025)
InSTA: Towards Internet-Scale Training For Agents
by: Trabucco, Brandon, et al.
Published: (2025)
by: Trabucco, Brandon, et al.
Published: (2025)
The Price of Format: Diversity Collapse in LLMs
by: Yun, Longfei, et al.
Published: (2025)
by: Yun, Longfei, et al.
Published: (2025)
SL-BiLEM: Structured Learnable Behavior-in-the-Loop Epidemic Modeling for Forecasting and Policy Evaluation
by: Wang, Haochun, et al.
Published: (2026)
by: Wang, Haochun, et al.
Published: (2026)
Scale-Adaptive Power Flow Analysis with Local Topology Slicing and Multi-Task Graph Learning
by: Li, Yongzhe, et al.
Published: (2026)
by: Li, Yongzhe, et al.
Published: (2026)
GRC-Net: Gram Residual Co-attention Net for epilepsy prediction
by: You, Bihao, et al.
Published: (2025)
by: You, Bihao, et al.
Published: (2025)
Parallel Test-Time Scaling for Latent Reasoning Models
by: You, Runyang, et al.
Published: (2025)
by: You, Runyang, et al.
Published: (2025)
Personalized Treatment Outcome Prediction from Scarce Data via Dual-Channel Knowledge Distillation and Adaptive Fusion
by: Chen, Wenjie, et al.
Published: (2025)
by: Chen, Wenjie, et al.
Published: (2025)
Evaluating machine learning models for predicting pesticide toxicity to honey bees
by: Adamczyk, Jakub, et al.
Published: (2025)
by: Adamczyk, Jakub, et al.
Published: (2025)
Is One Score Enough? Rethinking the Evaluation of Sequentially Evolving LLM Memory
by: Dong, Songwei, et al.
Published: (2026)
by: Dong, Songwei, et al.
Published: (2026)
Geometric and Dynamic Scaling in Deep Transformers
by: Su, Haoran, et al.
Published: (2026)
by: Su, Haoran, et al.
Published: (2026)
Large Language Models for Time Series: A Survey
by: Zhang, Xiyuan, et al.
Published: (2024)
by: Zhang, Xiyuan, et al.
Published: (2024)
am-ELO: A Stable Framework for Arena-based LLM Evaluation
by: Liu, Zirui, et al.
Published: (2025)
by: Liu, Zirui, et al.
Published: (2025)
On the Effects of Data Scale on UI Control Agents
by: Li, Wei, et al.
Published: (2024)
by: Li, Wei, et al.
Published: (2024)
Visual-information-driven model for crowd simulation using temporal convolutional network
by: Liang, Xuanwen, et al.
Published: (2023)
by: Liang, Xuanwen, et al.
Published: (2023)
Evaluation-driven Scaling for Scientific Discovery
by: Ye, Haotian, et al.
Published: (2026)
by: Ye, Haotian, et al.
Published: (2026)
From Static Analysis to Audience Dissemination: A Training-Free Multimodal Controversy Detection Multi-Agent Framework
by: Ding, Zihan, et al.
Published: (2026)
by: Ding, Zihan, et al.
Published: (2026)
PrismAgent: Illuminating Harm in Memes via a Zero-Shot Interpretable Multi-Agent Framework
by: Ding, Zihan, et al.
Published: (2026)
by: Ding, Zihan, et al.
Published: (2026)
TIER: Trajectory-Invariant Execution Rewards for Multi-Step Tool Composition
by: Kulkarni, Anay, et al.
Published: (2026)
by: Kulkarni, Anay, et al.
Published: (2026)
IFG: Internet-Scale Guidance for Functional Grasping Generation
by: Liu, Ray Muxin, et al.
Published: (2025)
by: Liu, Ray Muxin, et al.
Published: (2025)
Collaborative Unlabeled Data Optimization
by: Shang, Xinyi, et al.
Published: (2025)
by: Shang, Xinyi, et al.
Published: (2025)
LoReC: Rethinking Large Language Models for Graph Data Analysis
by: Zhan, Hongyu, et al.
Published: (2026)
by: Zhan, Hongyu, et al.
Published: (2026)
Integrating Social Determinants of Health into Knowledge Graphs: Evaluating Prediction Bias and Fairness in Healthcare
by: Shang, Tianqi, et al.
Published: (2024)
by: Shang, Tianqi, et al.
Published: (2024)
Video2Policy: Scaling up Manipulation Tasks in Simulation through Internet Videos
by: Ye, Weirui, et al.
Published: (2025)
by: Ye, Weirui, et al.
Published: (2025)
UniMTS: Unified Pre-training for Motion Time Series
by: Zhang, Xiyuan, et al.
Published: (2024)
by: Zhang, Xiyuan, et al.
Published: (2024)
MSCMHMST: A traffic flow prediction model based on Transformer
by: Geng, Weiyang, et al.
Published: (2025)
by: Geng, Weiyang, et al.
Published: (2025)
RDPI: A Refine Diffusion Probability Generation Method for Spatiotemporal Data Imputation
by: Liu, Zijin, et al.
Published: (2024)
by: Liu, Zijin, et al.
Published: (2024)
OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems
by: Li, Xiaozhe, et al.
Published: (2025)
by: Li, Xiaozhe, et al.
Published: (2025)
Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators
by: Bai, Sikai, et al.
Published: (2023)
by: Bai, Sikai, et al.
Published: (2023)
Addressing Long-Tail Noisy Label Learning Problems: a Two-Stage Solution with Label Refurbishment Considering Label Rarity
by: Wu, Ying-Hsuan, et al.
Published: (2024)
by: Wu, Ying-Hsuan, et al.
Published: (2024)
Learning Semantic Association Rules from Internet of Things Data
by: Karabulut, Erkan, et al.
Published: (2024)
by: Karabulut, Erkan, et al.
Published: (2024)
MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation
by: Wang, Yutong, et al.
Published: (2025)
by: Wang, Yutong, et al.
Published: (2025)
RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts
by: Wijk, Hjalmar, et al.
Published: (2024)
by: Wijk, Hjalmar, et al.
Published: (2024)
Class Unbiasing for Generalization in Medical Diagnosis
by: Zuo, Lishi, et al.
Published: (2025)
by: Zuo, Lishi, et al.
Published: (2025)
Similar Items
-
Model-diff: A Tool for Comparative Study of Language Models in the Input Space
by: Liu, Weitang, et al.
Published: (2024) -
Learning a Decision Tree Algorithm with Transformers
by: Zhuang, Yufan, et al.
Published: (2024) -
Toward Student-Oriented Teacher Network Training For Knowledge Distillation
by: Dong, Chengyu, et al.
Published: (2022) -
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision
by: Wang, Zihan, et al.
Published: (2024) -
Skill-R1: Agent Skill Evolution via Reinforcement Learning
by: Vishe, Yash, et al.
Published: (2026)