Saved in:
| Main Authors: | Wang, Dingzirui, Zhang, Xuanliang, Cao, Rongyu, Dou, Longxu, Luo, Xianzhen, Ma, Yingwei, Zhu, Qingfu, Che, Wanxiang, Li, Binhua, Huang, Fei, Li, Yongbin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.23133 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Enhancing Numerical Reasoning with the Guidance of Reliable Reasoning Processes
by: Wang, Dingzirui, et al.
Published: (2024)
by: Wang, Dingzirui, et al.
Published: (2024)
In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks
by: Wang, Dingzirui, et al.
Published: (2024)
by: Wang, Dingzirui, et al.
Published: (2024)
A Survey of Table Reasoning with Large Language Models
by: Zhang, Xuanliang, et al.
Published: (2024)
by: Zhang, Xuanliang, et al.
Published: (2024)
FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats
by: Zhang, Xuanliang, et al.
Published: (2024)
by: Zhang, Xuanliang, et al.
Published: (2024)
Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQL
by: Wang, Dingzirui, et al.
Published: (2024)
by: Wang, Dingzirui, et al.
Published: (2024)
DAC: Decomposed Automation Correction for Text-to-SQL
by: Wang, Dingzirui, et al.
Published: (2024)
by: Wang, Dingzirui, et al.
Published: (2024)
MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQL
by: Zhang, Xuanliang, et al.
Published: (2024)
by: Zhang, Xuanliang, et al.
Published: (2024)
RoT: Enhancing Table Reasoning with Iterative Row-Wise Traversals
by: Zhang, Xuanliang, et al.
Published: (2025)
by: Zhang, Xuanliang, et al.
Published: (2025)
SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
by: Zhang, Xuanliang, et al.
Published: (2024)
by: Zhang, Xuanliang, et al.
Published: (2024)
LLMs as Continuous Learners: Improving the Reproduction of Defective Code in Software Issues
by: Lin, Yalan, et al.
Published: (2024)
by: Lin, Yalan, et al.
Published: (2024)
Alibaba LingmaAgent: Improving Automated Issue Resolution via Comprehensive Repository Exploration
by: Ma, Yingwei, et al.
Published: (2024)
by: Ma, Yingwei, et al.
Published: (2024)
Scaling Laws for Agent Harnesses via Effective Feedback Compute
by: Zhang, Xuanliang, et al.
Published: (2026)
by: Zhang, Xuanliang, et al.
Published: (2026)
How Do Language Models Understand Tables? A Mechanistic Analysis of Cell Location
by: Zhang, Xuanliang, et al.
Published: (2026)
by: Zhang, Xuanliang, et al.
Published: (2026)
Abacus-SQL: A Text-to-SQL System Empowering Cross-Domain and Open-Domain Database Retrieval
by: Xu, Keyan, et al.
Published: (2025)
by: Xu, Keyan, et al.
Published: (2025)
MULTITAT: Benchmarking Multilingual Table-and-Text Question Answering
by: Zhang, Xuanliang, et al.
Published: (2025)
by: Zhang, Xuanliang, et al.
Published: (2025)
Bounds of Chain-of-Thought Robustness: Reasoning Steps, Embed Norms, and Beyond
by: Wang, Dingzirui, et al.
Published: (2025)
by: Wang, Dingzirui, et al.
Published: (2025)
AlignEvoSkill: Towards Knowledge-Aware and Task-Aligned Agent Skill Evolution
by: Wang, Dingzirui, et al.
Published: (2025)
by: Wang, Dingzirui, et al.
Published: (2025)
When Does Context Help? Error Dynamics of Contextual Information in Large Language Models
by: Wang, Dingzirui, et al.
Published: (2026)
by: Wang, Dingzirui, et al.
Published: (2026)
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?
by: Pan, Zhenyu, et al.
Published: (2024)
by: Pan, Zhenyu, et al.
Published: (2024)
Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute
by: Ma, Yingwei, et al.
Published: (2025)
by: Ma, Yingwei, et al.
Published: (2025)
RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization
by: Dong, Yihong, et al.
Published: (2025)
by: Dong, Yihong, et al.
Published: (2025)
Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning
by: Ma, Zhiyuan, et al.
Published: (2025)
by: Ma, Zhiyuan, et al.
Published: (2025)
Automated Snippet-Alignment Data Augmentation for Code Translation
by: Zhang, Zhiming, et al.
Published: (2025)
by: Zhang, Zhiming, et al.
Published: (2025)
Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement
by: Ma, Yingwei, et al.
Published: (2024)
by: Ma, Yingwei, et al.
Published: (2024)
Do Code LLMs Understand Design Patterns?
by: Pan, Zhenyu, et al.
Published: (2025)
by: Pan, Zhenyu, et al.
Published: (2025)
Multi-Layer Attention is the Amplifier of Demonstration Effectiveness
by: Wang, Dingzirui, et al.
Published: (2025)
by: Wang, Dingzirui, et al.
Published: (2025)
Learning-to-Context Slope: Evaluating In-Context Learning Effectiveness Beyond Performance Illusions
by: Wang, Dingzriui, et al.
Published: (2025)
by: Wang, Dingzriui, et al.
Published: (2025)
Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training
by: Wang, Yixuan, et al.
Published: (2024)
by: Wang, Yixuan, et al.
Published: (2024)
Semi-Instruct: Bridging Natural-Instruct and Self-Instruct for Code Large Language Models
by: Luo, Xianzhen, et al.
Published: (2024)
by: Luo, Xianzhen, et al.
Published: (2024)
Scaling Laws for Code: A More Data-Hungry Regime
by: Luo, Xianzhen, et al.
Published: (2025)
by: Luo, Xianzhen, et al.
Published: (2025)
CRVQ: Channel-Relaxed Vector Quantization for Extreme Compression of LLMs
by: Xu, Yuzhuang, et al.
Published: (2024)
by: Xu, Yuzhuang, et al.
Published: (2024)
To Diff or Not to Diff? Structure-Aware and Adaptive Output Formats for Efficient LLM-based Code Editing
by: Cheng, Wei, et al.
Published: (2026)
by: Cheng, Wei, et al.
Published: (2026)
Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits
by: Li, Bohan, et al.
Published: (2024)
by: Li, Bohan, et al.
Published: (2024)
How Many Code and Test Cases Are Enough? Evaluating Test Cases Generation from a Binary-Matrix Perspective
by: Luo, Xianzhen, et al.
Published: (2025)
by: Luo, Xianzhen, et al.
Published: (2025)
Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts
by: Luo, Xianzhen, et al.
Published: (2024)
by: Luo, Xianzhen, et al.
Published: (2024)
Large Language Model Unlearning for Source Code
by: Jiang, Xue, et al.
Published: (2025)
by: Jiang, Xue, et al.
Published: (2025)
Reasoning Does Not Necessarily Improve Role-Playing Ability
by: Feng, Xiachong, et al.
Published: (2025)
by: Feng, Xiachong, et al.
Published: (2025)
Is Compression Really Linear with Code Intelligence?
by: Xuyang, Shijie, et al.
Published: (2025)
by: Xuyang, Shijie, et al.
Published: (2025)
Success is in the Details: Evaluate and Enhance Details Sensitivity of Code LLMs through Counterfactuals
by: Luo, Xianzhen, et al.
Published: (2025)
by: Luo, Xianzhen, et al.
Published: (2025)
ChartREG++: Towards Benchmarking and Improving Chart Referring Expression Grounding under Diverse referring clues and Multi-Target Referring
by: Niu, Tianhao, et al.
Published: (2026)
by: Niu, Tianhao, et al.
Published: (2026)
Similar Items
-
Enhancing Numerical Reasoning with the Guidance of Reliable Reasoning Processes
by: Wang, Dingzirui, et al.
Published: (2024) -
In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks
by: Wang, Dingzirui, et al.
Published: (2024) -
A Survey of Table Reasoning with Large Language Models
by: Zhang, Xuanliang, et al.
Published: (2024) -
FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats
by: Zhang, Xuanliang, et al.
Published: (2024) -
Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQL
by: Wang, Dingzirui, et al.
Published: (2024)