Saved in:
| Main Authors: | Ni, Shiwen, Chen, Guhong, Li, Shuaimin, Chen, Xuanang, Li, Siyi, Wang, Bingli, Wang, Qiyao, Wang, Xingjian, Zhang, Yifan, Fan, Liyang, Li, Chengming, Xu, Ruifeng, Sun, Le, Yang, Min |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.15361 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
by: Ni, Shiwen, et al.
Published: (2023)
by: Ni, Shiwen, et al.
Published: (2023)
IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property
by: Wang, Qiyao, et al.
Published: (2025)
by: Wang, Qiyao, et al.
Published: (2025)
Layer-wise Regularized Dropout for Neural Language Models
by: Ni, Shiwen, et al.
Published: (2024)
by: Ni, Shiwen, et al.
Published: (2024)
Prompt4Vis: Prompting Large Language Models with Example Mining and Schema Filtering for Tabular Data Visualization
by: Li, Shuaimin, et al.
Published: (2024)
by: Li, Shuaimin, et al.
Published: (2024)
AgentCourt: Simulating Court with Adversarial Evolvable Lawyer Agents
by: Chen, Guhong, et al.
Published: (2024)
by: Chen, Guhong, et al.
Published: (2024)
Training on the Benchmark Is Not All You Need
by: Ni, Shiwen, et al.
Published: (2024)
by: Ni, Shiwen, et al.
Published: (2024)
xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
by: Lee, Sunbowen, et al.
Published: (2025)
by: Lee, Sunbowen, et al.
Published: (2025)
MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property
by: Ni, Shiwen, et al.
Published: (2024)
by: Ni, Shiwen, et al.
Published: (2024)
Automatic Paper Reviewing with Heterogeneous Graph Reasoning over LLM-Simulated Reviewer-Author Debates
by: Li, Shuaimin, et al.
Published: (2025)
by: Li, Shuaimin, et al.
Published: (2025)
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models
by: Hou, Jinchang, et al.
Published: (2024)
by: Hou, Jinchang, et al.
Published: (2024)
VisPoison: An Effective Backdoor Attack Framework for Tabular Data Visualization Models
by: Li, Shuaimin, et al.
Published: (2024)
by: Li, Shuaimin, et al.
Published: (2024)
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models
by: Liu, Ziqiang, et al.
Published: (2024)
by: Liu, Ziqiang, et al.
Published: (2024)
AutoPatent: A Multi-Agent Framework for Automatic Patent Generation
by: Wang, Qiyao, et al.
Published: (2024)
by: Wang, Qiyao, et al.
Published: (2024)
FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration
by: Wang, Qiyao, et al.
Published: (2026)
by: Wang, Qiyao, et al.
Published: (2026)
CoTJudger: A Graph-Driven Framework for Automatic Evaluation of Chain-of-Thought Efficiency and Redundancy in LRMs
by: Li, Siyi, et al.
Published: (2026)
by: Li, Siyi, et al.
Published: (2026)
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training
by: Fang, Feiteng, et al.
Published: (2024)
by: Fang, Feiteng, et al.
Published: (2024)
History, Development, and Principles of Large Language Models-An Introductory Survey
by: Wang, Zichong, et al.
Published: (2024)
by: Wang, Zichong, et al.
Published: (2024)
Expanding before Inferring: Enhancing Factuality in Large Language Models through Premature Layers Interpolation
by: Chen, Dingwei, et al.
Published: (2025)
by: Chen, Dingwei, et al.
Published: (2025)
Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching
by: Wang, Tianshu, et al.
Published: (2024)
by: Wang, Tianshu, et al.
Published: (2024)
Beyond Quantity: Trajectory Diversity Scaling for Code Agents
by: Chen, Guhong, et al.
Published: (2026)
by: Chen, Guhong, et al.
Published: (2026)
MultiTEND: A Multilingual Benchmark for Natural Language to NoSQL Query Translation
by: Qin, Zhiqian, et al.
Published: (2025)
by: Qin, Zhiqian, et al.
Published: (2025)
Spatial Reasoning in Multimodal Large Language Models: A Survey of Tasks, Benchmarks and Methods
by: Liu, Weichen, et al.
Published: (2025)
by: Liu, Weichen, et al.
Published: (2025)
MGCR-Net:Multimodal Graph-Conditioned Vision-Language Reconstruction Network for Remote Sensing Change Detection
by: Wang, Chengming, et al.
Published: (2025)
by: Wang, Chengming, et al.
Published: (2025)
Zero-Shot Neural Architecture Search with Weighted Response Correlation
by: Jing, Kun, et al.
Published: (2025)
by: Jing, Kun, et al.
Published: (2025)
An Investigation of Large Language Models and Their Vulnerabilities in Spam Detection
by: Tang, Qiyao, et al.
Published: (2025)
by: Tang, Qiyao, et al.
Published: (2025)
Leveraging Large Vision Language Model For Better Automatic Web GUI Testing
by: Wang, Siyi, et al.
Published: (2024)
by: Wang, Siyi, et al.
Published: (2024)
EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning
by: Chen, Yi, et al.
Published: (2023)
by: Chen, Yi, et al.
Published: (2023)
READoc: A Unified Benchmark for Realistic Document Structured Extraction
by: Li, Zichao, et al.
Published: (2024)
by: Li, Zichao, et al.
Published: (2024)
Personalized Large Language Model Assistant with Evolving Conditional Memory
by: Yuan, Ruifeng, et al.
Published: (2023)
by: Yuan, Ruifeng, et al.
Published: (2023)
Large Language Model Agent for Hyper-Parameter Optimization
by: Liu, Siyi, et al.
Published: (2024)
by: Liu, Siyi, et al.
Published: (2024)
CS-Eval: A Comprehensive Large Language Model Benchmark for CyberSecurity
by: Yu, Zhengmin, et al.
Published: (2024)
by: Yu, Zhengmin, et al.
Published: (2024)
On Concentration Inequality of the Laplacian Matrix of Erdős-Rényi Graphs
by: Chen, Yiming, et al.
Published: (2025)
by: Chen, Yiming, et al.
Published: (2025)
Beyond Local Edits: Embedding-Virtualized Knowledge for Broader Evaluation and Preservation of Model Editing
by: Liu, Shuainan, et al.
Published: (2026)
by: Liu, Shuainan, et al.
Published: (2026)
TABLE 2 in Female association of seven species of the genus Amphinemura Ris, 1902 (Nemouridae: Amphinemurinae) in China based on morphological and molecular data
by: Wang, Bingli, et al.
Published: (2025)
by: Wang, Bingli, et al.
Published: (2025)
Lower Layers Matter: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
by: Chen, Dingwei, et al.
Published: (2024)
by: Chen, Dingwei, et al.
Published: (2024)
Educational-Psychological Dialogue Robot Based on Multi-Agent Collaboration
by: Ni, Shiwen, et al.
Published: (2024)
by: Ni, Shiwen, et al.
Published: (2024)
NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models
by: Xu, Ancheng, et al.
Published: (2024)
by: Xu, Ancheng, et al.
Published: (2024)
Hybrid Offline-online Scheduling Method for Large Language Model Inference Optimization
by: Pang, Bowen, et al.
Published: (2025)
by: Pang, Bowen, et al.
Published: (2025)
AI-Salesman: Towards Reliable Large Language Model Driven Telemarketing
by: Zhang, Qingyu, et al.
Published: (2025)
by: Zhang, Qingyu, et al.
Published: (2025)
AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field
by: Liang, Chen, et al.
Published: (2025)
by: Liang, Chen, et al.
Published: (2025)
Similar Items
-
Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models
by: Ni, Shiwen, et al.
Published: (2023) -
IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property
by: Wang, Qiyao, et al.
Published: (2025) -
Layer-wise Regularized Dropout for Neural Language Models
by: Ni, Shiwen, et al.
Published: (2024) -
Prompt4Vis: Prompting Large Language Models with Example Mining and Schema Filtering for Tabular Data Visualization
by: Li, Shuaimin, et al.
Published: (2024) -
AgentCourt: Simulating Court with Adversarial Evolvable Lawyer Agents
by: Chen, Guhong, et al.
Published: (2024)