Saved in:
| Main Authors: | Li, Yinsheng, Dong, Zhen, Shao, Yi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.11527 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
IdeaBench: Benchmarking Large Language Models for Research Idea Generation
by: Guo, Sikun, et al.
Published: (2024)
by: Guo, Sikun, et al.
Published: (2024)
QuantBench: Benchmarking AI Methods for Quantitative Investment
by: Wang, Saizhuo, et al.
Published: (2025)
by: Wang, Saizhuo, et al.
Published: (2025)
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition
by: Liu, Yujie, et al.
Published: (2025)
by: Liu, Yujie, et al.
Published: (2025)
Automating Structural Engineering Workflows with Large Language Model Agents
by: Liang, Haoran, et al.
Published: (2025)
by: Liang, Haoran, et al.
Published: (2025)
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
by: Li, Haohang, et al.
Published: (2024)
by: Li, Haohang, et al.
Published: (2024)
Enterprise Benchmarks for Large Language Model Evaluation
by: Zhang, Bing, et al.
Published: (2024)
by: Zhang, Bing, et al.
Published: (2024)
FinBen: A Holistic Financial Benchmark for Large Language Models
by: Xie, Qianqian, et al.
Published: (2024)
by: Xie, Qianqian, et al.
Published: (2024)
Are LLMs Socially Adaptive? Contrasting Belief Evolution in Large Language Models and Humans
by: Lei, Yu, et al.
Published: (2024)
by: Lei, Yu, et al.
Published: (2024)
Opportunities for Large Language Models and Discourse in Engineering Design
by: Göpfert, Jan, et al.
Published: (2023)
by: Göpfert, Jan, et al.
Published: (2023)
FinAudio: A Benchmark for Audio Large Language Models in Financial Applications
by: Cao, Yupeng, et al.
Published: (2025)
by: Cao, Yupeng, et al.
Published: (2025)
When Life gives you LLMs, make LLM-ADE: Large Language Models with Adaptive Data Engineering
by: Choi, Stephen, et al.
Published: (2024)
by: Choi, Stephen, et al.
Published: (2024)
CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications
by: Cao, Yupeng, et al.
Published: (2024)
by: Cao, Yupeng, et al.
Published: (2024)
OceanGPT: A Large Language Model for Ocean Science Tasks
by: Bi, Zhen, et al.
Published: (2023)
by: Bi, Zhen, et al.
Published: (2023)
Diabetica: Adapting Large Language Model to Enhance Multiple Medical Tasks in Diabetes Care and Management
by: Wei, Lai, et al.
Published: (2024)
by: Wei, Lai, et al.
Published: (2024)
BikeBench: A Bicycle Design Benchmark for Generative Models with Objectives and Constraints
by: Regenwetter, Lyle, et al.
Published: (2025)
by: Regenwetter, Lyle, et al.
Published: (2025)
From Concept to Manufacturing: Evaluating Vision-Language Models for Engineering Design
by: Picard, Cyril, et al.
Published: (2023)
by: Picard, Cyril, et al.
Published: (2023)
MetaBench: A Multi-task Benchmark for Assessing LLMs in Metabolomics
by: Lu, Yuxing, et al.
Published: (2025)
by: Lu, Yuxing, et al.
Published: (2025)
ProtChatGPT: Towards Understanding Proteins with Large Language Models
by: Wang, Chao, et al.
Published: (2024)
by: Wang, Chao, et al.
Published: (2024)
LiveTradeBench: Seeking Real-World Alpha with Large Language Models
by: Yu, Haofei, et al.
Published: (2025)
by: Yu, Haofei, et al.
Published: (2025)
PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents
by: Yin, Xiangyu, et al.
Published: (2024)
by: Yin, Xiangyu, et al.
Published: (2024)
Diffusion Large Language Models for Black-Box Optimization
by: Yuan, Ye, et al.
Published: (2026)
by: Yuan, Ye, et al.
Published: (2026)
Empirical Asset Pricing with Large Language Model Agents
by: Cheng, Junyan, et al.
Published: (2024)
by: Cheng, Junyan, et al.
Published: (2024)
Assertion-Aware Test Code Summarization with Large Language Models
by: Mollah, Anamul Haque, et al.
Published: (2025)
by: Mollah, Anamul Haque, et al.
Published: (2025)
Compressing Large Language Models with PCA Without Performance Loss
by: Bengtsson, Magnus
Published: (2025)
by: Bengtsson, Magnus
Published: (2025)
Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks
by: Wang, Julian Junyan, et al.
Published: (2025)
by: Wang, Julian Junyan, et al.
Published: (2025)
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs
by: Lu, Guilong, et al.
Published: (2025)
by: Lu, Guilong, et al.
Published: (2025)
TCM-FTP: Fine-Tuning Large Language Models for Herbal Prescription Prediction
by: Zhou, Xingzhi, et al.
Published: (2024)
by: Zhou, Xingzhi, et al.
Published: (2024)
QuantMCP: Grounding Large Language Models in Verifiable Financial Reality
by: Zeng, Yifan
Published: (2025)
by: Zeng, Yifan
Published: (2025)
A Survey of Sustainability in Large Language Models: Applications, Economics, and Challenges
by: Singh, Aditi, et al.
Published: (2024)
by: Singh, Aditi, et al.
Published: (2024)
Assessing and Enhancing Large Language Models in Rare Disease Question-answering
by: Wang, Guanchu, et al.
Published: (2024)
by: Wang, Guanchu, et al.
Published: (2024)
Large Language Models for Bioinformatics
by: Ruan, Wei, et al.
Published: (2025)
by: Ruan, Wei, et al.
Published: (2025)
CAX-Agent: A Lightweight Agent Harness for Reliable APDL Automation
by: Lin, Chenying, et al.
Published: (2026)
by: Lin, Chenying, et al.
Published: (2026)
Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization
by: Yang, Shaoliang, et al.
Published: (2026)
by: Yang, Shaoliang, et al.
Published: (2026)
Reinforcement Learning of Large Language Models for Interpretable Credit Card Fraud Detection
by: Lin, Cooper, et al.
Published: (2026)
by: Lin, Cooper, et al.
Published: (2026)
FLAME: Financial Large-Language Model Assessment and Metrics Evaluation
by: Guo, Jiayu, et al.
Published: (2025)
by: Guo, Jiayu, et al.
Published: (2025)
Physics-Informed Large Language Models for HVAC Anomaly Detection with Autonomous Rule Generation
by: Lin, Subin, et al.
Published: (2025)
by: Lin, Subin, et al.
Published: (2025)
FinRule-Bench: A Benchmark for Joint Reasoning over Financial Tables and Principles
by: Malarkkan, Arun Vignesh, et al.
Published: (2026)
by: Malarkkan, Arun Vignesh, et al.
Published: (2026)
Ethereum Price Prediction Employing Large Language Models for Short-term and Few-shot Forecasting
by: Makri, Eftychia, et al.
Published: (2025)
by: Makri, Eftychia, et al.
Published: (2025)
EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods
by: Ding, Hongcheng, et al.
Published: (2024)
by: Ding, Hongcheng, et al.
Published: (2024)
Can Large Language Models Solve Engineering Equations? A Systematic Comparison of Direct Prediction and Solver-Assisted Approaches
by: Kodathala, Sai Varun, et al.
Published: (2026)
by: Kodathala, Sai Varun, et al.
Published: (2026)
Similar Items
-
IdeaBench: Benchmarking Large Language Models for Research Idea Generation
by: Guo, Sikun, et al.
Published: (2024) -
QuantBench: Benchmarking AI Methods for Quantitative Investment
by: Wang, Saizhuo, et al.
Published: (2025) -
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition
by: Liu, Yujie, et al.
Published: (2025) -
Automating Structural Engineering Workflows with Large Language Model Agents
by: Liang, Haoran, et al.
Published: (2025) -
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
by: Li, Haohang, et al.
Published: (2024)