:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yoash, Noga Ben, Brief, Meni, Ovadia, Oded, Shenderovitz, Gil, Mishaeli, Moshik, Lemberg, Rachel, Sheetrit, Eitam
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Computational Engineering, Finance, and Science Computation and Language
Online Access:	https://arxiv.org/abs/2504.04596
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Mixing It Up: The Cocktail Effect of Multi-Task Fine-Tuning on LLM Performance -- A Case Study in Finance
by: Brief, Meni, et al.
Published: (2024)

Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions
by: Ovadia, Oded, et al.
Published: (2025)

ReMatch: Retrieval Enhanced Schema Matching with LLMs
by: Sheetrit, Eitam, et al.
Published: (2024)

Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs
by: Ovadia, Oded, et al.
Published: (2023)

AI-Trader: Benchmarking Autonomous Agents in Real-Time Financial Markets
by: Fan, Tianyu, et al.
Published: (2025)

BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs
by: Lu, Guilong, et al.
Published: (2025)

Finance Agent Benchmark: Benchmarking LLMs on Real-world Financial Research Tasks
by: Bigeard, Antoine, et al.
Published: (2025)

Evaluation and Benchmarking Suite for Financial Large Language Models and Agents
by: Lin, Shengyuan, et al.
Published: (2026)

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
by: Yang, Yuzhe, et al.
Published: (2024)

FinSight: Towards Real-World Financial Deep Research
by: Jin, Jiajie, et al.
Published: (2025)

All That Glisters Is Not Gold: A Benchmark for Reference-Free Counterfactual Financial Misinformation Detection
by: Jiang, Yuechen, et al.
Published: (2026)

Fin-RATE: A Real-world Financial Analytics and Tracking Evaluation Benchmark for LLMs on SEC Filings
by: Jiang, Yidong, et al.
Published: (2026)

Tokenized but Illiquid? Evidence from Real-World Asset Markets
by: Mafrur, Rischan
Published: (2026)

FinDocMRE: A Benchmark for Document-Level Financial Multimodal Reasoning Evaluation
by: Zhu, Jiayong, et al.
Published: (2026)

Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models
by: Wu, Xiaojun, et al.
Published: (2024)

INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
by: Li, Haohang, et al.
Published: (2024)

Beyond TVL: An Explainable Risk Scoring Framework for Tokenized Real-World Assets
by: Mafrur, Rischan, et al.
Published: (2026)

EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements
by: Sugiura, Issa, et al.
Published: (2025)

MMFCTUB: Multi-Modal Financial Credit Table Understanding Benchmark
by: Yakun, Cui, et al.
Published: (2026)

BizCompass: Benchmarking the Reasoning Capabilities of LLMs in Business Knowledge and Applications
by: Hao, Jianing, et al.
Published: (2026)

FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark for Evaluating LLMs
by: Wang, Yan, et al.
Published: (2025)

FinMTM: A Multi-Turn Multimodal Benchmark for Financial Reasoning and Agent Evaluation
by: Zhang, Chenxi, et al.
Published: (2026)

Evaluating Large Language Models (LLMs) in Financial NLP: A Comparative Study on Financial Report Analysis
by: Mohsin, Md Talha
Published: (2025)

FinRL Contests: Benchmarking Data-driven Financial Reinforcement Learning Agents
by: Wang, Keyi, et al.
Published: (2025)

Identifying and Quantifying Financial Bubbles with the Hyped Log-Periodic Power Law Model
by: Cao, Zheng, et al.
Published: (2025)

Transformer-Based Financial Fraud Detection with Cloud-Optimized Real-Time Streaming
by: Deng, Tingting, et al.
Published: (2025)

FinanceReasoning: Benchmarking Financial Numerical Reasoning More Credible, Comprehensive and Challenging
by: Tang, Zichen, et al.
Published: (2025)

The Statistical Significance of the Inclusion of Graph Neural Networks in the Financial Time Series Forecasting Problem
by: Gregnanin, Marco, et al.
Published: (2026)

VisFinEval: A Scenario-Driven Chinese Multimodal Benchmark for Holistic Financial Understanding
by: Liu, Zhaowei, et al.
Published: (2025)

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications
by: Huang, Jimin, et al.
Published: (2024)

No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks
by: Hu, Gang, et al.
Published: (2024)

SMARTFinRAG: Interactive Modularized Financial RAG Benchmark
by: Zha, Yiwei
Published: (2025)

Toward Engineering AGI: Benchmarking the Engineering Design Capabilities of LLMs
by: Guo, Xingang, et al.
Published: (2025)

SeQwen at the Financial Misinformation Detection Challenge Task: Sequential Learning for Claim Verification and Explanation Generation in Financial Domains
by: Purbey, Jebish, et al.
Published: (2024)

Financial Wind Tunnel: A Retrieval-Augmented Market Simulator
by: Cao, Bokai, et al.
Published: (2025)

FinTagging: Benchmarking LLMs for Extracting and Structuring Financial Information
by: Wang, Yan, et al.
Published: (2025)

Unlocking Noisy Real-World Corpora for Foundation Model Pre-Training via Quality-Aware Tokenization
by: Gollwitzer, Arvid E., et al.
Published: (2026)

FCMBench: The First Large-scale Financial Credit Multimodal Benchmark for Real-world Applications
by: Yang, Yehui, et al.
Published: (2026)

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models
by: Kuang, Ziyan, et al.
Published: (2025)

Modeling News Interactions and Influence for Financial Market Prediction
by: Wang, Mengyu, et al.
Published: (2024)