Saved in:
| Main Authors: | Yoash, Noga Ben, Brief, Meni, Ovadia, Oded, Shenderovitz, Gil, Mishaeli, Moshik, Lemberg, Rachel, Sheetrit, Eitam |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.04596 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mixing It Up: The Cocktail Effect of Multi-Task Fine-Tuning on LLM Performance -- A Case Study in Finance
by: Brief, Meni, et al.
Published: (2024)
by: Brief, Meni, et al.
Published: (2024)
Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions
by: Ovadia, Oded, et al.
Published: (2025)
by: Ovadia, Oded, et al.
Published: (2025)
ReMatch: Retrieval Enhanced Schema Matching with LLMs
by: Sheetrit, Eitam, et al.
Published: (2024)
by: Sheetrit, Eitam, et al.
Published: (2024)
Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs
by: Ovadia, Oded, et al.
Published: (2023)
by: Ovadia, Oded, et al.
Published: (2023)
AI-Trader: Benchmarking Autonomous Agents in Real-Time Financial Markets
by: Fan, Tianyu, et al.
Published: (2025)
by: Fan, Tianyu, et al.
Published: (2025)
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs
by: Lu, Guilong, et al.
Published: (2025)
by: Lu, Guilong, et al.
Published: (2025)
Finance Agent Benchmark: Benchmarking LLMs on Real-world Financial Research Tasks
by: Bigeard, Antoine, et al.
Published: (2025)
by: Bigeard, Antoine, et al.
Published: (2025)
Evaluation and Benchmarking Suite for Financial Large Language Models and Agents
by: Lin, Shengyuan, et al.
Published: (2026)
by: Lin, Shengyuan, et al.
Published: (2026)
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
by: Yang, Yuzhe, et al.
Published: (2024)
by: Yang, Yuzhe, et al.
Published: (2024)
FinSight: Towards Real-World Financial Deep Research
by: Jin, Jiajie, et al.
Published: (2025)
by: Jin, Jiajie, et al.
Published: (2025)
All That Glisters Is Not Gold: A Benchmark for Reference-Free Counterfactual Financial Misinformation Detection
by: Jiang, Yuechen, et al.
Published: (2026)
by: Jiang, Yuechen, et al.
Published: (2026)
Fin-RATE: A Real-world Financial Analytics and Tracking Evaluation Benchmark for LLMs on SEC Filings
by: Jiang, Yidong, et al.
Published: (2026)
by: Jiang, Yidong, et al.
Published: (2026)
Tokenized but Illiquid? Evidence from Real-World Asset Markets
by: Mafrur, Rischan
Published: (2026)
by: Mafrur, Rischan
Published: (2026)
FinDocMRE: A Benchmark for Document-Level Financial Multimodal Reasoning Evaluation
by: Zhu, Jiayong, et al.
Published: (2026)
by: Zhu, Jiayong, et al.
Published: (2026)
Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models
by: Wu, Xiaojun, et al.
Published: (2024)
by: Wu, Xiaojun, et al.
Published: (2024)
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
by: Li, Haohang, et al.
Published: (2024)
by: Li, Haohang, et al.
Published: (2024)
Beyond TVL: An Explainable Risk Scoring Framework for Tokenized Real-World Assets
by: Mafrur, Rischan, et al.
Published: (2026)
by: Mafrur, Rischan, et al.
Published: (2026)
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements
by: Sugiura, Issa, et al.
Published: (2025)
by: Sugiura, Issa, et al.
Published: (2025)
MMFCTUB: Multi-Modal Financial Credit Table Understanding Benchmark
by: Yakun, Cui, et al.
Published: (2026)
by: Yakun, Cui, et al.
Published: (2026)
BizCompass: Benchmarking the Reasoning Capabilities of LLMs in Business Knowledge and Applications
by: Hao, Jianing, et al.
Published: (2026)
by: Hao, Jianing, et al.
Published: (2026)
FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark for Evaluating LLMs
by: Wang, Yan, et al.
Published: (2025)
by: Wang, Yan, et al.
Published: (2025)
FinMTM: A Multi-Turn Multimodal Benchmark for Financial Reasoning and Agent Evaluation
by: Zhang, Chenxi, et al.
Published: (2026)
by: Zhang, Chenxi, et al.
Published: (2026)
Evaluating Large Language Models (LLMs) in Financial NLP: A Comparative Study on Financial Report Analysis
by: Mohsin, Md Talha
Published: (2025)
by: Mohsin, Md Talha
Published: (2025)
FinRL Contests: Benchmarking Data-driven Financial Reinforcement Learning Agents
by: Wang, Keyi, et al.
Published: (2025)
by: Wang, Keyi, et al.
Published: (2025)
Identifying and Quantifying Financial Bubbles with the Hyped Log-Periodic Power Law Model
by: Cao, Zheng, et al.
Published: (2025)
by: Cao, Zheng, et al.
Published: (2025)
Transformer-Based Financial Fraud Detection with Cloud-Optimized Real-Time Streaming
by: Deng, Tingting, et al.
Published: (2025)
by: Deng, Tingting, et al.
Published: (2025)
FinanceReasoning: Benchmarking Financial Numerical Reasoning More Credible, Comprehensive and Challenging
by: Tang, Zichen, et al.
Published: (2025)
by: Tang, Zichen, et al.
Published: (2025)
The Statistical Significance of the Inclusion of Graph Neural Networks in the Financial Time Series Forecasting Problem
by: Gregnanin, Marco, et al.
Published: (2026)
by: Gregnanin, Marco, et al.
Published: (2026)
VisFinEval: A Scenario-Driven Chinese Multimodal Benchmark for Holistic Financial Understanding
by: Liu, Zhaowei, et al.
Published: (2025)
by: Liu, Zhaowei, et al.
Published: (2025)
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications
by: Huang, Jimin, et al.
Published: (2024)
by: Huang, Jimin, et al.
Published: (2024)
No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks
by: Hu, Gang, et al.
Published: (2024)
by: Hu, Gang, et al.
Published: (2024)
SMARTFinRAG: Interactive Modularized Financial RAG Benchmark
by: Zha, Yiwei
Published: (2025)
by: Zha, Yiwei
Published: (2025)
Toward Engineering AGI: Benchmarking the Engineering Design Capabilities of LLMs
by: Guo, Xingang, et al.
Published: (2025)
by: Guo, Xingang, et al.
Published: (2025)
SeQwen at the Financial Misinformation Detection Challenge Task: Sequential Learning for Claim Verification and Explanation Generation in Financial Domains
by: Purbey, Jebish, et al.
Published: (2024)
by: Purbey, Jebish, et al.
Published: (2024)
Financial Wind Tunnel: A Retrieval-Augmented Market Simulator
by: Cao, Bokai, et al.
Published: (2025)
by: Cao, Bokai, et al.
Published: (2025)
FinTagging: Benchmarking LLMs for Extracting and Structuring Financial Information
by: Wang, Yan, et al.
Published: (2025)
by: Wang, Yan, et al.
Published: (2025)
Unlocking Noisy Real-World Corpora for Foundation Model Pre-Training via Quality-Aware Tokenization
by: Gollwitzer, Arvid E., et al.
Published: (2026)
by: Gollwitzer, Arvid E., et al.
Published: (2026)
FCMBench: The First Large-scale Financial Credit Multimodal Benchmark for Real-world Applications
by: Yang, Yehui, et al.
Published: (2026)
by: Yang, Yehui, et al.
Published: (2026)
From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models
by: Kuang, Ziyan, et al.
Published: (2025)
by: Kuang, Ziyan, et al.
Published: (2025)
Modeling News Interactions and Influence for Financial Market Prediction
by: Wang, Mengyu, et al.
Published: (2024)
by: Wang, Mengyu, et al.
Published: (2024)
Similar Items
-
Mixing It Up: The Cocktail Effect of Multi-Task Fine-Tuning on LLM Performance -- A Case Study in Finance
by: Brief, Meni, et al.
Published: (2024) -
Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions
by: Ovadia, Oded, et al.
Published: (2025) -
ReMatch: Retrieval Enhanced Schema Matching with LLMs
by: Sheetrit, Eitam, et al.
Published: (2024) -
Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs
by: Ovadia, Oded, et al.
Published: (2023) -
AI-Trader: Benchmarking Autonomous Agents in Real-Time Financial Markets
by: Fan, Tianyu, et al.
Published: (2025)