Saved in:
| Main Authors: | Huang, Yu-Shiang, Lee, Yun-Yu, Chou, Tzu-Hsin, Lin, Che, Wang, Chuan-Ju |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.09997 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Decision-Oriented Text Evaluation
by: Huang, Yu-Shiang, et al.
Published: (2025)
by: Huang, Yu-Shiang, et al.
Published: (2025)
BERTScoreVisualizer: A Web Tool for Understanding Simplified Text Evaluation with BERTScore
by: Jaskowski, Sebastian, et al.
Published: (2024)
by: Jaskowski, Sebastian, et al.
Published: (2024)
Financial Risk Relation Identification through Dual-view Adaptation
by: Chiu, Wei-Ning, et al.
Published: (2025)
by: Chiu, Wei-Ning, et al.
Published: (2025)
Heritage identity and Indigenous language learning motivation: A case of Indigenous Taiwanese high school students
by: Hung Tzu Huang, et al.
Published: (2024)
by: Hung Tzu Huang, et al.
Published: (2024)
'Finance Wizard' at the FinLLM Challenge Task: Financial Text Summarization
by: Lee, Meisin, et al.
Published: (2024)
by: Lee, Meisin, et al.
Published: (2024)
A Survey of Large Language Models in Finance (FinLLMs)
by: Lee, Jean, et al.
Published: (2024)
by: Lee, Jean, et al.
Published: (2024)
FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in Finance Domain
by: Hu, Tiansheng, et al.
Published: (2025)
by: Hu, Tiansheng, et al.
Published: (2025)
Enhancing Medication Recommendation with LLM Text Representation
by: Lee, Yu-Tzu
Published: (2024)
by: Lee, Yu-Tzu
Published: (2024)
FinHarness: An Inline Lifecycle Safety Harness for Finance LLM Agents
by: Jia, Haoxuan, et al.
Published: (2026)
by: Jia, Haoxuan, et al.
Published: (2026)
Resolving Regular Polysemy in Named Entities
by: Hsieh, Shu-Kai, et al.
Published: (2024)
by: Hsieh, Shu-Kai, et al.
Published: (2024)
The False Resonance: A Critical Examination of Emotion Embedding Similarity for Speech Generation Evaluation
by: Tsai, Yun-Shao, et al.
Published: (2026)
by: Tsai, Yun-Shao, et al.
Published: (2026)
Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings
by: Shor, Joel, et al.
Published: (2023)
by: Shor, Joel, et al.
Published: (2023)
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research
by: Huang, Sian-Yao, et al.
Published: (2024)
by: Huang, Sian-Yao, et al.
Published: (2024)
Fin-Bias: Comprehensive Evaluation for LLM Decision-Making under human bias in Finance Domain
by: Hu, Xiaoyu, et al.
Published: (2026)
by: Hu, Xiaoyu, et al.
Published: (2026)
FinGen: A Dataset for Argument Generation in Finance
by: Chen, Chung-Chi, et al.
Published: (2024)
by: Chen, Chung-Chi, et al.
Published: (2024)
FinMTEB: Finance Massive Text Embedding Benchmark
by: Tang, Yixuan, et al.
Published: (2025)
by: Tang, Yixuan, et al.
Published: (2025)
NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance
by: Su, Huan-Yi, et al.
Published: (2024)
by: Su, Huan-Yi, et al.
Published: (2024)
Semantic Similarity Matching for Patent Documents Using Ensemble BERT-related Model and Novel Text Processing Method
by: Yu, Liqiang, et al.
Published: (2024)
by: Yu, Liqiang, et al.
Published: (2024)
FinXABSA: Explainable Finance through Aspect-Based Sentiment Analysis
by: Ong, Keane, et al.
Published: (2023)
by: Ong, Keane, et al.
Published: (2023)
Improving Speech Emotion Recognition in Under-Resourced Languages via Speech-to-Speech Translation with Bootstrapping Data Selection
by: Lin, Hsi-Che, et al.
Published: (2024)
by: Lin, Hsi-Che, et al.
Published: (2024)
FinSafetyBench: Evaluating LLM Safety in Real-World Financial Scenarios
by: Hou, Yutao, et al.
Published: (2026)
by: Hou, Yutao, et al.
Published: (2026)
AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning
by: Lin, Tzu-Han, et al.
Published: (2025)
by: Lin, Tzu-Han, et al.
Published: (2025)
Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models
by: Li, Haoyang, et al.
Published: (2025)
by: Li, Haoyang, et al.
Published: (2025)
Evaluating AI for Finance: Is AI Credible at Assessing Investment Risk?
by: Chawla, Divij, et al.
Published: (2025)
by: Chawla, Divij, et al.
Published: (2025)
SemanticShield: LLM-Powered Audits Expose Shilling Attacks in Recommender Systems
by: Li, Kaihong, et al.
Published: (2025)
by: Li, Kaihong, et al.
Published: (2025)
FinBERT2: A Specialized Bidirectional Encoder for Bridging the Gap in Finance-Specific Deployment of Large Language Models
by: Xu, Xuan, et al.
Published: (2025)
by: Xu, Xuan, et al.
Published: (2025)
From Simulation to Strategy: Automating Personalized Interaction Planning for Conversational Agents
by: Chang, Wen-Yu, et al.
Published: (2025)
by: Chang, Wen-Yu, et al.
Published: (2025)
DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging
by: Lin, Tzu-Han, et al.
Published: (2024)
by: Lin, Tzu-Han, et al.
Published: (2024)
Evaluating Large Language Models as Expert Annotators
by: Tseng, Yu-Min, et al.
Published: (2025)
by: Tseng, Yu-Min, et al.
Published: (2025)
MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
by: Gan, Ziliang, et al.
Published: (2024)
by: Gan, Ziliang, et al.
Published: (2024)
CO-VADA: A Confidence-Oriented Voice Augmentation Debiasing Approach for Fair Speech Emotion Recognition
by: Tsai, Yun-Shao, et al.
Published: (2025)
by: Tsai, Yun-Shao, et al.
Published: (2025)
LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard
by: Rao, Varun, et al.
Published: (2025)
by: Rao, Varun, et al.
Published: (2025)
NuBE
Published: (2023)
Published: (2023)
Editing the Mind of Giants: An In-Depth Exploration of Pitfalls of Knowledge Editing in Large Language Models
by: Hsueh, Cheng-Hsun, et al.
Published: (2024)
by: Hsueh, Cheng-Hsun, et al.
Published: (2024)
TriBench-Ko: Evaluating LLM Risks in Judicial Workflows
by: Lee, Haesung, et al.
Published: (2026)
by: Lee, Haesung, et al.
Published: (2026)
A Survey of Generative Information Retrieval
by: Kuo, Tzu-Lin, et al.
Published: (2024)
by: Kuo, Tzu-Lin, et al.
Published: (2024)
How Small Transformation Expose the Weakness of Semantic Similarity Measures
by: Nikiema, Serge Lionel, et al.
Published: (2025)
by: Nikiema, Serge Lionel, et al.
Published: (2025)
FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models
by: Guo, Xin, et al.
Published: (2023)
by: Guo, Xin, et al.
Published: (2023)
Why Expert Alignment Is Hard: Evidence from Subjective Evaluation
by: Lin, Tzu-Mi, et al.
Published: (2026)
by: Lin, Tzu-Mi, et al.
Published: (2026)
EMO-Debias: Benchmarking Gender Debiasing Techniques in Multi-Label Speech Emotion Recognition
by: Lin, Yi-Cheng, et al.
Published: (2025)
by: Lin, Yi-Cheng, et al.
Published: (2025)
Similar Items
-
Decision-Oriented Text Evaluation
by: Huang, Yu-Shiang, et al.
Published: (2025) -
BERTScoreVisualizer: A Web Tool for Understanding Simplified Text Evaluation with BERTScore
by: Jaskowski, Sebastian, et al.
Published: (2024) -
Financial Risk Relation Identification through Dual-view Adaptation
by: Chiu, Wei-Ning, et al.
Published: (2025) -
Heritage identity and Indigenous language learning motivation: A case of Indigenous Taiwanese high school students
by: Hung Tzu Huang, et al.
Published: (2024) -
'Finance Wizard' at the FinLLM Challenge Task: Financial Text Summarization
by: Lee, Meisin, et al.
Published: (2024)