Saved in:
| Main Authors: | Luo, Miaosen, Long, Jiesen, Li, Zequn, Yang, Yunying, Jiang, Yuncheng, Mai, Sijie |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.02429 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Explainable Fusion and Balanced Learning in Multimodal Sentiment Analysis
by: Luo, Miaosen, et al.
Published: (2025)
by: Luo, Miaosen, et al.
Published: (2025)
End-to-end Semantic-centric Video-based Multimodal Affective Computing
by: Lin, Ronghao, et al.
Published: (2024)
by: Lin, Ronghao, et al.
Published: (2024)
C2F-Thinker: Coarse-to-Fine Reasoning with Hint-Guided Reinforcement Learning for Multimodal Sentiment Analysis
by: Luo, Miaosen, et al.
Published: (2026)
by: Luo, Miaosen, et al.
Published: (2026)
MissMAC-Bench: Building Solid Benchmark for Missing Modality Issue in Robust Multimodal Affective Computing
by: Lin, Ronghao, et al.
Published: (2026)
by: Lin, Ronghao, et al.
Published: (2026)
SpatialText: A Pure-Text Cognitive Benchmark for Spatial Understanding in Large Language Models
by: Jiang, Peiyao, et al.
Published: (2026)
by: Jiang, Peiyao, et al.
Published: (2026)
Finetuning Generative Large Language Models with Discrimination Instructions for Knowledge Graph Completion
by: Liu, Yang, et al.
Published: (2024)
by: Liu, Yang, et al.
Published: (2024)
E2Edev: Benchmarking Large Language Models in End-to-End Software Development Task
by: Liu, Jingyao, et al.
Published: (2025)
by: Liu, Jingyao, et al.
Published: (2025)
SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models
by: Yin, Han, et al.
Published: (2025)
by: Yin, Han, et al.
Published: (2025)
AlphaForgeBench: Benchmarking End-to-End Trading Strategy Design with Large Language Models
by: Zhang, Wentao, et al.
Published: (2026)
by: Zhang, Wentao, et al.
Published: (2026)
Using Large Language Model for End-to-End Chinese ASR and NER
by: Li, Yuang, et al.
Published: (2024)
by: Li, Yuang, et al.
Published: (2024)
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
by: He, Hongliang, et al.
Published: (2024)
by: He, Hongliang, et al.
Published: (2024)
End-to-End Graph Flattening Method for Large Language Models
by: Hong, Bin, et al.
Published: (2024)
by: Hong, Bin, et al.
Published: (2024)
End-To-End Clinical Trial Matching with Large Language Models
by: Ferber, Dyke, et al.
Published: (2024)
by: Ferber, Dyke, et al.
Published: (2024)
MiCU: End-to-End Smart Home Command Understanding with Large Language Model
by: Han, Haowei, et al.
Published: (2026)
by: Han, Haowei, et al.
Published: (2026)
Large Language Models as End-to-end Combinatorial Optimization Solvers
by: Jiang, Xia, et al.
Published: (2025)
by: Jiang, Xia, et al.
Published: (2025)
LEAP: Learnable End-to-End Adaptive Pruning of Large Language Models
by: Mozaffari, Mohammad, et al.
Published: (2026)
by: Mozaffari, Mohammad, et al.
Published: (2026)
A Prompt-Based Knowledge Graph Foundation Model for Universal In-Context Reasoning
by: Cui, Yuanning, et al.
Published: (2024)
by: Cui, Yuanning, et al.
Published: (2024)
LMGenDrive: Bridging Multimodal Understanding and Generative World Modeling for End-to-End Driving
by: Shao, Hao, et al.
Published: (2026)
by: Shao, Hao, et al.
Published: (2026)
Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models
by: Jia, Yifan, et al.
Published: (2025)
by: Jia, Yifan, et al.
Published: (2025)
LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving
by: Qiao, Zhijie, et al.
Published: (2025)
by: Qiao, Zhijie, et al.
Published: (2025)
The End of Manual Decoding: Towards Truly End-to-End Language Models
by: Wang, Zhichao, et al.
Published: (2025)
by: Wang, Zhichao, et al.
Published: (2025)
AutoRD: An Automatic and End-to-End System for Rare Disease Knowledge Graph Construction Based on Ontologies-enhanced Large Language Models
by: Cao, Lang, et al.
Published: (2024)
by: Cao, Lang, et al.
Published: (2024)
Self-Retrieval: End-to-End Information Retrieval with One Large Language Model
by: Tang, Qiaoyu, et al.
Published: (2024)
by: Tang, Qiaoyu, et al.
Published: (2024)
Language Model Inversion through End-to-End Differentiation
by: Denamganaï, Kevin Yandoka, et al.
Published: (2026)
by: Denamganaï, Kevin Yandoka, et al.
Published: (2026)
SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control
by: Wang, Yuxuan, et al.
Published: (2025)
by: Wang, Yuxuan, et al.
Published: (2025)
ProjDevBench: Benchmarking AI Coding Agents on End-to-End Project Development
by: Lu, Pengrui, et al.
Published: (2026)
by: Lu, Pengrui, et al.
Published: (2026)
Reinforced Reasoning for End-to-End Retrosynthetic Planning
by: Zuo, Chenyang, et al.
Published: (2026)
by: Zuo, Chenyang, et al.
Published: (2026)
End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations
by: Luo, Lirui, et al.
Published: (2024)
by: Luo, Lirui, et al.
Published: (2024)
Graphy'our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data
by: Lai, Longbin, et al.
Published: (2025)
by: Lai, Longbin, et al.
Published: (2025)
First Ask Then Answer: A Framework Design for AI Dialogue Based on Supplementary Questioning with Large Language Models
by: Fu, Chuanruo, et al.
Published: (2025)
by: Fu, Chuanruo, et al.
Published: (2025)
A Conflict-Aware Penalty and Statistical Loss Framework for Balancing Modalities and Enhancing Stability in Multimodal Sentiment Analysis
by: Dai, Jianheng, et al.
Published: (2026)
by: Dai, Jianheng, et al.
Published: (2026)
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues
by: Medjad, Maya, et al.
Published: (2025)
by: Medjad, Maya, et al.
Published: (2025)
In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach
by: Gao, Yiran, et al.
Published: (2026)
by: Gao, Yiran, et al.
Published: (2026)
MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented Generation
by: Blandón, María Andrea Cruz, et al.
Published: (2025)
by: Blandón, María Andrea Cruz, et al.
Published: (2025)
MageBench: Bridging Large Multimodal Models to Agents
by: Zhang, Miaosen, et al.
Published: (2024)
by: Zhang, Miaosen, et al.
Published: (2024)
AudioJailbreak: Jailbreak Attacks against End-to-End Large Audio-Language Models
by: Chen, Guangke, et al.
Published: (2025)
by: Chen, Guangke, et al.
Published: (2025)
ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents
by: Lai, Hanyu, et al.
Published: (2025)
by: Lai, Hanyu, et al.
Published: (2025)
E2E-AFG: An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation
by: Jiang, Yun, et al.
Published: (2024)
by: Jiang, Yun, et al.
Published: (2024)
Beyond Hungarian: Match-Free Supervision for End-to-End Object Detection
by: Qiu, Shoumeng, et al.
Published: (2026)
by: Qiu, Shoumeng, et al.
Published: (2026)
NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
by: Wu, Kai, et al.
Published: (2024)
by: Wu, Kai, et al.
Published: (2024)
Similar Items
-
Towards Explainable Fusion and Balanced Learning in Multimodal Sentiment Analysis
by: Luo, Miaosen, et al.
Published: (2025) -
End-to-end Semantic-centric Video-based Multimodal Affective Computing
by: Lin, Ronghao, et al.
Published: (2024) -
C2F-Thinker: Coarse-to-Fine Reasoning with Hint-Guided Reinforcement Learning for Multimodal Sentiment Analysis
by: Luo, Miaosen, et al.
Published: (2026) -
MissMAC-Bench: Building Solid Benchmark for Missing Modality Issue in Robust Multimodal Affective Computing
by: Lin, Ronghao, et al.
Published: (2026) -
SpatialText: A Pure-Text Cognitive Benchmark for Spatial Understanding in Large Language Models
by: Jiang, Peiyao, et al.
Published: (2026)