Saved in:
| Main Authors: | Cao, Qian, Chen, Xu, Song, Ruihua, Jiang, Hao, Yang, Guang, Cao, Zhao |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2209.02427 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Causal Inspired Multi Modal Recommendation
by: Yang, Jie, et al.
Published: (2025)
by: Yang, Jie, et al.
Published: (2025)
Robust Motion Generation using Part-level Reliable Data from Videos
by: Li, Boyuan, et al.
Published: (2025)
by: Li, Boyuan, et al.
Published: (2025)
DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing
by: Cao, Qian, et al.
Published: (2026)
by: Cao, Qian, et al.
Published: (2026)
Enhancing Multimodal Large Language Models for Ancient Chinese Character Evolution Analysis via Glyph-Driven Fine-Tuning
by: Song, Rui, et al.
Published: (2026)
by: Song, Rui, et al.
Published: (2026)
Appformer: A Novel Framework for Mobile App Usage Prediction Leveraging Progressive Multi-Modal Data Fusion and Feature Extraction
by: Sun, Chuike, et al.
Published: (2024)
by: Sun, Chuike, et al.
Published: (2024)
PaperScope: A Multi-Modal Multi-Document Benchmark for Agentic Deep Research Across Massive Scientific Papers
by: Xiong, Lei, et al.
Published: (2026)
by: Xiong, Lei, et al.
Published: (2026)
Human-Inspired Multi-Level Reinforcement Learning
by: Wu, Mingkang, et al.
Published: (2025)
by: Wu, Mingkang, et al.
Published: (2025)
Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving
by: Lou, Yang, et al.
Published: (2023)
by: Lou, Yang, et al.
Published: (2023)
Seeing the Goal, Missing the Truth: Human Accountability for AI Bias
by: Cao, Sean, et al.
Published: (2026)
by: Cao, Sean, et al.
Published: (2026)
Hierarchical Attacks for Multi-Modal Multi-Agent Reasoning
by: Zhou, Hao, et al.
Published: (2026)
by: Zhou, Hao, et al.
Published: (2026)
DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach
by: Tang, Xin, et al.
Published: (2024)
by: Tang, Xin, et al.
Published: (2024)
TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets
by: Chen, Jintai, et al.
Published: (2024)
by: Chen, Jintai, et al.
Published: (2024)
Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment
by: Dai, Shenghong, et al.
Published: (2024)
by: Dai, Shenghong, et al.
Published: (2024)
Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling
by: Cao, Yang, et al.
Published: (2025)
by: Cao, Yang, et al.
Published: (2025)
APLe: Token-Wise Adaptive for Multi-Modal Prompt Learning
by: Cao, Guiming, et al.
Published: (2024)
by: Cao, Guiming, et al.
Published: (2024)
Learning Multi-Modal Mobility Dynamics for Generalized Next Location Recommendation
by: Dai, Junshu, et al.
Published: (2025)
by: Dai, Junshu, et al.
Published: (2025)
DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation
by: Zhao, Wang, et al.
Published: (2024)
by: Zhao, Wang, et al.
Published: (2024)
Internalizing Agency from Reflective Experience
by: Ge, Rui, et al.
Published: (2026)
by: Ge, Rui, et al.
Published: (2026)
SliceGraph: Mapping Process Isomers in Multi-Run Chain-of-Thought Reasoning
by: Chen, Kang, et al.
Published: (2026)
by: Chen, Kang, et al.
Published: (2026)
Evaluating Frontier LLMs on PhD-Level Mathematical Reasoning: A Benchmark on a Textbook in Theoretical Computer Science about Randomized Algorithms
by: Cao, Yang, et al.
Published: (2025)
by: Cao, Yang, et al.
Published: (2025)
Cross-Modal Distillation For Widely Differing Modalities
by: Zhao, Cairong, et al.
Published: (2025)
by: Zhao, Cairong, et al.
Published: (2025)
Chinese Stock Prediction Based on a Multi-Modal Transformer Framework: Macro-Micro Information Fusion
by: AI, Lumen, et al.
Published: (2025)
by: AI, Lumen, et al.
Published: (2025)
MatterChat: A Multi-Modal LLM for Material Science
by: Tang, Yingheng, et al.
Published: (2025)
by: Tang, Yingheng, et al.
Published: (2025)
SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning
by: Li, Ruohao, et al.
Published: (2025)
by: Li, Ruohao, et al.
Published: (2025)
Harmony: A Unified Framework for Modality Incremental Learning
by: Song, Yaguang, et al.
Published: (2025)
by: Song, Yaguang, et al.
Published: (2025)
Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation
by: Cao, Yang, et al.
Published: (2025)
by: Cao, Yang, et al.
Published: (2025)
MARS: Co-evolving Dual-System Deep Research via Multi-Agent Reinforcement Learning
by: Chen, Guoxin, et al.
Published: (2025)
by: Chen, Guoxin, et al.
Published: (2025)
Creation of Novel Soft Robot Designs using Generative AI
by: Chan, Wee Kiat, et al.
Published: (2024)
by: Chan, Wee Kiat, et al.
Published: (2024)
Pair-In, Pair-Out: Latent Multi-Token Prediction for Efficient LLMs
by: Tan, Wenhui, et al.
Published: (2026)
by: Tan, Wenhui, et al.
Published: (2026)
ORMind: A Cognitive-Inspired End-to-End Reasoning Framework for Operations Research
by: Wang, Zhiyuan, et al.
Published: (2025)
by: Wang, Zhiyuan, et al.
Published: (2025)
Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond
by: Cao, Yang, et al.
Published: (2024)
by: Cao, Yang, et al.
Published: (2024)
Thermally Activated Dual-Modal Adversarial Clothing against AI Surveillance Systems
by: Long, Jiahuan, et al.
Published: (2025)
by: Long, Jiahuan, et al.
Published: (2025)
Multi-Modal Manipulation via Multi-Modal Policy Consensus
by: Chen, Haonan, et al.
Published: (2025)
by: Chen, Haonan, et al.
Published: (2025)
MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data
by: Zhu, Zhenghao, et al.
Published: (2025)
by: Zhu, Zhenghao, et al.
Published: (2025)
Contests with Spillovers: Incentivizing Content Creation with GenAI
by: Ohayon, Sagi, et al.
Published: (2026)
by: Ohayon, Sagi, et al.
Published: (2026)
StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management
by: Zhang, Ruizhe, et al.
Published: (2026)
by: Zhang, Ruizhe, et al.
Published: (2026)
MPE-TTS: Customized Emotion Zero-Shot Text-To-Speech Using Multi-Modal Prompt
by: Wu, Zhichao, et al.
Published: (2025)
by: Wu, Zhichao, et al.
Published: (2025)
PatentMind: A Multi-Aspect Reasoning Graph for Patent Similarity Evaluation
by: Yoo, Yongmin, et al.
Published: (2025)
by: Yoo, Yongmin, et al.
Published: (2025)
Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria
by: Cao, Yang, et al.
Published: (2025)
by: Cao, Yang, et al.
Published: (2025)
Sketch Then Paint: Hierarchical Reinforcement Learning for Diffusion Multi-Modal Large Language Models
by: Luo, Siqi, et al.
Published: (2026)
by: Luo, Siqi, et al.
Published: (2026)
Similar Items
-
Causal Inspired Multi Modal Recommendation
by: Yang, Jie, et al.
Published: (2025) -
Robust Motion Generation using Part-level Reliable Data from Videos
by: Li, Boyuan, et al.
Published: (2025) -
DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing
by: Cao, Qian, et al.
Published: (2026) -
Enhancing Multimodal Large Language Models for Ancient Chinese Character Evolution Analysis via Glyph-Driven Fine-Tuning
by: Song, Rui, et al.
Published: (2026) -
Appformer: A Novel Framework for Mobile App Usage Prediction Leveraging Progressive Multi-Modal Data Fusion and Feature Extraction
by: Sun, Chuike, et al.
Published: (2024)