:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cao, Qian, Chen, Xu, Song, Ruihua, Jiang, Hao, Yang, Guang, Cao, Zhao
Format:	Preprint
Published:	2022
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2209.02427
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Causal Inspired Multi Modal Recommendation
by: Yang, Jie, et al.
Published: (2025)

Robust Motion Generation using Part-level Reliable Data from Videos
by: Li, Boyuan, et al.
Published: (2025)

DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing
by: Cao, Qian, et al.
Published: (2026)

Enhancing Multimodal Large Language Models for Ancient Chinese Character Evolution Analysis via Glyph-Driven Fine-Tuning
by: Song, Rui, et al.
Published: (2026)

Appformer: A Novel Framework for Mobile App Usage Prediction Leveraging Progressive Multi-Modal Data Fusion and Feature Extraction
by: Sun, Chuike, et al.
Published: (2024)

PaperScope: A Multi-Modal Multi-Document Benchmark for Agentic Deep Research Across Massive Scientific Papers
by: Xiong, Lei, et al.
Published: (2026)

Human-Inspired Multi-Level Reinforcement Learning
by: Wu, Mingkang, et al.
Published: (2025)

Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving
by: Lou, Yang, et al.
Published: (2023)

Seeing the Goal, Missing the Truth: Human Accountability for AI Bias
by: Cao, Sean, et al.
Published: (2026)

Hierarchical Attacks for Multi-Modal Multi-Agent Reasoning
by: Zhou, Hao, et al.
Published: (2026)

DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach
by: Tang, Xin, et al.
Published: (2024)

TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets
by: Chen, Jintai, et al.
Published: (2024)

Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment
by: Dai, Shenghong, et al.
Published: (2024)

Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling
by: Cao, Yang, et al.
Published: (2025)

APLe: Token-Wise Adaptive for Multi-Modal Prompt Learning
by: Cao, Guiming, et al.
Published: (2024)

Learning Multi-Modal Mobility Dynamics for Generalized Next Location Recommendation
by: Dai, Junshu, et al.
Published: (2025)

DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation
by: Zhao, Wang, et al.
Published: (2024)

Internalizing Agency from Reflective Experience
by: Ge, Rui, et al.
Published: (2026)

SliceGraph: Mapping Process Isomers in Multi-Run Chain-of-Thought Reasoning
by: Chen, Kang, et al.
Published: (2026)

Evaluating Frontier LLMs on PhD-Level Mathematical Reasoning: A Benchmark on a Textbook in Theoretical Computer Science about Randomized Algorithms
by: Cao, Yang, et al.
Published: (2025)

Cross-Modal Distillation For Widely Differing Modalities
by: Zhao, Cairong, et al.
Published: (2025)

Chinese Stock Prediction Based on a Multi-Modal Transformer Framework: Macro-Micro Information Fusion
by: AI, Lumen, et al.
Published: (2025)

MatterChat: A Multi-Modal LLM for Material Science
by: Tang, Yingheng, et al.
Published: (2025)

SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning
by: Li, Ruohao, et al.
Published: (2025)

Harmony: A Unified Framework for Modality Incremental Learning
by: Song, Yaguang, et al.
Published: (2025)

Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation
by: Cao, Yang, et al.
Published: (2025)

MARS: Co-evolving Dual-System Deep Research via Multi-Agent Reinforcement Learning
by: Chen, Guoxin, et al.
Published: (2025)

Creation of Novel Soft Robot Designs using Generative AI
by: Chan, Wee Kiat, et al.
Published: (2024)

Pair-In, Pair-Out: Latent Multi-Token Prediction for Efficient LLMs
by: Tan, Wenhui, et al.
Published: (2026)

ORMind: A Cognitive-Inspired End-to-End Reasoning Framework for Operations Research
by: Wang, Zhiyuan, et al.
Published: (2025)

Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond
by: Cao, Yang, et al.
Published: (2024)

Thermally Activated Dual-Modal Adversarial Clothing against AI Surveillance Systems
by: Long, Jiahuan, et al.
Published: (2025)

Multi-Modal Manipulation via Multi-Modal Policy Consensus
by: Chen, Haonan, et al.
Published: (2025)

MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data
by: Zhu, Zhenghao, et al.
Published: (2025)

Contests with Spillovers: Incentivizing Content Creation with GenAI
by: Ohayon, Sagi, et al.
Published: (2026)

StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management
by: Zhang, Ruizhe, et al.
Published: (2026)

MPE-TTS: Customized Emotion Zero-Shot Text-To-Speech Using Multi-Modal Prompt
by: Wu, Zhichao, et al.
Published: (2025)

PatentMind: A Multi-Aspect Reasoning Graph for Patent Similarity Evaluation
by: Yoo, Yongmin, et al.
Published: (2025)

Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria
by: Cao, Yang, et al.
Published: (2025)

Sketch Then Paint: Hierarchical Reinforcement Learning for Diffusion Multi-Modal Large Language Models
by: Luo, Siqi, et al.
Published: (2026)