Saved in:
| Main Authors: | Yuan, Zhengqing, Liu, Yixin, Cao, Yihan, Sun, Weixiang, Jia, Haolong, Chen, Ruoxi, Li, Zhaoxu, Lin, Bin, Yuan, Li, He, Lifang, Wang, Chi, Ye, Yanfang, Sun, Lichao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.13248 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Bora: Biomedical Generalist Video Generation Model
by: Sun, Weixiang, et al.
Published: (2024)
by: Sun, Weixiang, et al.
Published: (2024)
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
by: Yuan, Zhengqing, et al.
Published: (2023)
by: Yuan, Zhengqing, et al.
Published: (2023)
Horizon-LM: A RAM-Centric Architecture for LLM Training
by: Yuan, Zhengqing, et al.
Published: (2026)
by: Yuan, Zhengqing, et al.
Published: (2026)
ViT-1.58b: Mobile Vision Transformers in the 1-bit Era
by: Yuan, Zhengqing, et al.
Published: (2024)
by: Yuan, Zhengqing, et al.
Published: (2024)
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
by: Yuan, Zhengqing, et al.
Published: (2026)
by: Yuan, Zhengqing, et al.
Published: (2026)
TTT-Unet: Enhancing U-Net with Test-Time Training Layers for Biomedical Image Segmentation
by: Zhou, Rong, et al.
Published: (2024)
by: Zhou, Rong, et al.
Published: (2024)
3D4D: An Interactive, Editable, 4D World Model via 3D Video Generation
by: He, Yunhong, et al.
Published: (2025)
by: He, Yunhong, et al.
Published: (2025)
EfficientLLM: Efficiency in Large Language Models
by: Yuan, Zhengqing, et al.
Published: (2025)
by: Yuan, Zhengqing, et al.
Published: (2025)
ArtGPT-4: Towards Artistic-understanding Large Vision-Language Models with Enhanced Adapter
by: Yuan, Zhengqing, et al.
Published: (2023)
by: Yuan, Zhengqing, et al.
Published: (2023)
Food4All: A Multi-Agent Framework for Real-time Free Food Discovery with Integrated Nutritional Metadata
by: Yuan, Zhengqing, et al.
Published: (2025)
by: Yuan, Zhengqing, et al.
Published: (2025)
Biomedical SAM 2: Segment Anything in Biomedical Images and Videos
by: Yan, Zhiling, et al.
Published: (2024)
by: Yan, Zhiling, et al.
Published: (2024)
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
by: Liu, Yixin, et al.
Published: (2024)
by: Liu, Yixin, et al.
Published: (2024)
BLURR: A Boosted Low-Resource Inference for Vision-Language-Action Models
by: Ma, Xiaoyu, et al.
Published: (2025)
by: Ma, Xiaoyu, et al.
Published: (2025)
MLP-KAN: Unifying Deep Representation and Function Learning
by: He, Yunhong, et al.
Published: (2024)
by: He, Yunhong, et al.
Published: (2024)
Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations?
by: Huang, Yue, et al.
Published: (2024)
by: Huang, Yue, et al.
Published: (2024)
CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era
by: Shi, Kaiwen, et al.
Published: (2026)
by: Shi, Kaiwen, et al.
Published: (2026)
Rethinking and Red-Teaming Protective Perturbation in Personalized Diffusion Models
by: Liu, Yixin, et al.
Published: (2024)
by: Liu, Yixin, et al.
Published: (2024)
Medical Unlearnable Examples: Securing Medical Data from Unauthorized Training via Sparsity-Aware Local Masking
by: Sun, Weixiang, et al.
Published: (2024)
by: Sun, Weixiang, et al.
Published: (2024)
MetaAgents: Large Language Model Based Agents for Decision-Making on Teaming
by: Li, Yuan, et al.
Published: (2023)
by: Li, Yuan, et al.
Published: (2023)
Instruction Mining: Instruction Data Selection for Tuning Large Language Models
by: Cao, Yihan, et al.
Published: (2023)
by: Cao, Yihan, et al.
Published: (2023)
MedGPT-oss: Training a General-Purpose Vision-Language Model for Biomedicine
by: Zhang, Kai, et al.
Published: (2026)
by: Zhang, Kai, et al.
Published: (2026)
Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction
by: Bao, Han, et al.
Published: (2026)
by: Bao, Han, et al.
Published: (2026)
Temporal Graph Pattern Machine
by: Ma, Yijun, et al.
Published: (2026)
by: Ma, Yijun, et al.
Published: (2026)
NG-Router: Graph-Supervised Multi-Agent Collaboration for Nutrition Question Answering
by: Shi, Kaiwen, et al.
Published: (2025)
by: Shi, Kaiwen, et al.
Published: (2025)
EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models
by: Chen, Ruoxi, et al.
Published: (2023)
by: Chen, Ruoxi, et al.
Published: (2023)
NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes
by: Xu, Tianyang, et al.
Published: (2025)
by: Xu, Tianyang, et al.
Published: (2025)
Interpretable Graph-Language Modeling for Detecting Youth Illicit Drug Use
by: Li, Yiyang, et al.
Published: (2025)
by: Li, Yiyang, et al.
Published: (2025)
AgentRouter: A Knowledge-Graph-Guided LLM Router for Collaborative Multi-Agent Question Answering
by: Zhang, Zheyuan, et al.
Published: (2025)
by: Zhang, Zheyuan, et al.
Published: (2025)
Empirical Perturbation Analysis of Linear System Solvers from a Data Poisoning Perspective
by: Liu, Yixin, et al.
Published: (2024)
by: Liu, Yixin, et al.
Published: (2024)
Large Language Models Often Say One Thing and Do Another
by: Xu, Ruoxi, et al.
Published: (2025)
by: Xu, Ruoxi, et al.
Published: (2025)
Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing
by: Sun, Hanchi, et al.
Published: (2026)
by: Sun, Hanchi, et al.
Published: (2026)
NeuroClaw Technical Report
by: Wang, Cheng, et al.
Published: (2026)
by: Wang, Cheng, et al.
Published: (2026)
SAMed-2: Selective Memory Enhanced Medical Segment Anything Model
by: Yan, Zhiling, et al.
Published: (2025)
by: Yan, Zhiling, et al.
Published: (2025)
BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks
by: Zhang, Kai, et al.
Published: (2023)
by: Zhang, Kai, et al.
Published: (2023)
EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage
by: Liao, Zeyi, et al.
Published: (2024)
by: Liao, Zeyi, et al.
Published: (2024)
MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling
by: Wang, Qian, et al.
Published: (2025)
by: Wang, Qian, et al.
Published: (2025)
End-to-End Deep Learning for Structural Brain Imaging: A Unified Framework
by: Su, Yao, et al.
Published: (2025)
by: Su, Yao, et al.
Published: (2025)
Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents
by: Wang, Zehong, et al.
Published: (2026)
by: Wang, Zehong, et al.
Published: (2026)
PreScam: A Benchmark for Predicting Scam Progression from Early Conversations
by: Sun, Weixiang, et al.
Published: (2026)
by: Sun, Weixiang, et al.
Published: (2026)
Stroke Modeling Enables Vectorized Character Generation with Large Vectorized Glyph Model
by: Zhang, Xinyue, et al.
Published: (2025)
by: Zhang, Xinyue, et al.
Published: (2025)
Similar Items
-
Bora: Biomedical Generalist Video Generation Model
by: Sun, Weixiang, et al.
Published: (2024) -
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
by: Yuan, Zhengqing, et al.
Published: (2023) -
Horizon-LM: A RAM-Centric Architecture for LLM Training
by: Yuan, Zhengqing, et al.
Published: (2026) -
ViT-1.58b: Mobile Vision Transformers in the 1-bit Era
by: Yuan, Zhengqing, et al.
Published: (2024) -
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
by: Yuan, Zhengqing, et al.
Published: (2026)