Saved in:
| Main Authors: | Li, Xiang, Yao, Yiqun, Jiang, Xin, Fang, Xuezhi, Wang, Chao, Liu, Xinzhang, Wang, Zihan, Zhao, Yu, Wang, Xin, Huang, Yuyao, Song, Shuangyong, Li, Yongxiang, Zhang, Zheng, Zhao, Bo, Sun, Aixin, Wang, Yequan, He, Zhongjiang, Wang, Zhongyuan, Li, Xuelong, Huang, Tiejun |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.16645 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
52B to 1T: Lessons Learned via Tele-FLM Series
by: Li, Xiang, et al.
Published: (2024)
by: Li, Xiang, et al.
Published: (2024)
FLM-Audio: Natural Monologues Improves Native Full-Duplex Chatbots via Dual Training
by: Yao, Yiqun, et al.
Published: (2025)
by: Yao, Yiqun, et al.
Published: (2025)
TeleChat Technical Report
by: He, Zhongjiang, et al.
Published: (2024)
by: He, Zhongjiang, et al.
Published: (2024)
Technical Report of TeleChat2, TeleChat2.5 and T1
by: Wang, Zihan, et al.
Published: (2025)
by: Wang, Zihan, et al.
Published: (2025)
FLM-101B: An Open LLM and How to Train It with $100K Budget
by: Li, Xiang, et al.
Published: (2023)
by: Li, Xiang, et al.
Published: (2023)
RoboEgo System Card: An Omnimodal Model with Native Full Duplexity
by: Yao, Yiqun, et al.
Published: (2025)
by: Yao, Yiqun, et al.
Published: (2025)
Towards Robustness and Diversity: Continual Learning in Dialog Generation with Text-Mixup and Batch Nuclear-Norm Maximization
by: Wang, Zihan, et al.
Published: (2024)
by: Wang, Zihan, et al.
Published: (2024)
Emotional Support with LLM-based Empathetic Dialogue Generation
by: Wang, Shiquan, et al.
Published: (2025)
by: Wang, Shiquan, et al.
Published: (2025)
Open-domain Implicit Format Control for Large Language Model Generation
by: Yao, Yiqun, et al.
Published: (2024)
by: Yao, Yiqun, et al.
Published: (2024)
EgoMem: Lifelong Memory Agent for Full-duplex Omnimodal Models
by: Yao, Yiqun, et al.
Published: (2025)
by: Yao, Yiqun, et al.
Published: (2025)
Training Report of TeleChat3-MoE
by: Liu, Xinzhang, et al.
Published: (2025)
by: Liu, Xinzhang, et al.
Published: (2025)
Sketch: A Toolkit for Streamlining LLM Operations
by: Jiang, Xin, et al.
Published: (2024)
by: Jiang, Xin, et al.
Published: (2024)
Toward Embodied AGI: A Review of Embodied AI and the Road Ahead
by: Wang, Yequan, et al.
Published: (2025)
by: Wang, Yequan, et al.
Published: (2025)
Not All Layers of LLMs Are Necessary During Inference
by: Fan, Siqi, et al.
Published: (2024)
by: Fan, Siqi, et al.
Published: (2024)
If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs
by: Fan, Siqi, et al.
Published: (2025)
by: Fan, Siqi, et al.
Published: (2025)
MR-UIE: Multi-Perspective Reasoning with Reinforcement Learning for Universal Information Extraction
by: Li, Zhongqiu, et al.
Published: (2025)
by: Li, Zhongqiu, et al.
Published: (2025)
nanoLM: an Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales
by: Yao, Yiqun, et al.
Published: (2023)
by: Yao, Yiqun, et al.
Published: (2023)
TableReasoner: Advancing Table Reasoning Framework with Large Language Models
by: Xiong, Sishi, et al.
Published: (2025)
by: Xiong, Sishi, et al.
Published: (2025)
RB-SQL: A Retrieval-based LLM Framework for Text-to-SQL
by: Wu, Zhenhe, et al.
Published: (2024)
by: Wu, Zhenhe, et al.
Published: (2024)
TableZoomer: A Collaborative Agent Framework for Large-scale Table Question Answering
by: Xiong, Sishi, et al.
Published: (2025)
by: Xiong, Sishi, et al.
Published: (2025)
Masked Structural Growth for 2x Faster Language Model Pre-training
by: Yao, Yiqun, et al.
Published: (2023)
by: Yao, Yiqun, et al.
Published: (2023)
GCRE-GPT: A Generative Model for Comparative Relation Extraction
by: Wang, Yequan, et al.
Published: (2023)
by: Wang, Yequan, et al.
Published: (2023)
GOAT-SLM: A Spoken Language Model with Paralinguistic and Speaker Characteristic Awareness
by: Chen, Hongjie, et al.
Published: (2025)
by: Chen, Hongjie, et al.
Published: (2025)
Mutual Enhancement Between Global Tokens and Patch Tokens: From Theory to Practice
by: Huang, Xiusheng, et al.
Published: (2026)
by: Huang, Xiusheng, et al.
Published: (2026)
TCNN: Triple Convolutional Neural Network Models for Retrieval-based Question Answering System in E-commerce
by: Song, Shuangyong, et al.
Published: (2020)
by: Song, Shuangyong, et al.
Published: (2020)
Introducing Visual Scenes and Reasoning: A More Realistic Benchmark for Spoken Language Understanding
by: Wu, Di, et al.
Published: (2025)
by: Wu, Di, et al.
Published: (2025)
BoSS: Beyond-Semantic Speech
by: Wang, Qing, et al.
Published: (2025)
by: Wang, Qing, et al.
Published: (2025)
Table-R1: Region-based Reinforcement Learning for Table Understanding
by: Wu, Zhenhe, et al.
Published: (2025)
by: Wu, Zhenhe, et al.
Published: (2025)
D-QRELO: Training- and Data-Free Delta Compression for Large Language Models via Quantization and Residual Low-Rank Approximation
by: Li, Junlin, et al.
Published: (2026)
by: Li, Junlin, et al.
Published: (2026)
Commonsense Knowledge Editing Based on Free-Text in LLMs
by: Huang, Xiusheng, et al.
Published: (2024)
by: Huang, Xiusheng, et al.
Published: (2024)
Prompt-Level Reward Specifications for Open-Ended Post-Training
by: Weng, Zijun, et al.
Published: (2026)
by: Weng, Zijun, et al.
Published: (2026)
TeleStyle: Content-Preserving Style Transfer in Images and Videos
by: Zhang, Shiwen, et al.
Published: (2026)
by: Zhang, Shiwen, et al.
Published: (2026)
On Switched Event-triggered Full State-constrained Formation Control for Multi-vehicle Systems
by: Li, Zihan, et al.
Published: (2026)
by: Li, Zihan, et al.
Published: (2026)
Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models
by: Nie, Shuo, et al.
Published: (2026)
by: Nie, Shuo, et al.
Published: (2026)
TelePhysics: Physics-Grounded Multi-Object Scene Generation from a Single Image with Real-Time Interaction
by: Zhang, Xin, et al.
Published: (2026)
by: Zhang, Xin, et al.
Published: (2026)
Technical Report: Coopetition in Heterogeneous Cross-Silo Federated Learning
by: Huang, Chao, et al.
Published: (2024)
by: Huang, Chao, et al.
Published: (2024)
Light Field Transformation of Metasurface Based on Arbitrary Jones Matrix
by: Xiaotong Zhang, et al.
Published: (2024)
by: Xiaotong Zhang, et al.
Published: (2024)
TeleWorld: Towards Dynamic Multimodal Synthesis with a 4D World Model
by: Chen, Yabo, et al.
Published: (2025)
by: Chen, Yabo, et al.
Published: (2025)
Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models
by: Hu, Wentao, et al.
Published: (2025)
by: Hu, Wentao, et al.
Published: (2025)
The multiple points of maps from sphere to Euclidean space
by: Wang, Jun, et al.
Published: (2021)
by: Wang, Jun, et al.
Published: (2021)
Similar Items
-
52B to 1T: Lessons Learned via Tele-FLM Series
by: Li, Xiang, et al.
Published: (2024) -
FLM-Audio: Natural Monologues Improves Native Full-Duplex Chatbots via Dual Training
by: Yao, Yiqun, et al.
Published: (2025) -
TeleChat Technical Report
by: He, Zhongjiang, et al.
Published: (2024) -
Technical Report of TeleChat2, TeleChat2.5 and T1
by: Wang, Zihan, et al.
Published: (2025) -
FLM-101B: An Open LLM and How to Train It with $100K Budget
by: Li, Xiang, et al.
Published: (2023)