Saved in:
| Main Authors: | Yao, Yiqun, fan, Siqi, Huang, Xiusheng, Fang, Xuezhi, Li, Xiang, Ni, Ziyi, Jiang, Xin, Meng, Xuying, Han, Peng, Shang, Shuo, Liu, Kang, Sun, Aixin, Wang, Yequan |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2304.06875 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs
by: Fan, Siqi, et al.
Published: (2025)
by: Fan, Siqi, et al.
Published: (2025)
Not All Layers of LLMs Are Necessary During Inference
by: Fan, Siqi, et al.
Published: (2024)
by: Fan, Siqi, et al.
Published: (2024)
Open-domain Implicit Format Control for Large Language Model Generation
by: Yao, Yiqun, et al.
Published: (2024)
by: Yao, Yiqun, et al.
Published: (2024)
EgoMem: Lifelong Memory Agent for Full-duplex Omnimodal Models
by: Yao, Yiqun, et al.
Published: (2025)
by: Yao, Yiqun, et al.
Published: (2025)
GCRE-GPT: A Generative Model for Comparative Relation Extraction
by: Wang, Yequan, et al.
Published: (2023)
by: Wang, Yequan, et al.
Published: (2023)
RoboEgo System Card: An Omnimodal Model with Native Full Duplexity
by: Yao, Yiqun, et al.
Published: (2025)
by: Yao, Yiqun, et al.
Published: (2025)
The Price of a Second Thought: On the Evaluation of Reasoning Efficiency in Large Language Models
by: Fan, Siqi, et al.
Published: (2025)
by: Fan, Siqi, et al.
Published: (2025)
Sketch: A Toolkit for Streamlining LLM Operations
by: Jiang, Xin, et al.
Published: (2024)
by: Jiang, Xin, et al.
Published: (2024)
FLM-101B: An Open LLM and How to Train It with $100K Budget
by: Li, Xiang, et al.
Published: (2023)
by: Li, Xiang, et al.
Published: (2023)
FLM-Audio: Natural Monologues Improves Native Full-Duplex Chatbots via Dual Training
by: Yao, Yiqun, et al.
Published: (2025)
by: Yao, Yiqun, et al.
Published: (2025)
Position-Aware Depth Decay Decoding ($D^3$): Boosting Large Language Model Inference Efficiency
by: Fan, Siqi, et al.
Published: (2025)
by: Fan, Siqi, et al.
Published: (2025)
Reasons and Solutions for the Decline in Model Performance after Editing
by: Huang, Xiusheng, et al.
Published: (2024)
by: Huang, Xiusheng, et al.
Published: (2024)
Commonsense Knowledge Editing Based on Free-Text in LLMs
by: Huang, Xiusheng, et al.
Published: (2024)
by: Huang, Xiusheng, et al.
Published: (2024)
Masked Structural Growth for 2x Faster Language Model Pre-training
by: Yao, Yiqun, et al.
Published: (2023)
by: Yao, Yiqun, et al.
Published: (2023)
Toward Embodied AGI: A Review of Embodied AI and the Road Ahead
by: Wang, Yequan, et al.
Published: (2025)
by: Wang, Yequan, et al.
Published: (2025)
Capability Localization: Capabilities Can be Localized rather than Individual Knowledge
by: Huang, Xiusheng, et al.
Published: (2025)
by: Huang, Xiusheng, et al.
Published: (2025)
Mutual Enhancement Between Global Tokens and Patch Tokens: From Theory to Practice
by: Huang, Xiusheng, et al.
Published: (2026)
by: Huang, Xiusheng, et al.
Published: (2026)
SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms
by: Xing, Xingrun, et al.
Published: (2024)
by: Xing, Xingrun, et al.
Published: (2024)
Aligning the Spectrum: Hybrid Graph Pre-training and Prompt Tuning across Homophily and Heterophily
by: Luo, Haitong, et al.
Published: (2025)
by: Luo, Haitong, et al.
Published: (2025)
NetGPT: Generative Pretrained Transformer for Network Traffic
by: Meng, Xuying, et al.
Published: (2023)
by: Meng, Xuying, et al.
Published: (2023)
Theory-optimal Quantization Based on Flatness
by: Huang, Xiusheng, et al.
Published: (2026)
by: Huang, Xiusheng, et al.
Published: (2026)
Hint Tuning: Less Data Makes Better Reasoners
by: Fan, Siqi, et al.
Published: (2026)
by: Fan, Siqi, et al.
Published: (2026)
A Contrastive Pre-trained Foundation Model for Deciphering Imaging Noisomics across Modalities
by: Gu, Yuanjie, et al.
Published: (2026)
by: Gu, Yuanjie, et al.
Published: (2026)
Nethira: A Heterogeneity-aware Hierarchical Pre-trained Model for Network Traffic Classification
by: Lin, Chungang, et al.
Published: (2026)
by: Lin, Chungang, et al.
Published: (2026)
PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training
by: Yi, Rongjie, et al.
Published: (2024)
by: Yi, Rongjie, et al.
Published: (2024)
Benchmarking the Performance of Pre-trained LLMs across Urdu NLP Tasks
by: Tahir, Munief Hassan, et al.
Published: (2024)
by: Tahir, Munief Hassan, et al.
Published: (2024)
EAQEC codes from s-Galois hulls decomposition of linear codes
by: Li, Hui, et al.
Published: (2024)
by: Li, Hui, et al.
Published: (2024)
Linear complementary pairs of codes over a finite non-commutative Frobenius ring
by: Bhowmick, Sanjit, et al.
Published: (2024)
by: Bhowmick, Sanjit, et al.
Published: (2024)
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study
by: An, Keyu, et al.
Published: (2024)
by: An, Keyu, et al.
Published: (2024)
Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History
by: Zhong, Qishuai, et al.
Published: (2025)
by: Zhong, Qishuai, et al.
Published: (2025)
MUON+: Towards More Effective Muon via One Additional Normalization Step for LLM Pre-training
by: Zhang, Ruijie, et al.
Published: (2026)
by: Zhang, Ruijie, et al.
Published: (2026)
BiPFT: Binary Pre-trained Foundation Transformer with Low-rank Estimation of Binarization Residual Polynomials
by: Xing, Xingrun, et al.
Published: (2023)
by: Xing, Xingrun, et al.
Published: (2023)
Spectral-Based Graph Neural Networks for Complementary Item Recommendation
by: Luo, Haitong, et al.
Published: (2024)
by: Luo, Haitong, et al.
Published: (2024)
CrossPT-EEG: A Benchmark for Cross-Participant and Cross-Time Generalization of EEG-based Visual Decoding
by: Zhu, Shuqi, et al.
Published: (2024)
by: Zhu, Shuqi, et al.
Published: (2024)
A Closer Look at the Explainability of Contrastive Language-Image Pre-training
by: Li, Yi, et al.
Published: (2023)
by: Li, Yi, et al.
Published: (2023)
B-cos LM: Efficiently Transforming Pre-trained Language Models for Improved Explainability
by: Wang, Yifan, et al.
Published: (2025)
by: Wang, Yifan, et al.
Published: (2025)
SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
by: Lin, Jingru, et al.
Published: (2024)
by: Lin, Jingru, et al.
Published: (2024)
52B to 1T: Lessons Learned via Tele-FLM Series
by: Li, Xiang, et al.
Published: (2024)
by: Li, Xiang, et al.
Published: (2024)
Tele-FLM Technical Report
by: Li, Xiang, et al.
Published: (2024)
by: Li, Xiang, et al.
Published: (2024)
Home After Loss: Housing Relocation and Affordability Stress Following Spousal Loss
by: Gum‐Ryeong Park
Published: (2026)
by: Gum‐Ryeong Park
Published: (2026)
Similar Items
-
If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs
by: Fan, Siqi, et al.
Published: (2025) -
Not All Layers of LLMs Are Necessary During Inference
by: Fan, Siqi, et al.
Published: (2024) -
Open-domain Implicit Format Control for Large Language Model Generation
by: Yao, Yiqun, et al.
Published: (2024) -
EgoMem: Lifelong Memory Agent for Full-duplex Omnimodal Models
by: Yao, Yiqun, et al.
Published: (2025) -
GCRE-GPT: A Generative Model for Comparative Relation Extraction
by: Wang, Yequan, et al.
Published: (2023)