:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yao, Yiqun, fan, Siqi, Huang, Xiusheng, Fang, Xuezhi, Li, Xiang, Ni, Ziyi, Jiang, Xin, Meng, Xuying, Han, Peng, Shang, Shuo, Liu, Kang, Sun, Aixin, Wang, Yequan
Format:	Preprint
Published:	2023
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2304.06875
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs
by: Fan, Siqi, et al.
Published: (2025)

Not All Layers of LLMs Are Necessary During Inference
by: Fan, Siqi, et al.
Published: (2024)

Open-domain Implicit Format Control for Large Language Model Generation
by: Yao, Yiqun, et al.
Published: (2024)

EgoMem: Lifelong Memory Agent for Full-duplex Omnimodal Models
by: Yao, Yiqun, et al.
Published: (2025)

GCRE-GPT: A Generative Model for Comparative Relation Extraction
by: Wang, Yequan, et al.
Published: (2023)

RoboEgo System Card: An Omnimodal Model with Native Full Duplexity
by: Yao, Yiqun, et al.
Published: (2025)

The Price of a Second Thought: On the Evaluation of Reasoning Efficiency in Large Language Models
by: Fan, Siqi, et al.
Published: (2025)

Sketch: A Toolkit for Streamlining LLM Operations
by: Jiang, Xin, et al.
Published: (2024)

FLM-101B: An Open LLM and How to Train It with $100K Budget
by: Li, Xiang, et al.
Published: (2023)

FLM-Audio: Natural Monologues Improves Native Full-Duplex Chatbots via Dual Training
by: Yao, Yiqun, et al.
Published: (2025)

Position-Aware Depth Decay Decoding ($D^3$): Boosting Large Language Model Inference Efficiency
by: Fan, Siqi, et al.
Published: (2025)

Reasons and Solutions for the Decline in Model Performance after Editing
by: Huang, Xiusheng, et al.
Published: (2024)

Commonsense Knowledge Editing Based on Free-Text in LLMs
by: Huang, Xiusheng, et al.
Published: (2024)

Masked Structural Growth for 2x Faster Language Model Pre-training
by: Yao, Yiqun, et al.
Published: (2023)

Toward Embodied AGI: A Review of Embodied AI and the Road Ahead
by: Wang, Yequan, et al.
Published: (2025)

Capability Localization: Capabilities Can be Localized rather than Individual Knowledge
by: Huang, Xiusheng, et al.
Published: (2025)

Mutual Enhancement Between Global Tokens and Patch Tokens: From Theory to Practice
by: Huang, Xiusheng, et al.
Published: (2026)

SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms
by: Xing, Xingrun, et al.
Published: (2024)

Aligning the Spectrum: Hybrid Graph Pre-training and Prompt Tuning across Homophily and Heterophily
by: Luo, Haitong, et al.
Published: (2025)

NetGPT: Generative Pretrained Transformer for Network Traffic
by: Meng, Xuying, et al.
Published: (2023)

Theory-optimal Quantization Based on Flatness
by: Huang, Xiusheng, et al.
Published: (2026)

Hint Tuning: Less Data Makes Better Reasoners
by: Fan, Siqi, et al.
Published: (2026)

A Contrastive Pre-trained Foundation Model for Deciphering Imaging Noisomics across Modalities
by: Gu, Yuanjie, et al.
Published: (2026)

Nethira: A Heterogeneity-aware Hierarchical Pre-trained Model for Network Traffic Classification
by: Lin, Chungang, et al.
Published: (2026)

PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training
by: Yi, Rongjie, et al.
Published: (2024)

Benchmarking the Performance of Pre-trained LLMs across Urdu NLP Tasks
by: Tahir, Munief Hassan, et al.
Published: (2024)

EAQEC codes from s-Galois hulls decomposition of linear codes
by: Li, Hui, et al.
Published: (2024)

Linear complementary pairs of codes over a finite non-commutative Frobenius ring
by: Bhowmick, Sanjit, et al.
Published: (2024)

Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study
by: An, Keyu, et al.
Published: (2024)

Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History
by: Zhong, Qishuai, et al.
Published: (2025)

MUON+: Towards More Effective Muon via One Additional Normalization Step for LLM Pre-training
by: Zhang, Ruijie, et al.
Published: (2026)

BiPFT: Binary Pre-trained Foundation Transformer with Low-rank Estimation of Binarization Residual Polynomials
by: Xing, Xingrun, et al.
Published: (2023)

Spectral-Based Graph Neural Networks for Complementary Item Recommendation
by: Luo, Haitong, et al.
Published: (2024)

CrossPT-EEG: A Benchmark for Cross-Participant and Cross-Time Generalization of EEG-based Visual Decoding
by: Zhu, Shuqi, et al.
Published: (2024)

A Closer Look at the Explainability of Contrastive Language-Image Pre-training
by: Li, Yi, et al.
Published: (2023)

B-cos LM: Efficiently Transforming Pre-trained Language Models for Improved Explainability
by: Wang, Yifan, et al.
Published: (2025)

SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
by: Lin, Jingru, et al.
Published: (2024)

52B to 1T: Lessons Learned via Tele-FLM Series
by: Li, Xiang, et al.
Published: (2024)

Tele-FLM Technical Report
by: Li, Xiang, et al.
Published: (2024)

Home After Loss: Housing Relocation and Affordability Stress Following Spousal Loss
by: Gum‐Ryeong Park
Published: (2026)