Saved in:
| Main Authors: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.21075 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866914605663518720 |
|---|---|
| author | Gunter, Tom Wang, Zirui Wang, Chong Pang, Ruoming Narayanan, Andy Zhang, Aonan Zhang, Bowen Chen, Chen Chiu, Chung-Cheng Qiu, David Gopinath, Deepak Yap, Dian Ang Yin, Dong Nan, Feng Weers, Floris Yin, Guoli Huang, Haoshuo Wang, Jianyu Lu, Jiarui Peebles, John Ye, Ke Lee, Mark Du, Nan Chen, Qibin Keunebroek, Quentin Wiseman, Sam Evans, Syd Lei, Tao Rathod, Vivek Kong, Xiang Du, Xianzhi Li, Yanghao Wang, Yongqiang Gao, Yuan Ahmed, Zaid Xu, Zhaoyang Lu, Zhiyun Rashid, Al Jose, Albin Madappally Doane, Alec Bencomo, Alfredo Vanderby, Allison Hansen, Andrew Jain, Ankur Anupama, Anupama Mann Kamal, Areeba Wu, Bugu Brum, Carolina Maalouf, Charlie Erdenebileg, Chinguun Dulhanty, Chris Parilla, Daniel Moritz, Dominik Kang, Doug Jimenez, Eduardo Ladd, Evan Shi, Fangping Bai, Felix Chu, Frank Hohman, Fred Kotek, Hadas Coleman, Hannah Gillis Li, Jane Bigham, Jeffrey Cao, Jeffery Lai, Jeff Cheung, Jessica Shan, Jiulong Zhou, Joe Li, John Qin, Jun Singh, Karanjeet Vega, Karla Zou, Kelvin Heckman, Laura Gardiner, Lauren Bowler, Margit Cordell, Maria Cao, Meng Hay, Nicole Shahdadpuri, Nilesh Godwin, Otto Dighe, Pranay Rachapudi, Pushyami Tantawi, Ramsey Frigg, Roman Davarnia, Sam Shah, Sanskruti Guha, Saptarshi Sirovica, Sasha Ma, Shen Ma, Shuang Wang, Simon Kim, Sulgi Jayaram, Suma Shankar, Vaishaal Paidi, Varsha Kumar, Vivek Wang, Xin Zheng, Xin Cheng, Walker Shrager, Yael Ye, Yang Tanaka, Yasu Guo, Yihao Meng, Yunsong Luo, Zhao Tang Ouyang, Zhi Aygar, Alp Wan, Alvin Walkingshaw, Andrew Narayanan, Andy Lin, Antonie Farooq, Arsalan Ramerth, Brent Reed, Colorado Bartels, Chris Chaney, Chris Riazati, David Yang, Eric Liang Feldman, Erin Hochstrasser, Gabriel Seguin, Guillaume Belousova, Irina Pelemans, Joris Yang, Karen Vahid, Keivan Alizadeh Cao, Liangliang Najibi, Mahyar Zuliani, Marco Horton, Max Cho, Minsik Bhendawade, Nikhil Dong, Patrick Maj, Piotr Agrawal, Pulkit Shan, Qi Fu, Qichen Poston, Regan Xu, Sam Liu, Shuangning Rao, Sushma Heeramun, Tashweena Merth, Thomas Rayala, Uday Cui, Victor Sridhar, Vivek Rangarajan Zhang, Wencong Zhang, Wenqi Wu, Wentao Zhou, Xingyu Liu, Xinwen Zhao, Yang Xia, Yin Ren, Zhile Ren, Zhongzheng |
| author_facet | Gunter, Tom Wang, Zirui Wang, Chong Pang, Ruoming Narayanan, Andy Zhang, Aonan Zhang, Bowen Chen, Chen Chiu, Chung-Cheng Qiu, David Gopinath, Deepak Yap, Dian Ang Yin, Dong Nan, Feng Weers, Floris Yin, Guoli Huang, Haoshuo Wang, Jianyu Lu, Jiarui Peebles, John Ye, Ke Lee, Mark Du, Nan Chen, Qibin Keunebroek, Quentin Wiseman, Sam Evans, Syd Lei, Tao Rathod, Vivek Kong, Xiang Du, Xianzhi Li, Yanghao Wang, Yongqiang Gao, Yuan Ahmed, Zaid Xu, Zhaoyang Lu, Zhiyun Rashid, Al Jose, Albin Madappally Doane, Alec Bencomo, Alfredo Vanderby, Allison Hansen, Andrew Jain, Ankur Anupama, Anupama Mann Kamal, Areeba Wu, Bugu Brum, Carolina Maalouf, Charlie Erdenebileg, Chinguun Dulhanty, Chris Parilla, Daniel Moritz, Dominik Kang, Doug Jimenez, Eduardo Ladd, Evan Shi, Fangping Bai, Felix Chu, Frank Hohman, Fred Kotek, Hadas Coleman, Hannah Gillis Li, Jane Bigham, Jeffrey Cao, Jeffery Lai, Jeff Cheung, Jessica Shan, Jiulong Zhou, Joe Li, John Qin, Jun Singh, Karanjeet Vega, Karla Zou, Kelvin Heckman, Laura Gardiner, Lauren Bowler, Margit Cordell, Maria Cao, Meng Hay, Nicole Shahdadpuri, Nilesh Godwin, Otto Dighe, Pranay Rachapudi, Pushyami Tantawi, Ramsey Frigg, Roman Davarnia, Sam Shah, Sanskruti Guha, Saptarshi Sirovica, Sasha Ma, Shen Ma, Shuang Wang, Simon Kim, Sulgi Jayaram, Suma Shankar, Vaishaal Paidi, Varsha Kumar, Vivek Wang, Xin Zheng, Xin Cheng, Walker Shrager, Yael Ye, Yang Tanaka, Yasu Guo, Yihao Meng, Yunsong Luo, Zhao Tang Ouyang, Zhi Aygar, Alp Wan, Alvin Walkingshaw, Andrew Narayanan, Andy Lin, Antonie Farooq, Arsalan Ramerth, Brent Reed, Colorado Bartels, Chris Chaney, Chris Riazati, David Yang, Eric Liang Feldman, Erin Hochstrasser, Gabriel Seguin, Guillaume Belousova, Irina Pelemans, Joris Yang, Karen Vahid, Keivan Alizadeh Cao, Liangliang Najibi, Mahyar Zuliani, Marco Horton, Max Cho, Minsik Bhendawade, Nikhil Dong, Patrick Maj, Piotr Agrawal, Pulkit Shan, Qi Fu, Qichen Poston, Regan Xu, Sam Liu, Shuangning Rao, Sushma Heeramun, Tashweena Merth, Thomas Rayala, Uday Cui, Victor Sridhar, Vivek Rangarajan Zhang, Wencong Zhang, Wenqi Wu, Wentao Zhou, Xingyu Liu, Xinwen Zhao, Yang Xia, Yin Ren, Zhile Ren, Zhongzheng |
| contents | We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used to train the model, the training process, how the models are optimized for inference, and the evaluation results. We highlight our focus on Responsible AI and how the principles are applied throughout the model development. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2407_21075 |
| institution | arXiv |
| publishDate | 2024 |
| record_format | arxiv |
| spellingShingle | Apple Intelligence Foundation Language Models Gunter, Tom Wang, Zirui Wang, Chong Pang, Ruoming Narayanan, Andy Zhang, Aonan Zhang, Bowen Chen, Chen Chiu, Chung-Cheng Qiu, David Gopinath, Deepak Yap, Dian Ang Yin, Dong Nan, Feng Weers, Floris Yin, Guoli Huang, Haoshuo Wang, Jianyu Lu, Jiarui Peebles, John Ye, Ke Lee, Mark Du, Nan Chen, Qibin Keunebroek, Quentin Wiseman, Sam Evans, Syd Lei, Tao Rathod, Vivek Kong, Xiang Du, Xianzhi Li, Yanghao Wang, Yongqiang Gao, Yuan Ahmed, Zaid Xu, Zhaoyang Lu, Zhiyun Rashid, Al Jose, Albin Madappally Doane, Alec Bencomo, Alfredo Vanderby, Allison Hansen, Andrew Jain, Ankur Anupama, Anupama Mann Kamal, Areeba Wu, Bugu Brum, Carolina Maalouf, Charlie Erdenebileg, Chinguun Dulhanty, Chris Parilla, Daniel Moritz, Dominik Kang, Doug Jimenez, Eduardo Ladd, Evan Shi, Fangping Bai, Felix Chu, Frank Hohman, Fred Kotek, Hadas Coleman, Hannah Gillis Li, Jane Bigham, Jeffrey Cao, Jeffery Lai, Jeff Cheung, Jessica Shan, Jiulong Zhou, Joe Li, John Qin, Jun Singh, Karanjeet Vega, Karla Zou, Kelvin Heckman, Laura Gardiner, Lauren Bowler, Margit Cordell, Maria Cao, Meng Hay, Nicole Shahdadpuri, Nilesh Godwin, Otto Dighe, Pranay Rachapudi, Pushyami Tantawi, Ramsey Frigg, Roman Davarnia, Sam Shah, Sanskruti Guha, Saptarshi Sirovica, Sasha Ma, Shen Ma, Shuang Wang, Simon Kim, Sulgi Jayaram, Suma Shankar, Vaishaal Paidi, Varsha Kumar, Vivek Wang, Xin Zheng, Xin Cheng, Walker Shrager, Yael Ye, Yang Tanaka, Yasu Guo, Yihao Meng, Yunsong Luo, Zhao Tang Ouyang, Zhi Aygar, Alp Wan, Alvin Walkingshaw, Andrew Narayanan, Andy Lin, Antonie Farooq, Arsalan Ramerth, Brent Reed, Colorado Bartels, Chris Chaney, Chris Riazati, David Yang, Eric Liang Feldman, Erin Hochstrasser, Gabriel Seguin, Guillaume Belousova, Irina Pelemans, Joris Yang, Karen Vahid, Keivan Alizadeh Cao, Liangliang Najibi, Mahyar Zuliani, Marco Horton, Max Cho, Minsik Bhendawade, Nikhil Dong, Patrick Maj, Piotr Agrawal, Pulkit Shan, Qi Fu, Qichen Poston, Regan Xu, Sam Liu, Shuangning Rao, Sushma Heeramun, Tashweena Merth, Thomas Rayala, Uday Cui, Victor Sridhar, Vivek Rangarajan Zhang, Wencong Zhang, Wenqi Wu, Wentao Zhou, Xingyu Liu, Xinwen Zhao, Yang Xia, Yin Ren, Zhile Ren, Zhongzheng Artificial Intelligence Computation and Language Machine Learning We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used to train the model, the training process, how the models are optimized for inference, and the evaluation results. We highlight our focus on Responsible AI and how the principles are applied throughout the model development. |
| title | Apple Intelligence Foundation Language Models |
| topic | Artificial Intelligence Computation and Language Machine Learning |
| url | https://arxiv.org/abs/2407.21075 |