Saved in:
Bibliographic Details
Main Authors: Gunter, Tom, Wang, Zirui, Wang, Chong, Pang, Ruoming, Narayanan, Andy, Zhang, Aonan, Zhang, Bowen, Chen, Chen, Chiu, Chung-Cheng, Qiu, David, Gopinath, Deepak, Yap, Dian Ang, Yin, Dong, Nan, Feng, Weers, Floris, Yin, Guoli, Huang, Haoshuo, Wang, Jianyu, Lu, Jiarui, Peebles, John, Ye, Ke, Lee, Mark, Du, Nan, Chen, Qibin, Keunebroek, Quentin, Wiseman, Sam, Evans, Syd, Lei, Tao, Rathod, Vivek, Kong, Xiang, Du, Xianzhi, Li, Yanghao, Wang, Yongqiang, Gao, Yuan, Ahmed, Zaid, Xu, Zhaoyang, Lu, Zhiyun, Rashid, Al, Jose, Albin Madappally, Doane, Alec, Bencomo, Alfredo, Vanderby, Allison, Hansen, Andrew, Jain, Ankur, Anupama, Anupama Mann, Kamal, Areeba, Wu, Bugu, Brum, Carolina, Maalouf, Charlie, Erdenebileg, Chinguun, Dulhanty, Chris, Parilla, Daniel, Moritz, Dominik, Kang, Doug, Jimenez, Eduardo, Ladd, Evan, Shi, Fangping, Bai, Felix, Chu, Frank, Hohman, Fred, Kotek, Hadas, Coleman, Hannah Gillis, Li, Jane, Bigham, Jeffrey, Cao, Jeffery, Lai, Jeff, Cheung, Jessica, Shan, Jiulong, Zhou, Joe, Li, John, Qin, Jun, Singh, Karanjeet, Vega, Karla, Zou, Kelvin, Heckman, Laura, Gardiner, Lauren, Bowler, Margit, Cordell, Maria, Cao, Meng, Hay, Nicole, Shahdadpuri, Nilesh, Godwin, Otto, Dighe, Pranay, Rachapudi, Pushyami, Tantawi, Ramsey, Frigg, Roman, Davarnia, Sam, Shah, Sanskruti, Guha, Saptarshi, Sirovica, Sasha, Ma, Shen, Ma, Shuang, Wang, Simon, Kim, Sulgi, Jayaram, Suma, Shankar, Vaishaal, Paidi, Varsha, Kumar, Vivek, Wang, Xin, Zheng, Xin, Cheng, Walker, Shrager, Yael, Ye, Yang, Tanaka, Yasu, Guo, Yihao, Meng, Yunsong, Luo, Zhao Tang, Ouyang, Zhi, Aygar, Alp, Wan, Alvin, Walkingshaw, Andrew, Lin, Antonie, Farooq, Arsalan, Ramerth, Brent, Reed, Colorado, Bartels, Chris, Chaney, Chris, Riazati, David, Yang, Eric Liang, Feldman, Erin, Hochstrasser, Gabriel, Seguin, Guillaume, Belousova, Irina, Pelemans, Joris, Yang, Karen, Vahid, Keivan Alizadeh, Cao, Liangliang, Najibi, Mahyar, Zuliani, Marco, Horton, Max, Cho, Minsik, Bhendawade, Nikhil, Dong, Patrick, Maj, Piotr, Agrawal, Pulkit, Shan, Qi, Fu, Qichen, Poston, Regan, Xu, Sam, Liu, Shuangning, Rao, Sushma, Heeramun, Tashweena, Merth, Thomas, Rayala, Uday, Cui, Victor, Sridhar, Vivek Rangarajan, Zhang, Wencong, Zhang, Wenqi, Wu, Wentao, Zhou, Xingyu, Liu, Xinwen, Zhao, Yang, Xia, Yin, Ren, Zhile, Ren, Zhongzheng
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.21075
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914605663518720
author Gunter, Tom
Wang, Zirui
Wang, Chong
Pang, Ruoming
Narayanan, Andy
Zhang, Aonan
Zhang, Bowen
Chen, Chen
Chiu, Chung-Cheng
Qiu, David
Gopinath, Deepak
Yap, Dian Ang
Yin, Dong
Nan, Feng
Weers, Floris
Yin, Guoli
Huang, Haoshuo
Wang, Jianyu
Lu, Jiarui
Peebles, John
Ye, Ke
Lee, Mark
Du, Nan
Chen, Qibin
Keunebroek, Quentin
Wiseman, Sam
Evans, Syd
Lei, Tao
Rathod, Vivek
Kong, Xiang
Du, Xianzhi
Li, Yanghao
Wang, Yongqiang
Gao, Yuan
Ahmed, Zaid
Xu, Zhaoyang
Lu, Zhiyun
Rashid, Al
Jose, Albin Madappally
Doane, Alec
Bencomo, Alfredo
Vanderby, Allison
Hansen, Andrew
Jain, Ankur
Anupama, Anupama Mann
Kamal, Areeba
Wu, Bugu
Brum, Carolina
Maalouf, Charlie
Erdenebileg, Chinguun
Dulhanty, Chris
Parilla, Daniel
Moritz, Dominik
Kang, Doug
Jimenez, Eduardo
Ladd, Evan
Shi, Fangping
Bai, Felix
Chu, Frank
Hohman, Fred
Kotek, Hadas
Coleman, Hannah Gillis
Li, Jane
Bigham, Jeffrey
Cao, Jeffery
Lai, Jeff
Cheung, Jessica
Shan, Jiulong
Zhou, Joe
Li, John
Qin, Jun
Singh, Karanjeet
Vega, Karla
Zou, Kelvin
Heckman, Laura
Gardiner, Lauren
Bowler, Margit
Cordell, Maria
Cao, Meng
Hay, Nicole
Shahdadpuri, Nilesh
Godwin, Otto
Dighe, Pranay
Rachapudi, Pushyami
Tantawi, Ramsey
Frigg, Roman
Davarnia, Sam
Shah, Sanskruti
Guha, Saptarshi
Sirovica, Sasha
Ma, Shen
Ma, Shuang
Wang, Simon
Kim, Sulgi
Jayaram, Suma
Shankar, Vaishaal
Paidi, Varsha
Kumar, Vivek
Wang, Xin
Zheng, Xin
Cheng, Walker
Shrager, Yael
Ye, Yang
Tanaka, Yasu
Guo, Yihao
Meng, Yunsong
Luo, Zhao Tang
Ouyang, Zhi
Aygar, Alp
Wan, Alvin
Walkingshaw, Andrew
Narayanan, Andy
Lin, Antonie
Farooq, Arsalan
Ramerth, Brent
Reed, Colorado
Bartels, Chris
Chaney, Chris
Riazati, David
Yang, Eric Liang
Feldman, Erin
Hochstrasser, Gabriel
Seguin, Guillaume
Belousova, Irina
Pelemans, Joris
Yang, Karen
Vahid, Keivan Alizadeh
Cao, Liangliang
Najibi, Mahyar
Zuliani, Marco
Horton, Max
Cho, Minsik
Bhendawade, Nikhil
Dong, Patrick
Maj, Piotr
Agrawal, Pulkit
Shan, Qi
Fu, Qichen
Poston, Regan
Xu, Sam
Liu, Shuangning
Rao, Sushma
Heeramun, Tashweena
Merth, Thomas
Rayala, Uday
Cui, Victor
Sridhar, Vivek Rangarajan
Zhang, Wencong
Zhang, Wenqi
Wu, Wentao
Zhou, Xingyu
Liu, Xinwen
Zhao, Yang
Xia, Yin
Ren, Zhile
Ren, Zhongzheng
author_facet Gunter, Tom
Wang, Zirui
Wang, Chong
Pang, Ruoming
Narayanan, Andy
Zhang, Aonan
Zhang, Bowen
Chen, Chen
Chiu, Chung-Cheng
Qiu, David
Gopinath, Deepak
Yap, Dian Ang
Yin, Dong
Nan, Feng
Weers, Floris
Yin, Guoli
Huang, Haoshuo
Wang, Jianyu
Lu, Jiarui
Peebles, John
Ye, Ke
Lee, Mark
Du, Nan
Chen, Qibin
Keunebroek, Quentin
Wiseman, Sam
Evans, Syd
Lei, Tao
Rathod, Vivek
Kong, Xiang
Du, Xianzhi
Li, Yanghao
Wang, Yongqiang
Gao, Yuan
Ahmed, Zaid
Xu, Zhaoyang
Lu, Zhiyun
Rashid, Al
Jose, Albin Madappally
Doane, Alec
Bencomo, Alfredo
Vanderby, Allison
Hansen, Andrew
Jain, Ankur
Anupama, Anupama Mann
Kamal, Areeba
Wu, Bugu
Brum, Carolina
Maalouf, Charlie
Erdenebileg, Chinguun
Dulhanty, Chris
Parilla, Daniel
Moritz, Dominik
Kang, Doug
Jimenez, Eduardo
Ladd, Evan
Shi, Fangping
Bai, Felix
Chu, Frank
Hohman, Fred
Kotek, Hadas
Coleman, Hannah Gillis
Li, Jane
Bigham, Jeffrey
Cao, Jeffery
Lai, Jeff
Cheung, Jessica
Shan, Jiulong
Zhou, Joe
Li, John
Qin, Jun
Singh, Karanjeet
Vega, Karla
Zou, Kelvin
Heckman, Laura
Gardiner, Lauren
Bowler, Margit
Cordell, Maria
Cao, Meng
Hay, Nicole
Shahdadpuri, Nilesh
Godwin, Otto
Dighe, Pranay
Rachapudi, Pushyami
Tantawi, Ramsey
Frigg, Roman
Davarnia, Sam
Shah, Sanskruti
Guha, Saptarshi
Sirovica, Sasha
Ma, Shen
Ma, Shuang
Wang, Simon
Kim, Sulgi
Jayaram, Suma
Shankar, Vaishaal
Paidi, Varsha
Kumar, Vivek
Wang, Xin
Zheng, Xin
Cheng, Walker
Shrager, Yael
Ye, Yang
Tanaka, Yasu
Guo, Yihao
Meng, Yunsong
Luo, Zhao Tang
Ouyang, Zhi
Aygar, Alp
Wan, Alvin
Walkingshaw, Andrew
Narayanan, Andy
Lin, Antonie
Farooq, Arsalan
Ramerth, Brent
Reed, Colorado
Bartels, Chris
Chaney, Chris
Riazati, David
Yang, Eric Liang
Feldman, Erin
Hochstrasser, Gabriel
Seguin, Guillaume
Belousova, Irina
Pelemans, Joris
Yang, Karen
Vahid, Keivan Alizadeh
Cao, Liangliang
Najibi, Mahyar
Zuliani, Marco
Horton, Max
Cho, Minsik
Bhendawade, Nikhil
Dong, Patrick
Maj, Piotr
Agrawal, Pulkit
Shan, Qi
Fu, Qichen
Poston, Regan
Xu, Sam
Liu, Shuangning
Rao, Sushma
Heeramun, Tashweena
Merth, Thomas
Rayala, Uday
Cui, Victor
Sridhar, Vivek Rangarajan
Zhang, Wencong
Zhang, Wenqi
Wu, Wentao
Zhou, Xingyu
Liu, Xinwen
Zhao, Yang
Xia, Yin
Ren, Zhile
Ren, Zhongzheng
contents We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used to train the model, the training process, how the models are optimized for inference, and the evaluation results. We highlight our focus on Responsible AI and how the principles are applied throughout the model development.
format Preprint
id arxiv_https___arxiv_org_abs_2407_21075
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Apple Intelligence Foundation Language Models
Gunter, Tom
Wang, Zirui
Wang, Chong
Pang, Ruoming
Narayanan, Andy
Zhang, Aonan
Zhang, Bowen
Chen, Chen
Chiu, Chung-Cheng
Qiu, David
Gopinath, Deepak
Yap, Dian Ang
Yin, Dong
Nan, Feng
Weers, Floris
Yin, Guoli
Huang, Haoshuo
Wang, Jianyu
Lu, Jiarui
Peebles, John
Ye, Ke
Lee, Mark
Du, Nan
Chen, Qibin
Keunebroek, Quentin
Wiseman, Sam
Evans, Syd
Lei, Tao
Rathod, Vivek
Kong, Xiang
Du, Xianzhi
Li, Yanghao
Wang, Yongqiang
Gao, Yuan
Ahmed, Zaid
Xu, Zhaoyang
Lu, Zhiyun
Rashid, Al
Jose, Albin Madappally
Doane, Alec
Bencomo, Alfredo
Vanderby, Allison
Hansen, Andrew
Jain, Ankur
Anupama, Anupama Mann
Kamal, Areeba
Wu, Bugu
Brum, Carolina
Maalouf, Charlie
Erdenebileg, Chinguun
Dulhanty, Chris
Parilla, Daniel
Moritz, Dominik
Kang, Doug
Jimenez, Eduardo
Ladd, Evan
Shi, Fangping
Bai, Felix
Chu, Frank
Hohman, Fred
Kotek, Hadas
Coleman, Hannah Gillis
Li, Jane
Bigham, Jeffrey
Cao, Jeffery
Lai, Jeff
Cheung, Jessica
Shan, Jiulong
Zhou, Joe
Li, John
Qin, Jun
Singh, Karanjeet
Vega, Karla
Zou, Kelvin
Heckman, Laura
Gardiner, Lauren
Bowler, Margit
Cordell, Maria
Cao, Meng
Hay, Nicole
Shahdadpuri, Nilesh
Godwin, Otto
Dighe, Pranay
Rachapudi, Pushyami
Tantawi, Ramsey
Frigg, Roman
Davarnia, Sam
Shah, Sanskruti
Guha, Saptarshi
Sirovica, Sasha
Ma, Shen
Ma, Shuang
Wang, Simon
Kim, Sulgi
Jayaram, Suma
Shankar, Vaishaal
Paidi, Varsha
Kumar, Vivek
Wang, Xin
Zheng, Xin
Cheng, Walker
Shrager, Yael
Ye, Yang
Tanaka, Yasu
Guo, Yihao
Meng, Yunsong
Luo, Zhao Tang
Ouyang, Zhi
Aygar, Alp
Wan, Alvin
Walkingshaw, Andrew
Narayanan, Andy
Lin, Antonie
Farooq, Arsalan
Ramerth, Brent
Reed, Colorado
Bartels, Chris
Chaney, Chris
Riazati, David
Yang, Eric Liang
Feldman, Erin
Hochstrasser, Gabriel
Seguin, Guillaume
Belousova, Irina
Pelemans, Joris
Yang, Karen
Vahid, Keivan Alizadeh
Cao, Liangliang
Najibi, Mahyar
Zuliani, Marco
Horton, Max
Cho, Minsik
Bhendawade, Nikhil
Dong, Patrick
Maj, Piotr
Agrawal, Pulkit
Shan, Qi
Fu, Qichen
Poston, Regan
Xu, Sam
Liu, Shuangning
Rao, Sushma
Heeramun, Tashweena
Merth, Thomas
Rayala, Uday
Cui, Victor
Sridhar, Vivek Rangarajan
Zhang, Wencong
Zhang, Wenqi
Wu, Wentao
Zhou, Xingyu
Liu, Xinwen
Zhao, Yang
Xia, Yin
Ren, Zhile
Ren, Zhongzheng
Artificial Intelligence
Computation and Language
Machine Learning
We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used to train the model, the training process, how the models are optimized for inference, and the evaluation results. We highlight our focus on Responsible AI and how the principles are applied throughout the model development.
title Apple Intelligence Foundation Language Models
topic Artificial Intelligence
Computation and Language
Machine Learning
url https://arxiv.org/abs/2407.21075