A geeky deep-dive into the deep research work that went into Apple’s new ‘foundation’ models – both on-device and out on their ‘private’ cloud:
In the following overview, we will detail how two of these models — a ~3 billion parameter on-device language model, and a larger server-based language model available with Private Cloud Compute and running on Apple silicon servers — have been built and adapted to perform specialized tasks efficiently, accurately, and responsibly. These two foundation models are part of a larger family of generative models created by Apple to support users and developers; this includes a coding model to build intelligence into Xcode, as well as a diffusion model to help users express themselves visually, for example, in the Messages app. We look forward to sharing more information soon on this broader set of models.
It’s state-of-the-art work. Apple haven’t been asleep.
Read it all here.