Introducing Apple’s On-Device and Server Foundation Models

A geeky deep-dive into the deep research work that went into Apple’s new ‘foundation’ models – both on-device and out on their ‘private’ cloud:

In the following overview, we will detail how two of these models — a ~3 billion parameter on-device language model, and a larger server-based language model available with Private Cloud Compute and running on Apple silicon servers — have been built and adapted to perform specialized tasks efficiently, accurately, and responsibly. These two foundation models are part of a larger family of generative models created by Apple to support users and developers; this includes a coding model to build intelligence into Xcode, as well as a diffusion model to help users express themselves visually, for example, in the Messages app. We look forward to sharing more information soon on this broader set of models.

It’s state-of-the-art work. Apple haven’t been asleep.

Read it all here.

Leave a comment