Apple Publishes Details About New 'MM1' AI Model - MacRumorsOpen MenuShow RoundupsShow Forums menuVisit ForumsOpen Sidebar
Skip to Content

Apple Publishes Details About New 'MM1' AI Model

Apple researchers have developed a new method for training large language models (LLMs) that seamlessly integrates both text and visual information.

hey siri banner apple
The company's findings, detailed in a research paper titled "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training," showcase a new approach to creating more intelligent and flexible AI systems. By utilizing a diverse dataset comprising image-caption pairs, interleaved image-text documents, and text-only data, Apple's claims that the MM1 model sets a new standard in AI's ability to perform tasks such as image captioning, visual question answering, and natural language inference with a high level of accuracy.

Apple's research focuses on the combination of different types of training data and model architectures, which enables the AI to understand and generate language based on a mix of visual and linguistic cues. This capability is vital for tasks that require a nuanced comprehension of the world, such as interpreting complex images or answering questions that involve visual elements.

The paper also highlights the MM1 model's exceptional in-context learning abilities, particularly in the largest 30 billion parameter configuration of the model. This version apparently exhibits remarkable capabilities for multi-step reasoning over multiple images using few-shot "chain-of-thought" prompting, a technique that allows the AI to perform complex, open-ended problem solving based on minimal examples.

This research emerges as part of Apple's broader initiative to enhance its AI capabilities amid growing competition. Earlier today, Bloomberg's Mark Gurman reported that Apple is in discussions with Google to license Google's Gemini generative large-language models to power new features coming to the iPhone as part of iOS 18.

Popular Stories

Apple Sales Coach App

Apple Sales Coach Will Use AI-Generated Video Presenters

Tuesday May 12, 2026 7:43 am PDT by
The Apple Sales Coach app will begin using AI-generated video presenters to deliver personalized training content to retail salespeople around the world. In a new video message, an Apple trainer said that the update addresses a limitation of traditional training programs: the impossibility of creating truly individualized content for hundreds of thousands of salespeople across different...
Apple Card iPhone 16 Pro Feature

Apple Card Promo to Offer Free AirPods Pro 3

Friday May 15, 2026 8:59 am PDT by
Starting as early as next week, customers who sign up for an Apple Card at Apple's retail stores in the U.S. will receive $249 cash back when they purchase AirPods Pro 3, according to Bloomberg's Mark Gurman. The promotion has yet to be officially announced by Apple, so exact terms and conditions are not available at this time. AirPods Pro 3 are priced at $249 in the U.S., so customers who...
Apple WWDC25 iOS 26 CarPlay Light mode 250609

Six Popular iPhone Apps Now Available on CarPlay

Thursday May 14, 2026 9:10 am PDT by
Apple's CarPlay system for accessing iPhone apps on a vehicle's dashboard screen has received six popular apps in recent weeks: ChatGPT, Perplexity, Grok, Google Meet, WhatsApp, and the indie artist streaming platform Audiomack. Make sure you have the latest version of each app and they will automatically appear on CarPlay. ChatGPT Starting with iOS 26.4, CarPlay supports voice-based...

Top Rated Comments

28 months ago
Apple has recently been killing in Deep learning space. Apple released MM1, and they have released more information than most open source LLM companies. I have been testing MLX for some of my workflows, it’s probably the fastest among other python libraries. It runs open source LLM models on my iPad Pro. Gonna be interesting once it gets to iPhone and other devices. With recent updates, I can run A Falcon 180 B on my M1 Max and my Nvidia RTX 4090 GPU can only dream. I hope Apple keeps up with the releases.
Score: 24 Votes (Like | Disagree)
28 months ago
Will be extremely funny to read all this PR from Apple about the big important ML research they are doing and then find out at WWDC they are just gonna outsource Siri to Microsoft, lmao
Score: 16 Votes (Like | Disagree)
28 months ago
It'll be way ahead of Gemini if it simply acknowledges the existence of White people. The bar is pretty low, Apple. Let's see what you've got.
Score: 12 Votes (Like | Disagree)
spazzcat Avatar
28 months ago

Again this is all propaganda, Apple is not going to release the key to their secret sauce until at least WWDC, and maybe not even then.

Cook needs any goodwill the media will give him. I see him running from department to department asking, "What can we release through the backdoor, I'm sinking here!!!!"
We have no idea what's going on at Apple or what Apple is planning for iOS 18 and everything you wrote is hyperbole. And I would take any new development in iOS 18 with a grain salt, the feature set would have been locked down months ago.
Score: 10 Votes (Like | Disagree)
Amazing Iceman Avatar
28 months ago

Will be extremely funny to read all this PR from Apple about the big important ML research they are doing and then find out at WWDC they are just gonna outsource Siri to Microsoft, lmao
Then the new Siri will be called Siritana
Score: 10 Votes (Like | Disagree)
mdriftmeyer Avatar
28 months ago

Will be extremely funny to read all this PR from Apple about the big important ML research they are doing and then find out at WWDC they are just gonna outsource Siri to Microsoft, lmao
The former head of AI at Google is an SVP at Apple for the past near decade. He like the PA Semi purchase have been hard at work for their respective teams.

Apple hasn't become who they are with such a vast treasure trove of expertise and wealth by sheer dumb luck.
Score: 8 Votes (Like | Disagree)