Apple Publishes Details About New 'MM1' AI Model - MacRumorsOpen MenuShow RoundupsShow Forums menuVisit ForumsOpen Sidebar
Skip to Content

Apple Publishes Details About New 'MM1' AI Model

Apple researchers have developed a new method for training large language models (LLMs) that seamlessly integrates both text and visual information.

hey siri banner apple
The company's findings, detailed in a research paper titled "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training," showcase a new approach to creating more intelligent and flexible AI systems. By utilizing a diverse dataset comprising image-caption pairs, interleaved image-text documents, and text-only data, Apple's claims that the MM1 model sets a new standard in AI's ability to perform tasks such as image captioning, visual question answering, and natural language inference with a high level of accuracy.

Apple's research focuses on the combination of different types of training data and model architectures, which enables the AI to understand and generate language based on a mix of visual and linguistic cues. This capability is vital for tasks that require a nuanced comprehension of the world, such as interpreting complex images or answering questions that involve visual elements.

The paper also highlights the MM1 model's exceptional in-context learning abilities, particularly in the largest 30 billion parameter configuration of the model. This version apparently exhibits remarkable capabilities for multi-step reasoning over multiple images using few-shot "chain-of-thought" prompting, a technique that allows the AI to perform complex, open-ended problem solving based on minimal examples.

This research emerges as part of Apple's broader initiative to enhance its AI capabilities amid growing competition. Earlier today, Bloomberg's Mark Gurman reported that Apple is in discussions with Google to license Google's Gemini generative large-language models to power new features coming to the iPhone as part of iOS 18.

Popular Stories

gemini for mac app google

Google Gemini Spark Comes to Mac With Local File Automation

Wednesday July 1, 2026 9:10 am PDT by
Google today launched Gemini Spark for its macOS desktop app, allowing the AI agent to automate tasks involving local files and desktop workflows. Google announced the update today, adding a dedicated Spark tab to the sidebar of the Gemini app for macOS. The feature allows the AI agent to take action on files stored locally on a user's computer rather than just responding to questions in a...
iCloud iPhone 17 Pro

iPhone Users Who Pay for iCloud Storage Get Two New Perks on iOS 27

Thursday July 2, 2026 6:10 am PDT by
If you pay for certain iCloud+ storage plans beyond the 5GB that Apple offers for free, you will receive two more perks on iOS 27 at no additional cost. A summary of the two new iCloud+ perks on iOS 27:Increased daily usage limits for some new Apple Intelligence features, including image generation in the revamped Image Playground app. HomeKit Secure Video cameras receive generated video...
iPhone 4 on Black Feature

Apple Facing One of Its Worst Leaks Since the iPhone 4

Thursday July 2, 2026 9:53 am PDT by
Apple supplier Tata Electronics recently suffered a cyberattack that resulted in thousands of confidential files being published on the dark web, and this reportedly included some photos and documents related to the upcoming iPhone 18 Pro. We have elected not to share any of the leaked photos in this story due to the illegal nature in which they were obtained, but they can easily be found...

Top Rated Comments

30 months ago
Apple has recently been killing in Deep learning space. Apple released MM1, and they have released more information than most open source LLM companies. I have been testing MLX for some of my workflows, it’s probably the fastest among other python libraries. It runs open source LLM models on my iPad Pro. Gonna be interesting once it gets to iPhone and other devices. With recent updates, I can run A Falcon 180 B on my M1 Max and my Nvidia RTX 4090 GPU can only dream. I hope Apple keeps up with the releases.
Score: 24 Votes (Like | Disagree)
30 months ago
Will be extremely funny to read all this PR from Apple about the big important ML research they are doing and then find out at WWDC they are just gonna outsource Siri to Microsoft, lmao
Score: 16 Votes (Like | Disagree)
30 months ago
It'll be way ahead of Gemini if it simply acknowledges the existence of White people. The bar is pretty low, Apple. Let's see what you've got.
Score: 12 Votes (Like | Disagree)
spazzcat Avatar
30 months ago

Again this is all propaganda, Apple is not going to release the key to their secret sauce until at least WWDC, and maybe not even then.

Cook needs any goodwill the media will give him. I see him running from department to department asking, "What can we release through the backdoor, I'm sinking here!!!!"
We have no idea what's going on at Apple or what Apple is planning for iOS 18 and everything you wrote is hyperbole. And I would take any new development in iOS 18 with a grain salt, the feature set would have been locked down months ago.
Score: 10 Votes (Like | Disagree)
Amazing Iceman Avatar
30 months ago

Will be extremely funny to read all this PR from Apple about the big important ML research they are doing and then find out at WWDC they are just gonna outsource Siri to Microsoft, lmao
Then the new Siri will be called Siritana
Score: 10 Votes (Like | Disagree)
mdriftmeyer Avatar
30 months ago

Will be extremely funny to read all this PR from Apple about the big important ML research they are doing and then find out at WWDC they are just gonna outsource Siri to Microsoft, lmao
The former head of AI at Google is an SVP at Apple for the past near decade. He like the PA Semi purchase have been hard at work for their respective teams.

Apple hasn't become who they are with such a vast treasure trove of expertise and wealth by sheer dumb luck.
Score: 8 Votes (Like | Disagree)