Apple Teams Up With NVIDIA to Speed Up AI Language Models
Apple has shared details on a collaboration with NVIDIA to greatly improve the performance of large language models (LLMs) by implementing a new text generation technique that offers substantial speed improvements for AI applications.

Apple earlier this year published and open-sourced Recurrent Drafter (ReDrafter), an approach that combines beam search and dynamic tree attention methods to accelerate text generation. Beam search explores multiple potential text sequences at once for better results, while tree attention organizes and removes redundant overlaps among these sequences to improve efficiency.
Apple has now integrated the technology into NVIDIA's TensorRT-LLM framework, which optimizes LLMs running on NVIDIA GPUs, where it achieved "state of the art performance," according to Apple. The integration saw the technique manage a 2.7x speed increase in tokens generated per second during testing with a production model containing tens of billions of parameters.
Apple says the improved performance not only reduces user-perceived latency but also leads to decreased GPU usage and power consumption. From Apple's Machine Learning Research blog:
"LLMs are increasingly being used to power production applications, and improving inference efficiency can both impact computational costs and reduce latency for users. With ReDrafter's novel approach to speculative decoding integrated into the NVIDIA TensorRT-LLM framework, developers can now benefit from faster token generation on NVIDIA GPUs for their production LLM applications."
Developers interested in implementing ReDrafter can find detailed information on both Apple's website and NVIDIA's developer blog.
Popular Stories
Bloomberg's Mark Gurman has high expectations for Apple's first foldable iPhone.
In his Power On newsletter today, he said the foldable iPhone will be "the most significant overhaul in the iPhone's history."
"iPhone 4, iPhone 6 and iPhone X were clearly a big deal, but this is a whole new design," he said.
Like Samsung's Galaxy Z Fold 7, the foldable iPhone will reportedly open up like ...
iOS 26.5 is now available for developers, and while it doesn't include any new Siri capabilities, there are some major changes for the European Union, and smaller tweaks for features available worldwide.
Suggested Places
In the Maps app, there's a new "Suggested Places" feature that recommends locations to visit based on trending places nearby and recent searches. When Apple launches ads in ...
Apple today added the MacBook Air (13-inch, 2017) to its "vintage" products list, meaning the device is now only eligible for repairs at Apple Stores and Apple Authorized Service Providers if parts remain available.
The MacBook Air (13-inch, 2017) was the final MacBook Air model released before Apple redesigned the laptop and gave it a Retina display in 2018.
Apple also added all iPad...
Popular Stories
Bloomberg's Mark Gurman has high expectations for Apple's first foldable iPhone.
In his Power On newsletter today, he said the foldable iPhone will be "the most significant overhaul in the iPhone's history."
"iPhone 4, iPhone 6 and iPhone X were clearly a big deal, but this is a whole new design," he said.
Like Samsung's Galaxy Z Fold 7, the foldable iPhone will reportedly open up like ...
iOS 26.5 is now available for developers, and while it doesn't include any new Siri capabilities, there are some major changes for the European Union, and smaller tweaks for features available worldwide.
Suggested Places
In the Maps app, there's a new "Suggested Places" feature that recommends locations to visit based on trending places nearby and recent searches. When Apple launches ads in ...
Apple today added the MacBook Air (13-inch, 2017) to its "vintage" products list, meaning the device is now only eligible for repairs at Apple Stores and Apple Authorized Service Providers if parts remain available.
The MacBook Air (13-inch, 2017) was the final MacBook Air model released before Apple redesigned the laptop and gave it a Retina display in 2018.
Apple also added all iPad...