Wed. Oct 16th, 2024

Apple wants AI to run directly on its hardware instead of in the cloud

By

Dec 21, 2023

Enlarge / The iPhone 15 Pro. (credit: Apple)

Apple’s latest research about running large language models on smartphones offers the clearest signal yet that the iPhone maker plans to catch up with its Silicon Valley rivals in generative artificial intelligence.

The paper, entitled “LLM in a Flash,” offers a “solution to a current computational bottleneck,” its researchers write.

Its approach “paves the way for effective inference of LLMs on devices with limited memory,” they said. Inference refers to how large language models, the large data repositories that power apps like ChatGPT, respond to users’ queries. Chatbots and LLMs normally run in vast data centers with much greater computing power than an iPhone.

Read 15 remaining paragraphs | Comments

By

Related Post

Microsoft’s prototype Surface Laptop leaks with Intel’s Lunar Lake chips inside

Oct 16, 2024

Microsoft pulls $1 Xbox Game Pass trial just before new Call of Duty release

Oct 16, 2024

Facebook put us out there

Oct 16, 2024

You missed

Microsoft’s prototype Surface Laptop leaks with Intel’s Lunar Lake chips inside

Oct 16, 2024

Microsoft pulls $1 Xbox Game Pass trial just before new Call of Duty release

Oct 16, 2024

Facebook put us out there

Oct 16, 2024

Amazon’s new Kindle family includes the first color Kindle

Oct 16, 2024