- cross-posted to:
- technology@beehaw.org
- cross-posted to:
- technology@beehaw.org
Apple wants AI to run directly on its hardware instead of in the cloud::iPhone maker wants to catch up to its rivals when it comes to AI.
Remember, this probably isn’t an either or thing. Both Apple and Google have been offloading certain AI tasks to devices to speed up response time and process certain requests offline.
Yep, though Google is happy to process your data in the cloud constantly while Apple consistently tries to find ways to achieve it locally, which is generally better for privacy and security but also cheaper for them too.
Yea thats why they look trough your images for “cp”
Who looks at images?
Ok have to correct myself, they crawled back from this 2 weeks ago due to backlash, but i doubt they wont do it at least in a similar way or hidden like they did with reducing power on older devices to “save battery” https://www.wired.com/story/apple-photo-scanning-csam-communication-safety-messages/
Google have been offloading certain AI tasks to devices
No, Google has simply been blatantly lying about this to convince you to buy new phones. It’s very easy to prove, because as soon as you disable any network connections, these functions cease to work.
Just because a certain requests don’t work offline, that doesn’t mean that Google isn’t actually running models locally for many requests.
My pixel isn’t new enough to run nano. What are some examples of offline processing not working?
I wouldn’t be surprised if the handshake between Pro and Nano was intermingled for certain requests. Some stuff done in the cloud, and some stuff done locally for speed - but if the internet is off, they kill the processing of the request entirely because half of the required platform isn’t available.
Just because a certain requests don’t work offline, that doesn’t mean that Google isn’t actually running models locally for many requests.
Yeah? It does.
What a thought provoking reply.
I dunno what you expect me to say. It’s not complicated.
You’re really going to say that Google isn’t doing anything locally with Tensor? That’s just silly.
No that is not what I said.
How’s that supposed to work?
I’m picturing a backpack full of batteries and graphics cards. Maybe they’re talking about a more limited model?
This is a Financial Times article, regurgitated by Ars Technica. The article isn’t by a tech journalist, it’s by a business journalist, and their definition of “AI” is a lot looser than what you’re thinking of.
I’m pretty sure they’re talking about things that Apple is already doing not just on current hardware but even on hardware from a few years ago. For example the keyboard on iOS now uses pretty much the same technology as ChatGPT but scaled way way down to the point where “Tiny Language Model” would probably be more accurate. I wouldn’t be surprised if the training data is as small as ten megabytes, compared to half a terabyte for ChatGPT.
The model will learn that you say “Fuck Yeah!” to one person and “That is interesting, thanks for sharing it with me.” to someone else. Very cool technology - but it’s not AI. The keyboard really will suggest swear words now by the way - if you’ve used them previously in a similar context to the current one. The old algorithmic keyboard had hardcoded “do not swear, ever” logic.
I’ve been playing with llama.cpp a bit for the last week and it’s surprisingly workable on a recent laptop just using the CPU. It’s not really hard to imagine Apple and others adding (more) AI accelerators on mobile.
Google is already doing this with Gemini Nano. https://store.google.com/intl/en/ideas/articles/pixel-feature-drop-december-2023
They’re making their own silicone now. You can achieve a lot more efficiency when you’re streamlined the whole way through.
silicone
It’s silicon. Silicon is a naturally occurring chemical element, whereas silicone is a synthetic substance.
Silicon is for computer chips, silicone is for boobies.
By making their own, you mean telling Taiwan Semiconductor Manufacturing Company “hey we are going to buy enough of these units that you have to give us the specs we chose at a better price than the competitors, and since we chose the specs off your manufacturing capacity sheets we will say “engineered in Cupertino TM” “
Btw I’m not shitting on Apple here. I love my m2 processor.
Sure, but being that pedantic is neither concise or pertinent to the question at hand.
Apple is neither making nor engineering silicon for any of their produce.
Yes, like google is doing with their tensor chips in the pixels
I’m going to blow your mind here…the ‘cloud’ is just two or three data centres with replication turned on. It’s mostly a buzz word to charge a bit more
Wait, so you mean it’s not actual rain clouds in the sky??
They looked into it but apparently no
Eh, it’s a bit more than that. I work on a private cloud, the implications of it being a cloud versus traditional bare metal or virtualization platforms are around the APIs, quick spin up/down cycles, fully integrated recovery, imaging and remote console systems, integration with automated deployment platforms and others. It’s not just a buzz word.
Most of that’s on any half decent commercial server. You’re right there’s definitely some differences though.
I actually worked on our corporate move from private servers (main, backup and dr) to Azure cloud which had the only two server locations (melb and Sydney) and the mythology around cloud seemed a bit much
Charging more for cloud? As if apple is not finding an excuse to charge even more for their overpriced phones by going offline.
This is the best summary I could come up with:
Apple’s latest research about running large language models on smartphones offers the clearest signal yet that the iPhone maker plans to catch up with its Silicon Valley rivals in generative artificial intelligence.
The paper was published on December 12 but caught wider attention after Hugging Face, a popular site for AI researchers to showcase their work, highlighted it late on Wednesday.
Device manufacturers and chipmakers are hoping that new AI features will help revive the smartphone market, which has had its worst year in a decade, with shipments falling an estimated 5 percent, according to Counterpoint Research.
Running the kind of large AI model that powers ChatGPT or Google’s Bard on a personal device brings formidable technical challenges, because smartphones lack the huge computing resources and energy available in a data center.
Apple tested its approach on models including Falcon 7B, a smaller version of an open source LLM originally developed by the Technology Innovation Institute in Abu Dhabi.
Academic papers are not a direct indicator of how Apple intends to add new features to its products, but they offer a rare glimpse into its secretive research labs and the company’s latest technical breakthroughs.
The original article contains 741 words, the summary contains 194 words. Saved 74%. I’m a bot and I’m open source!
Google is doing this exact same thing with Gemini, the platform behind Bard / Assistant.
Gemini has large scale models, that live in data centers, and handles complex queries. They also have a “Nano” version of the model that can live on a phone and handle simpler on-device tasks.
The smaller models are great for things like natural language UI and smart home controls. It’s also way faster and capable of working offline. A big use case for offline AI has been hiking with the Apple Watch in areas with no reception.
Also battery management, background tasks power distribution and hardware energy efficiency, i mean it would be great to have ai that adapted hardware energy consumption settings depending on my use case, yes i know that algorithms already exist to do that, but it would be great to have much much more flexible energy manager based on ai that accommodate and adapt to my use cases
AKA “we completely missed the boat on this thing and are going to pretend it was intentional by focusing on an inevitable inflection point a few years out from today instead.”
Also Apple sucks at cloud services.
Can’t wait for the Apple announcement, “AI features require iPhone 18 or later. Your older phone CPU just isn’t powerful enough.”
Which wouldn’t be a problem if there was a cloud option.
iPhone 18 pro max actually) as they did already with iPhone 15 pro max and console games and it still overheating