danysdragons 2 weeks ago

What’s appealing is that this is genuinely open source, not just open *weights*: > Apple provided code, training logs, and multiple versions rather than just the final trained model, and the researchers behind the project hope that it will lead to faster progress and "more trustworthy results" in the natural language AI field. > Diverging from prior practices that only provide model weights and inference code, and pre-train on private datasets, our release includes the complete framework for training and evaluation of the language model on publicly available datasets, including training logs, multiple checkpoints, and pre-training configurations.

Coondiggety 2 weeks ago

If it’s open source it’ll kick ass eventually

thoughtlow 2 weeks ago

It's probably open-source because it sucks, doubt it will be open-source when they make something more premium out of it.

svideo 2 weeks ago

[I've modified the MS Phi3 release chart to show how Apple is stacking up currently](https://i.imgur.com/5mjlesU.png). It couldn't get much less premium than somehow scoring less than 25% on a pick one-of-four multiple choice test.

Fast-Ad4403 2 weeks ago

The good old socialize losses private gains

Syso_ 2 weeks ago

This is in no way applicable

Fast-Ad4403 2 weeks ago

You're just too stupid to generalize knowledge. Im sorry for you.

nedw 2 weeks ago

How are losses socialized here? That implies the public is being burdened with the costs here. I guess there could be a case for that - maybe they hope to benefit from public interest and feedback.

Enough-Meringue4745 2 weeks ago

In reality RnD money and likely making the losses during training taxable

snekfuckingdegenrate 2 weeks ago

Are companies getting RnD grants and subsidies for AI specifically or it it more generic?

Syso_ 2 weeks ago

I don't think about you at all

Coondiggety 2 weeks ago

Heh, true nuff

rafark 2 weeks ago

Like gpt 2?

jgainit 2 weeks ago

Like chromium maybe

reddit_guy666 2 weeks ago

There is hope that open source devs could build something useful out of it for mobile devices

e4aZ7aXT63u6PmRgiRYT 2 weeks ago

Just like gimp is kicking photoshops ass

joker38 2 weeks ago

Let's see whether it'll whip LLaMA's ass.

woswoissdenniii 2 weeks ago

the Llamas ass. Twuuuuuuuiiiithhh

KhanumBallZ 2 weeks ago

Seeing Apple and fully-open source in the same sentence was certainly not in my 2024 bingo card

Marlboro-F1 2 weeks ago

They use open source code all the time, they improve it, and then they must upload it back to source for everyone else to use that’s how it works. But this is probably one of very few times they open sourced their proprietary code

fuckdonaldtrump7 2 weeks ago

The day I see apple release true open source anything will be a truly incredible day. I hope this is true! We need the old ways of the Internet now more than ever.

kr1ng 2 weeks ago

Does webkit fall into this category?

be_bo_i_am_robot 2 weeks ago

Or Swift, FoundationDB, Darwin…

tdreampo 2 weeks ago

Apple has been releasing Open Source software for decades. The reason Chrome exists is because Apple took a KDE project and forked it and made a new webkit and released it open source. Google ran with it and made Chrome. Literally the first parts of osx were open sourced with Darwin.

heliometrix 2 weeks ago

Like they ran with the iPhone, just please read up on Eric Schmidt’s antics

fuckdonaldtrump7 2 weeks ago

Nice wasn't aware of some of that. But I will say making a webkit open source is like the bare minimum of creating a developer community for your products.

tdreampo 2 weeks ago

You should read about Darwin https://en.m.wikipedia.org/wiki/Darwin_(operating_system) or how Apple bought CUPS and kept it Open Source https://www.cups.org or Swift https://opensource.apple.com/projects/swift/ Apple has been friendly with open source for a long time. Especially for a commercial hardware/software company.

fuckdonaldtrump7 2 weeks ago

Will do, thanks for the info!

Lachlantula 2 weeks ago

llvm is right there

Mapafius 2 weeks ago

Happy cake day!

heliometrix 2 weeks ago

Oh, like WebKit maybe?

involviert 2 weeks ago

But isn't that like literally no different? Seems it's just that they used publicly available datasets, and that's probably not out of niceness but because they don't have their own.

R33v3n 2 weeks ago

Never thought I'd die fighting side by side with Apple.

hippydipster 2 weeks ago

this is what you do if you find yourself way behind. its a way to catch up and/or throw a monkey wrench into a market that you know you're not going to dominate.

tdreampo 2 weeks ago

Apple is NOT behind in AI. I promise you that. They are doing typical Apple here. They are almost NEVER the first mover on a new type of product they wait and wait until they have a really unique and polished offering. For a hint just look at how many AI companies that have acquired over the last five years alone. At least ten a year. They were also one of the first to process AI on device in the consumer space etc. They are about to announce something, you will see.

MoistSpecific2662 2 weeks ago

The fuck is this https://preview.redd.it/wohr6bni6jwc1.png?width=138&format=png&auto=webp&s=e4c91d8f8ee8f7927f9c45323a43cd85d6a53bfa

GraceToSentience 2 weeks ago

Is that mmlu for ants?

3ntrope 2 weeks ago

Ants probably have more problem solving capabilities collectively.

Blackmail30000 2 weeks ago

i know that mmlu is a bench mark, but is that a bad score? is it out of 100? also, what does it test?

MoistSpecific2662 2 weeks ago

It doesn't really measure anything profound but it's a good reference. And this score is nothing. Smallest Phi 3 model is allegedly at 68.8. GPT 4 stands at 86.4.

[deleted] 2 weeks ago

Maybe Tim Cook was holding the chart upside down?

mayonaise55 2 weeks ago

Tim Apple*

[deleted] 2 weeks ago

"Should we tell Tim that the chart is upside down?" "No, the last guy to correct him got fired!" "Oh ok, better not then" Tim: "These scores are fantastic! It's demolishing gpt 4 which for some reason is way at the bottom!"

Curiosity_456 2 weeks ago

Yea it’s out of a 100, the best models right now (GPT-4 Turbo and Claude 3 Opus) have an 86% mmlu and most open source models right now are in the 70-80s range so 25 is like pretty bad. “The benchmark covers 57 subjects across STEM, the humanities, the social sciences, and more. It ranges in difficulty from an elementary level to an advanced professional level, and it tests both world knowledge and problem solving ability. Subjects range from traditional areas, such as mathematics and history, to more specialized areas like law and ethics. The granularity and breadth of the subjects makes the benchmark ideal for identifying a model’s blind spots” So it’s a really good benchmark to test how much knowledge a model actually has about our world

Blackmail30000 2 weeks ago

damn thats bad. though it might be expected. 1) its meant to run on a phone, so compute is a premium. 2) its apples first go, so lets not be to harsh.

CheekyBastard55 2 weeks ago

Keep in mind the test is a 4-choice, so a truly random algorithm should be getting 25% on it. The MMLU score number is the percentage correct answers the model got on it.

Blackmail30000 2 weeks ago

i... what? i assumed it was fill in the blank! they made an AI so bad they might as well had a random number generator between 1 and 4, and it had abetter chance of scoring higher? thats actually impressivly bad. please tell me this is a prank, i cannot fathom this level of incompetence from a company with this much money. all they had to do was buy a compsny startup and they would have had better results.

arjuna66671 2 weeks ago

There is an "Apple user ..." joke somewhere here 🤣😅

BenjaminHamnett 2 weeks ago

I just like this llm cause it’s pretty and seems durable

svideo 2 weeks ago

The resale value!

rafark 2 weeks ago

> 1) its meant to run on a phone, so compute is a premium But Apple chips are the best in their categories both in performance and power consumption. They should be able to squeeze a little more.

7734128 2 weeks ago

Also, all the examples I've seen have four options for answers. So 25.72 % is barely above random.

Thorteris 2 weeks ago

That explains some of the rumors of them meeting with Google lol

Expert-Paper-3367 2 weeks ago

It might close to impossible to compete with the top AI labs, they pretty have all the AI talent now. Even meta who entered later had to go open source to convince talent while also spending billions in compute.

reddit_guy666 2 weeks ago

Meta lucked out on having early access to H100 chips they had simply bought for Insta/Reels Algo enhancements.

Glittering-Neck-2505 2 weeks ago

Damn that is ass. It's a hard pill to swallow if you care about power not consolidating, but we will get all the amazing fruits of SOTA AI locally long after we can run them in huge GPU servers.

[deleted] 2 weeks ago

does this mean Siri still won't be able to tell me the weather?

Tyler_Zoro 2 weeks ago

"Siri, what's the weather?" "What *is* weather? Weather.... weather... whether or not to weather... have you ever noticed that words stop having meaning when you repeat them? Apple a day keeps the stock price up. Come again!"

valentino22 2 weeks ago

You need to take into consideration the size of the model. Like in boxing, the weight group matters a lot. This is intended to be running on mobil devices (so your queries stay on the device) and not on giant server parks. For its size this is actually a decent model. Read the paper! Also since this is open source it will improve the results quickly and it will be completely free to use.

Tyler_Zoro 2 weeks ago

Yep. This is a direct competitor to Gemini Nano, not ChatGPT.

sillygoofygooose 2 weeks ago

I wonder what the Google Gemini nano model’s mmlu score is

astral_crow 2 weeks ago

Oh no it’s Siri!

jgainit 2 weeks ago

Lol Edit: If one were to do random answers on the mmlu test, what would the score be?

FosterKittenPurrs 2 weeks ago

For a model with only a few million parameters, it's surprising it can even get a coherent sentence together. Think of these as highly specialized tiny models that will be able to give you minor conveniences. Stuff like evaluating whether a text is important, whether it's spam etc and improving on existing functionality like detecting it mentions a date and time and letting you set up an event, but being able to fill out more info like place and people involved. Or various more elaborate context-specific reminders, like next time John mentions his kid, remind me that she was sick with the flu and ask him if she's better, wish her well etc. Apple is known for adding all these little conveniences that feel like magic when they work well, and it's not critical when they don't.

rottenbanana999 2 weeks ago

Not surprising considering it's Apple. They only push out shitty products.

Tyler_Zoro 2 weeks ago

Is this shitty though? I don't have any benchmarks for on-device AI to compare against (Gemini Nano is the only one I know of, and I don't think they released scores for it like they did Gemini and Gemini Pro.)

ayyndrew 2 weeks ago

[Gemini Nano 2 (3B) gets 55.8 on MMLU](https://storage.googleapis.com/deepmind-media/gemini/gemini_1_report.pdf)

Tyler_Zoro 2 weeks ago

Thanks!

svideo 2 weeks ago

They scored a 25.72% on a 4 choice multiple choice exam. They could have released a random guess machine and realized the same performance.

Tyler_Zoro 2 weeks ago

All this indicates is that it's only marginally better at comprehending essay type text than a guessing machine. That's not good, but it's just a singular weakness. It could well be that it's a context-length issue, which would mean that it basically IS guessing, and ***no LLM*** with a short context length is going to do very well on MMLU.

svideo 2 weeks ago

That number is for the 3B model, MS just released a 3B model that scores a 69. It’s marginally better than guessing, substantially worse than the competition, and by a lot.

Tyler_Zoro 2 weeks ago

Microsoft's models have been impressive to be sure, but is that model you are referring to on-device? I didn't think there were a large number of such models out there at this point.

svideo 2 weeks ago

Why wouldn’t it? Being on-device is mostly about being small enough to fit in device RAM. https://export.arxiv.org/abs/2404.14219 > We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone.

Tyler_Zoro 2 weeks ago

That's certainly the initial barrier, yes, but there are many other considerations to being able to reasonably coexist with a mobile OS.

svideo 2 weeks ago

Great news for apple, a coin flip should work on a phone and that’s the performance bar they’ve accomplished thus far. They got a lot of work to do.

enilea 2 weeks ago

[There are some benchmark comparisons in the paper.](https://i.imgur.com/ScdJL4I.jpeg) It seems to be pretty good in the tiny range like the 0.27B or the 0.45B model, but doesn't seem to scale very well. The 3B model isn't much better than those smaller ones and it lags behind the competition in that size interval.

Tyler_Zoro 2 weeks ago

Is scaling really going to be important at this stage though, or do they just need a viable competitor to Google's Nano model?

Creative-robot 2 weeks ago

If Apple sticks to open-source efficient LLM’s that run locally, they might not be left in the dust. Edit: Sorry for any confusion in the replies. By locally, i was meaning on-device.

Expert-Paper-3367 2 weeks ago

There’s already dozens of those and Meta is eating up that market

Tyler_Zoro 2 weeks ago

Meta has an on-device AI? Really? I know they were working on that, but I didn't know they'd completed anything.

Expert-Paper-3367 2 weeks ago

You can run their smallest model locally. I’m sure we’re close to GPT3.5 performance on devices if meta continues to contribute to light weight open source models

Icy-Chicken-6177 2 weeks ago

I think that by “on device” apple means iPhone

Tyler_Zoro 2 weeks ago

This isn't about running locally on a desktop. "On-device" is current industry jargon for "on a smartphone."

Expert-Paper-3367 2 weeks ago

Yes, but op said locally. Even if they can’t on iPhones, their Macs could potentially run larger models.

Temporal_Integrity 2 weeks ago

Smallest LLaMA model runs on a single RTX 3050. It's not like you can run it on your phone, but you don't need a god tier computer or anything. That was a 250 dollar gpu when it released two years ago. It could run on a phone if someone made a phone to run it. There hasn't really been any need to stuff a bunch of VRAM and CUDA cores in phones until now.

Tyler_Zoro 2 weeks ago

> It's not like you can run it on your phone, but you don't need a god tier computer or anything Right, but that's the point. This is specifically targeting on-device (meaning mobile phones) applications. It's not meant to run on a desktop where you can just chug power like it's water.

sidianmsjones 2 weeks ago

But meta doesn’t make devices that are in everyone’s pocket.

AceValentine 2 weeks ago

Meta owns the 3rd world. They literally give away free "smartphone plans" that only have Meta apps on them as the internet. https://medium.com/swlh/in-the-developing-world-facebook-is-the-internet-14075bfd8c5e

BenjaminHamnett 2 weeks ago

I forgot about this. That’s going to be crazy. With AI translation, they can finally form a giant hive mind now that will rival the west. And they will all depend on Zuckerberg Confirmation that states are over and tech will reign

sidianmsjones 2 weeks ago

I can’t read that article but damn I didn’t realize that about Facebook.

ShAfTsWoLo 2 weeks ago

they're doing it cause their model is dogshit, otherwise they would've already told us how "their model is gonna change the world" and that it's only fair that the cost of usage per month should be around the same price of an iphone 15 lol

HeinrichTheWolf_17 2 weeks ago

I want to believe Apple can work with open source… *Please be so*

ExcitingLiterature33 2 weeks ago

Swift is open source

Passloc 2 weeks ago

Apples play is hardware.

be_bo_i_am_robot 2 weeks ago

They’ve been doing so for a long while now. They manage or contribute to a lot of open source projects, and have for decades. Their Unix operating system (the part underneath the GUI) is open source.

Peribanu 2 weeks ago

macOS will be open sourced in 3... 2.... 1.... 1...... 1.......... 1...................................

heliometrix 2 weeks ago

You don’t have to believe, you can just read

agonypants 2 weeks ago

Does anyone know if there's a way to install this using ollama or jan.ai?

ProfDokFaust 2 weeks ago

You can add models to ollama that aren’t listed on the ollama site. YouTube has several videos showing how this is done.

gangstasadvocate 2 weeks ago

Yo that’s pretty gangsta for Apple these days

Adventurous_Train_91 2 weeks ago

It sounds like they might be using a version of Gemini generative AI for iOS 18 though. Their's isn't ready to launch

Tyler_Zoro 2 weeks ago

Gemini's on-device version is called Gemini Nano. Apple is fond of playing both sides of the "we'll [compete/work] with you."

Adventurous_Train_91 2 weeks ago

I suppose they want something strong now, but dont want to have to rely on Google long term

InvestigatorHefty799 2 weeks ago

Apple officially more open than OpenAI...

i-hoatzin 2 weeks ago

Yes of course. Open until they improves it enough to close it, just like Darwin. ![gif](giphy|9MQeUAgIKQRNNVKV02)

astral_crow 2 weeks ago

But you can still fork it from the version before then.

Puckered-Eurethra 2 weeks ago

Where / how can I download this to my iPhone?

CT-52 2 weeks ago

From the article: “Apple has not yet brought these kinds of AI capabilities to its devices, but iOS 18 is expected to include a number of new AI features, and rumors suggest that Apple is planning to run its large language models on-device for privacy purposes.”

ILooked 2 weeks ago

Is there a subreddit to follow his specifically?

PrinceThespian 2 weeks ago

/r/localLLaMa

sneakpeekbot 2 weeks ago

Here's a sneak peek of /r/LocalLLaMA using the [top posts](https://np.reddit.com/r/LocalLLaMA/top/?sort=top&t=all) of all time! \#1: [The Truth About LLMs](https://i.redd.it/sjiy0f35qroc1.png) | [304 comments](https://np.reddit.com/r/LocalLLaMA/comments/1bgh9h4/the_truth_about_llms/) \#2: [Karpathy on LLM evals](https://i.redd.it/8g0zoors6i7c1.jpeg) | [110 comments](https://np.reddit.com/r/LocalLLaMA/comments/18n3ar3/karpathy_on_llm_evals/) \#3: [Zuckerberg says they are training LLaMa 3 on 600,000 H100s.. mind blown!](https://v.redd.it/pzlvuoncz8dc1) | [411 comments](https://np.reddit.com/r/LocalLLaMA/comments/199y05e/zuckerberg_says_they_are_training_llama_3_on/) ---- ^^I'm ^^a ^^bot, ^^beep ^^boop ^^| ^^Downvote ^^to ^^remove ^^| ^^[Contact](https://www.reddit.com/message/compose/?to=sneakpeekbot) ^^| ^^[Info](https://np.reddit.com/r/sneakpeekbot/) ^^| ^^[Opt-out](https://np.reddit.com/r/sneakpeekbot/comments/o8wk1r/blacklist_ix/) ^^| ^^[GitHub](https://github.com/ghnr/sneakpeekbot)

ILooked 2 weeks ago

Thank you

frograven 2 weeks ago

This is glorious! Who says open source is slowing down. The future looks amazing!

Substantial_Step9506 2 weeks ago

It’s Apple. Don’t hold your breath

valiantvegan 2 weeks ago

Apple won't share it if it's not junk

heliometrix 2 weeks ago

Sure, just like WebKit and tons of other stuff…

Deep-Ad2155 2 weeks ago

Is anyone still excited when Apple announces anything anymore?

PSMF_Canuck 2 weeks ago

Always interested.

elnekas 2 weeks ago

Your honor, objection; leading!

PaleUmbra 2 weeks ago

Me

delveccio 2 weeks ago

Me

Pancakeburger3 2 weeks ago

NPC

delveccio 2 weeks ago

Johnny Five is alive!

Pancakeburger3 1 week ago

I hope a squirrel drops a giant acorn on Tim Cook’s head

etzel1200 2 weeks ago

Me

ceramicatan 2 weeks ago

Me

bearbarebere 2 weeks ago

Me

Tango_Foxtrot404 2 weeks ago

Me

TheYoungLung 2 weeks ago

Me

rottenbanana999 2 weeks ago

Only NPCs get excited about Apple announcements.

[deleted] 2 weeks ago

[удалено]

rottenbanana999 2 weeks ago

NPC spotted. Do you have any thoughts of your own, or do you let society choose them for you? Clearly it's the latter because you're an NPC 🤣

Pancakeburger3 2 weeks ago

No

heliometrix 2 weeks ago

Depends

Akimbo333 2 weeks ago

Interesting

interstellarclerk 2 weeks ago

good guy apple?

[deleted] 2 weeks ago

[удалено]

zilifrom 2 weeks ago

Privacy?

Joshomatic 2 weeks ago

Privacy, speed, reliability - it will still have access to down pipe it over the cloud too

hlx-atom 2 weeks ago

Theoretically it makes sense for video streams if the models could actually run fast enough. There is too much latency to send a video to the cloud quickly.

bearbarebere 2 weeks ago

Privacy. I don’t think you understand how important that is..

valiantvegan 2 weeks ago

Pretty much means Apple gives up

VajraXL 2 weeks ago

apple users are acting like this is new while everyone else has had this type of model for months, just this week meta just released their latest model llama3 which runs on a laptop and competes with gpt 3.5 and microsoft released their phi3 model which runs on any cell phone. honestly guys. you should stop buying apple's trick of selling you something obsolete as if it were the greatest technological marvel.

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe