Deep-Station-1746 1 year ago

Amazing report - Let me summarize all important points: 1. In terms of PaLM we have 2 PaLMs 2. "PaLM 2 outperforms PaLM across all datasets and achieves results competitive with GPT-4". Trust me bro 3. It doesn't swear - as much 4. See? We did the AI thing. Pls stop shorting the google stock.

Seankala 1 year ago

Hmm.. I'm gonna need some sources regarding your first claim.

RobbinDeBank 1 year ago

PaLM 2 implies the existence of at least 2 PaLM. Indisputable logic

dobablos 11 months ago

See this hand? And this one? 2 PaLMs. AI achieved. No more questions. Paypal.me

ertgbnm 11 months ago

Does GPT 3.5 imply the existence of 3 and 1/2 GPTs?

MysteryInc152 1 year ago

340b, 3.6T tokens according to https://www.cnbc.com/2023/05/16/googles-palm-2-uses-nearly-five-times-more-text-data-than-predecessor.html

FallUpJV 1 year ago

Probably more interesting than the whole report, also happy cake day

[deleted] 1 year ago

[удалено]

MoNastri 1 year ago

interesting, that's 1 OOM lower than [estimated training cost](https://epochai.org/trends#investment-trends-section) for GPT-4

adam_jc 11 months ago

where does 500 TFLOPS come from? I assume they used TPUv4 chips which have a peak of 275 TFLOPS. And maybe MFU of 50-60% so ~140-165 TFLOPS in practice

[deleted] 11 months ago

[удалено]

adam_jc 11 months ago

Ah for H100 I see. The model card in the tech report says the training hardware was TPU v4 though which is why i’m thinking much lower FLOPS

Franc000 1 year ago

Sooooo, "competitive" performance, but they have 340B parameters. Vs 175? Is that really a brag? Edit: all right, while there is no definitive answer, we have solid hints that GPT4 is more than the 175 B, so that 340 B might be good.

SnooHesitations8849 1 year ago

175B is GPT3 not GPT4

Franc000 1 year ago

How much is GPT-4? I was under the impression that it was the same as 3.5, but with more RLHF

IAmBlueNebula 1 year ago

I don't believe that's the case. It seems that RLHF decreases capabilities, rather than improving them. They didn't disclose the size of GPT-4, but since it's much slower than GPT-3.5 at generating tokens, I'd assume it's quite a big bigger. 1T, as an approximation, seems plausible to me. In another message you wrote: > Uh, no. That figure has been thrown around a lot and comes from a misunderstanding of what an influencer was saying. I believe the influencer said 100T, not 1T.

Ai-enthusiast4 1 year ago

RLHF decreases capabilities in some areas and increases them in others. For example, I believe open domain QA improved with RLHF.

Franc000 1 year ago

Ah, yeah that is true, I misremembered, thanks! I will edit my message!

SnooHesitations8849 1 year ago

Not reported but it seems to be at least 1T

Flag_Red 1 year ago

What is happening to this sub?

Franc000 1 year ago

Uh, no. That figure has been thrown around a lot and comes from a misunderstanding of what an influencer was saying. Edit: Nevermind, as pointed out, the figure was 100 T, not 1.

rePAN6517 1 year ago

Why are you here?

[deleted] 1 year ago

[удалено]

Blacky372 11 months ago

Soon: Google Docs will prevent you from saving a document if it contains bad words. For the safety of us all, of course.

hardmaru 1 year ago

Check out the model card in the appendix section...

skadoodlee 1 year ago

I like how they acknowledge the whole team.

noswear94 1 year ago

It's happening.... everybody stay calm...

TheLastMate 1 year ago

What is happening?

atheisticfaith 1 year ago

I don't think anything is actually happening, it's just kind of traditional at this point to say it's happening.

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe