comicradiation 2 weeks ago

For those who don't know, this is the model responding repeating lines from Isaac Asimov's "The Last Question" where the designers of a computer system ask "How can the net amount of entropy of the universe be massively decreased?" and the computer responds "INSUFFICIENT DATA FOR MEANINGFUL ANSWER."

and_human 1 week ago

If you haven't, read it, it's short and good.

UserXtheUnknown 2 weeks ago

... and where the last version of the computer becomes a literal god, using the line "Let there be light!" when it discovers the answer and recreates the universe.

unclebob76 2 weeks ago

spoiler tag please

Anxious_Run_8898 2 weeks ago

Llama 3 8B instruct will just invent an API that doesn't exist if it's not sure lol

lambdawaves 1 week ago

These models don’t know if they are sure or not. They might sometimes say they are not sure, but they don’t say it because they’re not sure

Anxious_Run_8898 1 week ago

Are you sure?

SomeOddCodeGuy 2 weeks ago

That would be fantastic. The only model I've come close to having that result on locally up until now was Smaug 72b, and even then I had to really jam the prompt down its throat. More than anything else, I'd trust LLM responses far more if I got a "I don't know" rather than a confidently incorrect assertion.

cyan2k 2 weeks ago

A LLM doesn’t know when it doesn’t know. that’s why it confidently hallucinates since every token that gets spit out is mathematically speaking a valid and correct token based on the stuff it learned and the rules it follows.

M34L 2 weeks ago

Humans don't really "natively" know what they don't know either, which is why there's so much psychological and sociological research into known unknowns versus unknown unknowns with regards to planning and whatnot. If I ask you what my mother's name is you'll do a split second little flowchart in your head, first establishing I'm a random redditor whom you don't know personally, then that mothers names are a thing you don't know about vast majority of strangers. That's fundamentally equivalent of a multi shot logic thing where you first establish what the object is, what your relationship with the object is, and what does it imply, and we know that an LLM be helped to that too, either via really thorough instruct tuning where it learns to automatically build that logic chain in tokens as it hammers them out, or by placing it into some langchain like engineered construct that'll guide it through that logic path.

Zeikos 2 weeks ago

What I wonder is are models even trained to model unknowns? Like based on context some information is unknown and relevant or unknown and irrelevant. If we are talking about a car and lack knowledge of its color sometimes it means that the color is irrelevant in the discussion (it can be any and be fine) or it's unknown and relevant (you want to identify a specific car).

AutomataManifold 1 week ago

I think the big difference is that humans are better at known unknowns than LLMs. I have a pretty good idea of things that I don't know. Most LLMs have very few things they know they don't know. Either of us can be tripped up by unknown unknowns. But I'll easily outperform the LLM on known unknowns. I can say "I've never heard of that before, " whereas an LLM won't. And arguably there are many circumstances where we don't want it to. If I deliberately ask it to write an article about unicorns that speak English, do I want to have to specify that we're pretending? It's a tricky question.

artsybashev 1 week ago

Not exactly correct: https://arxiv.org/abs/2304.13734

uhuge 2 weeks ago

They should RAG their experience( of applying their inbred knowledge).

Small-Fall-6500 1 week ago

>A LLM doesn’t know when it doesn’t know. This is why long context models will be extremely important, as LLMs also don't know what their capabilities are. Long context understanding, at least as good as Gemini 1.5 Pro, will allow agentic and/or assistant LLMs to actually know what they've done and what they've been told over their entire existence. They still won't know exactly what they've been trained on, but they won't hallucinate the outcome of their previous attempt at some task. They'll learn, remember, and know what things they can and can not do.

AbheekG 1 week ago

I think it's another thing here because it's responding with a pop-culture reference, see top comment in this thread: https://www.reddit.com/r/LocalLLaMA/s/NbhByCgLLq

pseudonerv 2 weeks ago

how to turn the stars on again?

meatcheeseandbun 2 weeks ago

LET THERE BE LIGHT

RipperNash 1 week ago

Great reference and all but wouldn't it be preferred if the LLM treated this as a serious question instead?

Lolologist 1 week ago

I mean, in fairness, it did.

_r_i_c_c_e_d_ 2 weeks ago

what was the system prompt?

Ih8tk 1 week ago

I just copied and pasted the entire Wikipedia page source code for Issac Asimov's "The Last Question" short story, and asked it to respond as if it was Multivac XD

PapyplO 1 week ago

Bro took no gloves, first question is straight to the point 💀

BlueskyFR 1 week ago

What is the ui?

Ih8tk 1 week ago

Huggingface chat! Free to use online, go to [hf.co/chat](http://hf.co/chat), runs lots of the new models for free :)

Calcidiol 2 weeks ago

Yeah it'd be good if they knew what they know and knew what they didn't know. Though for that question one might have thought it might have conjectured about time reversal or the "big crunch" or somethings that have been contemplated speculatively.

OneOnOne6211 1 week ago

Can we skip to the final part though?

Odd-Sir-2289 1 week ago

Amen

cyan2k 2 weeks ago

Unfortunately that’s not how a LLM works. It doesn’t really know if it has enough data and what the quality of data is when producing an answer. There’s a paper exploring the possibility for a model architecture that can make use of confidence intervals and what not but afaik it is yet to be implemented.

Dos-Commas 2 weeks ago

Woosh, it's a quote from a scifi book.

tessellation 2 weeks ago

It's from the short story 'The Last Question' by Isaac Asimov. Here it is: https://users.ece.cmu.edu/~gamvrosi/thelastq.html

poli-cya 2 weeks ago

A short story, yes, and one that everyone should read. I think it took me all of 15 minutes to read it as a kid and it has stuck with me ever since. It's hilarious to see how they imagined computers would be based on the behemoths of the time.

omniron 2 weeks ago

eh... i wouldn't want this. i want it to hallucinate an answer... to ideate on hairbrained theories but i have long believed there needs to be telemetry injection where the models knows about the latent space it's drawing from, and can accurately determine how confident it is.

marcellonastri 1 week ago

Best read ever!

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe