‘You Can’t Lick a Badger Twice’: Google Failures Highlight a Fundamental AI Flaw

Deebster@infosec.pub · 2 days ago

‘You Can’t Lick a Badger Twice’: Google Failures Highlight a Fundamental AI Flaw

chonglibloodsport@lemmy.world · 1 day ago

That’s because AI doesn’t know anything. All they do is make stuff up. This is called bullshitting and lots of people do it, even as a deliberate pastime. There was even a fantastic Star Trek TNG episode where Data learned to do it!

The key to bullshitting is to never look back. Just keep going forward! Constantly constructing sentences from the raw material of thought. Knowledge is something else entirely: justified true belief. It’s not sufficient to merely believe things, we need to have some justification (however flimsy). This means that true knowledge isn’t merely a feature of our brains, it includes a causal relation between ourselves and the world, however distant that may be.

A large language model at best could be said to have a lot of beliefs but zero justification. After all, no one has vetted the gargantuan training sets that go into an LLM to make sure only facts are incorporated into the model. Thus the only indicator of trustworthiness of a fact is that it’s repeated many times and in many different places in the training set. But that’s no help for obscure facts or widespread myths!

interdimensionalmeme@lemmy.ml · 1 day ago

Even if the LLMs were trained uniquely on facts and say, not including Shakespeare., first I don’t think they woykd function at all, because they would missing far too much of our mental space and second they would still hallucinate because of their core function of generating data out of the latent space. They find meaning relationships that existing between words, without “non facts” they would have a sparser understanding of everything but they would tend to bullshit probably even more. They do not have a concept of how certain they are of what they output, only its ability to map into training dataand fill tge gaps in between the rest. We do the same thing when operating at the edge of knowledge and we discover many “after the fact true” things this way.

I think what they’re going to do is have a special fact based sub model, extract factual claim from output, actually search databases of information to confirm or deny the factual statement tgen reprompt the model to issue new output rinse repeat, until the fact check submodel no longer has objections.

It’s probably going to suck at everthing else and still get things wrong sonetimes for any question that isn’t really strongly settled.

teft@lemmy.world · 1 day ago

60fps Next Generation makes my brain hurt. It’s like I’m watching a soap opera.

chonglibloodsport@lemmy.world · 1 day ago

Just gonna leave this here!