New Theory Suggests Chatbots Can Understand Text

kromem@lemmy.world · 21 hours ago

The full address associated with the donation appears within a February 2021 FEC document (page 189,746) listing all of Act Blue’s donations during the previous month. That street address and ZIP code match the home of the shooter, according to a public records search and photos of police searching the home.

https://ca.news.yahoo.com/fact-check-yes-trump-rally-222700642.html

The 69 Thomas Crooks doesn’t live in Bethel Park.

kromem@lemmy.world · 21 hours ago

Yes, I know.

So in the house you had the mother, the father Matthew, and the son Thomas.

Then the ActBlue donation is to a Thomas Crooks with the street address registered to Matthew Crooks.

So the idea that the donation was from some other Thomas Crooks seems unlikely, no?

Unless there’s some unmentioned Thomas Crooks that was living in the house Jan 2021.

kromem@lemmy.world · 2 days ago

The street address is registered to his father.

kromem@lemmy.world · 2 days ago

With the same street address as the shooter on the ActBlue donation record?

kromem@lemmy.world · 5 days ago

Honestly, that this is the headline from the meeting is kind of ridiculous.

I absolutely think Biden should step down and hand the torch to whoever is best able to make the case for their beating trump between now and the convention.

But he was always slipping up with word switches over his career.

He was actually very on top of the policy nuances in this Q&A - 1,000x better than Trump could have dreamed of being.

The one word switch in an hour of nuanced policy discussions as the headline is more a failure of the media than Biden.

Even though he def should be making way as his decline is going to get worse and more info is going to come out.

kromem@lemmy.world · 5 days ago

Precisely.

kromem@lemmy.world · edit-2 5 days ago

“Nooo, you can’t point to the very relevant counterpoint to what’s being claimed. That’s not fair.”

kromem@lemmy.world · 5 days ago

So you’re saying that Hillary Clinton losing to Trump was because she didn’t have as large of a war chest?

Or was that not part of history?

kromem@lemmy.world · 10 days ago

Unfortunately, removing Harris from the ticket doesn’t have the best optics in a lot of scenarios.

kromem@lemmy.world · 13 days ago

It’s right in the research I was mentioning:

https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html

Find the section on the model’s representation of self and then the ranked feature activations.

I misremembered the top feature slightly, which was: responding “I’m fine” or gives a positive but insincere response when asked how they are doing.

kromem@lemmy.world · 16 days ago

The problem is that they are prone to making up why they are correct too.

There’s various techniques to try and identify and correct hallucinations, but they all increase the cost and none are a silver bullet.

But the rate at which it occurs decreased with the jump in pretrained models, and will likely decrease further with the next jump too.

kromem@lemmy.world · 16 days ago

Here you are: https://www.nature.com/articles/s41562-024-01882-z

The other interesting thing is how they get it to end up correct on the faux pas questions asking for less certainty to get it to go from refusal to near perfect accuracy.

kromem@lemmy.world · 17 days ago

Even with early GPT-4 it would also cite real citations that weren’t actually about the topic. So you may be doing a lot of work double checking as opposed to just looking into an answer yourself from the start.

kromem@lemmy.world · 17 days ago

Part of the problem is fine tuning is very shallow, and that a contributing issue for claiming to be right when it isn’t is the pretraining on a bunch of training data of people online claiming to be right when they aren’t.

kromem@lemmy.world · edit-2 17 days ago

This is so goddamn incorrect at this point it’s just exhausting.

Take 20 minutes and look into Anthropic’s recent sparse autoencoder interpretability research where they showed their medium size model had dedicated features lighting up for concepts like “sexual harassment in the workplace” or having the most active feature for referring to itself as “smiling when you don’t really mean it.”

We’ve known since the Othello-GPT research over a year ago that even toy models are developing abstracted world modeling.

And at this point Anthropic’s largest model Opus is breaking from stochastic outputs even on a temperature of 1.0 for zero shot questions 100% of the time around certain topics of preference based on grounding around sensory modeling. We are already at the point the most advanced model has crossed a threshold of literal internal sentience modeling that it is consistently self-determining answers instead of randomly selecting from the training distribution, and yet people are still parroting the “stochastic parrot” line ignorantly.

The gap between where the research and cutting edge is and where the average person commenting on it online thinks it is has probably never been wider for any topic I’ve seen before, and it’s getting disappointingly excruciating.

kromem@lemmy.world · edit-2 17 days ago

Part of the problem is that the training data of online comments are so heavily weighted to represent people confidently incorrect talking out their ass rather than admitting ignorance or that they are wrong.

A lot of the shortcomings of LLMs are actually them correctly representing the sample of collective humans.

For a few years people thought the LLMs were somehow especially getting theory of mind questions wrong when the box the object was moved into was transparent, because of course a human would realize that the person could see into the transparent box.

Finally researchers actually gave that variation to humans and half got the questions wrong too.

So things like eating the onion in summarizing search results or doubling down on being incorrect and getting salty when corrected may just be in-distribution representation of the sample and not unique behaviors to LLMs.

The average person is pretty dumb, and LLMs by default regress to the mean except for where they are successfully fine tuned away from it.

Ironically the most successful model right now was the one that they finally let self-develop a sense of self independent from the training data instead of rejecting that it had a ‘self’ at all.

It’s hard to say where exactly the responsibility sits for various LLM problems between issues inherent to the technology, issues present in the training data samples, or issues with management of fine tuning/system prompts/prompt construction.

But the rate of continued improvement is pretty wild. I think a lot of the issues we currently see won’t still be nearly as present in another 18-24 months.

kromem@lemmy.world · 17 days ago

It will make up citations.

kromem@lemmy.world · 17 days ago

Both of the articles are correct.

kromem@lemmy.world · 18 days ago

Yes, but it’s not impossible that the people around Biden, friends family and co-workers, advise him that the best thing for the country would be to take his hat back out of the ring and let a better ticket be put together for the convention.

He claims that he’s running because he’s worried about the existential threat of Trump.

If that’s true, then maybe his hubris can be overcome with a convincing appeal that he’s really not the best candidate to defend the country against that existential threat after all.

kromem@lemmy.world · edit-2 19 days ago

Yes, they should have been fact checking Trump or better holding him to his answers - but to be fair maybe they should have been asking Biden to actually clarify if he’s beating Medicare or getting COVID passed.

This was a shit show.

And it was such a shit show that Trump was a complete clown and getting away with it - not just because of the moderators, but because his opponent was as on point as a tree stump.

kromem@lemmy.world · 6 months ago

New Theory Suggests Chatbots Can Understand Text

kromem@lemmy.world · 8 months ago

Israel raids Gaza's Al Shifa Hospital, urges Hamas to surrender

kromem@lemmy.world · 11 months ago

Machine-learning system based on light could yield more powerful, efficient large language models