How often does your LLM lie to you?

Crescent Baddie@sh.itjust.works · edit-2 3 months ago

How often does your LLM lie to you?

HumanPerson@sh.itjust.works · 3 months ago

Always. That is a known issue with ai that has to do with explainability. Basically, if you’re familiar with the general idea of neural networks, we don’t really understand the hidden layers so we can’t know if they “know” something so we can’t train them to give different answers based on if they do or don’t. They are still statistical models that are functionally always guessing.

Could you post the link to the sleeper agent thing?

DrDystopia@lemy.lol · 3 months ago

Could you post the link to the sleeper agent thing?

https://www.youtube.com/watch?v=Z3WMt_ncgUI

https://arxiv.org/abs/2401.05566

Crescent Baddie@sh.itjust.works · 3 months ago

Here’s the video I actually watched about the sleeper agents

https://www.youtube.com/watch?v=wL22URoMZjo

jwmgregory@lemmy.dbzer0.com · edit-2 1 month ago

deleted by creator

HumanPerson@sh.itjust.works · 3 months ago

I wouldn’t stop using ai completely over that. I generally don’t trust it with anything that important anyway.