#also this isn't really torment nexus shit imo
Explore tagged Tumblr posts
Text
"reward hacking" in AI research will never not be the funniest shit. one time iRobot stuck bumper sensors on the front of Roombas and set up their reward function so they lost points whenever they bumped into something, so the Roombas responded by learning to drive around backwards so when they did inevitably hit something it didn't trip the sensors. there's a lesson in here somewhere about human behavior but i'll leave it as an exercise to the reader
#also this isn't really torment nexus shit imo#this is called “instrumental convergence” and was described in 2003 as specifically something to be wary of with AI#also as per the source this specific incident literally did not happen
1K notes
·
View notes