Jascha's blog
About
Posts
2023-03-09
The hot mess theory of AI misalignment: More intelligent agents behave less coherently
2022-11-06
Too much efficiency makes everything worse: overfitting and the strong version of Goodhart's law
subscribe
via RSS