Media Outlet:
Wired Publication Date:
Description:
"The key thing about this process is that the neural network doesn’t even know whether it’s correctly identifying state/action pairs when it starts—it doesn’t know how to “read”—much less whether it has correctly interpreted the advice they convey (do you build near a river, or should you never build by a river?)."