How evaluation guides AI research | Synapse