Acknowledgements
This work was supported by the ERC Ad- vanced Grant SIMULACRON, by the Munich Center for Machine Learn- ing and by the EPSRC Programme Grant VisualAI EP/T028572/1. C. R. is supported by VisualAI EP/T028572/1 and ERC-UNION-CoG-101001212.