Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning

Publication
Advances in Neural Information Processing Systems (NeurIPS)