it really is shown that the simple pre-education job of predicting which caption goes with which impression is an productive and scalable way to find out SOTA picture representations from scratch with a dataset of 400 https://k2spiceshop.com/product/liquid-k2-on-paper-online/