We use single TITAN RTX for training, but GPUs with less memory are still doable with smaller batch size (provided precomputed features).