DeepFloyd IF, which requires a GPU with at least 16GB of Memory to execute, was trained on a dataset of more than a billion images and texts. It can produce an image given a cue like "a teddy bear wearing a shirt that reads "Deep Floyd," optionally in a variety of styles.