Style
244 |
246 | Samples from our method with style conditioning compared against other methods. We used an empty prompt 247 | and only conditioned on the image. We generally perform on par with IP-Adapter and outperform it on some 248 | samples. Note that the third image from the left is less degraded, and the third image from the right 249 | captures the mane of the horse better. 250 |
251 |Structure
252 |
255 | Samples from our method with structural conditioning compared against other methods. Note that for our 256 | method, especially compared with T2I Adapter, the details of the images are substantially more closely 257 | aligned with the depth prompt (see e.g. the lamp in the background of the living room scene and the side 258 | table's legs, or the salad on the pizza) 259 |
260 |