Lidé

Ing. Adéla Šubrtová

Všechny publikace

Diffusion Image Analogies

  • DOI: 10.1145/3588432.3591558
  • Odkaz: https://doi.org/10.1145/3588432.3591558
  • Pracoviště: Katedra počítačové grafiky a interakce, Skupina vizuálního rozpoznávání
  • Anotace:
    In this paper we present Diffusion Image Analogies—an examplebased image editing approach that builds upon the concept of image analogies originally introduced by Hertzmann et al. [2001]. Given a pair of images that specify the intent of a specific transition, our approach enables to modify the target image in a way that it follows the analogy specified by this exemplar. In contrast to previous techniques which were able to capture analogies mostly on the low-level textural details our approach handles also changes in higher level semantics including transition of object domain, change of facial expression, or stylization. Although similar modifications can be achieved using diffusion models guided by text prompts [Rombach et al. 2022] our approach can operate solely in the domain of images without the need to specify the user’s intent using textual form.We demonstrate power of our approach in various challenging scenarios where the specified analogy would be difficult to transfer using previous techniques.

ChunkyGAN: Real Image Inversion via Segments

  • DOI: 10.1007/978-3-031-20050-2_12
  • Odkaz: https://doi.org/10.1007/978-3-031-20050-2_12
  • Pracoviště: Katedra kybernetiky, Katedra počítačové grafiky a interakce, Skupina vizuálního rozpoznávání
  • Anotace:
    We present ChunkyGAN—a novel paradigm for modeling and editing images using generative adversarial networks. Unlike previous techniques seeking a global latent representation of the input image, our approach subdivides the input image into a set of smaller components (chunks) specified either manually or automatically using a pre-trained segmentation network. For each chunk, the latent code of a generative network is estimated locally with greater accuracy thanks to a smaller number of constraints. Moreover, during the optimization of latent codes, segmentation can further be refined to improve matching quality. This process enables high-quality projection of the original image with spatial disentanglement that previous methods would find challenging to achieve. To demonstrate the advantage of our approach, we evaluated it quantitatively and also qualitatively in various image editing scenarios that benefit from the higher reconstruction quality and local nature of the approach. Our method is flexible enough to manipulate even out-of-domain images that would be hard to reconstruct using global techniques.

Hairstyle Transfer between Face Images

  • DOI: 10.1109/FG52635.2021.9667038
  • Odkaz: https://doi.org/10.1109/FG52635.2021.9667038
  • Pracoviště: Skupina vizuálního rozpoznávání, Strojové učení
  • Anotace:
    We propose a neural network which takes two inputs, a hair image and a face image, and produces an output image having the hair of the hair image seamlessly merged with the inner face of the face image. Our architecture consists of neural networks mapping the input images into a latent code of a pretrained StyleGAN2 which generates the output high-definition image. We propose an algorithm for training parameters of the architecture solely from synthetic images generated by the StyleGAN2 itself without the need of any annotations or external dataset of hairstyle images. We empirically demonstrate the effectiveness of our method in applications including hair-style transfer, hair generation for 3D morphable models, and hair-style interpolation. Fidelity of the generated images is verified by a user study and by a novel hairstyle metric proposed in the paper.

Za stránku zodpovídá: Ing. Mgr. Radovan Suk