728x90

VToonify

  • paper https://arxiv.org/abs/2209.11224
    • ์›๋ณธ ์ด๋ฏธ์ง€์˜ input ๊ฐ’์„ ์–ด๋Š ์ •๋„ degree๋กœ ๋ณ€ํ™˜ํ• ์ง€ ๋ณ€๊ฒฝ ๊ฐ€๋Šฅ
    • ๊ธฐ์กด์˜ StyleGAN์—์„œ ์˜์ƒ์„ Toonify ์ง„ํ–‰์‹œ ๋ฐœ์ƒํ•˜๋Š” ๋‹จ์ ์ธ ํ•ด์ƒ๋„ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ–ˆ๋‹ค.
    • Toonify์™€ DualStyleGAN ๋ชจ๋ธ ๋‘˜๋‹ค ์ง€์› (www.mmlab-ntu.com/project/vtoonify/)
    • null
      • multi-scale content condition์œผ๋กœ ์›๋ณธ ์ด๋ฏธ์ง€์™€ stylegan์˜ ๋ณ€ํ™˜ ์ด๋ฏธ์ง€๋ฅผ ํ•ฉ์ณค๋‹ค.(Trainable Fusion Module ์‚ฌ์šฉ)
  • ๊ธฐ์กด StyleGAN์€ ์˜์ƒ ๋ณ€ํ™˜์‹œ ๋ถ€์ž์—ฐ์Šค๋Ÿฌ์›€์„ ๊ฐœ์„ 
  • reference style์— ๋”ฐ๋ผ ๋ณ€ํ™˜ ๊ฐ€๋Šฅ
  • pre-trained model ์ œ๊ณต (์ •๋ง ๊ฐ์‚ฌ!)
  • ์ƒ์—…์  ์‚ฌ์šฉ ๋ถˆ๊ฐ€ ์ฃผ์˜ 
  • null
    VToonify
    null

M1 Mac์—์„œ CPU๋กœ ๊ตฌ๋™

Github

  • M1 MAC์€ CUDA๊ฐ€ ์—†์œผ๋ฏ€๋กœ stylegan์—์„œ ์ œ๊ณตํ•˜๋Š” cpu ๋ฒ„์ „์˜ ๋ชจ๋“ˆ๋กœ ๋ฐ”๊ฟ”์ค€๋‹ค.
# vtoonify์˜ model.stylegan.model.py ํŒŒ์ผ์—์„œ metric ๋ถˆ๋Ÿฌ์˜ค๋Š” ๋ถ€๋ถ„์˜ ๋ชจ๋“ˆ์„ op_cpu๋กœ ๋ฐ”๊ฟ”์ค€๋‹ค.
# op -> op_cpu

from model.stylegan.op_cpu import FusedLeakyReLU, fused_leaky_relu, upfirdn2d, conv2d_gradfix

# parameter style_degree 0~1 Toonify ์ •๋„
optional arguments:
  -h, --help            show this help message and exit
  --content CONTENT     path of the content image/video
  --style_id STYLE_ID   the id of the style image
  --style_degree STYLE_DEGREE
                        style degree for VToonify-D
  --color_transfer      transfer the color of the style
  --ckpt CKPT           path of the saved model
  --output_path OUTPUT_PATH
                        path of the output images
  --scale_image         resize and crop the image to best fit the model
  --style_encoder_path STYLE_ENCODER_PATH
                        path of the style encoder
  --exstyle_path EXSTYLE_PATH
                        path of the extrinsic style code
  --faceparsing_path FACEPARSING_PATH
                        path of the face parsing model
  --video               if true, video stylization; if false, image stylization
  --cpu                 if true, only use cpu
  --backbone BACKBONE   dualstylegan | toonify
  --padding PADDING PADDING PADDING PADDING
                        left, right, top, bottom paddings to the face center
  --batch_size BATCH_SIZE
                        batch size of frames when processing video
  --parsing_map_path PARSING_MAP_PATH
                        path of the refined parsing map of the target video

 

Result

  • cartoon ๋นผ๊ณ ๋Š” ๋‹ค ์ง•๊ทธ๋Ÿฌ์›Œ ๋ณด์ธ๋‹ค.
  • ๋ชจ๋ธ๊ณผ train util ๋˜ํ•œ ์ œ๊ณต๋˜๋ฏ€๋กœ ๊ณ ํํ’~, spongebobํ’ ๋“ฑ dataset ์ ์šฉํ•ด ๋ณผ ์ˆ˜ ์žˆ์„ ๊ฑฐ ๊ฐ™๋‹ค.
  • style_degree ์กฐ์ ˆ ๊ฐ€๋Šฅ
    null
๋ฐ˜์‘ํ˜•

'๐Ÿ‘พ Deep Learning' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

[ASR, ] Deepspeech2  (0) 2023.02.22
[NVIDIA RIVA] ngc ๋“ฑ๋ก  (0) 2023.01.27
ViT(Vision in Transformer) Review  (0) 2022.12.19
Tensor ๊ฐ€์ง€๊ณ  ๋†€๊ธฐ [Einsum + einops]  (0) 2022.12.17
[Computer Vision] Image Modul Pillow import Error  (0) 2022.12.14
๋‹คํ–ˆ๋‹ค