From 4ee0b459256f641c3d179162a86c18cdea140691 Mon Sep 17 00:00:00 2001 From: summertight Date: Sat, 21 Sep 2024 16:45:43 +0000 Subject: [PATCH] 123 --- index.html | 21 ++++++++++++--------- 1 file changed, 12 insertions(+), 9 deletions(-) diff --git a/index.html b/index.html index 3eb29a6..f973a83 100644 --- a/index.html +++ b/index.html @@ -227,15 +227,18 @@

Key Idea

Download PDF.

- (A) Conceptual comparison between prior works and our method. Prior works rely on a seesaw game of two potentially conflicting losses: reconstruction loss and identity loss. - On the other hand, our method leverages a self-supervised approach with a clear ground truth, which allows for more stable training. - - (B) Comparing our base approach (Ours Base) with our enhanced method (Ours Full), - which includes techniques like perforation confusion and random mesh scaling. - Green masks represent target-posed source 3DMM masks, red masks indicate target 3DMM masks, and orange masks denote their intersection. - The first row shows that when the source face is larger than the target’s, the jaw is cut off. The second row - shows the opposite case, where the base model fails to inpaint the remaining regions - effectively, while Ours Full generates realistic face-swapped outputs. +

+ (A) Conceptual comparison between prior works and our method. Prior works rely on a seesaw game of two potentially conflicting losses: reconstruction loss and identity loss. On the other hand, our method leverages a self-supervised approach with a clear ground truth, which allows for more stable training. +

+ +

+ (B) Comparing our base approach (Ours Base) with our enhanced method (Ours Full), which includes techniques like perforation confusion and random mesh scaling. Green masks represent target-posed source 3DMM masks, red masks indicate target 3DMM masks, and orange masks denote their intersection. +

+ +

+ The first row shows that when the source face is larger than the target’s, the jaw is cut off. The second row shows the opposite case, where the base model fails to inpaint the remaining regions effectively, while Ours Full generates realistic face-swapped outputs. +

+