FUWhy
diff --git a/‎README.md‎
100644100755
Lines changed: 12 additions & 5 deletions b/‎README.md‎
100644100755
Lines changed: 12 additions & 5 deletions
diff --git a/‎dataset/DeepFakes/README.md‎
100644100755
Lines changed: 2 additions & 0 deletions b/‎dataset/DeepFakes/README.md‎
100644100755
Lines changed: 2 additions & 0 deletions
diff --git a/‎dataset/DeepFakesDetection/README.md‎
Lines changed: 13 additions & 0 deletions b/‎dataset/DeepFakesDetection/README.md‎
Lines changed: 13 additions & 0 deletions
diff --git a/‎dataset/FaceSwapKowalski/README.md‎
100644100755
Lines changed: 3 additions & 0 deletions b/‎dataset/FaceSwapKowalski/README.md‎
100644100755
Lines changed: 3 additions & 0 deletions
diff --git a/‎dataset/NeuralTextures/README.md‎
Lines changed: 15 additions & 0 deletions b/‎dataset/NeuralTextures/README.md‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎dataset/README.md‎
100644100755
Lines changed: 57 additions & 23 deletions b/‎dataset/README.md‎
100644100755
Lines changed: 57 additions & 23 deletions
diff --git a/‎images/deepfakes.gif‎
10.4 MB b/‎images/deepfakes.gif‎
10.4 MB
diff --git a/‎images/deepfakesdetection.gif‎
4.67 MB b/‎images/deepfakesdetection.gif‎
4.67 MB
diff --git a/‎images/ex_deepfakes.png‎
100644100755 b/‎images/ex_deepfakes.png‎
100644100755
diff --git a/‎images/ex_deepfakes_mask.png‎
100644100755 b/‎images/ex_deepfakes_mask.png‎
100644100755
@@ -3,16 +3,17 @@
 ![Header](images/teaser.png)
 
 ## Overview
-FaceForensics++ is a forensics dataset consisting of 1000 original video sequences that have been manipulated with three automated face manipulation methods: Deepfakes, Face2Face and FaceSwap. The data has been sourced from 977 youtube videos and all videos contain a trackable mostly frontal face without occlusions which enables automated tampering methods to generate realistic forgeries. As we provide binary masks the data can be used for image and video classification as well as segmentation. In addition, we provide 1000 Deepfakes models to generate and augment new data.
+FaceForensics++ is a forensics dataset consisting of 1000 original video sequences that have been manipulated with four automated face manipulation methods: Deepfakes, Face2Face, FaceSwap and NeuralTextures. The data has been sourced from 977 youtube videos and all videos contain a trackable mostly frontal face without occlusions which enables automated tampering methods to generate realistic forgeries. As we provide binary masks the data can be used for image and video classification as well as segmentation. In addition, we provide 1000 Deepfakes models to generate and augment new data.
+
+
 
 For more information, please consult [our updated paper](https://arxiv.org/abs/1901.08971).
 
-## [New: Benchmark](http://kaldir.vc.in.tum.de/faceforensics_benchmark/)
-We are offering an [automated benchmark](http://kaldir.vc.in.tum.de/faceforensics_benchmark/) for facial manipulation detection on the presence of compression based on our manipulation methods that contains 1000 images. If you are interested to test your approach on unseen data, check it out! For more information, please consult [our paper](https://arxiv.org/abs/1901.08971).
+## What is new:
 
-## What is new
+- __DeepFakes Detection Dataset:__ We are hosting the DeepFakes Detection Dataset provided by Google & JigSaw. The dataset contains over 3000 manipulated videos from 28 actors in various scenes. The dataset has a similar file structure and is downloaded by default together with the regular dataset. 
 
-We included a fourth manipulation method that does face manipulation using GANs and [Neural Textures](https://arxiv.org/pdf/1904.12356.pdf). All results have been updated to incorporate the new manipulation method and we have updated the benchmark as well. We refer to the paper for more information.
+- __Neural Textures:__ We included a fourth manipulation method that does face manipulation using GANs and [Neural Textures](https://arxiv.org/pdf/1904.12356.pdf). All results have been updated to incorporate the new manipulation method and we have updated the benchmark as well. We refer to the paper for more information.
 Unfortunately, we won't continue support on the old benchmark after this update, though you can still submit your models to the new benchmark by creating a new submission.
 
 ## Access
@@ -22,6 +23,10 @@ If you have not received a response within a week, it is likely that your email
 
 Once, you obtain the download link, please head to the [download section](dataset/README.md). You can also find details about the generation of the dataset there.
 
+## [Benchmark](http://kaldir.vc.in.tum.de/faceforensics_benchmark/)
+We are offering an [automated benchmark](http://kaldir.vc.in.tum.de/faceforensics_benchmark/) for facial manipulation detection on the presence of compression based on our manipulation methods that contains 1000 images. If you are interested to test your approach on unseen data, check it out! For more information, please consult [our paper](https://arxiv.org/abs/1901.08971). You can download the benchmark images here.
+
+
 ## Original FaceForensics
 You can view the original FaceForensics github [here](https://github.com/ondyari/FaceForensics/tree/original). Any request will also contain the download link to the original version of our dataset. 
 
@@ -47,6 +52,8 @@ Please view our youtube video [here](https://www.youtube.com/watch?v=x2g48Q2I2ZQ
 [![youtubev_video](https://img.youtube.com/vi/x2g48Q2I2ZQ/0.jpg)](https://www.youtube.com/watch?v=x2g48Q2I2ZQ)
 
 ## Changelog
+23.09.2019: Added sample videos as well as the Deepfakes Detection Dataset
+
 30.08.2019: Paper got accepted to ICCV 2019! Updated the download script to include NeuralTextures and changed instructions
 
 06.04.2019: Updated sample and added benchmark
 
@@ -3,6 +3,8 @@
 We use the [faceswap implementation from the deepfakes github](https://github.com/deepfakes/faceswap) for our generated DeepFakes videos. We made some changes to their implementation to make it fully automatic for our extracted videos. If you are interested in their current status, please head to the corresponding github.
 We provide the source code that was used for our experiments as well as the scripts to produce new videos as well as to recreate our manipulated videos using our provided models.
 
+## Example video
+![example video](../../images/deepfakes.gif)
 
 ## Setup
 
 
@@ -0,0 +1,13 @@
+# DeepFakesDetection Dataset
+
+Please see the [original blog post]() for more information.
+
+## Example Video
+![example video](../../images/deepfakesdetection.gif)
+
+## Masks
+In comparison to the FaceForensics++ dataset DeepFakesDetection manipulated videos, the masks are directly extracted after the manipulation.
+
+<img src="ex_original_actors.png" alt="original" width="640"/>
+<img src="ex_deepfakesdetection.png" alt="deepfakesdetection" width="640"/>
+<img src="ex_deepfaktesdetectiion_mask.png" alt="deepfakesdetectionmask" width="640"/>
@@ -1,5 +1,8 @@
 # FaceSwap Image Sequence Manipulation
 
+## Example video
+![example video](../../images/faceswap.gif)
+
 ## Install
 
 - Install python 2.7 and requirements file by running `pip install -r requirements.txt`
 
@@ -0,0 +1,15 @@
+# NeuralTextures Manipulation
+
+NeuralTextures videos are manipulated by using the face model by Face2Face to track and render corresponding UV masks. Those masks are then fed into an encoder decoder architecture which is optimized using Neural Textures. See [the appendix](https://arxiv.org/pdf/1901.08971.pdf) as well as [the original paper](https://arxiv.org/pdf/1904.12356.pdf) for more implementation details.  
+
+## Example Video
+![example video](../../images/neuraltextures.gif)
+
+## Disclaimer about Reenactment Quality
+
+We manipulated every video in our dataset multiple times and chose the best visual result for our dataset. This was needed, as we trained our network using a generative adversarial loss which sometimes corrupted the training und the output process. By heart, NeuralTextures will reenact face motions of an input video to a target video, though by using this adversarial approach it can happen that the reenactment will not be precise. As we only evaluated for visual quality, those instances can happen but visual quality is the more important factor for our main detection task. 
+
+## Masks
+In comparison to FaceSwap or Face2Face, it is not straightforward what to select as the NeuralTextures manipulated area. Mainly, we manipulate the region below the mouth though we feed a region that is a 1.7 scaled quadratic region around the Face2Face mask. The network can introduce noise in that area but that necessarily. We provide the used Face2Face masks for now, but will include the uv masks as well in the near future.
+ 
+![original image](../../images/ex_original.png)  ![neuraltextures](../../images/ex_neuraltextures.png) ![neuraltextures mask](../../images/ex_neuraltextures_mask.png)
@@ -4,6 +4,9 @@
 
 If you would like to download the FaceForensics++ data, please fill out [this google form](https://docs.google.com/forms/d/e/1FAIpQLSdRRR3L5zAv6tQ_CKxmK4W96tAab_pfBu2EKAgQbeDVhmXagg/viewform) and, once accepted, we will send you the link to our download script.. You will get a link to the download script which will be used throughout this text to obtain the full dataset. This includes 977 downloaded videos from youtube, 1000 original extracted sequences that contain a unobstructed face that can be easily tracked, as well as their manipulated versions using our three methods: Deepfakes, Face2Face and FaceSwap. We also provide all Deepfakes models.
 
+![example video face2face](../images/face2face.gif)
+(Example of a Face2Face manipulated video, videos of other methods can be found in their respective folders)
+
 There are two ways to get the dataset: you can use the script to download all images or videos or generate most of the data on your own using the scripts provided in this folder which saves quite a bit of bandwidth if you are interested in the raw image material. However, you will have to download the Face2Face manipulated videos/images as there is no publicly available implementation to generate them from scratch. 
 
 The dataset has the following folder structure which will either be produced by the download or generation scripts.
@@ -14,42 +17,64 @@ FaceForensics++ dataset
     < contains all original downloaded videos, video information files and their extracted sequences
       which can be used to extract the original sequences used in the dataset >
 |-- original_sequences
-    < c0/raw original sequence images/videos >
-    < c23/hq original sequence images/videos >
-    < c40/lq original sequence images/videos >
-|-- manipulated_sequenecs
+    |-- youtube
+        < c0/raw original sequence images/videos of the FaceForensics++ dataset >
+        < c23/hq original sequence images/videos >
+        < c40/lq original sequence images/videos >
+    |-- actors
+        < images/videos from the DeepFakesDetection dataset >
+|-- manipulated_sequences
     |-- Deepfakes
-    < Deepfake sequence images/videos of all three compression degrees as well as models and masks after poisson image editing>
+        < images/videos of all three compression degrees as well as models and masks after poisson image editing>
+    |-- DeepFakesDetection
+        < images/videos ... as well as masks >
     |-- Face2Face
-    < Face2Face sequence images/videos of all three compression degrees as well as masks >
+        < images/videos ... as well as masks >
     |-- FaceSwap
-    < FaceSwap sequence images/videos of all three compression degrees as well as masks >
+        < images/videos ... as well as masks >
     |-- NeuralTextures
-    < NeuralTextures sequence images/videos of all three compression degrees as well as masks >
+        < images/videos ... well as masks >
 ```
 
-We renamed all original sequences to integers between `0` and `999`. The original youtube id's can be recovered using `conversion_dict.json`.
+### Original sequence filenames
+- FaceForensics++: We renamed all original sequences saved in the `youtube` folder to integers between `0` and `999`. The original youtube id's can be recovered using `conversion_dict.json`.
+- DeepFakesDetection: The original DeepFakesDetection sequences are stored in the `actors` folder. The sequence filenames are of the form `<actor number>__<scene name>`.
+
+### Manipulated sequence filenames
+- FaceForensics++: All filenames are of the form `<target sequence>_<source sequence>` so you can easily identify the sources.
+- DeepFakesDetection: We employ a similar naming scheme here, however it is a little bit more tricky. The naming scheme is `<target actor>_<source actor>__<sequence name>__<8 charactor long experiment id>`. The experiment id is necessary as some actor pairings have been recorded multiple times.
 
+### Space requirement
 Here is a overview of the space required to save/download the dataset:
 
-- The original downladed videos from youtube: 38.5GB
-- All h264 compressed videos with compression rate factor
-    - 0: ~400GB
-    - 23: ~8GB
-    - 40: ~1GB
-- All raw extracted images as pngs: ~2TB
+- FaceForensics++
+    - The original downladed source videos from youtube: 38.5GB
+    - All h264 compressed videos with compression rate factor
+        - raw/0: ~500GB
+        - 23: ~10GB
+        - 40: ~2GB
+    - All raw extracted images as pngs: ~2TB
+- DeepFakesDetection:
+    - The 363 original source actor videos:
+        - raw/0: ~200GB
+        - 23: ~3GB
+        - 40: ~400MB
+    - The 3068 manipulated videos:
+        - raw/0: ~1.6TB
+        - c23: ~22GB
+        - c40: ~3GB 
 
 ## 1. Download script
 
 ### General usage
 Please consult
 
-`python download-FaceForensics_v3.py -h`
+`python download-FaceForensics_v4.py -h`
 
 for a detailed overview of the download scrips parameter choices and their respective defaults. The general usage is as follows:
 
 ```shell
-python download-FaceForensics_v3.py
+python download-FaceForensics_v4.py
     <output path>
     -d <dataset type, e.g., Face2Face, original or all>
     -c <compression quality, e.g., c23 or raw>
@@ -63,25 +88,33 @@ We advise you to download the compressed videos and extract the frames on your o
 ### Examples
 In order to download all light compressed (i.e., a visually lossless compression rate factor of 23 using the h264 codec) original as well as altered videos of all three manipulation methods use
 
-`python download-Faceforensics_v3.py <output path> -d all -c c23 -t videos`
+`python download-Faceforensics_v4.py <output path> -d all -c c23 -t videos`
+
+If you are only interested in a few samples of the dataset, say 10, append `--num_videos 10`. 
 
-For all lossless compressed (i.e., a compression rate factor of 0) extracted original videos run
+For all raw/lossless compressed (i.e., a compression rate factor of 0) extracted original videos run
 
-`python download-FaceForensics_v3.py <output path> -d original -c c0 -t videos`
+`python download-FaceForensics_v4.py <output path> -d original -c raw -t videos`
+
+The DeepFakesDetection dataset videos can be obtained by running
+
+`python download-FaceForensics_v4.py <output path> -d <DeepFakesDetection or DeepFakesDetection_original>-c raw -t videos`
 
 With
 
-`python download-FaceForensics_v3.py <output path> -d Face2Face -t masks`
+`python download-FaceForensics_v4.py <output path> -d Face2Face -t masks`
 
 you obtain the corresponding masks of the chosen method, i.e., a binary mask indicating the manipulated pixels.
 
 ### Original Videos
 
 You can download the original videos that were downloaded from youtube using
 
-`python download-FaceForensics_v3.py <output path> -d original_youtube_videos`
+`python download-FaceForensics_v4.py <output path> -d original_youtube_videos`
+
+The zipped file contains all downloaded videos in their original length as well as a json file containing the frames that were extracted for our dataset. If you are only interested in the frame locations and video information because you want to download them on your own, use:
 
-The zipped file contains all downloaded videos in their original length as well as a json file containing the frames that were extracted for our dataset.
+`python download-FaceForensics_v4.py <output path> -d original_youtube_videos_info`
 
 
 ### Audio
@@ -95,6 +128,7 @@ We only downloaded the source video without audio. However, you can re-download
 We provide binary masks for all our manipulation methods. For FaceSwap and Face2Face those masks are pretty self-explanatory. However, it is more difficult for DeepFakes and NeuralTextures.
 - Deepfakes: after we feed in our face through the auto-encoder and warp it back to the image, we apply Poisson image editing. This process is done on a rectangular box around the face. Please consult the [DeepFakes readme](datasets/DeepFakes).
 - NeuralTextures: NeuralTextures takes a 1.7 scaled part around the face bounding box of the Face2Face tracker as input and manipulates the whole region. However, the method has skip connections which allow it to directly copy pixel values from non-face areas of this crop. The NeuralTexture masks report the tracking results for those regions, though we will upload the manipulated regions as well and add more details to this process soon.
+- DeepFakesDetection: masks are provided direcly after DeepFake output and thus are not rectangular shaped as the Deepfakes masks provided in FaceForensics++. We will provide those in the near future.
 
 
 ### Frame Extraction