Skip to content

Latest commit

 

History

History
221 lines (204 loc) · 9.14 KB

README.md

File metadata and controls

221 lines (204 loc) · 9.14 KB

FFgan

A cutting-edge program that leverages the power of Generative Adversarial Networks (GANs) to generate synthetic faces.

Description

FFgan is an innovative program that harnesses the power of Generative Adversarial Networks (GANs) to generate synthetic faces. GANs consist of two neural networks, a generator, and a discriminator, engaged in a competitive learning process. The generator network learns to produce realistic synthetic faces, while the discriminator network learns to distinguish between real and synthetic faces.

By leveraging the advanced capabilities of the Dlib library for image processing and incorporating a specific DCGAN model for color image generation, FFgan offers a powerful and versatile platform for face generation. It enables users to create lifelike synthetic faces with exceptional realism and diversity.

This technology has significant implications for both research and model production. In research, FFgan provides a valuable tool for studying facial characteristics, exploring variations in facial expressions, and analyzing demographic trends without relying on real-world data. It offers researchers the ability to generate large datasets with controlled variables for training and evaluating facial recognition systems, expression analysis algorithms, and more.

FFgan opens up new possibilities for designing and iterating on novel face models. It allows designers and developers to experiment with different facial features, expressions, and attributes, accelerating the development of innovative applications in various fields, including computer graphics, entertainment, virtual reality, and character design.

An additional advantage of FFgan is its contribution to the protection of personal data. As the generation process relies solely on synthetic data, there is no need to use real faces or store large-scale personal datasets. This mitigates privacy concerns and reduces the risk of data breaches, making it a privacy-friendly solution for face-related research and applications.

Features

  • GAN-based face generation.
  • Integration of Dlib library for image processing and IA model (also the Boost library).
  • DCGAN for color image generation of medium resolution (up to 162 pixels).
  • Built-in internal web server for convenient model testing.
  • Ready to use without extensive setup (pre-computed model provided).

Installation

To install FFgan and recompile the program, follow these steps:

  1. Ensure that you have the latest version of Dlib (version 19.24) installed.
  2. Make sure you have a compatible platform, such as Windows 10, and the Microsoft Visual Studio 2022 (64-bit, Version 17.6.0) compiler.
  3. Download the FFgan source code from the GitHub repository.
  4. Open the project in Microsoft Visual Studio.
  5. Recompile the project using the provided source files.

The training process for FFgan involved utilizing a dataset of over 150,000 faces extracted from real photos captured from the internet. A dedicated crawler was created specifically for this purpose. The face extraction and alignment process used traditional techniques, including those demonstrated in the examples of the Dlib library.

FFgan designed from the example "dnn_dcgan_train_ex.cpp" (available at http://dlib.net/dnn_dcgan_train_ex.cpp.html). The CNN model has been adapted for generating high-quality color images.

Generator model:

Layer Output Shape
Input (1x100 Noise Tensor) (1, 100)
ReLU/BatchNorm (4, 4, 512)
ConvTranspose (10, 10, 256)
ReLU/BatchNorm (20, 20, 128)
ConvTranspose (40, 40, 128)
ReLU/BatchNorm (80, 80, 64)
ConvTranspose (162, 162, 3)
Output (162, 162, 3)
Sigmoid (1, 1)
FC (1, 1)

Discriminator model:

Layer Output Shape
Input: RGB Image (162, 162, 3)
Convolution (160, 160, 3)
LeakyReLU/BatchNorm/Dropout (160, 160, 3)
Convolution (80, 80, 512)
LeakyReLU/BatchNorm/Dropout (80, 80, 512)
Convolution (78, 78, 256)
LeakyReLU/BatchNorm/Dropout (78, 78, 256)
Convolution (39, 39, 128)
LeakyReLU/BatchNorm/Dropout (39, 39, 128)
Convolution (19, 19, 128)
LeakyReLU/BatchNorm/Dropout (19, 19, 128)
Convolution (9, 9, 64)
LeakyReLU/BatchNorm/Dropout (9, 9, 64)
FullyConnected (1, 1)
Output Real/Fake Classification

Usage

The first example demonstrates training the FFgan model using the images in the specified directory. The second example shows how to generate a specified number of images automatically using the trained model. In this case, the command FFgan --gen 10 generates 10 images.

FFgan --train <directory>

Description:
Trains or fine-tunes the FFgan model using the images provided in the specified directory. The directory should directly contain all the images or subdirectories containing the images. The images will be resized to 162x162 pixels during training. It is recommended that the images have a minimum size of 162 pixels on each side (note that the default face extraction modules in Dlib extract faces of 200 pixels on each side).

Arguments:

  • --train <directory>: Specifies the directory containing the training images. The directory should directly contain the images or subdirectories containing the images.


FFgan --gen <number>

Description:
Generates a specified number of images automatically and displays them in a window. The program performs a test using the discriminator to check if the generated image is an acceptable "candidate." If not, it iterates a certain number of times to try to find a higher-quality image.

Arguments:

  • --gen <number>: Generates a specified number of images automatically and displays them in a window. The program does not perform a test based on the specified number; instead, it allows the user to manually stop the generation process by closing the window where the generated faces are displayed or by using Ctrl+C in the execution console.


FFgan --web

Description:
Instantiates a local web server listening for requests on port 9190 and generates a face receiving a request from a Web browser.

Arguments:

  • --gen <number>: Allows direct access to the generated images via a web interface. Users can access the generated images by navigating to http://localhost:9190 in their web browser.

License

This program is licensed under the GNU General Public License (GPL). The GPL grants users the freedom to use, modify, and distribute the software. However, commercial usage is not allowed under this license. It will also be strongly appreciated that any external usage of this program, especially in academic area, includes proper attribution to the author and his work. Please provide a reference to Cydral and acknowledge its contributions when using the FFgan program for research or other purposes.

Acknowledgments

Special thanks to Davis E. King and all the contributors for the amazing Dlib library. Their dedication and hard work have made it possible to develop high-quality and efficient AI models using Dlib. We are grateful for the quality and speed of the AI models provided by Dlib, which greatly contribute to the success of the FFgan program.