Exploring generative models with Python - Not a...

Exploring generative models with PythonJavier Jorge Cano

PyCon 2018

Problem definition

• Several of the successful applications of machinelearning techniques are based on the amount of data currentlyavailable.

• But sometimes data is scarce, i.e: difficult processes to collectit (i.e: medical imaging).

• We can obtain more data performing controlled distortionsthat do not modify the true nature of the sample, this iscalled data augmentation.

Figure 1: Left to right: original image and different alterations.

Figure 2: Another example of simulating different positions androtations with different transformations.

• These transformations are hand-crafted and problem-dependent. Could we provide a domain-agnosticapproach to do this?

Generative models

• Neural Networks are good at classifying, meaning that theylearn a “mapping” between the input and the output.

Figure 3: Example of DNN-based classifier.

• We can force this map to mirror the input, so we end upwith a model that can reconstruct the sample.

Figure 4: Example of DNN-based reconstruction model.

• These models usually encode the input in some manifold, andthen decode it to recover the original sample.

• This kind of models are known as Generative models.• Could we use this model’s “inner representation” to cre-ate new outputs?

• Could these artificial samples be useful? (i.e: for trainingclassifiers)

• Two techniques dominate recent approaches to deal withthese problems. One line of research is based on Gener-ative Adversarial Networks (GAN) and the other oneis based on Variational Autoencoders (VAE).

• GAN: Based on the mixture of Game Theory and MachineLearning, where two networks (Generator and Discriminator)are competing in a game where one wants to fool the other.

GD T/F

Figure 5: Discriminator learns to decide if a sample is true or fake,while Generator aims to fool the discriminator.

• VAE: Based on the principle of encoding-decoding, thesemodels aim to minimize the reconstruction error (i.e: Meansquared error) while constraining the inner representation tobe similar to a simple distribution (i.e: Gaussian).

xt x̂t

zt = µt + σt ∗ ϵ

ϵ ∼ N (0, 1)

Figure 6: VAE architecture, the input xt is projected in themanifold, obtaining the vector zt. The decoder recovers this vectorgetting the reconstructed input x̂t.

Figure 7: Left: 2D Latent space provided by the model, trainedwith MNIST dataset. Right: Sampling the 2D manifold and gettingthe images.

• We are going to use VAE as they can provide the manifolddirectly. With this manifold we can perform different modi-fications over the original data.

VAE for data augmentation

1 Train VAE with your training set and use a validation set inorder to check the model’s evolution.

2 Project your training set using the model, obtaining theprojected versions zt for each sample.

3 Alter these zt vectors, and then reconstruct them.

z2 zt+1zt

Figure 8: Encoding the input, modifying it and then reconstructingthe altered sample with the decoder.

Different methods to alter the projected samples:

Initial projection

Applying noise

Performing interpolation

Performing extrapolationFigure 9: 2D example manifold’s transformations.

With extrapolation

With interpolation

Adding noise

Real Instances

Figure 10: Original input and the different outputs depending on themodification, each column corresponds to the same digit.

Figure 11: Smooth transformations that capture changes in the scaleand rotation, even changes among classes.

Case use: UJIPEN Dataset

Labels Dim. Tr. Val. Ts.97 4,9k (70x70 px) 5,820 582 4,656

Table 1: Dataset distribution.

Figure 12: Some samples from UJIPEN.

Generation methodsClassifier Baseline Noise Interpolation Extrapolation

Support vector machines 54.79 59.09 63.48 63.19K-nearest neighbors 31.86 39.29 53.15 46.74Neural Network 63.99 57.20 65.60 64.09

Table 2: UJIPEN results, accuracy on test set (%)[1].

Figure 13: Top row: Samples after interpolating. Bottom row:Samples after extrapolating.

Figure 14: Linear interpolation between samples from the same class.

Conclusions

• We used a generative model, a VAE in particular, to createsynthetic samples.

• We performed different alterations in the inner representationthat helped the selected classifiers.

• For future work, we want to use this model in different data:text, speech, feature vectors, etc.

Python For Science: All the experiments were performedwith:• Numpy.• TensorFlow.• Matplotlib.

Reference

[1] J. Jorge, J. Vieco, R. Paredes, J. A. Sanchez, and J. M. Benedí.Empirical evaluation of variational autoencoders for dataaugmentation.In Proceedings of the 13th International Joint Conference onComputer Vision, Imaging and Computer Graphics Theory andApplications - Volume 5: VISAPP, pages 96–104. INSTICC,SciTePress, 2018.

Exploring generative models with Python - Not a...

Documents

Exploiting Generative Models in Discriminative Classifierspapers.nips.cc/paper/1520-exploiting-generative-models-in... · Exploiting Generative Models in Discriminative Classifiers

generative models - arXiv

Introduction to generative models - Mihaela Roscaelarosca.net/intro_generative_models.pdf · Introduction to generative models— Mihaela Rosca Generate new dataWhat do intelligent

Deep generative models of natural imagesVariational autoencoders Generative adversarial networks Generative moment matching networks Evaluating generative models Outline 1 Motivation

Deep Generative Models with Learnable Knowledge Constraintspapers.nips.cc/paper/8250-deep-generative-models-with-learnable... · Deep Generative Models with Learnable Knowledge Constraints

Deep Generative Models with Stick-Breaking Priorsenalisni/nalisnick_uciAIML_talk.pdf · Overview of Deep Generative Models. Deep Generative Models with Stick-Breaking Priors. 1 3

Generative Models for Image Understanding

Deep Generative Models

Generative Models vs. Discriminative models

Deep generative modeling: Implicit models

Exploring generative atomic models in cryo-EM reconstruction...Exploring generative atomic models in cryo-EM reconstruction Ellen D. Zhong MIT zhonge@mit.edu Adam Lerer Facebook AI

Power Law and Its Generative Models

Lecture 12: Generative Models

Generative Models and Stochastic Algorithms for Population ...helper.ipam.ucla.edu/publications/mbi2008/mbi2008_8020.pdfThree generative statistical models and stochastic algorithms

Generative Adversarial Networks, and Applications Adversarial Networks, and Applications Outline: •Generative Models vs Discriminative Models (Background) •Generative Adversarial

Generative Models vs. Discriminative models. Roughly: Discriminative Feedforw ard Bottom-up Generative Feedforward recurrentfeedback Bottom-uphorizontaltop-down

Deep Generative models for Inverse Problemscorelab.ntua.gr/ml-seminar/slides/Dimakis_NTUA.pdf · Deep Generative models for Inverse Problems. Outline • Generative Models • Using

Blackboard models, Generative Communication, Inference ...old.disco.unimib.it/simone/ldp_com/Linguaggi-april-2006.pdfBlackboard models, Generative Communication, Rule-base Inference

One-Shot Generalization in Deep Generative Models · One-shot Generalization in Deep Generative Models aware generative models that make use of spatial trans-formers in this way

Generative Models for Graph-Based Protein Design...Generative models for protein sequence and structure A number of works have explored the use of generative models for protein engineering