SVAE

This module is the implementation of the Hyperspherical VAE proposed in (https://arxiv.org/abs/1804.00891). This models uses a Hyperspherical latent space.

Available samplers

HypersphereUniformSampler

Sampling from uniform distribution on hypersphere.

class pythae.models.SVAEConfig(input_dim=None, latent_dim=10, uses_default_encoder=True, uses_default_decoder=True, reconstruction_loss='mse')[source]

\(\mathcal{S}\)-VAE model config config class

Parameters
  • input_dim (tuple) – The input_data dimension.

  • latent_dim (int) – The latent space dimension in which lives the hypersphere. Default: None.

  • reconstruction_loss (str) – The reconstruction loss to use [‘bce’, ‘mse’]. Default: ‘mse’

class pythae.models.SVAE(model_config, encoder=None, decoder=None)[source]

\(\mathcal{S}\)-VAE model.

Parameters
  • model_config (SVAEConfig) – The Variational Autoencoder configuration setting the main

  • model. (parameters of the) –

  • encoder (BaseEncoder) – An instance of BaseEncoder (inheriting from torch.nn.Module which plays the role of encoder. This argument allows you to use your own neural networks architectures if desired. If None is provided, a simple Multi Layer Preception (https://en.wikipedia.org/wiki/Multilayer_perceptron) is used. Default: None.

  • decoder (BaseDecoder) – An instance of BaseDecoder (inheriting from torch.nn.Module which plays the role of decoder. This argument allows you to use your own neural networks architectures if desired. If None is provided, a simple Multi Layer Preception (https://en.wikipedia.org/wiki/Multilayer_perceptron) is used. Default: None.

Note

For high dimensional data we advice you to provide you own network architectures. With the provided MLP you may end up with a MemoryError.

forward(inputs, **kwargs)[source]

The VAE model

Parameters

inputs (BaseDataset) – The training dataset with labels

Returns

An instance of ModelOutput containing all the relevant parameters

Return type

ModelOutput

get_nll(data, n_samples=1, batch_size=100)[source]

Function computed the estimate negative log-likelihood of the model. It uses importance sampling method with the approximate posterior distribution. This may take a while.

Parameters
  • data (torch.Tensor) – The input data from which the log-likelihood should be estimated. Data must be of shape [Batch x n_channels x …]

  • n_samples (int) – The number of importance samples to use for estimation

  • batch_size (int) – The batchsize to use to avoid memory issues