SVAE¶
This module is the implementation of the Hyperspherical VAE proposed in (https://arxiv.org/abs/1804.00891). This models uses a Hyperspherical latent space.
Available samplers¶
Sampling from uniform distribution on hypersphere. |
- class pythae.models.SVAEConfig(input_dim=None, latent_dim=10, uses_default_encoder=True, uses_default_decoder=True, reconstruction_loss='mse')[source]¶
\(\mathcal{S}\)-VAE model config config class
- class pythae.models.SVAE(model_config, encoder=None, decoder=None)[source]¶
\(\mathcal{S}\)-VAE model.
- Parameters
model_config (SVAEConfig) – The Variational Autoencoder configuration setting the main
model. (parameters of the) –
encoder (BaseEncoder) – An instance of BaseEncoder (inheriting from torch.nn.Module which plays the role of encoder. This argument allows you to use your own neural networks architectures if desired. If None is provided, a simple Multi Layer Preception (https://en.wikipedia.org/wiki/Multilayer_perceptron) is used. Default: None.
decoder (BaseDecoder) – An instance of BaseDecoder (inheriting from torch.nn.Module which plays the role of decoder. This argument allows you to use your own neural networks architectures if desired. If None is provided, a simple Multi Layer Preception (https://en.wikipedia.org/wiki/Multilayer_perceptron) is used. Default: None.
Note
For high dimensional data we advice you to provide you own network architectures. With the provided MLP you may end up with a
MemoryError.- forward(inputs, **kwargs)[source]¶
The VAE model
- Parameters
inputs (BaseDataset) – The training dataset with labels
- Returns
An instance of ModelOutput containing all the relevant parameters
- Return type
- get_nll(data, n_samples=1, batch_size=100)[source]¶
Function computed the estimate negative log-likelihood of the model. It uses importance sampling method with the approximate posterior distribution. This may take a while.
- Parameters
data (torch.Tensor) – The input data from which the log-likelihood should be estimated. Data must be of shape [Batch x n_channels x …]
n_samples (int) – The number of importance samples to use for estimation
batch_size (int) – The batchsize to use to avoid memory issues