This repository contains the Julia Flux implementation of the Invariant Statistical Loss (ISL) proposed in the paper Training Implicit Generative Models via an Invariant Statistical Loss, published in the AISTATS 2024 conference.
Please, if you use this code, cite the article:
@inproceedings{de2024training,
title={Training Implicit Generative Models via an Invariant Statistical Loss},
author={de Frutos, Jos{\'e} Manuel and Olmos, Pablo and Lopez, Manuel Alberto Vazquez and M{\'\i}guez, Joaqu{\'\i}n},
booktitle={International Conference on Artificial Intelligence and Statistics},
pages={2026--2034},
year={2024},
organization={PMLR}
}
The ISL
repository is organized into several directories that encapsulate different aspects of the project, ranging from the core source code and custom functionalities to examples demonstrating the application of the project's capabilities, as well as testing frameworks to ensure reliability.
-
CustomLossFunction.jl
: This file contains implementations of the ISL custom loss function tailored for the models developed within the repository. -
ISL.jl
: Serves as the main module file of the repository, this file aggregates and exports the functionalities developed inCustomLossFunction.jl
.
-
time_series_predictions/
: This subdirectory showcases how the ISL project's models can be applied to time series prediction tasks. -
Learning1d_distribution/
: Focuses on the task of learning 1D distributions with the ISL.
runtests.jl
: This script is responsible for running automated tests against theISL.jl
module.
To install ISL, simply use Julia's package manager. The module is not registered so you need to clone the repository and follow the following steps:
julia> push!(LOAD_PATH,pwd()) # You are in the ISL Repository
julia> using ISL
To reproduce the enviroment for compiling the repository:
(@v1.9) pkg> activate pathToRepository/ISL
If you want to use any utility subrepository like GAN or DeepAR, make sure it's within your path.
After installing the package, you can immediately start experimenting with the examples provided in the examples/
directory. These examples are designed to help you understand how to apply the package for different statistical learning tasks, including learning 1-D distributions and time series prediction.
The following example demonstrates how to learn a 1-D distribution from a benchmark dataset. It utilizes a simple neural network architecture for both the generator and the discriminator within the context of an Invariant Statistical Learning (ISL) framework.
# Example file: examples/Learning1d_distribution/benchmark_unimodal.jl
using ISL # Import the ISL module
include("../utils.jl") # Include necessary utilities
@test_experiments "N(0,1) to N(23,1)" begin
# Define the generator and discriminator networks with ELU activation
gen = Chain(Dense(1, 7), elu, Dense(7, 13), elu, Dense(13, 7), elu, Dense(7, 1))
# Set up the noise and target models
noise_model = Normal(0.0f0, 1.0f0)
target_model = Normal(4.0f0, 2.0f0)
n_samples = 10000
# Configure the hyperparameters for the ISL
hparams = AutoISLParams(;
max_k=10, samples=1000, epochs=1000, η=1e-2, transform=noise_model
)
# Prepare the dataset and the loader for training
train_set = Float32.(rand(target_model, hparams.samples))
loader = Flux.DataLoader(train_set; batchsize=-1, shuffle=true, partial=false)
# Train the model using the ISL
auto_invariant_statistical_loss(gen, loader, hparams)
# Visualize the learning results
plot_global(
x -> quantile.(-target_model, cdf(noise_model, x)),
noise_model,
target_model,
gen,
n_samples,
(-3:0.1:3),
(-2:0.1:10),
)
end
This section showcases two examples: predicting a univariate time series with an AutoRegressive model and forecasting electricity consumption. These examples illustrate the application of ISL for time series analysis.
- Example 1: AutoRegressive Model Prediction
@test_experiments "testing AutoRegressive Model 1" begin
# --- Model Parameters and Data Generation ---
# Define AR model parameters
ar_hparams = ARParams(;
ϕ=[0.5f0, 0.3f0, 0.2f0], # Autoregressive coefficients
x₁=rand(Normal(0.0f0, 1.0f0)), # Initial value from a Normal distribution
proclen=2000, # Length of the process
noise=Normal(0.0f0, 0.2f0), # Noise in the AR process
)
# Define the recurrent and generative models
recurrent_model = Chain(RNN(1 => 10, relu), RNN(10 => 10, relu))
generative_model = Chain(Dense(11, 16, relu), Dense(16, 1, identity))
# Generate training and testing data
n_series = 200 # Number of series to generate
loaderXtrain, loaderYtrain, loaderXtest, loaderYtest = generate_batch_train_test_data(
n_series, ar_hparams
)
# --- Training Configuration ---
# Define hyperparameters for time series prediction
ts_hparams = HyperParamsTS(;
seed=1234,
η=1e-3, # Learning rate
epochs=n_series,
window_size=1000, # Size of the window for prediction
K=10, # Hyperparameter K (if it has a specific use, add a comment)
)
# Train model and calculate loss
loss = ts_invariant_statistical_loss_one_step_prediction(
recurrent_model, generative_model, loaderXtrain, loaderYtrain, ts_hparams
)
# --- Visualization ---
# Plotting the time series prediction
plot_univariate_ts_prediction(
recurrent_model,
generative_model,
collect(loaderXtrain)[2], # Extract the first batch for plotting
collect(loaderXtest)[2], # Extract the first batch for plotting
ts_hparams;
n_average=1000, # Number of predictions to average
)
end
- Example 2: Electricity Consumption Forecasting
@test_experiments "testing electricity-c" begin
# Data loading and preprocessing
csv_file_path = "examples/time_series_predictions/data/LD2011_2014.txt"
# Specify custom types for certain columns to ensure proper data handling
column_types = Dict(
"MT_005" => Float32,
"MT_006" => Float32,
"MT_007" => Float32,
"MT_008" => Float32,
"MT_168" => Float32,
"MT_333" => Float32,
"MT_334" => Float32,
"MT_335" => Float32,
"MT_336" => Float32,
"MT_338" => Float32,
)
df = DataFrame(
CSV.File(csv_file_path; delim=';', header=true, decimal=',', types=column_types)
)
# Hyperparameters setup
hparams = HyperParamsTS(; seed=1234, η=1e-2, epochs=2000, window_size=1000, K=10)
# Model definition
rec = Chain(RNN(1 => 3, relu), RNN(3 => 3, relu))
gen = Chain(Dense(4, 10, identity), Dense(10, 1, identity))
# Training and testing data setup
start, num_training_data, num_test_data = 35040, 35040, 1000
# Aggregate time series data for training and testing
selected_columns = [
"MT_005",
"MT_006",
"MT_007",
"MT_008",
"MT_168",
"MT_333",
"MT_334",
"MT_335",
"MT_336",
"MT_338",
]
loader_xtrain, loader_ytrain, loader_xtest = aggregate_time_series_data(
df, selected_columns, start, num_training_data, num_test_data
)
# Model training
losses = []
@showprogress for _ in 1:100
loss = ts_invariant_statistical_loss(
rec, gen, loader_xtrain, loader_ytrain, hparams
)
append!(losses, loss)
end
# Forecasting
τ = 24
xtrain = collect(loaderXtrain)[1]
xtest = collect(loaderXtest)[1]
prediction, stds = ts_forecast(
rec, gen, xtrain, xtest, τ; n_average=1000, noise_model=Normal(0.0f0, 1.0f0)
)
plot(prediction[1:τ])
plot!(xtest[1:τ])
end
Prediction τ = 24 (1 day)
Prediction τ = 168 (7 days)
For further information: [email protected]