algvr
diff --git a/‎README.md
Lines changed: 52 additions & 0 deletions b/‎README.md
Lines changed: 52 additions & 0 deletions
diff --git a/‎configurations/adversarial_interpolation_pretraining/adversarial_interpolation_pretraining.json
Lines changed: 136 additions & 0 deletions b/‎configurations/adversarial_interpolation_pretraining/adversarial_interpolation_pretraining.json
Lines changed: 136 additions & 0 deletions
diff --git a/‎configurations/dec_cc_hybrid/dec_cc_hybrid_fashion_mnist.json
Lines changed: 182 additions & 0 deletions b/‎configurations/dec_cc_hybrid/dec_cc_hybrid_fashion_mnist.json
Lines changed: 182 additions & 0 deletions
@@ -0,0 +1,52 @@
+# deep-clustering-recombination-framework
+
+Recombination framework for deep-learning–based clustering methods, based on "An ontology for systematization and recombination of deep-learning–based clustering methods".
+
+## How to use
+
+1. Configure one or more deep-learning–⁠based clustering methods using JSON files following the JSON schema found in ``schema/method.json``. Instructions can be found in the ``description`` properties of the respective JSON properties and objects in the schema files. For definitions of terms found therein and further elaborations, see the aforementioned ontology.
+2. Run ``main.py`` with the configuration files of the methods to process as arguments (all paths are considered to be relative to the location of ``main.py``). Any directories passed as arguments will be searched non-recursively, and any found JSON files will be processed.
+
+## Arguments for ``main.py``
+``--one_log_file``: path to single log file to use (uses separate log files for each method to process if omitted)
+
+``--no_log_files``: do not create log files
+
+``--no_log_timestamps``: do not prefix log messages with timestamps
+
+``--resume_on_error``: if an exception is raised while processing a method, process the next method instead of crashing
+
+## Cluster assignment strategies
+
+The following cluster assignment strategies are currently implemented:
+* based on output of sample-space classifier
+* based on output of feature-space classifier
+* classical clustering in feature space after training (using k-means)
+* based on feature-space centroids calculated during training (using soft assignments)
+
+## Trainable mappings
+
+In general, trainable mappings with layers from any class in the ``torch.nn`` module of the ``torch`` Python package can be constructed.
+
+## Design patterns
+JSON schema definitions of design patterns, as described in the ontology, can be found in ``schema/design_patterns``. Currently, the following design patterns are implemented:
+* training a feature extractor through reconstruction of samples
+* training a feature extractor by using adversarial interpolation
+* facilitating the training of a feature extractor by using layer-wise pretraining (variant using denoising autoencoder)
+* learning transformation-invariant feature representations by using contrastive learning and data augmentation (variant using SimCLR)
+* learning invariance of soft assignments to transformations by using assignment statistics vectors and data augmentation
+* encouraging cluster formation by minimizing the divergence between the current cluster assignment distribution and a derived target distribution
+* encouraging cluster formation by reinforcing the current assignment of samples to clusters (variants in feature space, in sample space using a decoder, and based on soft assignments)
+* preventing cluster degeneracy by maximizing the entropy of soft assignments
+
+## Datasets
+
+Note that the recombination framework is currently limited to processing datasets in the ``torchvision.datasets`` module of the ``torchvision`` Python package.
+
+## Exemplary method configurations
+Some configurations of deep-learning–based clustering methods, as discussed in "An ontology for systematization and recombination of deep-learning–based clustering methods", can be found in subdirectories of the ``configurations`` directory. Details can be found in the respective method's configuration file.
+* ``dec_cc_hybrid``: This method uses the standard ``784-500-500-2000-10`` encoder (introduced in https://arxiv.org/abs/1511.06335) as its feature extractor. It computes soft assignments based on a Student's-t–kernel measuring the similarity between feature representations and feature-space centroids as its cluster assignment strategy. Furthermore, it uses the "training a feature extractor through reconstruction of samples" design pattern during its pretraining phase, and the "learning transformation-invariant feature representations by using contrastive learning and data augmentation", "learning invariance of soft assignments to transformations by using assignment statistics vectors and data augmentation", "preventing cluster degeneracy by maximizing the entropy of soft assignments" (all three from the method in https://arxiv.org/abs/2009.09687), and "encouraging cluster formation by minimizing the divergence between the current cluster assignment distribution and a derived target distribution" design patterns during its finetuning phase.\
+Achieves a performance of ACC 0.978 (97.8 ± 0.2), NMI 0.944 (94.4 ± 1.0), ARI 0.952 (95.2 ± 0.4) evaluated on MNIST-Test after training on MNIST-Train. Further achieves a performance of ACC 0.583 (58.3 ± 1.5), NMI 0.633 (63.3 ± 0.6), ARI 0.470 (47.0 ± 1.5) on Fashion-MNIST-Test after training on Fashion-MNIST-Train. The values in parentheses indicate means and standard deviations over 10 runs. Configuration files are provided both for evaluation on MNIST-Test and on Fashion-MNIST-Test.
+* ``deep_k_means``: Recreation of the DKM-a method (introduced in https://arxiv.org/abs/1806.10069) on MNIST. This method uses the aforementioned ``784-500-500-2000-10`` encoder as its feature extractor. It computes soft assignments based on a Gaussian kernel measuring the similarity between feature representations and feature-space centroids as its cluster assignment strategy, using an annealed inverse temperature as described in https://arxiv.org/abs/1806.10069. Furthermore, it uses the feature-space–based variant of the "encouraging cluster formation by reinforcing the current assignment of samples to clusters" design pattern, as well as the "training a feature extractor through reconstruction of samples" design pattern during its single training phase.
+* ``layer_wise_pretraining``: A method trained using solely the denoising-autoencoder–based variant of the "facilitating the training of a feature extractor by using layer-wise pretraining" design pattern. The method uses the "classical clustering in feature space after training" cluster assignment strategy, using the aforementioned ``784-500-500-2000-10`` encoder as its feature extractor. Intended to test the correct implementation of the "facilitating the training of a feature extractor by using layer-wise pretraining" design pattern, and can be used as a basis for the construction of a method following a pretraining-finetuning training schedule.
+* ``adversarial_interpolation_pretraining``: A method trained using solely the "training a feature extractor by using adversarial interpolation" design pattern, using the aforementioned ``784-500-500-2000-10`` encoder as its feature extractor. The method uses the "classical clustering in feature space after training" cluster assignment strategy. Intended to test the correct implementation of the "training a feature extractor by using adversarial interpolation" design pattern, and can be used as a basis for the construction of a method following a pretraining-finetuning training schedule.
@@ -0,0 +1,136 @@
+{
+  "$schema": "../../schema/method.json",
+  "name": "adversarial_interpolation_pretraining",
+  "mappings": [
+    {
+      "name": "encoder",
+      "type": "feature_extractor",
+      "layers": [
+        { "name": "enc_flatten_1", "type": "Flatten" },
+        { "name": "enc_linear_2", "type": "Linear", "in_features": 784, "out_features": 500 },
+        { "name": "enc_relu_3", "type": "ReLU" },
+        { "name": "enc_linear_4", "type": "Linear", "in_features": 500, "out_features": 500 },
+        { "name": "enc_relu_5", "type": "ReLU" },
+        { "name": "enc_linear_6", "type": "Linear", "in_features": 500, "out_features": 2000 },
+        { "name": "enc_relu_7", "type": "ReLU" },
+        { "name": "enc_linear_8", "type": "Linear", "in_features": 2000, "out_features": 10 }
+      ]
+    },
+    {
+      "name": "decoder",
+      "type": "design_pattern_specific",
+      "layers": [
+        { "name": "dec_linear_1", "type": "Linear", "in_features": 10, "out_features": 2000 },
+        { "name": "dec_relu_2", "type": "ReLU" },
+        { "name": "dec_linear_3", "type": "Linear", "in_features": 2000, "out_features": 500 },
+        { "name": "dec_relu_4", "type": "ReLU" },
+        { "name": "dec_linear_5", "type": "Linear", "in_features": 500, "out_features": 500 },
+        { "name": "dec_relu_6", "type": "ReLU" },
+        { "name": "dec_linear_7", "type": "Linear", "in_features": 500, "out_features": 784 },
+        { "name": "dec_unflatten_8", "type": "Unflatten", "dim": 1, "unflattened_size": [1, 28, 28] }
+      ]
+    },
+    {
+      "name": "critic",
+      "type": "design_pattern_specific",
+      "layers": [
+        { "type": "Flatten" },
+        { "type": "Linear", "in_features": 784, "out_features": 500 },
+        { "type": "ReLU" },
+        { "type": "Linear", "in_features": 500, "out_features": 500 },
+        { "type": "ReLU" },
+        { "type": "Linear", "in_features": 500, "out_features": 2000 },
+        { "type": "ReLU" },
+        { "type": "Linear", "in_features": 2000, "out_features": 10 },
+        { "type": "Unflatten", "dim": 1, "unflattened_size": [1, 10] },
+        { "type": "AvgPool1d", "kernel_size": 10 },
+        { "type": "Flatten" }
+      ]
+    }
+  ],
+  "phases": [
+    {
+      "name": "pretraining",
+      "order": 1,
+      "exit_criteria": { "iterations": 50000 },
+      "save_mapping_parameters": [
+        { "mapping_name": "encoder", "saving_interval": 1000, "path_to_file_or_dir": "pretrained/", "keep_old_files": true },
+        { "mapping_name": "decoder", "saving_interval": 1000, "path_to_file_or_dir": "pretrained/", "keep_old_files": true },
+        { "mapping_name": "critic", "saving_interval": 1000, "path_to_file_or_dir": "pretrained/", "keep_old_files": true }
+      ],
+      "design_patterns": [
+        {
+          "pattern": "training_feature_extractor_through_reconstruction_of_samples",
+          "encoder_name": "encoder",
+          "decoder_name": "decoder",
+          "loss_optimizer_group_name": "ae_optimizer_group",
+          "loss_report_interval": 500
+        },
+        {
+          "pattern": "training_feature_extractor_by_using_adversarial_interpolation",
+          "encoder_name": "encoder",
+          "decoder_name": "decoder",
+          "critic_name": "critic",
+          "autoencoder_loss_optimizer_group_name": "ae_optimizer_group",
+          "critic_loss_optimizer_group_name": "critic_optimizer_group",
+          "autoencoder_loss_weight": 0.5,
+          "loss_report_interval": 500
+        }
+      ],
+      "optimizers": [
+        {
+          "type": "SGD",
+          "group_name": "ae_optimizer_group",
+          "lr": 0.001,
+          "momentum": 0.9,
+          "trained_mappings": ["encoder", "decoder"]
+        },
+        {
+          "type": "SGD",
+          "group_name": "critic_optimizer_group",
+          "lr": 0.001,
+          "momentum": 0.9,
+          "trained_mappings": ["critic"]
+        }
+      ]
+    },
+    {
+      "name": "evaluation",
+      "order": 2,
+      "exit_criteria": { "iterations": 1 },
+      "performance_evaluation_interval": 1
+    }
+  ],
+  "cluster_assignment_strategy": {
+    "type": "feature_representation_centroid_similarity",
+    "similarity_measure": "student_t",
+    "use_centroids_during_phases": ["evaluation"],
+    "centroid_initialization_strategy": "classical_clustering",
+    "centroid_initialization_classical_clustering_method": "k_means",
+    "centroid_recalculation_strategy": "fixed_centroids"
+  },
+  "datasets": [
+    {
+      "name": "MNIST-Train",
+      "dataset": "MNIST",
+      "root": "../../datasets/mnist",
+      "train": true,
+      "download": true,
+      "batch_size": 256,
+      "num_clusters": 10,
+      "phases": ["pretraining"]
+    },
+    {
+      "name": "MNIST-Test",
+      "dataset": "MNIST",
+      "root": "../../datasets/mnist",
+      "train": false,
+      "download": true,
+      "batch_size": 256,
+      "num_clusters": 10,
+      "phases": ["evaluation"],
+      "reinitialize_mappings": false
+    }
+  ],
+  "training_device": "cuda_if_available"
+}
@@ -0,0 +1,182 @@
+{
+    "$schema": "../../schema/method.json",
+    "name": "dec_cc_hybrid_fashion_mnist",
+    "mappings": [
+      {
+        "name": "encoder",
+        "type": "feature_extractor",
+        "layers": [
+          { "name": "enc_flatten_1", "type": "Flatten" },
+          { "name": "enc_linear_2", "type": "Linear", "in_features": 784, "out_features": 500 },
+          { "name": "enc_relu_3", "type": "ReLU" },
+          { "name": "enc_linear_4", "type": "Linear", "in_features": 500, "out_features": 500 },
+          { "name": "enc_relu_5", "type": "ReLU" },
+          { "name": "enc_linear_6", "type": "Linear", "in_features": 500, "out_features": 2000 },
+          { "name": "enc_relu_7", "type": "ReLU" },
+          { "name": "enc_linear_8", "type": "Linear", "in_features": 2000, "out_features": 10 }
+        ]
+      },
+      {
+        "name": "decoder",
+        "type": "design_pattern_specific",
+        "layers": [
+          { "name": "dec_linear_1", "type": "Linear", "in_features": 10, "out_features": 2000 },
+          { "name": "dec_relu_2", "type": "ReLU" },
+          { "name": "dec_linear_3", "type": "Linear", "in_features": 2000, "out_features": 500 },
+          { "name": "dec_relu_4", "type": "ReLU" },
+          { "name": "dec_linear_5", "type": "Linear", "in_features": 500, "out_features": 500 },
+          { "name": "dec_relu_6", "type": "ReLU" },
+          { "name": "dec_linear_7", "type": "Linear", "in_features": 500, "out_features": 784 },
+          { "name": "dec_unflatten_8", "type": "Unflatten", "dim": 1, "unflattened_size": [1, 28, 28] }
+        ]
+      },
+      {
+        "name": "instance_level_contrastive_head",
+        "type": "design_pattern_specific",
+        "prior_mapping_name": "encoder",
+        "layers": [
+          { "type": "Linear", "in_features": 10, "out_features": 512 },
+          { "type": "ReLU" },
+          { "type": "Linear", "in_features": 512, "out_features": 128 }
+        ]
+      }
+    ],
+    "phases": [
+      {
+        "name": "pretraining",
+        "order": 1,
+        "exit_criteria": { "iterations": 50000 },
+        "save_mapping_parameters": [
+          { "mapping_name": "encoder", "saving_interval": 1000, "path_to_file_or_dir": "pretrained/", "keep_old_files": true },
+          { "mapping_name": "decoder", "saving_interval": 1000, "path_to_file_or_dir": "pretrained/", "keep_old_files": true }
+        ],
+        "design_patterns": [
+          {
+            "pattern": "training_feature_extractor_through_reconstruction_of_samples",
+            "encoder_name": "encoder",
+            "decoder_name": "decoder",
+            "loss_report_interval": 500
+          }
+        ],
+        "optimizers": [
+          {
+            "type": "SGD",
+            "lr": 0.001,
+            "momentum": 0.9,
+            "trained_mappings": ["encoder", "decoder"]
+          }
+        ]
+      },
+      {
+        "name": "finetuning",
+        "order": 2,
+        "exit_criteria": { "iterations": 100000 },
+        "performance_evaluation_interval": 500,
+        "save_mapping_parameters": [
+          { "mapping_name": "encoder", "saving_interval": 1000, "path_to_file_or_dir": "./", "keep_old_files": true },
+          { "mapping_name": "instance_level_contrastive_head", "saving_interval": 1000, "path_to_file_or_dir": "./", "keep_old_files": true }
+        ],
+        "save_centroids": { "path_to_file_or_dir": "./", "saving_interval": 1000, "keep_old_files": true },
+        "design_patterns": [
+          {
+            "pattern": "learning_feature_representations_by_using_contrastive_learning_and_data_augmentation",
+            "contrastive_learning_head_name": "instance_level_contrastive_head",
+            "batch_augmentation_name_1": "batch_augmentation_1",
+            "batch_augmentation_name_2": "batch_augmentation_2",
+            "temperature_parameter": 0.5,
+            "loss_report_interval": 500
+          },
+          {
+            "pattern": "learning_invariance_to_transformations_by_using_assignment_statistics_vectors_and_data_augmentation",
+            "batch_augmentation_name_1": "batch_augmentation_1",
+            "batch_augmentation_name_2": "batch_augmentation_2",
+            "temperature_parameter": 1.0,
+            "loss_report_interval": 500
+          },
+          {
+            "pattern": "encouraging_cluster_formation_by_minimizing_divergence_between_current_and_target_cluster_assignment_distribution",
+            "loss_weight": 0.5,
+            "target_distribution_recalculation_interval": 140,
+            "loss_report_interval": 500
+          },
+          {
+            "pattern": "preventing_cluster_degeneracy_by_maximizing_entropy_of_soft_assignments",
+            "batch_augmentation_name": "batch_augmentation_1",
+            "loss_report_interval": 500
+          },
+          {
+            "pattern": "preventing_cluster_degeneracy_by_maximizing_entropy_of_soft_assignments",
+            "batch_augmentation_name": "batch_augmentation_2",
+            "loss_report_interval": 500
+          }
+        ],
+        "optimizers": [
+          {
+            "type": "SGD",
+            "lr": 0.001,
+            "momentum": 0.9,
+            "trained_mappings": ["encoder", "instance_level_contrastive_head"],
+            "optimizes_centroids": true
+          }
+        ]
+      },
+      {
+        "name": "evaluation",
+        "order": 3,
+        "exit_criteria": { "iterations": 1 },
+        "performance_evaluation_interval": 1
+      }
+    ],
+    "cluster_assignment_strategy": {
+      "type": "feature_representation_centroid_similarity",
+      "similarity_measure": "student_t",
+      "use_centroids_during_phases": ["finetuning", "evaluation"],
+      "centroid_initialization_strategy": "classical_clustering",
+      "centroid_initialization_classical_clustering_method": "k_means",
+      "centroid_recalculation_strategy": "recalculation_by_design_pattern"
+    },
+    "datasets": [
+      {
+        "name": "Fashion-MNIST-Train",
+        "dataset": "FashionMNIST",
+        "root": "../../datasets/fashion_mnist",
+        "train": true,
+        "download": true,
+        "batch_size": 256,
+        "num_clusters": 10,
+        "phases": ["pretraining", "finetuning"],
+        "batch_augmentations": [
+          {
+            "name": "batch_augmentation_1",
+            "transforms": [
+              { "type": "RandomPerspective", "distortion_scale": 0.2, "p": 1.0 },
+              { "type": "RandomAffine", "degrees": 0, "translate": [0.1, 0.1], "scale": [0.75, 1.25] },
+              { "type": "ColorJitter", "brightness": [0.7, 1.15], "contrast": [0.85, 1.15] },
+              { "type": "ToTensor" }
+            ]
+          },
+          {
+            "name": "batch_augmentation_2",
+            "transforms": [
+              { "type": "RandomPerspective", "distortion_scale": 0.2, "p": 1.0 },
+              { "type": "RandomAffine", "degrees": 0, "translate": [0.1, 0.1], "scale": [0.75, 1.25] },
+              { "type": "ColorJitter", "brightness": [0.7, 1.15], "contrast": [0.85, 1.15] },
+              { "type": "ToTensor" }
+            ]
+          }
+        ]
+      },
+      {
+        "name": "Fashion-MNIST-Test",
+        "dataset": "FashionMNIST",
+        "root": "../../datasets/fashion_mnist",
+        "train": false,
+        "download": true,
+        "batch_size": 256,
+        "num_clusters": 10,
+        "phases": ["evaluation"],
+        "reinitialize_mappings": false
+      }
+    ],
+    "training_device": "cuda_if_available"
+  }