Implement #334: Integrate cross-project experiment into pipeline #346

M8is · 2018-04-17T06:45:55Z

Implement #334: Integrate cross-project experiment into pipeline

@salsolatragus I thought you may want to check the current state. The tasks are now integrated into the pipeline and should work the same way as before (plus pipeline features like the default config). I've set them up to be called separately, but that is easy to reconfigure.

salsolatragus

Provide option --with-xp for ex2 and ex3. If chosen, provide example projects as training data.

salsolatragus · 2018-04-19T12:52:31Z

mubench.pipeline/tasks/implementations/crossproject_prepare.py

+
+class CrossProjectPrepareTask:
+    MAX_SUBTYPES_SAMPLE_SIZE = 25
+    MAX_PROJECT_SAMPLE_SIZE = 50


Move these two constants to config parameters. Use same names.

salsolatragus · 2018-04-19T13:09:46Z

mubench.pipeline/utils/config_util.py

@@ -238,6 +249,32 @@ def __add_run_ex3_subprocess(available_detectors: List[str], available_datasets:
    __setup_run_arguments(experiment_parser, available_detectors)


+def __add_run_cross_project_create_index(available_datasets: List[str], subparsers) -> None:


Implicitly run create index with prepare.

salsolatragus · 2018-04-19T13:14:00Z

mubench.pipeline/utils/config_util.py

+    __setup_filter_arguments(parser, available_datasets)
+
+
+def __add_run_cross_project_prepare(subparsers) -> None:


Provide as top-level task ./mubench checkout-xp.

salsolatragus · 2018-05-03T13:03:23Z

Try collecting results from AsterskTasks for passing prepared xp data on to subsequent tasks. If this does not work (easily), we persist indexes version-wise and load these after preparation.

M8is · 2018-05-04T09:42:18Z

Turns out we really didn't need anything more since we can write one index per version. Woops.

Well, now that it's implemented we might as well keep it. It isn't that complicated and seems to be a useful feature. Essentially, we can now split something up into arbitrarily small tasks and just accumulate the results. At this point, I'm wondering if TaskRunner is Turing complete...

Anyway, we can now run ex2 and ex3 --with-xp, which prepares the cross project examples and passes all sources paths to the detector via the training sources argument.

M8is · 2018-05-04T09:45:19Z

@salsolatragus I just remembered that the solution I've implemented now is exactly what you suggested. There's definitely a lot of space for improvements, but it should work for now.

salsolatragus

Only 2 minors!

salsolatragus · 2018-05-14T12:09:58Z

mubench.pipeline/tasks/implementations/detect_all_findings.py

        return {
            key_detector_mode: DetectAllFindingsTask.__DETECTOR_MODE,
            key_target_src_paths: version_compile.original_sources_paths,
            key_target_classes_paths: version_compile.original_classes_paths,
-            key_dependency_classpath: version_compile.get_full_classpath()
+            key_dependency_classpath: version_compile.get_full_classpath(),
+            key_training_src_path: xp_sources_paths


What happens if there is no xp paths.

salsolatragus · 2018-05-14T12:11:03Z

mubench.pipeline/tasks/implementations/detect_provided_correct_usages.py

@@ -40,7 +47,7 @@ def _get_findings_path(self, detector: Detector, version: ProjectVersion, misuse
    def _get_detector_arguments(version_compile: VersionCompile, misuse_compile: MisuseCompile):
        return {
            key_detector_mode: DetectProvidedCorrectUsagesTask.__DETECTOR_MODE,
-            key_training_src_path: misuse_compile.correct_usage_sources_path,
+            key_training_src_path: [misuse_compile.correct_usage_sources_path],


Revert this.

M8is · 2018-05-15T07:16:11Z

This should be ready now.
Note: if you want to use this in the detectors, you'll have to catch the java.io.FileNotFoundException: no training source path provided. What is happening is pretty clear from the exception message at least. We'd have to release another mubench.cli version to add a default.

M8is self-assigned this Apr 17, 2018

M8is requested a review from salsolatragus April 17, 2018 06:45

salsolatragus reviewed Apr 19, 2018

View reviewed changes

M8is force-pushed the cross-project-experiment branch from e447a83 to 4cd544b Compare April 20, 2018 06:12

Mattis Manfred Kämmerer added 10 commits April 20, 2018 12:00

Integrate cross-project create index into pipeline

001d207

Integrate cross-project prepare into pipeline

3711fd4

Integrate cross-project create project list into pipeline

dd53a05

Check if file exists before reading

8ed94ad

Require boa credentials

c534a0c

Remove unused constant

a2b2f70

Setup cross-project parameters last

3a14e36

Pass sample size limit via config argument

30f2c98

Merge create index and project list into prepare

3c99b27

Add cross-project prepare as top-level task checkout-xp

bd16a52

M8is force-pushed the cross-project-experiment branch from 24009e2 to bd16a52 Compare April 20, 2018 10:00

salsolatragus changed the base branch from master-dev to master April 20, 2018 14:31

Mattis Manfred Kämmerer added 6 commits May 2, 2018 15:01

Move reading cross project index to a separate task

5216ee2

Add cross project option for ex1 and ex3

6248eb0

Require boa credentials only on --with-xp flag

251e60a

Add task for skipping cross project prepare

19fd900

Always provide CrossProjectSourcesPaths in ex1 and ex3

33153e8

Use cross project sources for detection in ex1

e39dbd2

Mattis Manfred Kämmerer added 7 commits May 4, 2018 10:27

Fix test setup and formatting

881429f

Accumulate results of leaf tasks

de0c1b6

Remove xp from ex1

006aed3

Remove obsolete code

49d2946

Add cross project support for ex2 and ex3

0350447

Remove cross project args from run ex1

1f8b6e3

Add cross project args to run ex2

c389c51

Mattis Manfred Kämmerer added 3 commits May 4, 2018 11:39

Collect misuses to create xp index

006f572

Write index file per project version

977f216

Fix a test

05b1720

salsolatragus requested changes May 14, 2018

View reviewed changes

Mattis Manfred Kämmerer added 2 commits May 15, 2018 08:43

Revert a change

0f0082d

Add training paths only if xp sources are provided

d7a8e32

salsolatragus approved these changes May 17, 2018

View reviewed changes

salsolatragus self-assigned this May 29, 2018

salsolatragus force-pushed the master branch 3 times, most recently from 8f1f1a5 to edd7d88 Compare September 20, 2019 10:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement #334: Integrate cross-project experiment into pipeline #346

Implement #334: Integrate cross-project experiment into pipeline #346

M8is commented Apr 17, 2018

salsolatragus left a comment

salsolatragus Apr 19, 2018

salsolatragus Apr 19, 2018

salsolatragus Apr 19, 2018

salsolatragus commented May 3, 2018

M8is commented May 4, 2018

M8is commented May 4, 2018

salsolatragus left a comment

salsolatragus May 14, 2018

salsolatragus May 14, 2018

M8is commented May 15, 2018

		@@ -238,6 +249,32 @@ def __add_run_ex3_subprocess(available_detectors: List[str], available_datasets:
		__setup_run_arguments(experiment_parser, available_detectors)


		def __add_run_cross_project_create_index(available_datasets: List[str], subparsers) -> None:

		__setup_filter_arguments(parser, available_datasets)


		def __add_run_cross_project_prepare(subparsers) -> None:

Implement #334: Integrate cross-project experiment into pipeline #346

Are you sure you want to change the base?

Implement #334: Integrate cross-project experiment into pipeline #346

Conversation

M8is commented Apr 17, 2018

salsolatragus left a comment

Choose a reason for hiding this comment

salsolatragus Apr 19, 2018

Choose a reason for hiding this comment

salsolatragus Apr 19, 2018

Choose a reason for hiding this comment

salsolatragus Apr 19, 2018

Choose a reason for hiding this comment

salsolatragus commented May 3, 2018

M8is commented May 4, 2018

M8is commented May 4, 2018

salsolatragus left a comment

Choose a reason for hiding this comment

salsolatragus May 14, 2018

Choose a reason for hiding this comment

salsolatragus May 14, 2018

Choose a reason for hiding this comment

M8is commented May 15, 2018