Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

End To End Test Suite extended #583

Merged
Merged
Show file tree
Hide file tree
Changes from 49 commits
Commits
Show all changes
61 commits
Select commit Hold shift + click to select a range
8089a0a
comparative values expanded
SuyDesignz Aug 7, 2022
36bdb44
Merge branch 'feature/EndToEndTesting' into feature/endToEndTestingCo…
SuyDesignz Aug 7, 2022
6616335
adding language specific mapper for the json result paths
SuyDesignz Aug 7, 2022
ab47b27
changed static identifier to hashCode. Resources loading made dynamic…
SuyDesignz Aug 7, 2022
36db863
implemented save temporary results with jackson json
SuyDesignz Aug 8, 2022
55191a9
Merge branch 'feature/EndToEndTesting' into feature/endToEndTestingCo…
SuyDesignz Aug 8, 2022
0ec54ed
Merge branch 'feature/EndToEndTesting' into feature/endToEndTestingCo…
SuyDesignz Aug 8, 2022
5485b5f
saving the temporary json results for comparison implemented
SuyDesignz Aug 8, 2022
207b28f
change test identifier from an int to string as SHA256 hash
SuyDesignz Aug 9, 2022
f511e61
mapper implemented for the paths
SuyDesignz Aug 9, 2022
373342e
implementation of storage testing
SuyDesignz Aug 9, 2022
4f633d6
Merge remote-tracking branch 'origin/master' into feature/endToEndTes…
SuyDesignz Aug 9, 2022
87b1133
insert comments and documentation for functions
SuyDesignz Aug 9, 2022
72e4042
Add comments
SuyDesignz Aug 9, 2022
19b9547
Code cleaned up and comments written. New mapper class implemented fo…
SuyDesignz Aug 10, 2022
f418266
removed unused code and functions
SuyDesignz Aug 10, 2022
7e06ab4
ReadMe file extended by "how to add new languages".
SuyDesignz Aug 10, 2022
a5833bd
Merge branch 'endToEndComparativeValueExtension' into feature/endToEn…
SuyDesignz Aug 10, 2022
ef789b7
fixed README file
SuyDesignz Aug 10, 2022
9249ef8
Changing the json format without DefaultPrettyPrinter() to exclude li…
SuyDesignz Aug 11, 2022
71a81e9
fixed formatting
SuyDesignz Aug 11, 2022
4f0080f
changed identifier to tested plagiatism file name
SuyDesignz Aug 14, 2022
7823d89
removed sonar cloud RELIABILITY Bugs
SuyDesignz Aug 14, 2022
c790015
Merge branch 'master' into feature/endToEndTestingComparativeValueExt…
SuyDesignz Aug 14, 2022
4fc1973
removed code smells detected from sonarcloud
SuyDesignz Aug 14, 2022
2f368a7
changed mapper class from HashMap to EnumMap
SuyDesignz Aug 14, 2022
280cec0
removed unused code and added commants for better maintainability
SuyDesignz Aug 14, 2022
cd0ed1b
implemented DynamicTests for OverAllTestCase
SuyDesignz Aug 14, 2022
4cef78e
Rework started over complete endToEndTestSuite
SuyDesignz Aug 15, 2022
3af1e95
added temporary save json and read json to module
SuyDesignz Aug 16, 2022
1a750b8
Merge remote-tracking branch 'origin/master' into feature/endToEndTes…
SuyDesignz Aug 21, 2022
e76365e
removed "result_" from json object
SuyDesignz Aug 21, 2022
d74b362
compare with results implemented.
SuyDesignz Aug 21, 2022
b9d6f51
cleaned code
SuyDesignz Aug 22, 2022
8120219
created comments for new classes and functions
SuyDesignz Aug 22, 2022
2be5a55
removed code smells and Bugs from code detected by sonarcloud
SuyDesignz Aug 22, 2022
96a4b47
changed code format
SuyDesignz Aug 22, 2022
2203e77
Removal of unused code and revision of the naming convention
SuyDesignz Aug 22, 2022
2fcb740
Changed testRun to add compaire errors to the list instead of assertE…
SuyDesignz Aug 22, 2022
9a64147
changed code formatting
SuyDesignz Aug 22, 2022
77e3ad4
removed record constructors and fixed jackson deserialization
SuyDesignz Aug 22, 2022
0fe94b1
added comments for public and private classes
SuyDesignz Aug 22, 2022
f38b95b
removed unnecessary code
SuyDesignz Aug 22, 2022
d0a1699
Merge remote-tracking branch 'origin/master' into feature/endToEndTes…
SuyDesignz Aug 22, 2022
25f7a09
removed code-smells and changed dokumentation in README.md file for n…
SuyDesignz Aug 22, 2022
4c23475
added comment
SuyDesignz Aug 22, 2022
893b264
removed code-smell
SuyDesignz Aug 22, 2022
9d4a120
Made name changes and removed code-smells
SuyDesignz Aug 23, 2022
f12b6b1
changed return type for helper function
SuyDesignz Aug 23, 2022
8a1fee7
Update jplag.endToEndTesting/src/main/java/de/jplag/end_to_end_testin…
SuyDesignz Aug 25, 2022
c3c7f5d
Update jplag.endToEndTesting/src/test/java/de/jplag/end_to_end_testin…
SuyDesignz Aug 25, 2022
4c24b9d
Update jplag.endToEndTesting/src/test/java/de/jplag/end_to_end_testin…
SuyDesignz Aug 25, 2022
8b24abd
Update jplag.endToEndTesting/src/main/java/de/jplag/end_to_end_testin…
SuyDesignz Aug 25, 2022
718c94b
Update jplag.endToEndTesting/src/main/java/de/jplag/end_to_end_testin…
SuyDesignz Aug 25, 2022
d51f7ac
Change Proposed Adjustments. Improvement of the code and readability
SuyDesignz Aug 25, 2022
a5084b5
Merge remote-tracking branch 'origin/master' into feature/endToEndTes…
SuyDesignz Aug 25, 2022
bb7316b
Modified language options class included in the endToEnd testing suite
SuyDesignz Aug 25, 2022
963f341
Changed README file
SuyDesignz Aug 25, 2022
4e87efb
fixed result path
SuyDesignz Aug 25, 2022
b657b4e
removed code smells detected by sonarcloud
SuyDesignz Aug 25, 2022
1efbf81
Functions improved and comments adjusted
SuyDesignz Aug 26, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
156 changes: 112 additions & 44 deletions jplag.endToEndTesting/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,18 +2,17 @@
With the help of the end-to-end module, changes to the detection of JPlag are to be tested.
With the help of elaborated plagiarisms, which have been worked out from suggestions in the literature on the topic of "plagiarism detection and avoidance", a wide range of detectable change can be covered. The selected plagiarisms are the decisive factor here as to whether a change in recognition can be perceived.

### References
## References
These elaborations provide basic ideas on how a modification of the plagiarized source code can look like or be adapted.
These code adaptations refer to a wide range of changes starting from
adding/removing comments to architectural changes in the deliverables.

The following elaborations were used to be able to create the plagiarisms with the largest coverage:
- [Mossad: defeating software plagiarism detection](https://dl.acm.org/doi/abs/10.1145/3428206 "Mossad: defeating software plagiarism detection")
- [Detecting Source Code Plagiarism on Introductory Programming Course Assignments Using a Bytecode Approach - Oscar Karnalim](https://ieeexplore.ieee.org/abstract/document/7910274 "Detecting Source Code Plagiarism on Introductory Programming Course Assignments Using a Bytecode Approach - Oscar Karnalim")
- [Detecting Disguised Plagiarism - Hatem A. Mahmoud](https://arxiv.org/abs/1711.02149 "Detecting Disguised Plagiarism - Hatem A. Mahmoud")
- [Instructor-centric source code plagiarism detection and plagiarism corpus](https://dl.acm.org/doi/abs/10.1145/2325296.2325328 "Instructor-centric source code plagiarism detection and plagiarism corpus")

### Steps Towards Plagiarism
## Steps Towards Plagiarism
The following changes were applied to sample tasks to create test cases:
<ul type="1">
<li>Inserting comments or empty lines (normalization level)</li>
Expand Down Expand Up @@ -47,59 +46,128 @@ More detailed information about the create as well as about the subject to the i

Software is according to [§ 2 of the copyright law](https://www.gesetze-im-internet.de/urhg/__2.html "§ 2 of the copyright law") a protected work which may not be plagiarized.

### JPlag - End To End TestSuite Structure
## JPlag - End To End TestSuite Structure
The construction of an end to end test is done with the help of the JPlag api.
The tests are generated dynamically according to the existing test data and allow the creation of endToEnd tests for all supported languages of JPlag without having to make any changes to the code.
The helper loads the existing test data from the designated directory and creates dynamic tests for the individual directories. It is therefore possible to create different test classes for the different languages.
- JPlagTestSuiteHelper:

``` java
[...]
/**
* This method creates the necessary results as well as models for a test run and summarizes them for a comparison.
* @param testClassNames Plagiarized classes names in the resource directorie which are needed for the test
* @param testIdentifier name of the testId to load and identify the stored results
* @throws IOException is thrown in case of problems with copying the plagiarism classes
* @throws ExitException in case the plagiarism detection with JPlag is preemptively terminated would be of the test.
*/
private void runJPlagTestSuite(String[] testClassNames, int testIdentifier) throws IOException, ExitException {
String functionName = StackWalker.getInstance().walk(stream -> stream.skip(1).findFirst().get()).getMethodName();
TestCaseModel testCaseModel = jplagTestSuiteHelper.createNewTestCase(testClassNames, functionName);
JPlagResult jplagResult = new JPlag(testCaseModel.getJPlagOptionsFromCurrentModel()).run();

for (JPlagComparison jPlagComparison : jplagResult.getAllComparisons()) {
assertEquals(testCaseModel.getCurrentJsonModel().getResultModelById(testIdentifier).getResultSimilarity(), jPlagComparison.similarity(),
"The JPlag results [similarity] do not match the stored values!");
}
}
[...]
public static Map<LanguageOption, Map<String, Path>> getAllLanguageResources()
```

The list of languages created in this way and their associated data are dynamically generated into test cases in the Test Suite.

``` java
Collection<DynamicTest> dynamicOverAllTest()
```
The created plagiarisms are copied with the JPlagTestSuiteHelper into temporary directories, which can then be checked with JPlag.

In order to be able to distinguish in which domain of the recognition changes have occurred, fine granular test cases are used. These are composed of the changes already mentioned above. The plagiarism is compared with the original delivery and thus it is possible to detect and test small sections of the recognition.

The values compared so far to detect a match are limited to the similarity of the JPlag scan. It is already planned to expand the comparative values in order to have more indications of a change.
The comparative values were discussed and tested. The following results of the JPlag scan are used for the comparison:
1. minimal similarity as `float`
2. maximum similarity as `float`
3. matched token numbe as `int`

The Current Discussion is already taking place and can be found at [End to end testing - "comparative values"](https://github.com/jplag/JPlag/issues/548 "End to end testing - \"comparative values\"").
The comparative values were disscussed and elaborated in the issue [End to end testing - "comparative values"](https://github.com/jplag/JPlag/issues/548 "End to end testing - \"comparative values\"").

Additionally it is possible to create several options for the test data. More information about the test options can be found at [JPlag - option variants for the endToEnd tests #590](https://github.com/jplag/JPlag/issues/590 "JPlag - option variants for the endToEnd tests #590"). Currently, various settings are supported by the `minimumTokenMatch`. This can be extended as desired in the record class `Options`.

The current JPlag scans will be compared with the stored ones.
This was done by storing the data in a *.json file which is read at the beginning of each test run.
This was done by storing the data in a *.json file which is read at the beginning of each test run.

``` json
{
[...]
"function_name": "normalizationLevelTest",
"test_results": [
{
"result_similarity": 100,
"test_identifier": 0
},
{
"result_similarity": 100,
"test_identifier": 1
},
{
"result_similarity": 100,
"test_identifier": 2
}
]
}
{
"options" : {
"minimum_token_match" : 1
},
"tests" : {
"SortAlgo-SortAlgo5" : {
"minimal_similarity" : 82.14286,
"maximum_similarity" : 82.14286,
"matched_token_number" : 46
},
"SortAlgo-SortAlgo6" : {
"minimal_similarity" : 83.58209,
"maximum_similarity" : 100.0,
"matched_token_number" : 56
},
"SortAlgo-SortAlgo7" : {
"minimal_similarity" : 96.42857,
"maximum_similarity" : 100.0,
"matched_token_number" : 54
},
[...]
```
---

## Create New Language End To End Tests

This section explains how to create new end to end tests in the existing test suite.
### Creating The Plagiarism
Before you add a new language to the end to end tests i would like to point out that the quality of the tests depends dreadfully on the plagiarism techniques you choose wicht were explaint in sechtion [Steps Towards Plagiarism](#steps-towards-plagiarism).
If you need more information about the creation of plans for this purpose, you can also read the elaborations that can be found under [References](#references).
The more and varied changes you apply, the more accurate the end-to-end tests for the language will be.

In the following an example is shown which is in the JavaEndToEnd tests and is used.

**Changing control structures for(…) to while(…):**

``` java
//base class
public class SortAlgo {
//[...]
public void BubbleSortWithoutRecursion(Integer arr[]) {
for(int i = arr.length; i > 1 ; i--) {
for(int innerCounter = 0; innerCounter < arr.length-1; innerCounter++)
{
if (arr[innerCounter] > arr[innerCounter + 1]) {
swap(arr, innerCounter, (innerCounter + 1));
}
}
}
}
//[...]
}
```

``` java
//created plagiarism
public class SortAlgo5{
//[...]
public void BubbleSortWithoutRecursion(Integer arr[]) {
int i = arr.length;
while(i > 1)
{
int innerCounter = 0;
while(innerCounter < arr.length -1)
{
if (arr[innerCounter] > arr[innerCounter + 1]) {
swap(arr, innerCounter, (innerCounter + 1));
}
innerCounter++;
}
i--;
}
}
//[...]
}
```
### Copying Plagiarism To The Resources

The plagiarisms created in [Creating The Plagiarism](#creating-the-plagiarism) must now be copied to the corresponding resources folder. It is important not to mix the languages of the plagiarisms or to copy the data into bottle resource paths.

- At the path `JPlag\jplag.endToEndTesting\src\test\resources\languageTestFiles` a new folder for the language should be created if it does not already exist. For example `[...]\resources\languageTestFiles\JAVA`. If you have plagiarized several different code samples, you can also create additional subfolders under the newly created folder for example `[...]\resources\languageTestFiles\JAVA\sortAlgo`.

It is important to note that the resource folder name must be exactly the same as the language option in JPlag. Otherwise the language option cannot be parsed correctly to the enum-type.
- JAVA
- PYTHON_3
- C_CPP
- C_SHARP
- GO_LANG
- KOTLIN
- R_LANG
- SCALA

Once the tests have been run for the first time, the information for the tests is stored in the folder `..\target\testing-directory-submission\LANGUAGE`. This data can be copied to the path `[...]\resources\results\LANGUAGE`. Each subdirectory gets its own result json file as `[...]\resources\results\JAVA\sortAlgo.json`. Once the test data has been copied, the endToEnd tests can be successfully tested. As soon as a change in the detection takes place, the results will differ from the stored results and the tests will fail if the results have changed.
Original file line number Diff line number Diff line change
Expand Up @@ -12,21 +12,24 @@ private TestDirectoryConstants() {
}

/**
* Name for the folder to copy the submissions to in order to test them with JPlag
* Create the complete path to the temporary result files. Here the temporary system path is extended with the
* "RESULT_DIRECTORY_NAME", which is predefined in this class.
*/
public static final String TEMPORARY_DIRECTORY_NAME = "testing-directory-submission";
public static final Path TEMPORARY_RESULT_DIRECTORY_NAME = Path.of("target", "testing-directory-temporary-result");

/**
* Create the complete path to the submission files. Here the temporary system path is extended with the
* "TEMPORARY_DIRECTORY_NAME", which is predefined in this class.
* Base path to the saved results
*/
public static final String TEMPORARY_SUBMISSION_DIRECTORY_NAME = Path.of("target", TEMPORARY_DIRECTORY_NAME).toString();
public static final Path BASE_PATH_TO_RESULT_JSON = Path.of("src", "test", "resources", "results");

/**
* Base path to the created plagiarism and the main file located in the project resources.
* Base path to the endToEnd testing resources
*/
public static final Path BASE_PATH_TO_JAVA_RESOURCES_SORTALGO = Path.of("src", "test", "resources", "java", "sortAlgo");
public static final Path BASE_PATH_TO_LANGUAGE_RESOURCES = Path.of("src", "test", "resources", "languageTestFiles");

/**
* Base path to the saved results of the previous tests in a *.json file
* Create the complete path to the submission files. Here the temporary system path is extended with the
* "SUBMISSION_DIRECTORY_NAME", which is predefined in this class.
*/
public static final Path BASE_PATH_TO_JAVA_RESULT_JSON = Path.of("src", "test", "resources", "results", "JavaResult.json");
public static final Path TEMPORARY_SUBMISSION_DIRECTORY_NAME = Path.of("target", "testing-directory-submission");
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,149 @@
package de.jplag.end_to_end_testing.helper;

import java.io.File;
import java.io.IOException;
import java.nio.file.Files;
import java.nio.file.Path;
import java.nio.file.StandardCopyOption;
import java.util.ArrayList;
import java.util.Collection;
import java.util.List;

import de.jplag.end_to_end_testing.constants.TestDirectoryConstants;
import de.jplag.options.LanguageOption;

/**
* Helper class to perform all necessary operations or functions on files or folders.
*/
public class FileHelper {

private FileHelper() {
// private constructor to prevent instantiation
}

/**
* Merges all contained filenames together without extension
* @param files whose names are to be merged
* @return merged filenames
*/
public static String getEnclosedFileNamesFromCollection(Collection<File> files) {
StringBuilder stringBuilder = new StringBuilder(files.size());
for (File file : files) {
String fileName = file.getName();
stringBuilder.append(fileName.substring(0, fileName.lastIndexOf('.')));
}

return stringBuilder.toString();
SuyDesignz marked this conversation as resolved.
Show resolved Hide resolved
}

/**
* Load all possible language is resource path
SuyDesignz marked this conversation as resolved.
Show resolved Hide resolved
* @param directoryNames folder names for which the language options should be listed.
* @return list of all LanguageOptions included in the resource path
*/
public static List<LanguageOption> getLanguageOptionsFromPath(String[] directoryNames) {
List<LanguageOption> returnList = new ArrayList<>();
for (String languageDirectoryName : directoryNames) {
returnList.add(LanguageOption.valueOf(languageDirectoryName));
}
return returnList;
SuyDesignz marked this conversation as resolved.
Show resolved Hide resolved
}

/**
* @param directorieRoot path from which all folders should be loaded
* @return all folders found in the specified path
*/
public static String[] getAllDirectoriesInPath(Path directorieRoot) {
return directorieRoot.toFile().list((dir, name) -> new File(dir, name).isDirectory());
}

/**
* Copies the passed filenames to a temporary path to use them in the tests
* @param classNames for which the test case is to be created
* @return paths created to the test submissions
* @throws IOException Exception can be thrown in cases that involve reading, copying or locating files.
*/
public static String[] createNewTestCaseDirectory(String[] classNames) throws IOException {
// Copy the resources data to the temporary path
String[] returnSubmissionPath = new String[classNames.length];
for (int counter = 0; counter < classNames.length; counter++) {
Path originalPath = Path.of(classNames[counter]);
returnSubmissionPath[counter] = Path
.of(TestDirectoryConstants.TEMPORARY_SUBMISSION_DIRECTORY_NAME.toString(), "submission" + (counter + 1)).toAbsolutePath()
.toString();
Path copiePath = Path.of(TestDirectoryConstants.TEMPORARY_SUBMISSION_DIRECTORY_NAME.toString(), "submission" + (counter + 1),
SuyDesignz marked this conversation as resolved.
Show resolved Hide resolved
originalPath.getFileName().toString());

File directory = new File(copiePath.toString());
if (!directory.exists()) {
directory.mkdirs();
}
Files.copy(originalPath, copiePath, StandardCopyOption.REPLACE_EXISTING);
}
return returnSubmissionPath;
}

/**
* Delete directory with including files
* @param folder Path to a folder or file to be deleted. This happens recursively to the path
* @throws IOException if an I/O error occurs
*/
public static void deleteCopiedFiles(File folder) throws IOException {
if (folder.exists()) {
File[] files = folder.listFiles();
if (files != null) { // some JVMs return null for empty dirs
for (File file : files) {
if (file.isDirectory()) {
deleteCopiedFiles(file);
} else {
Files.delete(file.toPath());
}
}
}
Files.delete(folder.toPath());
}
SuyDesignz marked this conversation as resolved.
Show resolved Hide resolved
}

/**
* Creates directory if it dose not exist
* @param directory to be created
* @throws IOException if the directory could not be created
*/
public static void createDirectoryIfItDoseNotExist(File directory) throws IOException {
SuyDesignz marked this conversation as resolved.
Show resolved Hide resolved
if (!directory.exists() && !directory.mkdirs()) {
throw new IOException(createNewIOExceptionStringForFileOrFOlderCreation(directory));
}
}

/**
* Creates file if it dose not exist
* @param file to be created
* @throws IOException if the file could not be created
*/
public static void createFileIfItDoseNotExist(File file) throws IOException {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Typo: 'does'

if (!file.exists() && !file.createNewFile()) {
throw new IOException(createNewIOExceptionStringForFileOrFOlderCreation(file));
}
}

/**
* @param resourcenPaths list of paths that lead to test resources
* @return all filenames contained in the paths
*/
public static String[] loadAllTestFileNames(Path resourcenPaths) {
var files = resourcenPaths.toFile().listFiles();
String[] fileNames = new String[files.length];
for (int i = 0; i < files.length; i++) {
fileNames[i] = files[i].getName();
}
return fileNames;
Comment on lines +129 to +133
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
String[] fileNames = new String[files.length];
for (int i = 0; i < files.length; i++) {
fileNames[i] = files[i].getName();
}
return fileNames;
return Arrays.stream(files).map(File::getName).toArray(length -> new String[length]);

}

/**
* @param file for which the exception text is to be created
* @return exception text for the specified file
*/
private static String createNewIOExceptionStringForFileOrFOlderCreation(File file) {
return "The file/folder at the location [" + file.toString() + "] could not be created!";
}
}
Loading