-
Notifications
You must be signed in to change notification settings - Fork 2
Home
Welcome to the rmc-workflow-scripts wiki!
The following is RMC's documentation on how to use the data triage scripts and what they are doing behind the scenes. RMC technicians perform this activity in our BitCurator environment.
-
Open a terminal window and navigate to where disk image triage scripts live.
```````````cd Desktop/rmc-workflow-scripts/disk-image-triage ``````````` -
Copy each disk image and log file into folder based on the media number.
-
In terminal window, run the following script. Replace RMA04133 with the collection number you've been working with.
```````````python3 organize-dirs.py ~/Desktop/RMA04133``````````` -
If they are large files (CDs, DVDs), this will take some time. When it is finished, spot check to make sure all files look like they've been successfully copied.
-
- Aggregate Guymager log files into a single csv file. CSV file is saved under directory "organized"
-
In terminal window, run the following script. Again, replace RMA04133 with the collection number you've been working with.
```````````python3 guymager-log-md.py ~/Desktop/RMA04133``````````` - CSV file is saved under directory "organized"
-
CSV output columns and where it’s pulling data from and meaning:
-
rmc_accession: guymager Case Number (technician inputted)
- rmc_item_number: guymager Evidence number (technician inputted)
- technician: guymager Examiner (technician inputted)
- file_path: guymager Image path (technician inputted)
- disk_image_format: guymager Format (technician selected)
- acquire_date: Ended (guymager calculates from time stamp of process end time)
- acquisition_time: how long it took to image and verify (guymager calculates and script translates)
- raw_md5_hash: MD5 hash (hash value of raw disk image, not of e0x file)
- number_sector_errors: if bad sector errors are listed
-
-
- Extract raw disk image from .e0x files, identify what file system(s) are represented on media, and dumps values into csv file
-
In terminal window, run the following script. Again, replace RMA04133 with the collection number you've been working with.
```````````python3 filesystem-id.py ~/Desktop/RMA04133``````````` - CSV file is saved under directory "organized"
-
- Run Bulk Extractor over raw disk images to identify potential level 1 data.
-
In terminal window, run the following script. Again, replace RMA04133 with the collection number you've been working with.
```````````python3 level1-data.py ~/Desktop/RMA04133``````````` - A folder gets created under each media folder for the bulk extractor output .txt files. These can be used to review potential level1 data.
- CSV file is saved under directory "organized"
- CSV output columns and where it’s pulling data from and meaning:
- alerts.txt: 0 if file is 0 bytes; 1 if file is greater than 0 bytes
- ccn_track2.txt: 0 if file is 0 bytes; 1 if file is greater than 0 bytes
- ccn.txt: 0 if file is 0 bytes; 1 if file is greater than 0 bytes
- pii.txt: 0 if file is 0 bytes; 1 if file is greater than 0 bytes
- telephone.txt: 0 if file is 0 bytes; 1 if file is greater than 0 bytes
- level_1: 0 if alerts.txt, ccn_track2.txt, cnn.txt, or pii.txt are 0; otherwise 1 (means there may be Level 1 data and technician should review)
- level_2: 0 if telephone.txt is 0; otherwise 1 (means there may be telephone numbers and technician may wish to review)
-
-
Combine CSV output files into single file for inclusion into digital accession spreadsheet
-
In terminal window, run the following script. Again, replace RMA04133 with the collection number you've been working with.
```````````python3 merge-outputs.py ~/Desktop/RMA04133/``````````` -
CSV output columns and where it’s pulling data from and meaning:
-
rmc_accession: guymager Case Number (technician inputted)
- rmc_item_number: guymager Evidence number (technician inputted)
- technician: guymager Examiner (technician inputted)
- file_path: guymager Image path (technician inputted)
- disk_image_format: guymager Format (technician selected)
- acquire_date: Ended (guymager calculates from time stamp of process end time)
- acquisition_time: how long it took to image and verify (guymager calculates and script translates)
- raw_md5_hash: MD5 hash (hash value of raw disk image, not of e0x file)
- number_sector_errors: if bad sector errors are listed
- alerts.txt: 0 if file is 0 bytes; 1 if file is greater than 0 bytes
- ccn_track2.txt: 0 if file is 0 bytes; 1 if file is greater than 0 bytes
- ccn.txt: 0 if file is 0 bytes; 1 if file is greater than 0 bytes
- pii.txt: 0 if file is 0 bytes; 1 if file is greater than 0 bytes
- telephone.txt: 0 if file is 0 bytes; 1 if file is greater than 0 bytes
- level_1: 0 if alerts.txt, ccn_track2.txt, cnn.txt, or pii.txt are 0; otherwise 1 (means there may be Level 1 data and technician should review)
- level_2: 0 if telephone.txt is 0; otherwise 1 (means there may be telephone numbers and technician may wish to review)
- exx_total_size: size of .e0x file
-
-