<- Back to the Gallery

Data Sanitization with RPA

Data Sanitization with RPA

Client Communications Security Establishment
Professor(s) SuCheng Lee, Adesh Shah
Program Mobile Application Design & Development
Students Keith Gardiner
Jesse McCoy
Aiden Carrie
Taranbeer Singh
Bharti Bharti
Harpreet Kaur

Project Description:

Our client wanted an accurate, automated way to redact sensitive information from classified documents. The solution we came up with was to use UiPath to create a robotic process automation (RPA) that automatically goes through documents searching for keywords in order to redact. In order to do this, we’ve created a blacklist file where the user is able to input whatever word they want found and redacted from a document.
The process begins with the creation and encryption of the blacklist file. Once the user creates the blacklist file (both .txt and .csv formats are accepted) and fills it with the words they wish to be redacted, they’re able to run the program and choose the blacklist file to encrypt. Once the user inputs the correct password, the program encrypts the blacklist file. Once the blacklist file is encrypted, it’s ready for use.
In order to run the automation, the user must decrypt the blacklist file they wish to use. The user will be prompted to choose which blacklist file they want to use, and once they input the correct password, the blacklist file is loaded into the program for use.
On top of the blacklist file, our program also allows the redaction of images, names, and social insurance numbers (SIN). The name redaction works the same way as the blacklist, where a user creates a names list file and enters which names they want redacted.
Once this is all done, you’re ready for redaction. The program will prompt you to choose which file the user wishes to redact. The program currently supports the redaction of .txt, .docx, .pdf and .csv files. The user is also prompted with options to redact images, names, and SIN. Once everything is selected, the program will open the document and redact anything that was entered into the blacklist file, along with whatever options were selected. The program then creates a copy of the original file as a new, redacted file.
Our program also supports the redaction of entire folders instead of just single files. The process is the exact same, the user just needs to choose the folder option instead of the file option. This feature allows the user to redact multiple files of different formats instead of having to manually choose file by file.
Features aside, one key aspect of our program is that it doesn’t use a database or anything external, so it’s a very secure, light program that is easy to use. Not only does this solution save our client money, it saves employees valuable time from doing mundane, repetitive tasks over and over.

Short Description:

Our solution takes classified documents of varying types and sanitizes their contents using a custom encrypted blacklist file.

Contact the Team

Video Presentation

Gallery

Data Sanitization with RPA Data Sanitization with RPA
Data Sanitization with RPA Data Sanitization with RPA
Data Sanitization with RPA Data Sanitization with RPA

Funded By




Comments

Comments are closed.