There are four de-identification tools that are generally available. Beyond those four, the tools that exist are internal to organizations and therefore not generally available, or have been developed for personal use (by researchers) and therefore have not been applied broadly.
The four generally available de-identification tools are:
The only tool that is commercially available and actively supported is PARAT from Privacy Analytics. Another useful point of comparison is that the algorithm implemented in PARAT has been shown in a recent article to perform better than the algorithm implemented in CAT (see http://www.jamia.org/cgi/content/short/16/5/670). Furthermore, the risk estimator used in PARAT has been shown to produce more accurate de-identification results than the one incorporated in mu-Argus (see http://www.jamia.org/cgi/content/abstract/15/5/627).
The UTD toolbox includes some of the same algorithms as CAT. This toolbox contains a set of capabilities rather than a tool that is ready to use by an end-user (e.g., an analyst), and therefore is targeted more at developers.
We also spent some time evaluating the CAT tool. There a significant number of usability issues with it (for example, we were unable to find the place to define the value of k for the k-anonymity algorithm, it was not possible to view data by equivalence class, and the data views gave the same record id every 60 records), and an inability to import standard data files. The lack of documentation and support made using the tool difficult. While this may have been good to complete a Masters thesis project, it clearly lacked important functionality for broader use.
Note that de-identification tools are different from masking tools. The attached document provides an overview of de-identification techniques and explains at some length the differences between these two approaches and when each is more suitable.
The author(s) retain all copyright to this knowledgebase article. Please include a citation to the web page if you reuse this material. More information is available at our lab web site: http://www.ehealthinformation.ca/.