V9 Shields Sensitive
Fields Simultaneously Transform and
Secure Transaction
Data
Do your files, tables or reports contain private
information about people, such as names, social security numbers,
account or phone numbers? Whether at rest or in motion, that data
may be at risk for identify theft.
To prevent the growing
problem of data privacy breaches, Innovative Routines International
(IRI), Inc. released CoSort Version
9 last quarter with
auditable, field-level
security for sensitive files.
CoSort's Sort Control
Language (SortCL) program adds strong encryption for fields and
records -- along with other protections -- to its parallel
data transformation and reporting functionality.
To accommodate your role-based access controls
(RBAC) framework, you can choose the
fields to protect, and the protection method for each field. For
example, you can sort on and encrypt the social security number
column, pseudonymize last names, group by and then mask the phone
number, and standardize the address field -- all in the same job
script and I/O pass. Consider how you can use the security features below in your
data integration, staging, and BI
operations:
Encryption &
Decryption AES-256 Is Just One of Many
Options in CoSort's SortCL
Tool
Encryption
is the process of transforming information into a meaningless
"ciphertext" form. Only those with the proper key can decrypt and
read the information. CoSort's SortCL tool now has the ability to
encrypt data at the field level while manipulating one or more
sequential files of any size or format. You can use SortCL's
built-in 256-bit Advanced Encryption Standard (AES) algorithm, or
link in your own encryption and decryption routine as a custom,
field-level transformation. You can use different encryption keys or
routines on different fields at the same time, as your security
rules require.
Masking: Anonymization &
Pseudonymization Obfuscate
Fields with Gibberish or Mask Them with Lookup
Values
Data masking refers generically
to any process that replaces real data with fake data.
CoSort’s SortCL tool can mask data through
anonymization and pseudonymization, as well as through data-type
conversion, alternate character masking, and custom,
field-level transformations.
Anonymization
is the process of removing information from a data source (e.g.
file) that can be used to backtrack to an actual person. Once
anonymized, the data cannot be linked to any source. Unlike
filtering and encryption, this technique allows the original field
layout (position, size, and data type) to remain the same, and look
realistic in test data environments. CoSort’s SortCL tool can
obfuscate fields using the other techniques listed above, as well as
through random (safe test) data generation, mathematical
expressions, character shifting and bit
manipulation.
Pseudonymization is the process of
replacing a personally identifiable field (like a patient's name) in
a data record with an artificial identifier (a pseudonym). Unlike
anonymization, this technique allows data to be traced back to its
origin, and the resulting hash or lookup value can appear very real.
SortCL uses lookup files to substitute pseudonyms for real field
values.
De-Identification &
Re-Identification CoSort's SortCL Also Includes
Field Encoding and
Decoding
De-identifcation
is the process of removing personally identifying data from a
record. In the medical and insurance industries, HIPAA regulations
require the protection of both obvious and remotely connected
patient information. CoSort's SortCL tool can
also de-identify field data with a special encoding routine where
the code holder can re-identify the field. To re-identify the field,
specify the same code in another job script.
De-identification can
also occur through the techniques above or through filtering
(described below).
Filtering
(Redaction) Just Don't Output the Sensitive
Fields
Filtering
is the process of removing (redacting) those fields or records
within your input files that do not need to be in the load files,
hand-offs, or custom report formats that SortCL can output. Specify
only the output fields you desire, or use condition logic to filter
records, according to you business
rules.
Safe Test
Data Randomly Generate or Select
Field Values
Instead
Safe
test data is different than traditional test data, which are
inherently unsafe snippets of real production files or tables. The
test data generated by CoSort V9's SortCL
tool, or the CoSort Test Data (RowGen)
product, looks real, but is safe because it comes from random value
generation and/or randomized lookups. Simultaneous transformation
and custom reporting of the test data is also possible, and the
metadata is the same between products. You can therefore use your
file layouts for both data transformation and test data
generation.
Contact
Us! Have a Question or Want a Free
30-Day
Trial?
Click here
to complete the on-line information and trial request form, or here
to reach an IRI
representative.
|
More
Data Privacy Advice
Click
here to read "5 Tips for Protecting Sensitve Data" by health
insurance data processing expert Darick Jarvis of EDIWatch,
Inc.
Click
here to read the IRI Article "Mitigating Data Risk: An
Introduction to Privacy Breach Prevention." An overview of the
risk and costs of private data exposure is combined with a number of
specific recommendations for protecting sensitive, personally
identifiable information.
Click
here to read the IRI White Paper "Making Data Safe for
Compliance and Outsourcing ... at the Field Level ... while
processing ... while presenting." This document contains a
consolidated working example (i.e. a SortCL job specification file)
that processes and protects data in 2 input files, using 1 set file,
and creating 5 output files in the same pass.
CoSort
Security in the News
A google
search shows the coverage we've been getting.
Unique
Data Security Benefits
In addition to a
choice of protections, there are several other
benefits to CoSort data privacy, including:
*
Cost - Field-level security ships with CoSort so
you don't pay additional fees, or need additional software or tools
to secure private data. * Precision -
Field-level security means your files and everything around them can
stay unencrypted and thus "open for business." *
Interoperability - CoSort's SortCL program runs on
all Unix, Linux and Windows platforms, via command-line, batch, API
call, or Java GUI. * Convenience - Field-level
privacy functions run in the same job script and I/O pass with
SortCL data transformation and reporting; i.e. you can protect your
data at its source, during processing and presentation. *
Flexibility - By working on portable flat files
(i.e. .txt, .csv, .sam), as well as index, LDIF and XML data, you
can protect data at rest and in motion, before and after data are
loaded into databases or reporting (BI) tools, or outsourced. *
Auditability - SortCL jobs can create an XML audit
trial so you can query and report on the application details to
prove how and when sensitive fields were protected, and thus verify
compliance with industry and government data privacy regulations
like HIPAA. * Prototyping - SortCL allows you to
continue processing and sharing protected files by displaying
ciphertext and other protected fields with printable characters, and
by generating (safe) test data in any field within real file
formats.
Supported
File Formats
CoSort secures columns in the following
file formats:
- ACUCOBOL-GT Vision
- CLF/ELF Web Logs
- Microsoft CSV
- LDIF (LDAP)
- MF COBOL (I-SAM & Variable)
- Line, Record and Variable Sequential
- Variable Block
- VSAM (in UniKix)
- XML (flat)
In order to protect, transform, and report
on flat file data using CoSort (or generate test data using RowGen),
you must specify the layout of your input and output files in the
field definition language of SortCL. Click
here for metadata conversion information if your files are
defined in other parameter formats. |