Email not displaying correctly? View it in your browser.
CoSort (IRI, Inc.)

Version 9 is Here!

Announcing a New Platform for High-Volume Data Transformation and Protection

Innovative Routines International (IRI), Inc. has released CoSort Version 9 for Unix, Linux and Windows servers. Major upgrades make Version 9 a single-pass platform for manipulating and managing large volumes of data, and a suite of solutions for: data transformation, reporting, security, conversion, governance; and, for speeding and testing applications. New features in CoSort Version 9 include:

1. Auditable Field-Level Protections for Compliance and Outsourcing
2. Up to 50% Improvement in High-Volume Sort Performance
3. Multi-File Joins and Dimensional Lookups
4. Multi-Byte (Asian) Character Collation and Conversion
5. Flat XML, LDIF, I-SAM and Other File Format Translations
6. Custom Field Functions (e.g. Data Cleansing)
7. Perl-Compatible Regular Expressions (PCRE)
8. Safe Test Data Generation in Real File/Report Formats
9. Dashboard Option for Business Intelligence

Version 9 users can direct CoSort’s main interface – SortCL – to re-host legacy sorts and data, and to replace less efficient methods of flat-file data processing (e.g. filters, transforms), presentation (reporting), protection (e.g. encryption, de-identification), and prototyping. SortCL can run all these functions, including most features listed above, in one pass. Also available are sort plug-ins and thread-safe API calls to accelerate other software.

Current CoSort users can obtain Version 9 release notes from their IRI representative. Version 9 is a major new release (chargeable upgrade). If you are not currently using CoSort, and are interested in a free 30-day evaluation, please click here.

 


Field Protections for Data at Risk

256-bit AES Is Just One of Many Data Security Options Built Into SortCL

Database or disk storage platforms are targets for hackers and malicious entities. However, operational and analytic data are exposed throughout IT operations. Data in motion between repositories is most at risk. According to Data Governance Institute President Gwen Thomas, "Companies have to protect their sensitive data. That means protecting it whether it's being stored, moved, or manipulated. Protecting data in motion can be a real challenge."

CoSort V9 delivers a panoply of solutions dedicated to the protection of data in flat files moving to and from databases, emails, spreadsheets, laptops, etc. As data architects filter, transform, manipulate and reformat files at the field-level, they can now protect data at risk with one or more integrated security features, including:

  • Anonymization/obfuscation
  • Encryption (256-bit AES and more)
  • De-identification
  • Redaction/filtering
  • Safe test data synthesis

Specifically, CoSort's "SortCL" tool allows you to specify native or custom security functions on each field, and at two different phases within each job script. Having CoSort V9 licensed on Unix, Linux and Windows machines offers many benefits to CISOs and compliance managers (and their IT staff), including:

  • A choice of data protections applied to fields on a need-to-know basis
  • Portable (flat-file) protection that is not limited to one database
  • Protection for data in motion and at rest (secure data to/from the DB)
  • Availability of non-sensitive database, file, and disk information
  • Combined protections in the same job script and I/O pass with high-volume data transformation, reporting, and test data synthesis
  • A query-ready XML audit trail to help verify compliance with privacy rules, perform limited data forensics, and support your risk and controls framework

See also: http://mcsv.net/cgi-bin/redir?MCid=ubl4bsBeWu[UNIQID]


Multi-File Joins and Lookups

First Again in the Sort Market

IRI first introduced join functionality to the sort market for data warehousing and reporting needs in 1999, with SortCL in CoSort V7. Since then, IRI delivered more speed and innovations like "JOIN ONLY". Now with CoSort V9, SortCL features:

  • Multi-file joins (3 or more extracted tables)
  • Joins over pre-sorted or unsorted files
  • Joins based on multiple conditions
  • Mix and match multiple join types
  • Multi-dimensional lookup (set) files

These powerful join features allow you to reduce database workloads and optimize large data warehouse ETL operations. SortCL continues to allow simultaneous aggregation, calculation, conversion, and multi-target output formatting.

V9's new field-matching functionality also includes multi-level value lookups on flat files. Lookup transformations during the action and output phases of your job script -- and value substitutions made from columns in delimited lookup files -- are designed to save time over direct computations and complex join operations.

See also: http://mcsv.net/cgi-bin/redir?MCid=g72Sbcq9lQ[UNIQID]
http://mcsv.net/cgi-bin/redir?MCid=Zx8Btecz6e[UNIQID]


Multi-Byte Collation & Conversion

Sorting Native Characters in Double-Byte Order

CoSort V9 now makes faster sorting and processing available for Asian data sets using double- and multi-byte characters. This is necessary for accurate permutation and conversion of large flat files containing Chinese, Japanese, and Korean key fields represented in native formats like BIG5, Shift_JIS and eHangul, respectively, among others. Please contact your IRI agent for more details.


Field-Level Functions, Custom Data Cleansing Support

User Libraries Enable Complex Transformations

CoSort's SortCL tool has always supported field filtering and scrubbing using built-in commands and user conditions. But now there are more possibilities for integrated, custom data cleansing. Data cleansing is important because it helps make field values and formats accurate and consistent with related data sets throughout your systems.

CoSort V9 now supports custom field-level transformations during the inrec and outfile phases of a SortCL job script. This means you can run custom transform functions - like data cleansing - on every field value, up to two times per job. Integrated field-level functions are important because they can - like so many other data manipulations - be combined in the same SortCL job script and I/O pass. This saves steps and time in the design and execution of high-volume transformation, reporting, and protection jobs.

Based on your business needs and modeling rules, you can plug in your own functions or those in data quality vendor libraries. Just specify your library paths at the top of the script, and your functions at the field level. You can also combine cleansing with scrubbing and bulk data reduction so you can simultaneously remove or save duplicate records, and include or omit others based on business and compliance rules.

See also: http://mcsv.net/cgi-bin/redir?MCid=XHvyDPFX5E[UNIQID]


Manipulate and Convert Large XML Files

Need XML Files From Other, Large Flat File Formats? Try CoSort 9

Although XML is an increasingly popular file interchange format, it has not been a practical structure for large files. That's because a 30GB XML file may contain 20GB worth of tags. And, until now, there simply has been no efficient way to rapidly convert, process, protect, or create large transaction files in XML. However, that changes with the new CoSort V9 package.

CoSort's SortCL tool can now process, convert, and create large, flat XML files. You can direct SortCL to filter, transform and output valid, well formed XML files that contain structured data from XML or any other supported flat file source -- of any size. The reverse is also true; SortCL can process huge, flat XML files and output the data in other formats, including ACUCOBOL Vision, CSV, ISAM, Sequential, and ELF.

See also: http://mcsv.net/cgi-bin/redir?MCid=uTlU0IwqG0[UNIQID]


Manipulate and Convert Large LDIF Files

CoSort Puts DAP Data Under Your Control

LDIF is an interchange format for representing LDAP (Lightweight Directory Access Protocol) contents and update requests. LDAP directories hold information with similar attributes, organized both logically and hierarchically -- e.g. an address book sorted by name, with email and phone data attached. While LDIF records may hold large volumes of customer and transaction information, LDIF records are in a format that most applications cannot readily import or process. CoSort's SortCL tool can now process LDIF data, and simultaneously convert files in LDIF to other file formats and vice versa. As usual, your SortCL job script would reference or specify your field layouts, and the input and output file types. For example, on input, the declaration under input file could be /PROCESS=LDIF, and on output it could be /PROCESS=CSV.

"The Comcast Data Engineering and Management Integration (DEMI) organization works with 10 terabytes of LDAP data on a daily basis as we work to distribute business critical information resources to the rest of the company. The fact is we would not be able to pull this off successfully without CoSort. It accurately and quickly processes billions of rows of DAP data and allows us to join and analyze this information in connection with our other data warehouse processes. No other tool gives us this much speed and flexibility and allows the processing of the volume of flat-file LDIF records to be achieved. The very talented CoSort team worked directly with us to develop their module and was able to turn it around very quickly. In turn they have developed a long-term customer relationship with America's largest Cable Operator and a large (50 TB) and growing data warehouse."

See also: http://mcsv.net/cgi-bin/redir?MCid=2uTpOoOGoK[UNIQID]


Create Safe Test Data in Production Formats

CoSort 9 Can Also Synthesize Prototype Files in SortCL Jobs

In addition to simultaneous data processing (transformations), presentation (reporting), and protection (field-level encryption, etc.) functionality, CoSort V9's SortCL tool also supports data prototyping.

SortCL can now generate safe test data in real file and report formats. You can create output fields in any supported data type and layout using randomly-generated or randomly-selected values. Test data allows you to:

  • Benchmark hardware and software
  • Populate test databases
  • Prototype and develop applications
  • Stress-test programs with fuller data ranges
  • Safely outsource real file formats


The CoSort Test Data Tool, RowGen, is also available for standalone generation, transformation, and presentation of safe test data in production file and report formats. RowGen is a lower cost product because it does not transform real data. But the same data definition and manipulation metadata work across both RowGen and SortCL, so you can move easily from prototyping to production later if you use only RowGen now.

See also: http://mcsv.net/cgi-bin/redir?MCid=mSwuPfwYoy[UNIQID]


New Dashboard Option Speeds Big Data Visualization

Marry Back-End Efficiency to Front-End Presentations

You may already use CoSort's SortCL tool to rapidly integrate, stage, and digest massive file volumes in ETL, federated data, or ODS operations that feed business intelligence tools and processes. Now you can also populate AND use a best-of-breed dashboard application to facilitate the development of business insight.

Specifically, SortCL can rapidly churn billions of records to produce output file subsets in XML or CSV formats that dashboards can import. The dashboard solution IRI now offers lets you cross-integrate that data with database, Excel, and other sources -- and then customize dynamic charts.

By combining SortCL with the CoSort Dashboard add-on option, you have a cost-effective, technically proven solution for saving time and money when transforming massive data volumes into drill-down dashboard charts.

See also: http://mcsv.net/cgi-bin/redir?MCid=Irj7t3F2dE[UNIQID]


CoSort Survey Thank You

Guess Who Won the Zune

If you received and read the 2006 CoSort Journal's 4th quarter edition, and completed the survey form at: http://mcsv.net/cgi-bin/redir?MCid=QJLZGSLoVU[UNIQID]
we want to thank you for your valuable input. We would also like to congratulate Robert Barnhurst, a Senior Consultant at BluePhoenix Solutions in South Bend, Indiana for winning the Microsoft Zune! Please visit the site above to complete the survey and register for your chance to win our next giveaway.
Consolidated Value Statement
... for CoSort V9's SortCL tool ...

"In a single pass through your single (file-based) source of the truth, CoSort field-level transformation software can: filter, migrate, integrate, stage, and present multiple, formatted views of big data; protect data on a need-to-know basis for outsourcing, compliance and test environments; and, prepare both real and test data for VLDB loads, analysis teams, and BI tools."



Surf IRI's New Web Site


You will find a broad range of solutions in the CoSort V9 environment, as well as updated product information. Links:
http://mcsv.net/cgi-bin/redir?MCid=rmHhPggKkS[UNIQID]
http://mcsv.net/cgi-bin/redir?MCid=50qMPScLbM[UNIQID]

Please use the page feedback links to help us improve content in your area(s) of interest.

We are in the process of updating the news and support sections of the site and appreciate your patience, questions and suggestions.

Please use this form:
http://mcsv.net/cgi-bin/redir?MCid=wR26s6UtCC[UNIQID]
You can also use this form to request a product evaluation, webinar, IRI's new white paper
"Making Data Safe for Compliance and Outsourcing," and so on.


Coming Soon

Fast Extract (FACT) for DB2
[[FACT for Oracle Now at v2.3]


Next Conference Exhibits
  • Micro Focus World, Orlando
  • Informatica World, Orlando
  • Oracle OpenWorld, San Francisco
You should be receiving this newsletter on an opt-in basis as part of your current or prospective business with IRI. If you wish to opt-out at any time, simply unsubscribe with the link provided. Opt-in and out actions are confirmed in writing. IRI does not disclose email addresses to third parties.

Unsubscribe info@iri.com from this list.

Our mailing address is:
CoSort / IRI, Inc.
2194 Highway A1A
Suite 303
Melbourne, FL 32937

Our telephone:
1.321.777.8889

Copyright (C) 2007 CoSort / IRI, Inc. All rights reserved.

Forward this email to a friend