Innovative Routines International (IRI), Inc. (The CoSORT Company) respects your time and privacy. You can stop or start future quarterly mailings at any time: click here or on the links at the bottom of this message. Please forward this newsletter to anyone interested in IRI or our high-performance data manipulation and management  tools, or open systems data processing technology.

 

FAst extraCT Logo       CoSORT Logo       RowGen Logo
The CoSORT Journal: Data Sorting and ETL News
                                                                                  Quarter 2, 2006
In this issue:
Upcoming CoSORT Trade Exhibits
CoSORT Now in metaController:
"An ETL Scheduler on Steroids
"
IBM 'Chiphopper' Certifies IRI Tools
Tech Tip: Change Data Capture
Next CoSORT Exhibits: DAMA & IDUG (DB2) 

Visit the IRI booth, #119, at the 18th Annual

DAMA Int'l Symposium & 10th Annual
Wilshire Meta-Data Conference
April 23 - 27, 2006
Hyatt Regency, Denver, Colorado

This year's conference will attract more than 1,000 professionals from around the world, and feature 130 world-class educational sessions and speakers on managing organizational data more effectively. IRI staff will demonstrate SortCL's explicit, re-useable metadata framework and functions used in CoSORT, FACT, and RowGenClick here for conference details and registration.

Shortly after, the next CoSORT exhibit will be in Booth #500 at:

IDUG - North America
May 7 - 11, 2006
Tampa Convention Center
Tampa Bay, Florida
 

Our focus there will be the CoSORT Load Accelerator for DB2, as well as SortCL's JCL and COBOL sort and data migrations and BI reporting, plus RowGen's test data and file synthesis capabilities.

metaController: Scheduling CoSORT in ETL 

CoSORT's batch utilities, including of course SortCL job scripts, are routinely scheduled events, invoked by tools ranging from free cron calls to advanced third-party scheduling software. 

CoSORT SortCL (and FAst extraCT) jobs can now also be conditionally scheduled into existing DW operations via Full360's metaController task orchestration software. metaController drops into current environments to manage and launch jobs, and give data architects a drag-and-drop means to design and manage process flows between systems. This now includes the means to launch, validate, and modify SortCL and FACT process tasks along with other tasks in data warehousing applications like DataStage, Informatica, Essbase, etc. The application components are tied together in the context of upstream and downstream process steps, all without complex scripting.

For example, ETL and data architects can use metaController to build a process map that includes unloading, sorting, grouping, and cube-building steps. A data architect could then invoke a workflow process (using the built-in workflow tool or external workflow engine), which can be dragged and dropped onto the existing process map. When the flow reaches the workflow node, an e-mail can go to the business analyst to request a sign-off. This lets the business start processes, approve them at checkpoints, validate job streams, etc. metaController thus removes traditional schedulers' limitations of linearity and obscurity from complex data processing steps.

Click here for more information about metaController and CoSORT.

Linux CoSORT & RowGen Now "Chiphoppers"

IRI is pleased to announce that CoSORT and RowGen were just ported to Linux under IBM's 'Chiphopper' certification program. This means that IRI software is now more cross-compatible with RHEL and SLES Linux running on IBM's entire eServer platform line, which includes x, i, p, JS20, OpenPower and zSeries.

CoSORT has always been the first UNIX sort product to support different Linux and IBM Linux (and AIX) platforms. RowGen is also on board, and FACT is expected soon, allowing IRI and IBM customers to leverage all three solutions and their common SortCL metadata across the entire eServer line. Click here for a list of all CoSORT-supported UNIX platforms.

About CoSORT and The CoSORT Journal
  • CoSORT solutions serve data warehouse (ETL) architects, very large database (VLDB) administrators, mainframe sort migrators, and developers and independent software vendors (ISVs) building faster sorting and data transformation into their applications.
  • CoSORT delivers the IT industry's fastest parallel UNIX sort engine and one of its most powerful flat-file manipulation and reporting programs, SortCL, which combines: row filtering and conditional selection, sort/merge and joins, drill-down aggregation and cross-row calculation, conversion and collation of more than 100 data types, database sequencing, and multi-target, multi-level output reformatting for reports, hand-offs, and DB load utilities.
  • Other special CoSORT features include: coroutine sort architecture; fully tunable and scalable parallel sort performance on all multi-CPU UNIX and Windows servers; cross-calculation on aggregated values and aggregation on cross-calculated values, cross-table joins (matching) integrated with data conversion and expression logic; multinational date and timestamp support; cross-platform Java GUI; and, e-commerce reporting via CSV/CLF and IP Address manipulation, plus ELF input and HTML output.
  • CoSORT also has plug-n-play replacements or parameter converters for sorting in: ACUCOBOL-GT, Amdocs Ensemble (telecom billing) Ascential DataStage; Informatica PowerCenter and PowerMart; Cincom Supra; IBM's DB2 loader and MVS/VSE sorts; MF COBOL Workbench, Net and Server Express; SAS System; Software AG Natural; Sun MRP, and, UNIX SVR4 (/bin/sort), and SyncSort UNIX.
  • IRI has begun to offer other data manipulation and management solutions like: FACT for fast unloads from Oracle; RowGen for custom data generation and format simulations; netCONVERT for mainframe tape data conversion and reformatting; x-PRESS for fast, and secure data compression and decompression; Logon for controlling and auditing access to UNIX systems; and Permitas for licensing and activating software applications.
  • The CoSORT Journal is a quarterly Email newsletter designed to keep subscribers updated on salient news and events at IRI, Inc. Past editions are archived here.

To remove or add an Email address in future CoSORT Journal mailings, please email news@iri.com. To contact an IRI agent, click here, call 1-800-333-SORT, or email info@iri.com.

CoSORT®, SortCL, Rowgen and Permitas are trademarks of IRI. FACT is a trademark of IDS Ltd. (CoSORT Korea). All other product or brand names mentioned herein may be (registered) trademarks or copyrights of their respective owners.

New CoSORT Agency for Switzerland

Outside Cantone Ticino, Data Processing Consulting Services (DPCS) has become the newest agency for CoSORT, FACT and RowGen in Europe. DPCS provides software and consulting services for those rightsizing from mainframes to open systems. CoSORT's legacy sort and data migration offerings will be the first order of business. 

Click here to find -- or inquire about becoming -- an IRI software agent for your application or location.

Tech Tip: Change Data Capture (CDC)

Fast, Easy Delta Reporting with SortCL 

Because of CoSORT's fast, low-cost file-system approach to data warehouse integration and staging -- its ability to simultaneously filter, sort, report, and speed DB loads -- SortCL can be a very simple and cost-effective way to capture and report on changed data in either a standalone or integrated framework.

The following SortCL application is an example of Insert, Update, Delete (i.e. Delta) Processing: 

Based on customer data that changes on a daily basis, a full outer join is used to produce a report that shows differences between the old and new sets of data. The "old" and "new" files are both pre-CoSORTed by Account (Acct). The final report puts a change tag in the last column of each record: 

Insert -- where a new record exists for a given Acct value where an old record did not (that is, a non-matched record). Update -- where the Amount (Amnt) value of the old and new matching records had changed. Delete -- where an old record exists for a given Acct value where a new record does not (that is, a non-matched record). No change -- where the Amnt value of the old and new record are the same (requires a matching record).

Shown side-by-side below are small subsets of the pre-sorted input files old and new

AC450 271        AC450 238
AC451 269        AC451 954
AC452 270        AC454 269
AC453 298        AC455 281
AC454 269        BNZ21 322
AC455 281        BNZ22 150


For this example, assume that the format for the pre-sorted file new is the same, but the data values are not; some match the old file values and some do not. 

The following SortCL script includes a full outer join and a conditional field, delta_flag, that is appended as the last column in the output:

/INFILE=old
   /FIELD=(Acct,POS=1,SIZE=5)
   /FIELD=(Amnt,POS=7,SIZE=3)
   /CONDITION=(old_exist, \
       TEST=(old.Amnt NE " "))
   /CONDITION=(old_nonexist, \
       TEST=(old.Amnt EQ " "))

/INFILE=new
   /FIELD=(Acct,POS=1,SIZE=5)
   /FIELD=(Amnt,POS=7,SIZE=3)
   /CONDITION=(new_exist, \
       TEST=(new.Amnt NE " "))
   /CONDITION=(new_nonexist, \
       TEST=(new.Amnt EQ " "))
   /CONDITION=(update, \
       TEST=(old.Amnt NE new.Amnt))
   /CONDITION=(no_update, \
       TEST=(old.Amnt EQ new.Amnt))

/JOIN FULL_OUTER old new WHERE \
       old.Acct EQ new.Acct

/OUTFILE=delta.out
   /FIELD=(old.Acct,POS=1, SIZE=5)
   /FIELD=(old.Amnt,POS=7, SIZE=3)
   /FIELD=(new.Acct,POS=11,SIZE=5)
   /FIELD=(new.Amnt,POS=17,SIZE=3)
   /DATA="   "
   /FIELD=(delta_flag,POS=21,  \
       IF update AND old_exist \
       AND new_exist THEN "Update"\
       ELSE IF old_nonexist AND\
       new_exist THEN "Insert" \
       ELSE IF old_exist AND   \
       new_nonexist THEN "Delete" \
       ELSE "")   # No change



Here is a subset of the output file, delta.out::

AC450 271 AC450 238 Update
AC451 269 AC451 954 Update
          BNZ21 322 Insert
          BNZ22 150 Insert
AC452 270           Delete
AC453 298           Delete
AC454 269 AC454 269
AC455 281 AC455 281


The full outer join produces the results of the right outer join (Insert) and the left outer join (Delete). The full outer join also returns the matches, where the Update and " " (empty) cases are determined by comparing the Amnt values of the matching records.

Of course, this example is greatly simplified in terms of data content and file formatting, but larger sources and larger, more detailed (and aggregated) output is always possible. Please email your questions to support@iri.com.

© 2006 Innovative Routines International (IRI), Inc.
2194 Highway A1A, Suite 303, Melbourne, FL 32937
All rights reserved.