You are here: Home  Services  Education & Training  IBM Modular Programs   Data Stage

Data stage

        DataStage integrates data on demand across many systems via a high performance parallel framework, extended metadata management & enterprise connectivity. It also provides massively scalable capabilities that enable companies to solve large-scale business problems, integrating with all types of data – 'big' or 'small' – to help you quickly adapt to your changing business environment.

IBM InfoSphere DataStage for Linux on System z provide these unique capabilities:

  • The powerful ETL solution supports the collection, integration and transformation of large volumes of data, with data structures ranging from simple to highly complex. IBM InfoSphere DataStage manages data arriving in real-time as well as data received on a periodic or scheduled basis.
  • The scalable platform enables companies to solve large-scale business problems through high-performance processing of massive data volumes. By leveraging the parallel processing capabilities of multiprocessor hardware platforms, IBM InfoSphere DataStage Enterprise Edition can scale to satisfy the demands of ever-growing data volumes, stringent real-time requirements, and ever shrinking batch windows.
  • Unique support for big data, making it easier and more efficient to explore and integrate with big data, to quickly get to the next level of analysis. InfoSphere DataStage provides support for InfoSphere BigInsights connectivity options for cassandra, hdfs, hive, hbase, mongodb, and other nosql databases, offers Balanced Optimization for Hadoop (to push processing to the data), IBM InfoSphere Streams integration (to provide direct data flow integration to gather and pass information to real-time analytical processes), big data job sequencing, plus features to support big data governance (such as impact analysis and data lineage on any big data integration points).
  • Workload management capabilities enable policy-driven control of system resources and prioritization of different classes of workloads. Customers can use new workload management capabilities to optimize hardware utilization and prioritize mission-critical tasks, throttle job activities where resources exceed specified thresholds, and assess, assign and reassign the priority of jobs as new jobs are submitted into the queue.
  • Harnesses the power of business rules management to more adapt quickly to changing business requirements. InfoSphere DataStage can integrate directly with IBM Operational Decision Management (formerly ILOG JRules), allowing organizations to make a giant leap forward in bridging the gap between business people and IT by implementing decision logic using IBM Operational Decision Management within InfoSphere Information Server.
  • Comprehensive source and target support for a virtually unlimited number of heterogeneous data sources and targets in a single job includes text files complex data structures in XML ERP systems such as SAP and PeopleSoft almost any database (including partitioned databases) web services; and business intelligence tools like SAS.