Tag: Pentaho

  • Reading large XML file using Pentaho and Apache Hop

    Refer to my pervious article – Processing GLIEF data in JSON format . I wanted refresh my knowledge of Pentaho Data Integration tool and see if I could process huge XML file without running into Java OOM (Java Out of Memory) problem. Pentaho offers Input step called – XML Input Stream (StAX – Streaming API…