Resource type
Date created
2013-08-23
Authors/Contributors
Author: Medforth, Nigel Woodland
Abstract
Traditional XML parsers process XML documents sequentially, one byte-at-a-time. Parabix-XML, however, parses documents 128-bytes-at-a-time, through the use of Pablo-generated parallel bit stream operations. Prior research on accelerating XML processing using the Parabix Framework lead to a number of interesting yet feature-light research prototypes. This project investigates the integration of Parabix into an existing widely-used XML parser, Xerces-C 3.1.1 of the Apache Software Foundation. Xerces was systematically restructured into nine independent layers that leverage parallel transcoding, deletion and bit stream operations yet still adhere to the existing programmer API. icXML supports all features provided by Xerces with the exception of object serialization and its layered structure supports future multicore acceleration using pipeline parallelism. Evaluation of icXML in a single-core setting demonstrates a speedup of 50% to 100% in a wide range of workloads.
Document
Identifier
etd8022
Copyright statement
Copyright is held by the author.
Scholarly level
Member of collection
Download file | Size |
---|---|
etd8022_NMedforth.pdf | 1.15 MB |