Skip to main content

icXML: Accelerating Xerces-C 3.1.1 using the Parabix Framework

Date created
2013-08-23
Authors/Contributors
Abstract
Traditional XML parsers process XML documents sequentially, one byte-at-a-time. Parabix-XML, however, parses documents 128-bytes-at-a-time, through the use of Pablo-generated parallel bit stream operations. Prior research on accelerating XML processing using the Parabix Framework lead to a number of interesting yet feature-light research prototypes. This project investigates the integration of Parabix into an existing widely-used XML parser, Xerces-C 3.1.1 of the Apache Software Foundation. Xerces was systematically restructured into nine independent layers that leverage parallel transcoding, deletion and bit stream operations yet still adhere to the existing programmer API. icXML supports all features provided by Xerces with the exception of object serialization and its layered structure supports future multicore acceleration using pipeline parallelism. Evaluation of icXML in a single-core setting demonstrates a speedup of 50% to 100% in a wide range of workloads.
Document
Identifier
etd8022
Copyright statement
Copyright is held by the author.
Permissions
The author granted permission for the file to be printed, but not for the text to be copied and pasted.
Scholarly level
Member of collection
Download file Size
etd8022_NMedforth.pdf 1.15 MB

Views & downloads - as of June 2023

Views: 12
Downloads: 0