Skip to main content

Parsing gigabytes of JSON per second with parallel bit streams

Resource type
Thesis type
(Thesis) M.Sc.
Date created
2023-03-14
Authors/Contributors
Abstract
Studies have shown that it is possible to boost the efficiency of text processing by carefully eliminating branches as well as reducing branch mispredictions and cache misses, which can be achieved with a few techniques, such as the use of Boolean algebra to reduce pointer-chasing in data structures and to abstract branching. With current advances in technology, vector extensions (SIMD) have been added to commodity processors and have allowed the creation of new algorithms that are able to accomplish the non-trivial task of parallelly processing streams in Gigabytes per second. The Parabix framework exploits the concept of parallel bit streams to take even more advantage of SIMD instructions by transposing and processing streams in batches. This study focuses on using Parabix to boost the efficiency of JSON parsing for Big Data.
Document
Extent
58 pages.
Identifier
etd22402
Copyright statement
Copyright is held by the author(s).
Permissions
This thesis may be printed or downloaded for non-commercial research and scholarly purposes.
Supervisor or Senior Supervisor
Thesis advisor: D., Cameron, Robert
Language
English
Member of collection
Download file Size
etd22402.pdf 1.17 MB

Views & downloads - as of June 2023

Views: 0
Downloads: 4