Resource type
Thesis type
(Thesis) M.A.Sc.
Date created
2022-12-15
Authors/Contributors
Author: Azizian, Bardia
Abstract
In this thesis, we study different approaches to visual data coding for machines and develop new methods to address some issues in this area. We mainly focus on the Image and Video signals processed by a Deep Neural Network (DNN)-based computer vision model. Our proposed methods are designed to be utilized in DNN-based machines deployed collaboratively on the edge and cloud. This framework is called Collaborative Intelligence (CI), in which a DNN model is split into two parts such that the edge device runs the first few layers, and the remaining layers are executed on the cloud. To that end, the intermediate feature tensors need to be coded and transferred to the cloud. This research explicitly attempts to provide solutions for efficient coding of these tensors, considering challenges such as motion estimation and compensation for videos in the latent space, and privacy for images.
Document
Extent
66 pages.
Identifier
etd22272
Copyright statement
Copyright is held by the author(s).
Supervisor or Senior Supervisor
Thesis advisor: bajic, ivan
Language
English
Member of collection
Download file | Size |
---|---|
etd22272.pdf | 16.42 MB |