Presentation + Paper
7 June 2024 A design space exploration framework for deployment of resource-constrained deep neural networks
Yan Zhang, Lei Pan, Phillip Berkowitz, Mun Wai Lee, Benjamin Riggan, Shuvra S. Bhattacharyya
Author Affiliations +
Abstract
Recent years have witnessed great progress in the development of deep neural networks (DNNs), which has led to growing interest in deploying DNNs in resource-constrained environments such as network-edge and edge-cloud environments. To address objectives of efficient DNN inference, numerous approaches as well as specialized platforms have been designed for inference acceleration. The flexibility and diverse capabilities offered by these approaches and platforms result in large design spaces with complex trade-offs for DNN deployment. Relevant objectives involved in these trade-offs include inference accuracy, latency, throughput, memory requirements, and energy consumption. Tools that can effectively assist designers in deriving efficient DNN configurations for specific deployment scenarios are therefore needed. In this work, we present a design space exploration framework for this purpose. In the proposed framework, DNNs are represented as dataflow graphs using a lightweight-dataflowbased modeling tool, and schedules (strategies for managing processing resources across different DNN tasks) are modeled in a formal, abstract form using dataflow methods as well. The dataflow-based application and schedule representations are integrated systematically with a multiobjective particle swarm optimization (PSO) strategy, which enables efficient evaluation of implementation trade-offs and derivation of Pareto fronts involving alternative deployment configurations. Experimental results using different DNN architectures demonstrate the effectiveness of our proposed framework in exploring design spaces for DNN deployment.
Conference Presentation
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Yan Zhang, Lei Pan, Phillip Berkowitz, Mun Wai Lee, Benjamin Riggan, and Shuvra S. Bhattacharyya "A design space exploration framework for deployment of resource-constrained deep neural networks", Proc. SPIE 13034, Real-Time Image Processing and Deep Learning 2024, 130340B (7 June 2024); https://doi.org/10.1117/12.3014043
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Design

Particle swarm optimization

Neural networks

Performance modeling

Machine learning

RELATED CONTENT


Back to Top