Exploring energy efficient object classification on reconfigurable logic

Oliveira, Bruno Gonçalves

Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/88114

Title:	Exploring energy efficient object classification on reconfigurable logic
Other Titles:	Exploração de Classificadores de Objetos Energéticamente Eficientes em Lógica Reconfigurável
Authors:	Oliveira, Bruno Gonçalves
Orientador:	Lobo, Jorge Nuno de Almeida e Sousa Almada
Keywords:	Computação Heterogénea; Sistemas Embebidos; Circuitos Personalizados; Baixo Consumo; Classificação de Objectos; Heterogeneous Computing; Embedded Systems; Custom Circuits; Low Power; Object Classification
Issue Date:	22-Jul-2019
Serial title, monograph or event:	EXPLORING ENERGY EFFICIENT OBJECT CLASSIFICATION ON RECONFIGURABLE LOGIC
Place of publication or event:	DEEC
Abstract:	A classificação de objetos é um problema com grande relevância em visão por computador, uma vez que pode ser integrada em um enorme conjunto de aplicações alvo, taiscomo agricultura e segurança. No presente, existe um conjunto de soluções que resolvemeste problema, sendo que, as que possuem maior sucesso, dependem de redes neuronais.As mais conhecidas são o GoogleNet, a AlexNet e o YOLO. No entanto, o seu processamento subjacente requer plataformas de alta performance tais como GPUs, clusters deCPUs ou ASICs customizados. Excluindo os ASICs, que tem um elevado custo, massão menos genéricos, elas tipicamente têm um elevado consumo energético e não sãoadequadas a sistemas embebidos. No entanto, tem havido progresso em abordagens debaixo consumo devido em parte ao mercado dos smartphones e tablets, estando disponíveismixes de arquiteturas (CPUs e GPUs) com lógica reconfigurável (FPGAs). Neste trabalho,propomos uma série de implementações de redes neuronais quantizadas em plataformashíbridas, explorando completamente o espaço de design, a performance de classificaçãoe a eficiência energética. O algoritmo subjacente é analisado, e os componentes chavepara computação concorrente e paralela identificados. O mapeamento na plataforma foiexplorado, desde a implementação CPU base até uma completamente customizada quemaximiza o uso dos recursos disponíveis. Um conjunto de métricas é considerado para aavaliação das diferentes configurações. No final, conseguimos classificadores de objetoscom diferentes caracteristicas a correr em dois dispositivos de baixo consumo. As análisesrealizadas às implementações suportaram a fiabilidade da compressão de redes neuronaisde convolução para caber nos dispostivos alvo, através da redução da precisão dos seusparâmetros. Object classification is a problem with great relevance in computer vision since itcan integrate a wide range of target applications, such as agriculture and security. Atpresent, there are a set of solutions that solve this problem, the most successful relyingon neural networks. Among the best known are GoogleNet, AlexNet and YOLO. However, the underlying processing requires high performing computational platforms suchas GPUs, CPU clusters, or custom ASICs. Apart from ASICs, that have a high cost butare less generic, they are typically high power and not well suited for embedded systems.However, there has been some progress in low power approaches, driven in part by thesmartphone and tablet market, and heterogeneous platforms are now available that explore a mix of architectures (CPUs and GPUs) with reconfigurable logic (FPGAs). In thiswork, we propose implementations of lightweight convolutions neural networks in hybridplatforms, thoroughly exploring the design space, the classification performance and thepower efficiency. The underlying algorithm is analysed, and key components for concurrent and parallel computation identified. Mappings of this to the heterogeneous platformwill be explored, ranging from a baseline CPU implementation to a full custom implementation maximising the use of the available resources. A set of metrics is considered for theevaluation of the different configurations. In the end, we achieved object classifiers withdifferent characteristics running in two low-power devices. Analyses performed on theimplementations supported the reliability of compression a convolution neural network tofit on the target device, through the reduction of the precision of its parameters.
Description:	Dissertação de Mestrado Integrado em Engenharia Electrotécnica e de Computadores apresentada à Faculdade de Ciências e Tecnologia
URI:	https://hdl.handle.net/10316/88114
Rights:	openAccess
Appears in Collections:	UC - Dissertações de Mestrado

Files in This Item:

File	Description	Size	Format
Dissertação_Bruno Goncalves Oliveira.pdf		17.08 MB	Adobe PDF	View/Open

Show full item record

Page view(s)

119

checked on Jul 23, 2024

Download(s)

173

checked on Jul 23, 2024

Google Scholar^TM

Check

This item is licensed under a Creative Commons License

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM