Please use this identifier to cite or link to this item:
Title: Use of convolutional neural networks for semantic image segmentation across different computing systems
Authors: Fisher, Andmorgan R.
Middleton, Timothy A.
Cotugno, Jonathan E.
Sava, Elena.
Clemente-Harding, Laura.
Berger, Joseph.
Smith, Allistar J.
Li, Teresa C.
Keywords: Aerial photography
Drone aircraft
Geospatial data--Computer processing
Remote-sensing images
Image processing
Neural networks (Computer science)
Computer vision
Publisher: Geospatial Research Laboratory (U.S.)
Engineer Research and Development Center (U.S.)
Series/Report no.: Technical Report (Engineer Research and Development Center (U.S.)) ; no. ERDC/GRL TR-20-7
Abstract: The advent of powerful computing platforms coupled with deep learning architectures have resulted in novel approaches to tackle many traditional computer vision problems in order to automate the interpretation of large and complex geospatial data. Such tasks are particularly important as data are widely available and UAS are increasingly being used. This document presents a workflow that leverages the use of CNNs and GPUs to automate pixel-wise segmentation of UAS imagery for faster image processing. GPU-based computing and parallelization is explored on multi-core GPUs to reduce development time, mitigate the need for extensive model training, and facilitate exploitation of mission critical information. VGG-16 model training times are compared among different systems (single, virtual, multi-GPUs) to investigate each platform’s capabilities. CNN results show a precision accuracy of 88% when applied to ground truth data. Coupling the VGG-16 model with GPU-accelerated processing and parallelizing across multiple GPUs decreases model training time while preserving accuracy. This signifies that GPU memory and cores available within a system are critical components in terms of preprocessing and processing speed. This workflow can be leveraged for future segmentation efforts, serve as a baseline to benchmark future CNN, and efficiently support critical image processing tasks for the Military.
Description: Technical Report
Gov't Doc #: ERDC/GRL TR-20-7
Rights: Approved for Public Release; Distribution is Unlimited
Size: 43 pages / 3.31 Mb
Types of Materials: PDF
Appears in Collections:Technical Report

Files in This Item:
File Description SizeFormat 
ERDC-GRL TR-20-7.pdf3.31 MBAdobe PDFThumbnail