Embarquez votre Intelligence Artificielle (IA) sur CPU ... · Artificial Intelligence opportunities...

Embarquez votre Intelligence Artificielle (IA) sur CPU,

GPU et FPGA

Pierre Nowodzienski – Application Engineer

pierre.nowodzienski@mathworks.fr

From Data to Business value

Generate raw data

End devices

Extract information

Data analysis

Get valuable knowledge

Make decisions

Artificial

Intelligence

1 0 1 1 0 11 0 1 1 0 1

Amount of data

Transport cost

High latency

Availability

Artificial Intelligence opportunities in « Internet of Everything » world

Energy cost

Do the right thing at the right place

Artificial Intelligence opportunities in « Internet of Everything » world

Mission Real-time analyticsLocal control center

Operational Intelligence

Global control center

Business intelligence

SWaP-C High Medium Low

Latency Very Low Low - Medium High

Today webinar focus:

How can we design and deploy Neural

Networks on embedded targets ?

Embedded targets & mitigations

Efficiency

(performance/watt)

Development

productivity

HighLow

Code generation

generation

• C/C++ programing language

• Sequential processing

• CUDA/ OpenCL programing

language

• Partly parallel processing

• VHDL/Verilog programing

language

• Partly parallel processing

MathWorks workflows: Neural Network to embedded targets

Artificial Neural Network

Design & Training

Application

design

Dataset

Train the Network

Trained

Convolutional or

DAG Network

Trained

Shallow Neural

Network

GPU Coder

Embedded

HDL CoderASIC

ANSI/ISO

compliant

Application

First part:

Deploying Deep Neural

Network

Second part:

Deploying Shallow

Neural Network

Deep Learning is a Subset of Machine Learning

Machine Learning

Deep Learning

Algorithm Design to Embedded Deployment Workflow

MATLAB algorithm

(functional reference)

Functional test1 Deployment

unit-test

Desktop

Deployment

integration-test

Desktop

Real-time test4

Embedded GPU

.mex .lib Cross-compiled

Build type

Call CUDA

from MATLAB

directly

Call CUDA from

(C++) hand-

coded main()

Call CUDA from (C++)

hand-coded main().

Application

Demo: Alexnet Deployment with ‘mex’ Code Generation

Algorithm Design to Embedded Deployment on Tegra GPU

Functional test1

(Test in MATLAB on host)

Deployment

unit-test

(Test generated code in

MATLAB on host + GPU)

Deployment

integration-test

(Test generated code within

C/C++ app on host + GPU)

Real-time test4

(Test generated code within

C/C++ app on Tegra target)

Tegra GPU

.mex .lib Cross-compiled

Build type

Call CUDA

from MATLAB

directly

Call CUDA from

(C++) hand-

coded main()

Call CUDA from (C++)

hand-coded main().

Cross-compiled on host

with Linaro toolchain

MATLAB algorithm

(functional reference)

Application

Alexnet Deployment to Tegra: Cross-Compiled with ‘lib’

Two small changes

1. Change build-type to ‘lib’

2. Select cross-compile toolchain

Desktop CPU

Raspberry Pi board

Deploying to CPUs

NVIDIA

TensorRT &

Libraries

Application

GPU Coder for Deployment

Deep Neural Networks

Deep Learning, machine learning

Image Processing and

Computer Vision

Image filtering, feature detection/extraction

Signal Processing and

Communications FFT, filtering, cross correlation,

5x faster than TensorFlow

2x faster than MXNet

60x faster than CPUs

for stereo disparity

20x faster than

CPUs for FFTs

ARM Compute

Library

MKL-DNN

Library

GPU Coder

Design & Training

Application

design

Dataset

Train the Network

Trained

Convolutional or

DAG Network

Trained

Shallow Neural

Network

GPU Coder

Embedded

HDL CoderASIC

ANSI/ISO

compliant

Application

Second part:

Deploying Shallow

Neural Network

Demo: Shallow network deployment on Zynq platform

Neural network as gas emission estimator (sensorless)

Engine

Shallow Neural

Network

Engine torque

Gas emission

Estimated torque

Estimated gas emission

Speed command

Fuel Rate

Demo workflow

Train the

Network

Create the

Network structure

Test the

Network

Iterate

Export to

Simulink

Fine-tune &

optimize for

the target

Generate

Demo summary

Train the

Network

Create the

Network structure

Test the

Network

Iterate

Export to

Simulink

Fine-tune &

optimize for

the target

Generate

Neural Network Toolbox

Parallel Computing Toolbox

Fixed Point

Designer

HDL Coder

Embedded

HDL Optimization options

▪ HDL Coder with Simulink

– Streaming

– Sharing

– Line buffers as RAMs

– RAM Fusion

– Architecture Flattening

– Efficient resource mapping

▪ HDL Coder with MATLAB

– RAM Mapping

– Loop Streaming

– Resource Sharing

– CSD/FCSD

▪ HDL Coder with Simulink

– Input/Output pipelining

– Distributed Pipelining

– Hierarchical Dist. Pipelining

– Constrained Pipelining

– Clock-Rate Pipelining

– Back-Annotation

– Adaptive Pipelining

▪ HDL Coder with MATLAB

– Input/Output pipelining

– Distributed pipelining

– Loop Unrolling

▪ HDL Workflow Advisor

▪ Automatic Delay Balancing

▪ Validation model generation

Area Optimizations Speed Optimizations

Workflow and Verification

Key takeaways

▪ Comprehensive & integrated development environment from dataset to target

▪ Fast design space exploration and trade-off

▪ Target-independant functional reference for target-optimized implementation

▪ Deploy « Smart application », not Neural network only

Design & Training

Application

design

Dataset

Train the Network

Trained

Convolutional or

DAG Network

Trained

Shallow Neural

Network

GPU Coder

Embedded

HDL CoderASIC

ANSI/ISO

compliant

Application

Next steps

▪ Web site technical resources

– Lookup Table Optimization

– Data Type Optimization (documentation)

– Efficient Implementation on FPGAs (documentation)

– Deep Learning Inference for Object Detection on Raspberry Pi

– Pedestrian Detection on a NVIDIA GPU with TensorRT

▪ Contact us

– pierre.nowodzienski@mathworks.fr

– +33-1-41-14-88-45

Special thanks to Vaidehi Venkatesan (Fixed-Point Designer development team)

for her great job to create this demo material!

Embarquez votre Intelligence Artificielle (IA) sur CPU ... · Artificial Intelligence opportunities...

Documents

Transcript of Embarquez votre Intelligence Artificielle (IA) sur CPU ... · Artificial Intelligence opportunities...

Artificial Sensory Analysis for Sensory Classification of Prosecco Sparkling Wines · 2017-07-25 · “Artificial Sensory Analysis” for Sensory Classification of Prosecco Sparkling

ÉTUDE DE DOSSIER Cadre de direction 2019 Concours réservé ......Machine Learning is a field of artificial intelligence that uses statistical techniques to give computer system s

A Proof of Useful Work for Arti cial Intelligence on the Blockchain · 2020. 1. 28. · A Proof of Useful Work for Artificial Intelligence on the Blockchain 3 M1 direct link direct

TPE Artificial Intelligence 2009

INF2820 Datalingvistikk – V2015 · • Computer science3. Språkteknologi • Artificial intelligence (AI) • NLP 1. Computational linguistics The game of the name • Navnene

Ex-Hi-Bi STUDIO · Vidéo Clip Film corporatif ... Aéro Montréal Agence XXY Artificial Non Intelligence Be Clothing BM Architectes C-D Le Cabaret du Mile End CCFC Cerruti Cofitex

АЛЬМА-МАТЕР · Big Data, Artificial Intelligence и много ... только для конечного числа дорог. Но если туннели будут

Presentation1 - WordPress.com · 2017-04-06 · Autonomy, Artificial Intelligence and Human Machine Teaming — Frequently Asked Questions Steven "Cap" Rogers Senior Scientist, ATRI

7 WIEF GLOBAL DISCOURSE ARTIFICIAL INTELLIGENCE AND THE … · A-9-1, Level 9, Hampshire Place Office 157 Hampshire, No. 1 Jalan Mayang Sari 50450 Kuala Lumpur, Malaysia ... machine

Chapter 15 ARTIFICIAL NEURAL NETWORKS FOR COMBINATORIAL OPTIMIZATIONalseda/MasterOpt/PotvinSmith_Neural... · 2013. 2. 7. · Artificial Neural Networks 433 unit hypercube resulting

Empowering threat hunters with artificial intelligence€¦ · Cloud-powered limitless scale Finds stealthy attackers in real-time Rich context to accelerate triage Custom IoCmatching

Artificial intelligence algorithm for predicting mortality ...

Introduction to Artificial Intelligence First-order Logic

ARTIFICIAL LANDSCAPES - Flux Laboratory

Coronavirus Disease 2019 (COVID-19): An Evidence Map of ... · 5/7/2020 · technologies and artificial intelligence, research on the pathophysiology of COVID-19 within different

Intelligence musicale/rythmique Intelligence intrapersonnelle...Intelligence musicale/rythmique Intelligence intrapersonnelle Intelligence corporelle/kinesthésique Intelligence logique/mathématique

WIE INTELLIGENTE ALGORITHMEN …...2019/10/10 · 2015 Artificial Intelligence Intelligent Personal Assistant (IPA) Predictive Intelligence Chatbots Voice processing Immediate Translation

mini MBA Artificial Intelligence & Technologypowerpoint.gondola.be/MBA AI - Brochure - 2020 - FR.pdf · L’intelligence artificielle (IA ou AI pour les anglophones) ... fiction.

CONSULTATION ON THE WHITE PAPER ON ARTIFICIAL INTELLIGENCE ... · The Chair on the Legal and Regulatory Implications of Artificial Intelligence is currently conducting several research

UNIVERSITÉ DE MONTRÉAL TOWARDS ACCURATE … · universitÉ de montrÉal towards accurate forecasting of epileptic seizures: artificial intelligence and effective connectivity findings