View on GitHub

MIVisionX

MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. Khronos OpenVX is also delivered with MIVisionX.

MIT licensed Language grade: Python

MIVisionX Python Inference Analyzer

MIVisionX Inference Analyzer Application using pre-trained ONNX/NNEF/Caffe models to analyze and summarize images.

Pre-trained models in ONNX, NNEF, & Caffe formats are supported by MIVisionX. The app first converts the pre-trained models to AMD Neural Net Intermediate Representation (NNIR), once the model has been translated into AMD NNIR (AMD’s internal open format), the Optimizer goes through the NNIR and applies various optimizations which would allow the model to be deployed on to target hardware most efficiently. Finally, AMD NNIR is converted into OpenVX C code, which is compiled and wrapped with a python API to run on any targeted hardware.

Analyzer Index

Prerequisites

Use MIVisionX Docker

MIVisionX provides developers with docker images for Ubuntu 16.04, Ubuntu 18.04, CentOS 7.5, & CentOS 7.6. Using docker images developers can quickly prototype and build applications without having to be locked into a single system setup or lose valuable time figuring out the dependencies of the underlying software.

Docker with display option

Usage

Command Line Interface (CLI)

usage: python mivisionx_inference_analyzer.py 	[-h] 
                             	       		--model_format MODEL_FORMAT 
                                       		--model_name MODEL_NAME 
                                       		--model MODEL 
                                       		--model_input_dims MODEL_INPUT_DIMS 
                                       		--model_output_dims MODEL_OUTPUT_DIMS 
                                       		--label LABEL 
                                       		--output_dir OUTPUT_DIR 
                                       		--image_dir IMAGE_DIR
                                       		[--image_val IMAGE_VAL] 
                                       		[--hierarchy HIERARCHY]
                                       		[--add ADD] 
                                       		[--multiply MULTIPLY]
				       		[--fp16 FP16]
                                       		[--replace REPLACE] 
                                       		[--verbose VERBOSE]

Usage help

  -h, --help            show this help message and exit
  --model_format        pre-trained model format, options:caffe/onnx/nnef [required]
  --model_name          model name                                        [required]
  --model               pre_trained model file/folder                     [required]
  --model_input_dims    c,h,w - channel,height,width                      [required]
  --model_output_dims   c,h,w - channel,height,width                      [required]
  --label               labels text file                                  [required]
  --output_dir          output dir to store ADAT results                  [required]
  --image_dir           image directory for analysis                      [required]
  --image_val           image list with ground truth                      [optional]
  --hierarchy           AMD proprietary hierarchical file                 [optional]
  --add                 input preprocessing factor      [optional - default:[0,0,0]]
  --multiply            input preprocessing factor      [optional - default:[1,1,1]]
  --fp16                quantize model to FP16 		     [optional - default:no]
  --replace             replace/overwrite model              [optional - default:no]
  --verbose             verbose                              [optional - default:no]

Graphical User Interface (GUI)

usage: python mivisionx_inference_analyzer.py

Supported Pre-Trained Model Formats

Samples

Sample 1 - Using Pre-Trained ONNX Model

Run SqueezeNet on sample images

Sample 2 - Using Pre-Trained Caffe Model

Run VGG 16 on sample images

Sample 3 - Using Pre-Trained NNEF Model

Run VGG 16 on sample images