Adding Object Detection solution

Signed-off-by: Ravi Kumar Neti <[email protected]>
quic · Jan 22, 2024 · 6666e0c · 6666e0c
1 parent f232f88
commit 6666e0c
Show file tree

Hide file tree

Showing 136 changed files with 16,582 additions and 0 deletions.
diff --git a/ai-solutions/android/03-ObjectDetection/GenerateDLC.ipynb b/ai-solutions/android/03-ObjectDetection/GenerateDLC.ipynb
diff --git a/ai-solutions/android/03-ObjectDetection/README.md b/ai-solutions/android/03-ObjectDetection/README.md
@@ -0,0 +1,215 @@
+## Object Detection with YoloNAS / SSDMobilenetV2 / YoloX
+The project is designed to utilize the [Qualcomm® Neural Processing SDK for AI ](https://developer.qualcomm.com/sites/default/files/docs/snpe/index.html), a deep learning software from Snapdragon platforms for Object Detection in Android. The Android application can be designed to use any built-in/connected camera to capture the objects and use Machine Learning model to get the prediction/inference and location of the respective objects.
+
+# Pre-requisites
+
+* Before starting the Android application, please follow the instructions for setting up Qualcomm Neural Processing SDK using the link provided. https://developer.qualcomm.com/sites/default/files/docs/snpe/setup.html
+* Android device 6.0 and above which uses below mentioned Snapdragon processors/Snapdragon HDK with display can be used to test the application
+* Download CocoDataset 2014 and give its path to Generate_DLC.ipynb. Change variable "dataset_path" in Quantization Section in notebook.
+
+
+## List of Supported Devices
+
+- Snapdragon® SM8550
+
+The above targets supports the application with CPU, GPU and DSP. For more information on the supported devices, please follow this link https://developer.qualcomm.com/docs/snpe/overview.html
+
+# Source Overview
+
+## Source Organization
+
+demo : Contains demo GIF
+
+app : Contains source files in standard Android app format
+
+app\src\main\assets : Contains Model binary DLC 
+
+app\src\main\java\com\qc\objectdetectionYoloNas : Application java source code 
+
+app\src\main\cpp : native source code 
+
+sdk: Contains openCV sdk
+
+## DLC Generation
+
+Run jupyter notebook GenerateDLC.ipynb. This notebook will generate YoloNAS quantized dlc.
+
+YoloNAS model is trained on COCO dataset for 80 classes of everyday objects.
+List of the classes can be found in dataset at : https://cocodataset.org/#explore
+
+## Code Implementation
+
+This application opens a camera preview, collects all the frames and converts them to bitmap. The network is built via Neural Network builder by passing model dlc name and runtime as the input. The bitmap is then given to the model for inference, which returns object prediction and localization of the respective object.
+
+
+### Prerequisite for Camera Preview.
+
+Permission to obtain camera preview frames is granted in the following file:
+```python
+/app/src/main/AndroidManifest.xml
+<uses-permission android:name="android.permission.CAMERA" />
+ ```
+In order to use camera2 APIs, add the below feature
+```python
+<uses-feature android:name="android.hardware.camera2" />
+```
+### Loading Model
+Code snippet for neural network connection and loading model:
+```java
+    snpe = snpeBuilder.setOutputLayers({})
+            .setPerformanceProfile(zdl::DlSystem::PerformanceProfile_t::BURST)
+            .setExecutionPriorityHint(
+                    zdl::DlSystem::ExecutionPriorityHint_t::HIGH)
+            .setRuntimeProcessorOrder(runtimeList)
+            .setUseUserSuppliedBuffers(useUserSuppliedBuffers)
+            .setPlatformConfig(platformConfig)
+            .setInitCacheMode(useCaching)
+            .setCPUFallbackMode(true)
+            .setUnconsumedTensorsAsOutputs(true)
+            .build();
+```
+### Preprocessing
+The bitmap image is passed as openCV Mat to native and then converted to BGR Mat. DLC models can work with specific image sizes.
+Therefore, we need to resize the input image to the size accepted by the corresponding selected model DLC before passing image to DLC.
+Below code reference for YoloNAS preprocessing. Similarly for other models based on model requirements, the preprocessing may change.
+```java
+    cv::Mat img320;
+    //Resize and get the size from model itself (320x320 for YOLONAS)
+    cv::resize(img,img320,cv::Size(dims[2],dims[1]),cv::INTER_LINEAR);
+
+    float inputScale = 0.00392156862745f;
+
+    float * accumulator = reinterpret_cast<float *> (&dest_buffer[0]);
+
+    //opencv read in BGRA by default
+    cvtColor(img320, img320, CV_BGRA2BGR);
+    int lim = img320.rows*img320.cols*3;
+    for(int idx = 0; idx<lim; idx++)
+        accumulator[idx]= img320.data[idx]*inputScale;
+ ```
+
+ ## PostProcessing
+ This included getting the class with highest confidence for each boxes and applying Non-Max Suppression to remove overlapping boxes.
+ Below code reference for YoloNAS postprocessing. Similarly for other models based on model requirements, the postprocessing may change.
+
+ ```python
+    for(int i =0;i<(2100);i++)
+    {
+        int start = i*80;
+        int end = (i+1)*80;
+
+        auto it = max_element (BBout_class.begin()+start, BBout_class.begin()+end);
+        int index = distance(BBout_class.begin()+start, it);
+
+        std::string classname = classnamemapping[index];
+        if(*it>=0.5 )
+        {
+            int x1 = BBout_boxcoords[i * 4 + 0];
+            int y1 = BBout_boxcoords[i * 4 + 1];
+            int x2 = BBout_boxcoords[i * 4 + 2];
+            int y2 = BBout_boxcoords[i * 4 + 3];
+            Boxlist.push_back(BoxCornerEncoding(x1, y1, x2, y2,*it,classname));
+        }
+    }
+
+    std::vector<BoxCornerEncoding> reslist = NonMaxSuppression(Boxlist,0.20);
+```
+then we just scale the coords for original image
+
+```python
+        float top,bottom,left,right;
+        left = reslist[k].y1 * ratio_1;   //y1
+        right = reslist[k].y2 * ratio_1;  //y2
+
+        bottom = reslist[k].x1 * ratio_2;  //x1
+        top = reslist[k].x2 * ratio_2;   //x2
+```
+
+## Drawing bounding boxes
+
+```python
+ RectangleBox rbox = boxlist.get(j);
+            float y = rbox.left;
+            float y1 = rbox.right;
+            float x =  rbox.top;
+            float x1 = rbox.bottom;
+
+            String fps_textLabel = "FPS: "+String.valueOf(rbox.fps);
+            canvas.drawText(fps_textLabel,10,70,mTextColor);
+
+            String processingTimeTextLabel= rbox.processing_time+"ms";
+
+            canvas.drawRect(x1, y, x, y1, mBorderColor);
+            canvas.drawText(rbox.label,x1+10, y+40, mTextColor);
+            canvas.drawText(processingTimeTextLabel,x1+10, y+90, mTextColor);
+```
+
+# Build and run with Android Studio
+
+## Build APK file with Android Studio 
+
+1. Clone QIDK repo. 
+
+2. Run below script, from the directory where it is present, to resolve dependencies of this project.
+
+* This will copy snpe-release.aar file from $SNPE_ROOT to "snpe-release" directory in Android project.
+
+	**NOTE - If you are using SNPE version 2.11 or greater, please change following line in resolveDependencies.sh.**
+	```
+	From: cp $SNPE_ROOT/android/snpe-release.aar snpe-release
+	To : cp $SNPE_ROOT/lib/android/snpe-release.aar snpe-release
+	```
+* Download opencv and paste to sdk directory, to enable OpenCv for android Java.
+
+```java
+	bash resolveDependencies.sh
+```
+
+
+3. Run jupyter notebook GenerateDLC.ipynb to generate DLC(s) for quantized YOLO_NAS DLC. Also, **change the dataset_path with Coco Dataset Path**.  
+* This script generates required dlc(s) and paste them to appropriate location. 
+
+
+4. Do gradle sync
+5. Compile the project. 
+6. Output APK file should get generated : app-debug.apk
+7. Prepare the Qualcomm Innovators development kit to install the application (Do not run APK on emulator)
+
+8. If Unsigned or Signed DSP runtime is not getting detected, then please check the logcat logs for the FastRPC error. DSP runtime may not get detected due to SE Linux security policy. Please try out following commands to set permissive SE Linux policy.
+
+It is recommended to run below commands.
+```java
+adb disable-verity
+adb reboot
+adb root
+adb remount
+adb shell setenforce 0
+```
+
+9. Install and test application : app-debug.apk
+```java
+adb install -r -t app-debug.apk
+```
+
+10. launch the application
+
+Following is the basic "Pose Detection" Android App 
+
+1. On launch of application, from home screen user can select the model and runtime and then press start camera button.
+2. On first launch of camera, user needs to provide camera permissions.
+3. After camera launched, the selected model with runtime starts loading in the background. User will see a dialogue box till model is being loaded.
+4. Once the model is loaded, it will start detecting objects and box will be seen around the object if respective object is detected on the screen 
+5. User can go back to home screen by pressing back button and select appropriate model and run-time and observe performance difference.
+
+Same results for the application are : 
+
+## Demo of the application
+![Screenshot](.//demo/ObjectDetectYoloNAS.gif)
+
+# References
+1. SSD - Single shot Multi box detector - https://arxiv.org/pdf/1512.02325.pdf
+2. https://github.com/Deci-AI/super-gradients
+3. https://zenodo.org/record/7789328
+
+
+###### *Snapdragon and Qualcomm Neural Processing SDK are products of Qualcomm Technologies, Inc. and/or its subsidiaries.*
diff --git a/ai-solutions/android/03-ObjectDetection/app/build.gradle b/ai-solutions/android/03-ObjectDetection/app/build.gradle
@@ -0,0 +1,68 @@
+apply plugin: 'com.android.application'
+
+android {
+    compileSdkVersion 30
+    buildToolsVersion "30.0.3"
+
+    defaultConfig {
+        applicationId "com.qcom.aistack_objdetect"
+        minSdkVersion 24
+        targetSdkVersion 30
+        versionCode 1
+        versionName "1.0"
+
+        testInstrumentationRunner "android.support.test.runner.AndroidJUnitRunner"
+        externalNativeBuild {
+            cmake {
+//                cppFlags ''
+                cppFlags "-std=c++11 -frtti -fexceptions"
+                arguments "-DOpenCV_DIR=" + project(':sdk').projectDir + "/native/jni",
+                        "-DANDROID_TOOLCHAIN=clang"
+//                        "-DANDROID_STL=c++_shared",
+//                         "-DANDROID_ARM_NEON=TRUE"
+                targets "objectdetectionYoloNas"
+            }
+            ndk {
+                abiFilters 'arm64-v8a'
+            }
+        }
+    }
+
+    packagingOptions {
+        pickFirst 'lib/x86/libc++_shared.so'
+        pickFirst 'lib/x86_64/libc++_shared.so'
+        pickFirst 'lib/arm64-v8a/libc++_shared.so'
+        pickFirst 'lib/armeabi-v7a/libc++_shared.so'
+    }
+
+    buildTypes {
+        release {
+            minifyEnabled false
+            proguardFiles getDefaultProguardFile('proguard-android.txt'), 'proguard-rules.pro'
+        }
+    }
+
+    compileOptions {
+        sourceCompatibility JavaVersion.VERSION_1_8
+        targetCompatibility JavaVersion.VERSION_1_8
+    }
+    ndkVersion '21.4.7075529'
+    externalNativeBuild {
+        cmake {
+            path file('src/main/cpp/CMakeLists.txt')
+        }
+    }
+}
+
+dependencies {
+    implementation fileTree(dir: 'libs', include: ['*.jar'])
+    implementation project(path: ':sdk')
+    testImplementation 'junit:junit:4.12'
+    androidTestImplementation 'com.android.support.test.espresso:espresso-core:3.0.1'
+    androidTestImplementation 'com.android.support.test.espresso:espresso-contrib:3.0.1'
+    implementation 'com.android.support:design:26.0.0'
+    implementation 'com.android.support:support-v4:26.0.0'
+
+
+
+}
diff --git a/ai-solutions/android/03-ObjectDetection/app/local.properties b/ai-solutions/android/03-ObjectDetection/app/local.properties
@@ -0,0 +1,8 @@
+## This file must *NOT* be checked into Version Control Systems,
+# as it contains information specific to your local configuration.
+#
+# Location of the SDK. This is only used by Gradle.
+# For customization when using a Version Control System, please read the
+# header note.
+#Sat Jan 07 01:53:02 IST 2023
+sdk.dir=C\:\\Users\\shubgoya\\AppData\\Local\\Android\\Sdk
diff --git a/ai-solutions/android/03-ObjectDetection/app/proguard-rules.pro b/ai-solutions/android/03-ObjectDetection/app/proguard-rules.pro
@@ -0,0 +1,21 @@
+# Add project specific ProGuard rules here.
+# You can control the set of applied configuration files using the
+# proguardFiles setting in build.gradle.
+#
+# For more details, see
+#   http://developer.android.com/guide/developing/tools/proguard.html
+
+# If your project uses WebView with JS, uncomment the following
+# and specify the fully qualified class name to the JavaScript interface
+# class:
+#-keepclassmembers class fqcn.of.javascript.interface.for.webview {
+#   public *;
+#}
+
+# Uncomment this to preserve the line number information for
+# debugging stack traces.
+#-keepattributes SourceFile,LineNumberTable
+
+# If you keep the line number information, uncomment this to
+# hide the original source file name.
+#-renamesourcefileattribute SourceFile
diff --git a/ai-solutions/android/03-ObjectDetection/app/src/main/AndroidManifest.xml b/ai-solutions/android/03-ObjectDetection/app/src/main/AndroidManifest.xml
@@ -0,0 +1,38 @@
+<?xml version="1.0" encoding="utf-8"?>
+<manifest xmlns:android="http://schemas.android.com/apk/res/android"
+    package="com.qcom.aistack_objdetect">
+
+    <uses-permission android:name="android.permission.CAMERA" />
+    <uses-feature android:name="android.hardware.camera2" />
+    <!--<uses-permission android:name="android.permission.MANAGE_EXTERNAL_STORAGE" />-->
+    <application
+        android:configChanges="orientation|screenSize"
+        android:extractNativeLibs="true"
+        android:allowBackup="true"
+        android:hardwareAccelerated="true"
+        android:icon="@mipmap/ic_launcher"
+        android:label="@string/app_name"
+        android:roundIcon="@mipmap/ic_launcher_round"
+        android:supportsRtl="true"
+        android:theme="@style/AppTheme">
+
+        <uses-native-library
+            android:name="libcdsprpc.so"
+            android:required="true"/>
+
+
+        <activity android:name="com.qcom.aistack_objdetect.HomeScreenActivity"
+            android:screenOrientation="portrait"
+            android:exported="true">
+            <intent-filter>
+                <action android:name="android.intent.action.MAIN" />
+
+                <category android:name="android.intent.category.LAUNCHER" />
+            </intent-filter>
+        </activity>
+        <activity android:name="com.qcom.aistack_objdetect.MainActivity"
+            android:screenOrientation="portrait">
+        </activity>
+    </application>
+
+</manifest>
diff --git a/ai-solutions/android/03-ObjectDetection/app/src/main/assets/ReadMe.txt b/ai-solutions/android/03-ObjectDetection/app/src/main/assets/ReadMe.txt
@@ -0,0 +1 @@
+Generate model DLC and place here