-
Notifications
You must be signed in to change notification settings - Fork 0
Leverage OpenCV (cv2) to build an object detection system. This system would combine a pre-trained MobileNet model, trained on the COCO dataset, with a frozen inference graph for real-time object identification within images or videos.
License
abeed04/Object-detection-model-using-Cv2
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
🔭 This project demonstrates real-time object detection using OpenCV (cv2), the MobileNet deep learning model, and the COCO dataset. It leverages a pre-trained frozen inference graph for efficient object identification within images or video streams. -Python 3.x
-OpenCV (cv2) library
-Pre-trained MobileNet, COCO model files and Frozen model (download them to local pc)
-The program automatically attempts to open the video file Streets.mp4(You can upload your own videos). If the file is unavailable, it defaults to your webcam stream.
-A window titled "Object Detection Program" will display the video feed with detected objects highlighted.
-Bounding boxes will be drawn around detected objects, and their corresponding class labels will be displayed.(Note:Only Objects that are labeled in labels.txt file are displayed)
-You can adjust the confidence threshold (confThreshold=0.55) in the code to control the minimum confidence required for a detection to be displayed.
-Press the 'q' key to exit the program. -The script imports necessary modules, including OpenCV (cv2) -The model configuration file (mobilenet_and_coco.txt) and the frozen inference graph (frozen_model.pb) are loaded using cv2.dnn_DetectionModel.
-Class labels are read from labels.txt and stored in a list class_labels. -The model's input size is set to 200x200 pixels.
-The input scale and mean values are adjusted for normalization.
-The input color channel order is swapped to BGR (OpenCV's default) if necessary. -A video capture object cap is created, attempting to open the video file Streets.mp4 first. It then falls back to the webcam if the file is not found.
-Error handling is included to raise an exception if video capture fails. -The loop continuously reads frames from the video stream using cap.read().
-The frame is resized to match the model's expected input size using OpenCV's resizing functions.
-model.detect is used for object detection in the frame. It returns class indices, confidence scores, and bounding boxes for detected objects that meet the specified confidence threshold.
-For each detected object:
1)A bounding box is drawn around the object using cv2.rectangle.
2)The corresponding class label from class_labels is retrieved and displayed on the frame using cv2.putText.
-The processed frame is displayed in the "Object Detection Program" window.
-The program exits when the 'q' key is pressed. You can modify the script to: -The video capture object is released using cap.release().
-OpenCV windows are destroyed using cv2.destroyAllWindows(). -Use a different video file or image set for object detection.
-Experiment with different confidence thresholds (confThreshold) to adjust the detection sensitivity.
-Explore ways to integrate this object detection functionality into a larger application. -The pre-trained MobileNet model may not be ideal for all object detection tasks. Consider exploring other models or fine-tuning the provided model on your specific dataset for improved accuracy.
-The script currently processes frames one by one. For more efficient real-time performance, you might investigate techniques like multithreading or GPU acceleration.
About
Leverage OpenCV (cv2) to build an object detection system. This system would combine a pre-trained MobileNet model, trained on the COCO dataset, with a frozen inference graph for real-time object identification within images or videos.
Topics
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published