SAHI with Detectron2 for Sliced Inference¶

0. Preparation¶

Install latest version of SAHI and Detectron2:

In [ ]:

Copied!





# flake8: noqa: E501
!pip install -U sahi
!pip install detectron2 -f https://dl.fbaipublicfiles.com/detectron2/wheels/cpu/torch1.10/index.html # for Detectron2-cpu
#!pip install detectron2 -f https://dl.fbaipublicfiles.com/detectron2/wheels/cu111/torch1.10/index.html # for Detectron2-cuda11.1
# flake8: noqa: E501
!pip install -U sahi
!pip install detectron2 -f https://dl.fbaipublicfiles.com/detectron2/wheels/cpu/torch1.10/index.html # for Detectron2-cpu
#!pip install detectron2 -f https://dl.fbaipublicfiles.com/detectron2/wheels/cu111/torch1.10/index.html # for Detectron2-cuda11.1

In [1]:

Copied!

import os

os.getcwd()
import os

os.getcwd()

Out[1]:

'/home/fatihakyon/dev/obss/sahi/demo'

Import required modules:

In [2]:

Copied!





# will be used for detectron2 fasterrcnn model zoo name
from IPython.display import Image

# import required functions, classes
from sahi import AutoDetectionModel
from sahi.predict import get_prediction, get_sliced_prediction, predict
from sahi.utils.cv import read_image
from sahi.utils.detectron2 import Detectron2TestConstants
from sahi.utils.file import download_from_url
# will be used for detectron2 fasterrcnn model zoo name
from IPython.display import Image

# import required functions, classes
from sahi import AutoDetectionModel
from sahi.predict import get_prediction, get_sliced_prediction, predict
from sahi.utils.cv import read_image
from sahi.utils.detectron2 import Detectron2TestConstants
from sahi.utils.file import download_from_url

In [3]:

Copied!





# set detectron2 fasterrcnn model zoo name
model_path = Detectron2TestConstants.FASTERCNN_MODEL_ZOO_NAME

# download test images into demo_data folder
download_from_url(
    "https://raw.githubusercontent.com/obss/sahi/main/demo/demo_data/small-vehicles1.jpeg",
    "demo_data/small-vehicles1.jpeg",
)
download_from_url(
    "https://raw.githubusercontent.com/obss/sahi/main/demo/demo_data/terrain2.png", "demo_data/terrain2.png"
)
# set detectron2 fasterrcnn model zoo name
model_path = Detectron2TestConstants.FASTERCNN_MODEL_ZOO_NAME

# download test images into demo_data folder
download_from_url(
    "https://raw.githubusercontent.com/obss/sahi/main/demo/demo_data/small-vehicles1.jpeg",
    "demo_data/small-vehicles1.jpeg",
)
download_from_url(
    "https://raw.githubusercontent.com/obss/sahi/main/demo/demo_data/terrain2.png", "demo_data/terrain2.png"
)

1. Standard Inference with a Detectron2 Model¶

Instantiate a detection model by defining model weight path, config path and other parameters:

In [4]:

Copied!





detection_model = AutoDetectionModel.from_pretrained(
    model_type="detectron2",
    model_path=model_path,
    config_path=model_path,
    confidence_threshold=0.5,
    image_size=640,
    device="cpu",  # or 'cuda:0'
)
detection_model = AutoDetectionModel.from_pretrained(
    model_type="detectron2",
    model_path=model_path,
    config_path=model_path,
    confidence_threshold=0.5,
    image_size=640,
    device="cpu",  # or 'cuda:0'
)

09/27/2022 17:44:09 - INFO - fvcore.common.checkpoint -   [Checkpointer] Loading from https://dl.fbaipublicfiles.com/detectron2/COCO-Detection/faster_rcnn_R_50_FPN_3x/137849458/model_final_280758.pkl ...
09/27/2022 17:44:09 - INFO - iopath.common.file_io -   URL https://dl.fbaipublicfiles.com/detectron2/COCO-Detection/faster_rcnn_R_50_FPN_3x/137849458/model_final_280758.pkl cached in /home/fatihakyon/.torch/iopath_cache/detectron2/COCO-Detection/faster_rcnn_R_50_FPN_3x/137849458/model_final_280758.pkl
09/27/2022 17:44:09 - INFO - fvcore.common.checkpoint -   Reading a file from 'Detectron2 Model Zoo'

Perform prediction by feeding the get_prediction function with an image path and a DetectionModel instance:

In [ ]:

Copied!

result = get_prediction("demo_data/small-vehicles1.jpeg", detection_model)
result = get_prediction("demo_data/small-vehicles1.jpeg", detection_model)

Or perform prediction by feeding the get_prediction function with a numpy image and a DetectionModel instance:

In [6]:

Copied!

result = get_prediction(read_image("demo_data/small-vehicles1.jpeg"), detection_model)
result = get_prediction(read_image("demo_data/small-vehicles1.jpeg"), detection_model)

Visualize predicted bounding boxes and masks over the original image:

In [7]:

Copied!

result.export_visuals(export_dir="demo_data/")

Image("demo_data/prediction_visual.png")
result.export_visuals(export_dir="demo_data/")

Image("demo_data/prediction_visual.png")

Out[7]:

No description has been provided for this image

2. Sliced Inference with a Detectron2 Model¶

To perform sliced prediction we need to specify slice parameters. In this example we will perform prediction over slices of 256x256 with an overlap ratio of 0.2:

In [8]:

Copied!





result = get_sliced_prediction(
    "demo_data/small-vehicles1.jpeg",
    detection_model,
    slice_height=256,
    slice_width=256,
    overlap_height_ratio=0.2,
    overlap_width_ratio=0.2,
)
result = get_sliced_prediction(
    "demo_data/small-vehicles1.jpeg",
    detection_model,
    slice_height=256,
    slice_width=256,
    overlap_height_ratio=0.2,
    overlap_width_ratio=0.2,
)

Performing prediction on 15 number of slices.

Visualize predicted bounding boxes and masks over the original image:

In [9]:

Copied!

result.export_visuals(export_dir="demo_data/")

Image("demo_data/prediction_visual.png")
result.export_visuals(export_dir="demo_data/")

Image("demo_data/prediction_visual.png")

Out[9]:

3. Prediction Result¶

Predictions are returned as sahi.prediction.PredictionResult, you can access the object prediction list as:

In [10]:

Copied!

object_prediction_list = result.object_prediction_list
object_prediction_list = result.object_prediction_list

In [11]:

Copied!

object_prediction_list[0]
object_prediction_list[0]

Out[11]:

ObjectPrediction<
    bbox: BoundingBox: <(656, 197, 671, 215), w: 15, h: 18>,
    mask: None,
    score: PredictionScore: <value: 0.9950496554374695>,
    category: Category: <id: 2, name: car>>

In [9]:

Copied!

result.to_coco_annotations()[:3]
result.to_coco_annotations()[:3]

Out[9]:

[{'image_id': None,
  'bbox': [656, 197, 15, 18],
  'score': 0.9950494170188904,
  'category_id': 2,
  'category_name': 'car',
  'segmentation': [],
  'iscrowd': 0,
  'area': 270},
 {'image_id': None,
  'bbox': [446, 308, 49, 34],
  'score': 0.9942395687103271,
  'category_id': 2,
  'category_name': 'car',
  'segmentation': [],
  'iscrowd': 0,
  'area': 1666},
 {'image_id': None,
  'bbox': [759, 231, 22, 18],
  'score': 0.9921348094940186,
  'category_id': 2,
  'category_name': 'car',
  'segmentation': [],
  'iscrowd': 0,
  'area': 396}]

ObjectPrediction's can be converted to COCO prediction format:

In [12]:

Copied!

result.to_coco_predictions(image_id=1)[:3]
result.to_coco_predictions(image_id=1)[:3]

Out[12]:

[{'image_id': 1,
  'bbox': [656, 197, 15, 18],
  'score': 0.9950496554374695,
  'category_id': 2,
  'category_name': 'car',
  'segmentation': [],
  'iscrowd': 0,
  'area': 270},
 {'image_id': 1,
  'bbox': [446, 308, 49, 34],
  'score': 0.9942396879196167,
  'category_id': 2,
  'category_name': 'car',
  'segmentation': [],
  'iscrowd': 0,
  'area': 1666},
 {'image_id': 1,
  'bbox': [759, 231, 22, 18],
  'score': 0.9921349287033081,
  'category_id': 2,
  'category_name': 'car',
  'segmentation': [],
  'iscrowd': 0,
  'area': 396}]

ObjectPrediction's can be converted to imantics annotation format:

In [ ]:

Copied!

!pip install -U imantics
!pip install -U imantics

In [13]:

Copied!

result.to_imantics_annotations()[:3]
result.to_imantics_annotations()[:3]

Out[13]:

[<imantics.annotation.Annotation at 0x7fce3bab95e0>,
 <imantics.annotation.Annotation at 0x7fce30371880>,
 <imantics.annotation.Annotation at 0x7fce30371f40>]

4. Batch Prediction¶

Set model and directory parameters:

In [14]:

Copied!





model_type = "detectron2"
model_path = model_path
model_config_path = model_path
model_device = "cpu"  # or 'cuda:0'
model_confidence_threshold = 0.5

slice_height = 480
slice_width = 480
overlap_height_ratio = 0.2
overlap_width_ratio = 0.2

source_image_dir = "demo_data/"
model_type = "detectron2"
model_path = model_path
model_config_path = model_path
model_device = "cpu"  # or 'cuda:0'
model_confidence_threshold = 0.5

slice_height = 480
slice_width = 480
overlap_height_ratio = 0.2
overlap_width_ratio = 0.2

source_image_dir = "demo_data/"

Perform sliced inference on given folder:

In [15]:

Copied!





predict(
    model_type=model_type,
    model_path=model_path,
    model_config_path=model_path,
    model_device=model_device,
    model_confidence_threshold=model_confidence_threshold,
    source=source_image_dir,
    slice_height=slice_height,
    slice_width=slice_width,
    overlap_height_ratio=overlap_height_ratio,
    overlap_width_ratio=overlap_width_ratio,
)
predict(
    model_type=model_type,
    model_path=model_path,
    model_config_path=model_path,
    model_device=model_device,
    model_confidence_threshold=model_confidence_threshold,
    source=source_image_dir,
    slice_height=slice_height,
    slice_width=slice_width,
    overlap_height_ratio=overlap_height_ratio,
    overlap_width_ratio=overlap_width_ratio,
)

There are 3 listed files in folder: demo_data/

09/27/2022 17:45:01 - INFO - fvcore.common.checkpoint -   [Checkpointer] Loading from https://dl.fbaipublicfiles.com/detectron2/COCO-Detection/faster_rcnn_R_50_FPN_3x/137849458/model_final_280758.pkl ...
09/27/2022 17:45:01 - INFO - fvcore.common.checkpoint -   Reading a file from 'Detectron2 Model Zoo'
Performing inference on images:   0%|          | 0/3 [00:00<?, ?it/s]

Performing prediction on 6 number of slices.

Performing inference on images:  33%|███▎      | 1/3 [00:01<00:03,  1.66s/it]

Prediction time is: 1628.41 ms
Performing prediction on 6 number of slices.

Performing inference on images:  67%|██████▋   | 2/3 [00:02<00:01,  1.14s/it]

Prediction time is: 728.87 ms
Performing prediction on 6 number of slices.

Performing inference on images: 100%|██████████| 3/3 [00:03<00:00,  1.09s/it]

Prediction time is: 762.43 ms
Prediction results are successfully exported to runs/predict/exp18