Annotation¶

`sahi.annotation` ¶

Annotation classes for object detection.

Contains classes for handling bounding boxes, categories, masks, and annotations.

Classes¶

`BoundingBox` `dataclass` ¶

BoundingBox represents a rectangular region in 2D space, typically used for object detection annotations.

Attributes:

Name	Type	Description
`box`	`Tuple[float, float, float, float]`	The bounding box coordinates in the format (minx, miny, maxx, maxy). - minx (float): Minimum x-coordinate (left). - miny (float): Minimum y-coordinate (top). - maxx (float): Maximum x-coordinate (right). - maxy (float): Maximum y-coordinate (bottom).
`shift_amount`	`Tuple[int, int]`	The amount to shift the bounding box in the x and y directions. Defaults to (0, 0).

BoundingBox Usage Example

bbox = BoundingBox((10.0, 20.0, 50.0, 80.0))
area = bbox.area
expanded_bbox = bbox.get_expanded_box(ratio=0.2)
shifted_bbox = bbox.get_shifted_box()
coco_format = bbox.to_coco_bbox()

Source code in sahi/annotation.py

@dataclass(frozen=True)
class BoundingBox:
    """BoundingBox represents a rectangular region in 2D space, typically used for object detection annotations.

    Attributes:
        box (Tuple[float, float, float, float]): The bounding box coordinates in the format (minx, miny, maxx, maxy).
            - minx (float): Minimum x-coordinate (left).
            - miny (float): Minimum y-coordinate (top).
            - maxx (float): Maximum x-coordinate (right).
            - maxy (float): Maximum y-coordinate (bottom).
        shift_amount (Tuple[int, int], optional): The amount to shift the bounding box in the x and y directions.
            Defaults to (0, 0).

    !!! example "BoundingBox Usage Example"
        ```python
        bbox = BoundingBox((10.0, 20.0, 50.0, 80.0))
        area = bbox.area
        expanded_bbox = bbox.get_expanded_box(ratio=0.2)
        shifted_bbox = bbox.get_shifted_box()
        coco_format = bbox.to_coco_bbox()
        ```
    """

    box: tuple[float, float, float, float] | list[float] | list[int]
    shift_amount: tuple[int, int] = (0, 0)

    def __post_init__(self) -> None:
        """Validate bounding box coordinates and shift amount."""
        if len(self.box) != 4 or any(coord < 0 for coord in self.box):
            raise ValueError("box must be 4 non-negative floats: [minx, miny, maxx, maxy]")
        if len(self.shift_amount) != 2:
            raise ValueError("shift_amount must be 2 integers: [shift_x, shift_y]")

    @property
    def minx(self) -> float:
        """Return minimum x-coordinate."""
        return self.box[0]

    @property
    def miny(self) -> float:
        """Return minimum y-coordinate."""
        return self.box[1]

    @property
    def maxx(self) -> float:
        """Return maximum x-coordinate."""
        return self.box[2]

    @property
    def maxy(self) -> float:
        """Return maximum y-coordinate."""
        return self.box[3]

    @property
    def shift_x(self) -> int:
        """Return x-coordinate shift."""
        return self.shift_amount[0]

    @property
    def shift_y(self) -> int:
        """Return y-coordinate shift."""
        return self.shift_amount[1]

    @property
    def area(self) -> float:
        """Return bounding box area."""
        return (self.maxx - self.minx) * (self.maxy - self.miny)

    def get_expanded_box(self, ratio: float = 0.1, max_x: int | None = None, max_y: int | None = None) -> BoundingBox:
        """Get an expanded bounding box by increasing its size by a given ratio.

        The expansion is applied equally in all directions. Optionally, the expanded box can be
        clipped to maximum x and y boundaries.

        Args:
            ratio (float, optional): The proportion by which to expand the box size.
                Default is 0.1 (10%).
            max_x (int, optional): The maximum allowed x-coordinate for the expanded box.
                If None, no maximum is applied.
            max_y (int, optional): The maximum allowed y-coordinate for the expanded box.
                If None, no maximum is applied.

        Returns:
            BoundingBox: A new BoundingBox instance representing the expanded box.
        """
        w = self.maxx - self.minx
        h = self.maxy - self.miny
        y_mar = int(h * ratio)
        x_mar = int(w * ratio)
        maxx = min(max_x, self.maxx + x_mar) if max_x else self.maxx + x_mar
        minx = max(0, self.minx - x_mar)
        maxy = min(max_y, self.maxy + y_mar) if max_y else self.maxy + y_mar
        miny = max(0, self.miny - y_mar)
        box: list[float] = [minx, miny, maxx, maxy]
        return BoundingBox(box)

    def to_xywh(self) -> list[float]:
        """Convert to [xmin, ymin, width, height] format.

        Returns:
            list[float]: A list containing the bounding box in the format [xmin, ymin, width, height].
        """
        return [self.minx, self.miny, self.maxx - self.minx, self.maxy - self.miny]

    def to_coco_bbox(self) -> list[float]:
        """Convert to COCO format: [xmin, ymin, width, height].

        Returns:
            list[float]: A list containing the bounding box in COCO format.
        """
        return self.to_xywh()

    def to_xyxy(self) -> list[float]:
        """Convert to [xmin, ymin, xmax, ymax] format.

        Returns:
            list[float]: A list containing the bounding box in the format [xmin, ymin, xmax, ymax].
        """
        return [self.minx, self.miny, self.maxx, self.maxy]

    def to_voc_bbox(self) -> list[float]:
        """Convert to VOC format: [xmin, ymin, xmax, ymax].

        Returns:
            list[float]: A list containing the bounding box in VOC format.
        """
        return self.to_xyxy()

    def get_shifted_box(self) -> BoundingBox:
        """Get shifted BoundingBox.

        Returns:
            BoundingBox: A new BoundingBox instance representing the shifted box.
        """
        box = [
            self.minx + self.shift_x,
            self.miny + self.shift_y,
            self.maxx + self.shift_x,
            self.maxy + self.shift_y,
        ]
        return BoundingBox(box)

    def __repr__(self) -> str:
        """Return string representation of bounding box."""
        return (
            f"BoundingBox: <{(self.minx, self.miny, self.maxx, self.maxy)}, "
            f"w: {self.maxx - self.minx}, h: {self.maxy - self.miny}>"
        )

Attributes¶

`area` `property` ¶

Return bounding box area.

`maxx` `property` ¶

Return maximum x-coordinate.

`maxy` `property` ¶

Return maximum y-coordinate.

`minx` `property` ¶

Return minimum x-coordinate.

`miny` `property` ¶

Return minimum y-coordinate.

`shift_x` `property` ¶

Return x-coordinate shift.

`shift_y` `property` ¶

Return y-coordinate shift.

Functions¶

`__post_init__()` ¶

Validate bounding box coordinates and shift amount.

Source code in sahi/annotation.py

def __post_init__(self) -> None:
    """Validate bounding box coordinates and shift amount."""
    if len(self.box) != 4 or any(coord < 0 for coord in self.box):
        raise ValueError("box must be 4 non-negative floats: [minx, miny, maxx, maxy]")
    if len(self.shift_amount) != 2:
        raise ValueError("shift_amount must be 2 integers: [shift_x, shift_y]")

`repr()` ¶

Return string representation of bounding box.

Source code in sahi/annotation.py

def __repr__(self) -> str:
    """Return string representation of bounding box."""
    return (
        f"BoundingBox: <{(self.minx, self.miny, self.maxx, self.maxy)}, "
        f"w: {self.maxx - self.minx}, h: {self.maxy - self.miny}>"
    )

`get_expanded_box(ratio=0.1, max_x=None, max_y=None)` ¶

Get an expanded bounding box by increasing its size by a given ratio.

The expansion is applied equally in all directions. Optionally, the expanded box can be clipped to maximum x and y boundaries.

Parameters:

Name	Type	Description	Default
`ratio` ¶	`float`	The proportion by which to expand the box size. Default is 0.1 (10%).	`0.1`
`max_x` ¶	`int`	The maximum allowed x-coordinate for the expanded box. If None, no maximum is applied.	`None`
`max_y` ¶	`int`	The maximum allowed y-coordinate for the expanded box. If None, no maximum is applied.	`None`

Returns:

Name	Type	Description
`BoundingBox`	`BoundingBox`	A new BoundingBox instance representing the expanded box.

Source code in sahi/annotation.py

def get_expanded_box(self, ratio: float = 0.1, max_x: int | None = None, max_y: int | None = None) -> BoundingBox:
    """Get an expanded bounding box by increasing its size by a given ratio.

    The expansion is applied equally in all directions. Optionally, the expanded box can be
    clipped to maximum x and y boundaries.

    Args:
        ratio (float, optional): The proportion by which to expand the box size.
            Default is 0.1 (10%).
        max_x (int, optional): The maximum allowed x-coordinate for the expanded box.
            If None, no maximum is applied.
        max_y (int, optional): The maximum allowed y-coordinate for the expanded box.
            If None, no maximum is applied.

    Returns:
        BoundingBox: A new BoundingBox instance representing the expanded box.
    """
    w = self.maxx - self.minx
    h = self.maxy - self.miny
    y_mar = int(h * ratio)
    x_mar = int(w * ratio)
    maxx = min(max_x, self.maxx + x_mar) if max_x else self.maxx + x_mar
    minx = max(0, self.minx - x_mar)
    maxy = min(max_y, self.maxy + y_mar) if max_y else self.maxy + y_mar
    miny = max(0, self.miny - y_mar)
    box: list[float] = [minx, miny, maxx, maxy]
    return BoundingBox(box)

`get_shifted_box()` ¶

Get shifted BoundingBox.

Returns:

Name	Type	Description
`BoundingBox`	`BoundingBox`	A new BoundingBox instance representing the shifted box.

Source code in sahi/annotation.py

def get_shifted_box(self) -> BoundingBox:
    """Get shifted BoundingBox.

    Returns:
        BoundingBox: A new BoundingBox instance representing the shifted box.
    """
    box = [
        self.minx + self.shift_x,
        self.miny + self.shift_y,
        self.maxx + self.shift_x,
        self.maxy + self.shift_y,
    ]
    return BoundingBox(box)

`to_coco_bbox()` ¶

Convert to COCO format: [xmin, ymin, width, height].

Returns:

Type	Description
`list[float]`	list[float]: A list containing the bounding box in COCO format.

Source code in sahi/annotation.py

def to_coco_bbox(self) -> list[float]:
    """Convert to COCO format: [xmin, ymin, width, height].

    Returns:
        list[float]: A list containing the bounding box in COCO format.
    """
    return self.to_xywh()

`to_voc_bbox()` ¶

Convert to VOC format: [xmin, ymin, xmax, ymax].

Returns:

Type	Description
`list[float]`	list[float]: A list containing the bounding box in VOC format.

Source code in sahi/annotation.py

def to_voc_bbox(self) -> list[float]:
    """Convert to VOC format: [xmin, ymin, xmax, ymax].

    Returns:
        list[float]: A list containing the bounding box in VOC format.
    """
    return self.to_xyxy()

`to_xywh()` ¶

Convert to [xmin, ymin, width, height] format.

Returns:

Type	Description
`list[float]`	list[float]: A list containing the bounding box in the format [xmin, ymin, width, height].

Source code in sahi/annotation.py

def to_xywh(self) -> list[float]:
    """Convert to [xmin, ymin, width, height] format.

    Returns:
        list[float]: A list containing the bounding box in the format [xmin, ymin, width, height].
    """
    return [self.minx, self.miny, self.maxx - self.minx, self.maxy - self.miny]

`to_xyxy()` ¶

Convert to [xmin, ymin, xmax, ymax] format.

Returns:

Type	Description
`list[float]`	list[float]: A list containing the bounding box in the format [xmin, ymin, xmax, ymax].

Source code in sahi/annotation.py

def to_xyxy(self) -> list[float]:
    """Convert to [xmin, ymin, xmax, ymax] format.

    Returns:
        list[float]: A list containing the bounding box in the format [xmin, ymin, xmax, ymax].
    """
    return [self.minx, self.miny, self.maxx, self.maxy]

`Category` `dataclass` ¶

Category of the annotation.

Attributes:

Name	Type	Description
`id`	`int`	Unique identifier for the category.
`name`	`str`	Name of the category.

Source code in sahi/annotation.py

@dataclass(frozen=True)
class Category:
    """Category of the annotation.

    Attributes:
        id (int): Unique identifier for the category.
        name (str): Name of the category.
    """

    id: int
    name: str

    def __post_init__(self) -> None:
        """Validate category id and name types."""
        if not isinstance(self.id, int):
            raise TypeError("id should be integer")
        if not isinstance(self.name, str):
            raise TypeError("name should be string")

    def __repr__(self) -> str:
        """Return string representation of category."""
        return f"Category: <id: {self.id}, name: {self.name}>"

Functions¶

`__post_init__()` ¶

Validate category id and name types.

Source code in sahi/annotation.py

def __post_init__(self) -> None:
    """Validate category id and name types."""
    if not isinstance(self.id, int):
        raise TypeError("id should be integer")
    if not isinstance(self.name, str):
        raise TypeError("name should be string")

`repr()` ¶

Return string representation of category.

Source code in sahi/annotation.py

def __repr__(self) -> str:
    """Return string representation of category."""
    return f"Category: <id: {self.id}, name: {self.name}>"

`Mask` ¶

Init Mask from coco segmentation representation.

Parameters:

Name	Type	Description	Default
`segmentation` ¶	`list[list[float]] \| ndarray`	List[List] [ [x1, y1, x2, y2, x3, y3, ...], [x1, y1, x2, y2, x3, y3, ...], ... ]	required
`full_shape` ¶	`list[int] \| list[int \| float] \| None`	List[int] Size of the full image, should be in the form of [height, width]	required
`shift_amount` ¶	`list[int] \| list[int \| float]`	List[int] To shift the box and mask predictions from sliced image to full sized image, should be in the form of [shift_x, shift_y]	`[0, 0]`

Source code in sahi/annotation.py

class Mask:
    """Init Mask from coco segmentation representation.

    Args:
        segmentation: List[List]
            [
                [x1, y1, x2, y2, x3, y3, ...],
                [x1, y1, x2, y2, x3, y3, ...],
                ...
            ]
        full_shape: List[int]
            Size of the full image, should be in the form of [height, width]
        shift_amount: List[int]
            To shift the box and mask predictions from sliced image to full
            sized image, should be in the form of [shift_x, shift_y]
    """

    def __init__(
        self,
        segmentation: list[list[float]] | np.ndarray,
        full_shape: list[int] | list[int | float] | None,
        shift_amount: list[int] | list[int | float] = [0, 0],
    ) -> None:
        """Initialize Mask object."""
        if full_shape is None:
            raise ValueError("full_shape must be provided")  # pyright: ignore[reportUnreachable]

        self.shift_x = int(shift_amount[0])
        self.shift_y = int(shift_amount[1])
        self.full_shape_height = int(full_shape[0])
        self.full_shape_width = int(full_shape[1])
        # Ensure segmentation is a list
        if isinstance(segmentation, np.ndarray):
            self.segmentation: list[list[float]] = segmentation.tolist()
        else:
            self.segmentation = segmentation

    @classmethod
    def from_float_mask(
        cls,
        mask: np.ndarray,
        full_shape: list[int],
        mask_threshold: float = 0.5,
        shift_amount: list[int] | None = None,
    ) -> Mask:
        """Create Mask from float mask array.

        Args:
            mask: np.ndarray of np.float elements
                Mask values between 0 and 1 (should have a shape of height*width)
            mask_threshold: float
                Value to threshold mask pixels between 0 and 1
            shift_amount: List
                To shift the box and mask predictions from sliced image
                to full sized image, should be in the form of [shift_x, shift_y]
            full_shape: List[int]
                Size of the full image after shifting, should be in the form of [height, width].
        """
        bool_mask = mask > mask_threshold
        if shift_amount is None:
            shift_amount = [0, 0]
        return cls(
            segmentation=get_coco_segmentation_from_bool_mask(bool_mask),
            shift_amount=shift_amount,
            full_shape=full_shape,
        )

    @classmethod
    def from_bool_mask(
        cls,
        bool_mask: np.ndarray,
        full_shape: list[int],
        shift_amount: list[int] | None = None,
    ) -> Mask:
        """Create Mask from boolean mask array.

        Args:
            bool_mask: np.ndarray with bool elements
                2D mask of object, should have a shape of height*width
            full_shape: List[int]
                Size of the full image, should be in the form of [height, width]
            shift_amount: List[int]
                To shift the box and mask predictions from sliced image to full
                sized image, should be in the form of [shift_x, shift_y].
        """
        if shift_amount is None:
            shift_amount = [0, 0]
        return cls(
            segmentation=get_coco_segmentation_from_bool_mask(bool_mask),
            shift_amount=shift_amount,
            full_shape=full_shape,
        )

    @property
    def bool_mask(self) -> np.ndarray:
        """Return boolean mask representation."""
        # Ensure segmentation is list[list[float]] for the utility function
        seg = self.segmentation
        if isinstance(seg, np.ndarray):
            seg = seg.tolist()
        return get_bool_mask_from_coco_segmentation(seg, width=self.full_shape_width, height=self.full_shape_height)

    @property
    def shape(self) -> list[int]:
        """Returns mask shape as [height, width]."""
        return [self.bool_mask.shape[0], self.bool_mask.shape[1]]

    @property
    def full_shape(self) -> list[int]:
        """Returns full mask shape after shifting as [height, width]."""
        return [self.full_shape_height, self.full_shape_width]

    @property
    def shift_amount(self) -> list[int]:
        """Returns the shift amount of the mask slice as [shift_x, shift_y]."""
        return [self.shift_x, self.shift_y]

    def get_shifted_mask(self) -> Mask:
        """Return shifted mask."""
        # Confirm full_shape is specified
        if (self.full_shape_height is None) or (self.full_shape_width is None):
            raise ValueError("full_shape is None")
        shifted_segmentation = []
        for s in self.segmentation:
            xs = [min(self.shift_x + s[i], self.full_shape_width) for i in range(0, len(s) - 1, 2)]
            ys = [min(self.shift_y + s[i], self.full_shape_height) for i in range(1, len(s), 2)]
            shifted_segmentation.append([j for i in zip(xs, ys) for j in i])
        return Mask(
            segmentation=shifted_segmentation,
            shift_amount=[0, 0],
            full_shape=self.full_shape,
        )

Attributes¶

`bool_mask` `property` ¶

Return boolean mask representation.

`full_shape` `property` ¶

Returns full mask shape after shifting as [height, width].

`shape` `property` ¶

Returns mask shape as [height, width].

`shift_amount` `property` ¶

Returns the shift amount of the mask slice as [shift_x, shift_y].

Functions¶

`init(segmentation, full_shape, shift_amount=[0, 0])` ¶

Initialize Mask object.

Source code in sahi/annotation.py

def __init__(
    self,
    segmentation: list[list[float]] | np.ndarray,
    full_shape: list[int] | list[int | float] | None,
    shift_amount: list[int] | list[int | float] = [0, 0],
) -> None:
    """Initialize Mask object."""
    if full_shape is None:
        raise ValueError("full_shape must be provided")  # pyright: ignore[reportUnreachable]

    self.shift_x = int(shift_amount[0])
    self.shift_y = int(shift_amount[1])
    self.full_shape_height = int(full_shape[0])
    self.full_shape_width = int(full_shape[1])
    # Ensure segmentation is a list
    if isinstance(segmentation, np.ndarray):
        self.segmentation: list[list[float]] = segmentation.tolist()
    else:
        self.segmentation = segmentation

`from_bool_mask(bool_mask, full_shape, shift_amount=None)` `classmethod` ¶

Create Mask from boolean mask array.

Parameters:

Name	Type	Description	Default
`bool_mask` ¶	`ndarray`	np.ndarray with bool elements 2D mask of object, should have a shape of height*width	required
`full_shape` ¶	`list[int]`	List[int] Size of the full image, should be in the form of [height, width]	required
`shift_amount` ¶	`list[int] \| None`	List[int] To shift the box and mask predictions from sliced image to full sized image, should be in the form of [shift_x, shift_y].	`None`

Source code in sahi/annotation.py

@classmethod
def from_bool_mask(
    cls,
    bool_mask: np.ndarray,
    full_shape: list[int],
    shift_amount: list[int] | None = None,
) -> Mask:
    """Create Mask from boolean mask array.

    Args:
        bool_mask: np.ndarray with bool elements
            2D mask of object, should have a shape of height*width
        full_shape: List[int]
            Size of the full image, should be in the form of [height, width]
        shift_amount: List[int]
            To shift the box and mask predictions from sliced image to full
            sized image, should be in the form of [shift_x, shift_y].
    """
    if shift_amount is None:
        shift_amount = [0, 0]
    return cls(
        segmentation=get_coco_segmentation_from_bool_mask(bool_mask),
        shift_amount=shift_amount,
        full_shape=full_shape,
    )

`from_float_mask(mask, full_shape, mask_threshold=0.5, shift_amount=None)` `classmethod` ¶

Create Mask from float mask array.

Parameters:

Name	Type	Description	Default
`mask` ¶	`ndarray`	np.ndarray of np.float elements Mask values between 0 and 1 (should have a shape of height*width)	required
`mask_threshold` ¶	`float`	float Value to threshold mask pixels between 0 and 1	`0.5`
`shift_amount` ¶	`list[int] \| None`	List To shift the box and mask predictions from sliced image to full sized image, should be in the form of [shift_x, shift_y]	`None`
`full_shape` ¶	`list[int]`	List[int] Size of the full image after shifting, should be in the form of [height, width].	required

Source code in sahi/annotation.py

@classmethod
def from_float_mask(
    cls,
    mask: np.ndarray,
    full_shape: list[int],
    mask_threshold: float = 0.5,
    shift_amount: list[int] | None = None,
) -> Mask:
    """Create Mask from float mask array.

    Args:
        mask: np.ndarray of np.float elements
            Mask values between 0 and 1 (should have a shape of height*width)
        mask_threshold: float
            Value to threshold mask pixels between 0 and 1
        shift_amount: List
            To shift the box and mask predictions from sliced image
            to full sized image, should be in the form of [shift_x, shift_y]
        full_shape: List[int]
            Size of the full image after shifting, should be in the form of [height, width].
    """
    bool_mask = mask > mask_threshold
    if shift_amount is None:
        shift_amount = [0, 0]
    return cls(
        segmentation=get_coco_segmentation_from_bool_mask(bool_mask),
        shift_amount=shift_amount,
        full_shape=full_shape,
    )

`get_shifted_mask()` ¶

Return shifted mask.

Source code in sahi/annotation.py

def get_shifted_mask(self) -> Mask:
    """Return shifted mask."""
    # Confirm full_shape is specified
    if (self.full_shape_height is None) or (self.full_shape_width is None):
        raise ValueError("full_shape is None")
    shifted_segmentation = []
    for s in self.segmentation:
        xs = [min(self.shift_x + s[i], self.full_shape_width) for i in range(0, len(s) - 1, 2)]
        ys = [min(self.shift_y + s[i], self.full_shape_height) for i in range(1, len(s), 2)]
        shifted_segmentation.append([j for i in zip(xs, ys) for j in i])
    return Mask(
        segmentation=shifted_segmentation,
        shift_amount=[0, 0],
        full_shape=self.full_shape,
    )

`ObjectAnnotation` ¶

All about an annotation such as Mask, Category, BoundingBox.

Source code in sahi/annotation.py

class ObjectAnnotation:
    """All about an annotation such as Mask, Category, BoundingBox."""

    def __init__(
        self,
        bbox: list[float] | None = None,
        segmentation: np.ndarray | list[list[float]] | None = None,
        category_id: int | None = None,
        category_name: str | None = None,
        shift_amount: list[int] | list[int | float] | None = None,
        full_shape: list[int] | list[int | float] | None = None,
    ) -> None:
        """Initialize ObjectAnnotation.

        Args:
            bbox: List
                [minx, miny, maxx, maxy]
            segmentation: List[List]
                [
                    [x1, y1, x2, y2, x3, y3, ...],
                    [x1, y1, x2, y2, x3, y3, ...],
                    ...
                ]
            category_id: int
                ID of the object category
            category_name: str
                Name of the object category
            shift_amount: List
                To shift the box and mask predictions from sliced image
                to full sized image, should be in the form of [shift_x, shift_y]
            full_shape: List
                Size of the full image after shifting, should be in
                the form of [height, width].
        """
        if not isinstance(category_id, int):
            raise ValueError("category_id must be an integer")
        if (bbox is None) and (segmentation is None):
            raise ValueError("you must provide a bbox or segmentation")

        if shift_amount is None:
            shift_amount = [0, 0]

        self.mask: Mask | None = None
        if segmentation is not None:
            self.mask = Mask(
                segmentation=segmentation,
                shift_amount=shift_amount,
                full_shape=full_shape,
            )
            # Convert to list if ndarray for get_bbox_from_coco_segmentation
            seg_for_bbox = segmentation if not isinstance(segmentation, np.ndarray) else segmentation.tolist()
            bbox_from_segmentation = get_bbox_from_coco_segmentation(seg_for_bbox)  # type: ignore[arg-type]
            # https://github.com/obss/sahi/issues/235
            if bbox_from_segmentation is not None:
                bbox = bbox_from_segmentation
            else:
                raise ValueError("Invalid segmentation mask.")

        # if bbox is a numpy object, convert it to python List[float]
        if isinstance(bbox, np.ndarray):
            bbox = copy.deepcopy(bbox).tolist()

        # bbox must not be None at this point
        assert bbox is not None

        # make sure bbox coords lie inside [0, image_size]
        xmin = max(bbox[0], 0)
        ymin = max(bbox[1], 0)
        if full_shape:
            xmax = min(bbox[2], full_shape[1])
            ymax = min(bbox[3], full_shape[0])
        else:
            xmax = bbox[2]
            ymax = bbox[3]
        bbox = [xmin, ymin, xmax, ymax]
        # set bbox - convert shift_amount to tuple for BoundingBox
        shift_amount_tuple: tuple[int, int] = (int(shift_amount[0]), int(shift_amount[1]))
        self.bbox = BoundingBox(bbox, shift_amount_tuple)

        category_name = category_name if category_name else str(category_id)
        self.category = Category(
            id=category_id,
            name=category_name,
        )

        self.merged = None

    @classmethod
    def from_bool_mask(
        cls,
        bool_mask: np.ndarray,
        category_id: int | None = None,
        category_name: str | None = None,
        shift_amount: list[int] | None = None,
        full_shape: list[int] | None = None,
    ) -> ObjectAnnotation:
        """Create ObjectAnnotation from bool_mask (2D np.ndarray).

        Args:
            bool_mask: np.ndarray with bool elements
                2D mask of object, should have a shape of height*width
            category_id: int
                ID of the object category
            category_name: str
                Name of the object category
            full_shape: List
                Size of the full image, should be in the form of [height, width]
            shift_amount: List
                To shift the box and mask predictions from sliced image to full
                sized image, should be in the form of [shift_x, shift_y]
        """
        segmentation = get_coco_segmentation_from_bool_mask(bool_mask)
        return cls(
            category_id=category_id,
            segmentation=segmentation,  # type: ignore[arg-type]
            category_name=category_name,
            shift_amount=shift_amount,
            full_shape=full_shape,
        )

    @classmethod
    def from_coco_segmentation(
        cls,
        segmentation: list[list[float]] | list[list[int]],
        full_shape: list[int],
        category_id: int | None = None,
        category_name: str | None = None,
        shift_amount: list[int] | None = None,
    ) -> ObjectAnnotation:
        """Create ObjectAnnotation from coco segmentation format.

        The segmentation format is:
        [
            [x1, y1, x2, y2, x3, y3, ...],
            [x1, y1, x2, y2, x3, y3, ...],
            ...
        ]

        Args:
            segmentation: List[List]
                [
                    [x1, y1, x2, y2, x3, y3, ...],
                    [x1, y1, x2, y2, x3, y3, ...],
                    ...
                ]
            category_id: int
                ID of the object category
            category_name: str
                Name of the object category
            full_shape: List
                Size of the full image, should be in the form of [height, width]
            shift_amount: List
                To shift the box and mask predictions from sliced image to full
                sized image, should be in the form of [shift_x, shift_y]
        """
        return cls(
            category_id=category_id,
            segmentation=segmentation,  # type: ignore[arg-type]
            category_name=category_name,
            shift_amount=shift_amount,
            full_shape=full_shape,
        )

    @classmethod
    def from_coco_bbox(
        cls,
        bbox: list[float] | list[int],
        category_id: int | None = None,
        category_name: str | None = None,
        shift_amount: list[int] | None = None,
        full_shape: list[int] | None = None,
    ) -> ObjectAnnotation:
        """Create ObjectAnnotation from coco bbox [minx, miny, width, height].

        Args:
            bbox: List
                [minx, miny, width, height]
            category_id: int
                ID of the object category
            category_name: str
                Name of the object category
            full_shape: List
                Size of the full image, should be in the form of [height, width]
            shift_amount: List
                To shift the box and mask predictions from sliced image to full
                sized image, should be in the form of [shift_x, shift_y]
        """
        xmin = bbox[0]
        ymin = bbox[1]
        xmax = bbox[0] + bbox[2]
        ymax = bbox[1] + bbox[3]
        bbox = [xmin, ymin, xmax, ymax]
        return cls(
            category_id=category_id,
            bbox=bbox,
            category_name=category_name,
            shift_amount=shift_amount,
            full_shape=full_shape,
        )

    @classmethod
    def from_coco_annotation_dict(
        cls,
        annotation_dict: dict,
        full_shape: list[int],
        category_name: str | None = None,
        shift_amount: list[int] | None = None,
    ) -> ObjectAnnotation:
        """Create ObjectAnnotation from COCO annotation dict.

        Converts a COCO formatted annotation dict (with fields "bbox",
        "segmentation", "category_id") to ObjectAnnotation.

        Args:
            annotation_dict: dict
                COCO formatted annotation dict (with fields "bbox", "segmentation", "category_id")
            category_name: str
                Category name of the annotation
            full_shape: List
                Size of the full image, should be in the form of [height, width]
            shift_amount: List
                To shift the box and mask predictions from sliced image to full
                sized image, should be in the form of [shift_x, shift_y]
        """
        if annotation_dict["segmentation"]:
            return cls.from_coco_segmentation(
                segmentation=annotation_dict["segmentation"],
                category_id=annotation_dict["category_id"],
                category_name=category_name,
                shift_amount=shift_amount,
                full_shape=full_shape,
            )
        else:
            return cls.from_coco_bbox(
                bbox=annotation_dict["bbox"],
                category_id=annotation_dict["category_id"],
                category_name=category_name,
                shift_amount=shift_amount,
                full_shape=full_shape,
            )

    @classmethod
    def from_shapely_annotation(
        cls,
        annotation: ShapelyAnnotation,
        full_shape: list[int],
        category_id: int | None = None,
        category_name: str | None = None,
        shift_amount: list[int] | None = None,
    ) -> ObjectAnnotation:
        """Create ObjectAnnotation from shapely_utils.ShapelyAnnotation.

        Args:
            annotation: shapely_utils.ShapelyAnnotation
            category_id: int
                ID of the object category
            category_name: str
                Name of the object category
            full_shape: List
                Size of the full image, should be in the form of [height, width]
            shift_amount: List
                To shift the box and mask predictions from sliced image to full
                sized image, should be in the form of [shift_x, shift_y]
        """
        return cls(
            category_id=category_id,
            segmentation=annotation.to_coco_segmentation(),  # type: ignore[arg-type]
            category_name=category_name,
            shift_amount=shift_amount,
            full_shape=full_shape,
        )

    @classmethod
    def from_imantics_annotation(
        cls,
        annotation: Any,
        shift_amount: list[int] | None = None,
        full_shape: list[int] | None = None,
    ) -> ObjectAnnotation:
        """Create ObjectAnnotation from imantics.annotation.Annotation.

        Args:
            annotation: imantics.annotation.Annotation
            shift_amount: List
                To shift the box and mask predictions from sliced image to full
                sized image, should be in the form of [shift_x, shift_y]
            full_shape: List
                Size of the full image, should be in the form of [height, width]
        """
        if shift_amount is None:
            shift_amount = [0, 0]
        return cls(
            category_id=annotation.category.id,
            segmentation=get_coco_segmentation_from_bool_mask(annotation.mask.array),
            category_name=annotation.category.name,
            shift_amount=shift_amount,
            full_shape=full_shape,
        )

    def to_coco_annotation(self) -> CocoAnnotation:
        """Convert to sahi.utils.coco.CocoAnnotation representation."""
        if self.mask:
            coco_annotation = CocoAnnotation.from_coco_segmentation(
                segmentation=self.mask.segmentation,  # type: ignore[arg-type]
                category_id=self.category.id,
                category_name=self.category.name,
            )
        else:
            coco_annotation = CocoAnnotation.from_coco_bbox(
                bbox=self.bbox.to_xywh(),  # type: ignore[arg-type]
                category_id=self.category.id,
                category_name=self.category.name,
            )
        return coco_annotation

    def to_coco_prediction(self) -> CocoPrediction:
        """Convert to sahi.utils.coco.CocoPrediction representation."""
        if self.mask:
            coco_prediction = CocoPrediction.from_coco_segmentation(
                segmentation=self.mask.segmentation,  # type: ignore[arg-type]
                category_id=self.category.id,
                category_name=self.category.name,
                score=1,
            )
        else:
            coco_prediction = CocoPrediction.from_coco_bbox(
                bbox=self.bbox.to_xywh(),  # type: ignore[arg-type]
                category_id=self.category.id,
                category_name=self.category.name,
                score=1,
            )
        return coco_prediction

    def to_shapely_annotation(self) -> ShapelyAnnotation:
        """Convert to sahi.utils.shapely.ShapelyAnnotation representation."""
        if self.mask:
            shapely_annotation = ShapelyAnnotation.from_coco_segmentation(
                segmentation=self.mask.segmentation,  # type: ignore[arg-type]
            )
        else:
            shapely_annotation = ShapelyAnnotation.from_coco_bbox(
                bbox=self.bbox.to_xywh(),  # type: ignore[arg-type]
            )
        return shapely_annotation

    def to_imantics_annotation(self) -> Any:
        """Convert to imantics.annotation.Annotation representation."""
        try:
            import imantics
        except ImportError:
            raise ImportError('Please run "pip install -U imantics" to install imantics first for imantics conversion.')

        imantics_category = imantics.Category(id=self.category.id, name=self.category.name)
        if self.mask is not None:
            imantics_mask = imantics.Mask.create(self.mask.bool_mask)
            imantics_annotation = imantics.annotation.Annotation.from_mask(
                mask=imantics_mask, category=imantics_category
            )
        else:
            imantics_bbox = imantics.BBox.create(self.bbox.to_xyxy())
            imantics_annotation = imantics.annotation.Annotation.from_bbox(
                bbox=imantics_bbox, category=imantics_category
            )
        return imantics_annotation

    def deepcopy(self) -> ObjectAnnotation:
        """Get deepcopy of current ObjectAnnotation instance.

        Returns:
            ObjectAnnotation: A deep copy of this ObjectAnnotation.
        """
        return copy.deepcopy(self)

    @classmethod
    def get_empty_mask(cls) -> Mask:
        """Return an empty mask."""
        return Mask(segmentation=[], full_shape=[0, 0])

    def get_shifted_object_annotation(self) -> ObjectAnnotation:
        """Return shifted object annotation."""
        if self.mask:
            shifted_mask = self.mask.get_shifted_mask()
            return ObjectAnnotation(
                bbox=self.bbox.get_shifted_box().to_xyxy(),
                category_id=self.category.id,
                segmentation=shifted_mask.segmentation,
                category_name=self.category.name,
                shift_amount=[0, 0],
                full_shape=shifted_mask.full_shape,
            )
        else:
            return ObjectAnnotation(
                bbox=self.bbox.get_shifted_box().to_xyxy(),
                category_id=self.category.id,
                segmentation=None,
                category_name=self.category.name,
                shift_amount=[0, 0],
                full_shape=None,
            )

    def __repr__(self) -> str:
        """Return string representation of object annotation."""
        return f"""ObjectAnnotation<
    bbox: {self.bbox},
    mask: {self.mask},
    category: {self.category}>"""

Functions¶

`init(bbox=None, segmentation=None, category_id=None, category_name=None, shift_amount=None, full_shape=None)` ¶

Initialize ObjectAnnotation.

Parameters:

Name	Type	Description	Default
`bbox` ¶	`list[float] \| None`	List [minx, miny, maxx, maxy]	`None`
`segmentation` ¶	`ndarray \| list[list[float]] \| None`	List[List] [ [x1, y1, x2, y2, x3, y3, ...], [x1, y1, x2, y2, x3, y3, ...], ... ]	`None`
`category_id` ¶	`int \| None`	int ID of the object category	`None`
`category_name` ¶	`str \| None`	str Name of the object category	`None`
`shift_amount` ¶	`list[int] \| list[int \| float] \| None`	List To shift the box and mask predictions from sliced image to full sized image, should be in the form of [shift_x, shift_y]	`None`
`full_shape` ¶	`list[int] \| list[int \| float] \| None`	List Size of the full image after shifting, should be in the form of [height, width].	`None`

Source code in sahi/annotation.py

def __init__(
    self,
    bbox: list[float] | None = None,
    segmentation: np.ndarray | list[list[float]] | None = None,
    category_id: int | None = None,
    category_name: str | None = None,
    shift_amount: list[int] | list[int | float] | None = None,
    full_shape: list[int] | list[int | float] | None = None,
) -> None:
    """Initialize ObjectAnnotation.

    Args:
        bbox: List
            [minx, miny, maxx, maxy]
        segmentation: List[List]
            [
                [x1, y1, x2, y2, x3, y3, ...],
                [x1, y1, x2, y2, x3, y3, ...],
                ...
            ]
        category_id: int
            ID of the object category
        category_name: str
            Name of the object category
        shift_amount: List
            To shift the box and mask predictions from sliced image
            to full sized image, should be in the form of [shift_x, shift_y]
        full_shape: List
            Size of the full image after shifting, should be in
            the form of [height, width].
    """
    if not isinstance(category_id, int):
        raise ValueError("category_id must be an integer")
    if (bbox is None) and (segmentation is None):
        raise ValueError("you must provide a bbox or segmentation")

    if shift_amount is None:
        shift_amount = [0, 0]

    self.mask: Mask | None = None
    if segmentation is not None:
        self.mask = Mask(
            segmentation=segmentation,
            shift_amount=shift_amount,
            full_shape=full_shape,
        )
        # Convert to list if ndarray for get_bbox_from_coco_segmentation
        seg_for_bbox = segmentation if not isinstance(segmentation, np.ndarray) else segmentation.tolist()
        bbox_from_segmentation = get_bbox_from_coco_segmentation(seg_for_bbox)  # type: ignore[arg-type]
        # https://github.com/obss/sahi/issues/235
        if bbox_from_segmentation is not None:
            bbox = bbox_from_segmentation
        else:
            raise ValueError("Invalid segmentation mask.")

    # if bbox is a numpy object, convert it to python List[float]
    if isinstance(bbox, np.ndarray):
        bbox = copy.deepcopy(bbox).tolist()

    # bbox must not be None at this point
    assert bbox is not None

    # make sure bbox coords lie inside [0, image_size]
    xmin = max(bbox[0], 0)
    ymin = max(bbox[1], 0)
    if full_shape:
        xmax = min(bbox[2], full_shape[1])
        ymax = min(bbox[3], full_shape[0])
    else:
        xmax = bbox[2]
        ymax = bbox[3]
    bbox = [xmin, ymin, xmax, ymax]
    # set bbox - convert shift_amount to tuple for BoundingBox
    shift_amount_tuple: tuple[int, int] = (int(shift_amount[0]), int(shift_amount[1]))
    self.bbox = BoundingBox(bbox, shift_amount_tuple)

    category_name = category_name if category_name else str(category_id)
    self.category = Category(
        id=category_id,
        name=category_name,
    )

    self.merged = None

`repr()` ¶

Return string representation of object annotation.

Source code in sahi/annotation.py

def __repr__(self) -> str:
    """Return string representation of object annotation."""
    return f"""ObjectAnnotation<
bbox: {self.bbox},
mask: {self.mask},
category: {self.category}>"""

`deepcopy()` ¶

Get deepcopy of current ObjectAnnotation instance.

Returns:

Name	Type	Description
`ObjectAnnotation`	`ObjectAnnotation`	A deep copy of this ObjectAnnotation.

Source code in sahi/annotation.py

def deepcopy(self) -> ObjectAnnotation:
    """Get deepcopy of current ObjectAnnotation instance.

    Returns:
        ObjectAnnotation: A deep copy of this ObjectAnnotation.
    """
    return copy.deepcopy(self)

`from_bool_mask(bool_mask, category_id=None, category_name=None, shift_amount=None, full_shape=None)` `classmethod` ¶

Create ObjectAnnotation from bool_mask (2D np.ndarray).

Parameters:

Name	Type	Description	Default
`bool_mask` ¶	`ndarray`	np.ndarray with bool elements 2D mask of object, should have a shape of height*width	required
`category_id` ¶	`int \| None`	int ID of the object category	`None`
`category_name` ¶	`str \| None`	str Name of the object category	`None`
`full_shape` ¶	`list[int] \| None`	List Size of the full image, should be in the form of [height, width]	`None`
`shift_amount` ¶	`list[int] \| None`	List To shift the box and mask predictions from sliced image to full sized image, should be in the form of [shift_x, shift_y]	`None`

Source code in sahi/annotation.py

@classmethod
def from_bool_mask(
    cls,
    bool_mask: np.ndarray,
    category_id: int | None = None,
    category_name: str | None = None,
    shift_amount: list[int] | None = None,
    full_shape: list[int] | None = None,
) -> ObjectAnnotation:
    """Create ObjectAnnotation from bool_mask (2D np.ndarray).

    Args:
        bool_mask: np.ndarray with bool elements
            2D mask of object, should have a shape of height*width
        category_id: int
            ID of the object category
        category_name: str
            Name of the object category
        full_shape: List
            Size of the full image, should be in the form of [height, width]
        shift_amount: List
            To shift the box and mask predictions from sliced image to full
            sized image, should be in the form of [shift_x, shift_y]
    """
    segmentation = get_coco_segmentation_from_bool_mask(bool_mask)
    return cls(
        category_id=category_id,
        segmentation=segmentation,  # type: ignore[arg-type]
        category_name=category_name,
        shift_amount=shift_amount,
        full_shape=full_shape,
    )

`from_coco_annotation_dict(annotation_dict, full_shape, category_name=None, shift_amount=None)` `classmethod` ¶

Create ObjectAnnotation from COCO annotation dict.

Converts a COCO formatted annotation dict (with fields "bbox", "segmentation", "category_id") to ObjectAnnotation.

Parameters:

Name	Type	Description	Default
`annotation_dict` ¶	`dict`	dict COCO formatted annotation dict (with fields "bbox", "segmentation", "category_id")	required
`category_name` ¶	`str \| None`	str Category name of the annotation	`None`
`full_shape` ¶	`list[int]`	List Size of the full image, should be in the form of [height, width]	required
`shift_amount` ¶	`list[int] \| None`	List To shift the box and mask predictions from sliced image to full sized image, should be in the form of [shift_x, shift_y]	`None`

Source code in sahi/annotation.py

@classmethod
def from_coco_annotation_dict(
    cls,
    annotation_dict: dict,
    full_shape: list[int],
    category_name: str | None = None,
    shift_amount: list[int] | None = None,
) -> ObjectAnnotation:
    """Create ObjectAnnotation from COCO annotation dict.

    Converts a COCO formatted annotation dict (with fields "bbox",
    "segmentation", "category_id") to ObjectAnnotation.

    Args:
        annotation_dict: dict
            COCO formatted annotation dict (with fields "bbox", "segmentation", "category_id")
        category_name: str
            Category name of the annotation
        full_shape: List
            Size of the full image, should be in the form of [height, width]
        shift_amount: List
            To shift the box and mask predictions from sliced image to full
            sized image, should be in the form of [shift_x, shift_y]
    """
    if annotation_dict["segmentation"]:
        return cls.from_coco_segmentation(
            segmentation=annotation_dict["segmentation"],
            category_id=annotation_dict["category_id"],
            category_name=category_name,
            shift_amount=shift_amount,
            full_shape=full_shape,
        )
    else:
        return cls.from_coco_bbox(
            bbox=annotation_dict["bbox"],
            category_id=annotation_dict["category_id"],
            category_name=category_name,
            shift_amount=shift_amount,
            full_shape=full_shape,
        )

`from_coco_bbox(bbox, category_id=None, category_name=None, shift_amount=None, full_shape=None)` `classmethod` ¶

Create ObjectAnnotation from coco bbox [minx, miny, width, height].

Parameters:

Name	Type	Description	Default
`bbox` ¶	`list[float] \| list[int]`	List [minx, miny, width, height]	required
`category_id` ¶	`int \| None`	int ID of the object category	`None`
`category_name` ¶	`str \| None`	str Name of the object category	`None`
`full_shape` ¶	`list[int] \| None`	List Size of the full image, should be in the form of [height, width]	`None`
`shift_amount` ¶	`list[int] \| None`	List To shift the box and mask predictions from sliced image to full sized image, should be in the form of [shift_x, shift_y]	`None`

Source code in sahi/annotation.py

@classmethod
def from_coco_bbox(
    cls,
    bbox: list[float] | list[int],
    category_id: int | None = None,
    category_name: str | None = None,
    shift_amount: list[int] | None = None,
    full_shape: list[int] | None = None,
) -> ObjectAnnotation:
    """Create ObjectAnnotation from coco bbox [minx, miny, width, height].

    Args:
        bbox: List
            [minx, miny, width, height]
        category_id: int
            ID of the object category
        category_name: str
            Name of the object category
        full_shape: List
            Size of the full image, should be in the form of [height, width]
        shift_amount: List
            To shift the box and mask predictions from sliced image to full
            sized image, should be in the form of [shift_x, shift_y]
    """
    xmin = bbox[0]
    ymin = bbox[1]
    xmax = bbox[0] + bbox[2]
    ymax = bbox[1] + bbox[3]
    bbox = [xmin, ymin, xmax, ymax]
    return cls(
        category_id=category_id,
        bbox=bbox,
        category_name=category_name,
        shift_amount=shift_amount,
        full_shape=full_shape,
    )

`from_coco_segmentation(segmentation, full_shape, category_id=None, category_name=None, shift_amount=None)` `classmethod` ¶

Create ObjectAnnotation from coco segmentation format.

The segmentation format is: [ [x1, y1, x2, y2, x3, y3, ...], [x1, y1, x2, y2, x3, y3, ...], ... ]

Parameters:

Name	Type	Description	Default
`segmentation` ¶	`list[list[float]] \| list[list[int]]`	List[List] [ [x1, y1, x2, y2, x3, y3, ...], [x1, y1, x2, y2, x3, y3, ...], ... ]	required
`category_id` ¶	`int \| None`	int ID of the object category	`None`
`category_name` ¶	`str \| None`	str Name of the object category	`None`
`full_shape` ¶	`list[int]`	List Size of the full image, should be in the form of [height, width]	required
`shift_amount` ¶	`list[int] \| None`	List To shift the box and mask predictions from sliced image to full sized image, should be in the form of [shift_x, shift_y]	`None`

Source code in sahi/annotation.py

@classmethod
def from_coco_segmentation(
    cls,
    segmentation: list[list[float]] | list[list[int]],
    full_shape: list[int],
    category_id: int | None = None,
    category_name: str | None = None,
    shift_amount: list[int] | None = None,
) -> ObjectAnnotation:
    """Create ObjectAnnotation from coco segmentation format.

    The segmentation format is:
    [
        [x1, y1, x2, y2, x3, y3, ...],
        [x1, y1, x2, y2, x3, y3, ...],
        ...
    ]

    Args:
        segmentation: List[List]
            [
                [x1, y1, x2, y2, x3, y3, ...],
                [x1, y1, x2, y2, x3, y3, ...],
                ...
            ]
        category_id: int
            ID of the object category
        category_name: str
            Name of the object category
        full_shape: List
            Size of the full image, should be in the form of [height, width]
        shift_amount: List
            To shift the box and mask predictions from sliced image to full
            sized image, should be in the form of [shift_x, shift_y]
    """
    return cls(
        category_id=category_id,
        segmentation=segmentation,  # type: ignore[arg-type]
        category_name=category_name,
        shift_amount=shift_amount,
        full_shape=full_shape,
    )

`from_imantics_annotation(annotation, shift_amount=None, full_shape=None)` `classmethod` ¶

Create ObjectAnnotation from imantics.annotation.Annotation.

Parameters:

Name	Type	Description	Default
`annotation` ¶	`Any`	imantics.annotation.Annotation	required
`shift_amount` ¶	`list[int] \| None`	List To shift the box and mask predictions from sliced image to full sized image, should be in the form of [shift_x, shift_y]	`None`
`full_shape` ¶	`list[int] \| None`	List Size of the full image, should be in the form of [height, width]	`None`

Source code in sahi/annotation.py

@classmethod
def from_imantics_annotation(
    cls,
    annotation: Any,
    shift_amount: list[int] | None = None,
    full_shape: list[int] | None = None,
) -> ObjectAnnotation:
    """Create ObjectAnnotation from imantics.annotation.Annotation.

    Args:
        annotation: imantics.annotation.Annotation
        shift_amount: List
            To shift the box and mask predictions from sliced image to full
            sized image, should be in the form of [shift_x, shift_y]
        full_shape: List
            Size of the full image, should be in the form of [height, width]
    """
    if shift_amount is None:
        shift_amount = [0, 0]
    return cls(
        category_id=annotation.category.id,
        segmentation=get_coco_segmentation_from_bool_mask(annotation.mask.array),
        category_name=annotation.category.name,
        shift_amount=shift_amount,
        full_shape=full_shape,
    )

`from_shapely_annotation(annotation, full_shape, category_id=None, category_name=None, shift_amount=None)` `classmethod` ¶

Create ObjectAnnotation from shapely_utils.ShapelyAnnotation.

Parameters:

Name	Type	Description	Default
`annotation` ¶	`ShapelyAnnotation`	shapely_utils.ShapelyAnnotation	required
`category_id` ¶	`int \| None`	int ID of the object category	`None`
`category_name` ¶	`str \| None`	str Name of the object category	`None`
`full_shape` ¶	`list[int]`	List Size of the full image, should be in the form of [height, width]	required
`shift_amount` ¶	`list[int] \| None`	List To shift the box and mask predictions from sliced image to full sized image, should be in the form of [shift_x, shift_y]	`None`

Source code in sahi/annotation.py

@classmethod
def from_shapely_annotation(
    cls,
    annotation: ShapelyAnnotation,
    full_shape: list[int],
    category_id: int | None = None,
    category_name: str | None = None,
    shift_amount: list[int] | None = None,
) -> ObjectAnnotation:
    """Create ObjectAnnotation from shapely_utils.ShapelyAnnotation.

    Args:
        annotation: shapely_utils.ShapelyAnnotation
        category_id: int
            ID of the object category
        category_name: str
            Name of the object category
        full_shape: List
            Size of the full image, should be in the form of [height, width]
        shift_amount: List
            To shift the box and mask predictions from sliced image to full
            sized image, should be in the form of [shift_x, shift_y]
    """
    return cls(
        category_id=category_id,
        segmentation=annotation.to_coco_segmentation(),  # type: ignore[arg-type]
        category_name=category_name,
        shift_amount=shift_amount,
        full_shape=full_shape,
    )

`get_empty_mask()` `classmethod` ¶

Return an empty mask.

Source code in sahi/annotation.py

@classmethod
def get_empty_mask(cls) -> Mask:
    """Return an empty mask."""
    return Mask(segmentation=[], full_shape=[0, 0])

`get_shifted_object_annotation()` ¶

Return shifted object annotation.

Source code in sahi/annotation.py

def get_shifted_object_annotation(self) -> ObjectAnnotation:
    """Return shifted object annotation."""
    if self.mask:
        shifted_mask = self.mask.get_shifted_mask()
        return ObjectAnnotation(
            bbox=self.bbox.get_shifted_box().to_xyxy(),
            category_id=self.category.id,
            segmentation=shifted_mask.segmentation,
            category_name=self.category.name,
            shift_amount=[0, 0],
            full_shape=shifted_mask.full_shape,
        )
    else:
        return ObjectAnnotation(
            bbox=self.bbox.get_shifted_box().to_xyxy(),
            category_id=self.category.id,
            segmentation=None,
            category_name=self.category.name,
            shift_amount=[0, 0],
            full_shape=None,
        )

`to_coco_annotation()` ¶

Convert to sahi.utils.coco.CocoAnnotation representation.

Source code in sahi/annotation.py

def to_coco_annotation(self) -> CocoAnnotation:
    """Convert to sahi.utils.coco.CocoAnnotation representation."""
    if self.mask:
        coco_annotation = CocoAnnotation.from_coco_segmentation(
            segmentation=self.mask.segmentation,  # type: ignore[arg-type]
            category_id=self.category.id,
            category_name=self.category.name,
        )
    else:
        coco_annotation = CocoAnnotation.from_coco_bbox(
            bbox=self.bbox.to_xywh(),  # type: ignore[arg-type]
            category_id=self.category.id,
            category_name=self.category.name,
        )
    return coco_annotation

`to_coco_prediction()` ¶

Convert to sahi.utils.coco.CocoPrediction representation.

Source code in sahi/annotation.py

def to_coco_prediction(self) -> CocoPrediction:
    """Convert to sahi.utils.coco.CocoPrediction representation."""
    if self.mask:
        coco_prediction = CocoPrediction.from_coco_segmentation(
            segmentation=self.mask.segmentation,  # type: ignore[arg-type]
            category_id=self.category.id,
            category_name=self.category.name,
            score=1,
        )
    else:
        coco_prediction = CocoPrediction.from_coco_bbox(
            bbox=self.bbox.to_xywh(),  # type: ignore[arg-type]
            category_id=self.category.id,
            category_name=self.category.name,
            score=1,
        )
    return coco_prediction

`to_imantics_annotation()` ¶

Convert to imantics.annotation.Annotation representation.

Source code in sahi/annotation.py

def to_imantics_annotation(self) -> Any:
    """Convert to imantics.annotation.Annotation representation."""
    try:
        import imantics
    except ImportError:
        raise ImportError('Please run "pip install -U imantics" to install imantics first for imantics conversion.')

    imantics_category = imantics.Category(id=self.category.id, name=self.category.name)
    if self.mask is not None:
        imantics_mask = imantics.Mask.create(self.mask.bool_mask)
        imantics_annotation = imantics.annotation.Annotation.from_mask(
            mask=imantics_mask, category=imantics_category
        )
    else:
        imantics_bbox = imantics.BBox.create(self.bbox.to_xyxy())
        imantics_annotation = imantics.annotation.Annotation.from_bbox(
            bbox=imantics_bbox, category=imantics_category
        )
    return imantics_annotation

`to_shapely_annotation()` ¶

Convert to sahi.utils.shapely.ShapelyAnnotation representation.

Source code in sahi/annotation.py

def to_shapely_annotation(self) -> ShapelyAnnotation:
    """Convert to sahi.utils.shapely.ShapelyAnnotation representation."""
    if self.mask:
        shapely_annotation = ShapelyAnnotation.from_coco_segmentation(
            segmentation=self.mask.segmentation,  # type: ignore[arg-type]
        )
    else:
        shapely_annotation = ShapelyAnnotation.from_coco_bbox(
            bbox=self.bbox.to_xywh(),  # type: ignore[arg-type]
        )
    return shapely_annotation

Annotation¶

sahi.annotation ¶

Classes¶

BoundingBox dataclass ¶

Attributes¶

area property ¶

maxx property ¶

maxy property ¶

minx property ¶

miny property ¶

shift_x property ¶

shift_y property ¶

Functions¶

__post_init__() ¶

__repr__() ¶

get_expanded_box(ratio=0.1, max_x=None, max_y=None) ¶

get_shifted_box() ¶

to_coco_bbox() ¶

to_voc_bbox() ¶

to_xywh() ¶

to_xyxy() ¶

Category dataclass ¶

Functions¶

__post_init__() ¶

__repr__() ¶

Mask ¶

segmentation ¶

full_shape ¶

shift_amount ¶

Attributes¶

bool_mask property ¶

full_shape property ¶

shape property ¶

shift_amount property ¶

Functions¶

__init__(segmentation, full_shape, shift_amount=[0, 0]) ¶

from_bool_mask(bool_mask, full_shape, shift_amount=None) classmethod ¶

from_float_mask(mask, full_shape, mask_threshold=0.5, shift_amount=None) classmethod ¶

get_shifted_mask() ¶

ObjectAnnotation ¶

Functions¶

__init__(bbox=None, segmentation=None, category_id=None, category_name=None, shift_amount=None, full_shape=None) ¶

__repr__() ¶

deepcopy() ¶

from_bool_mask(bool_mask, category_id=None, category_name=None, shift_amount=None, full_shape=None) classmethod ¶

from_coco_annotation_dict(annotation_dict, full_shape, category_name=None, shift_amount=None) classmethod ¶

from_coco_bbox(bbox, category_id=None, category_name=None, shift_amount=None, full_shape=None) classmethod ¶

from_coco_segmentation(segmentation, full_shape, category_id=None, category_name=None, shift_amount=None) classmethod ¶

from_imantics_annotation(annotation, shift_amount=None, full_shape=None) classmethod ¶

from_shapely_annotation(annotation, full_shape, category_id=None, category_name=None, shift_amount=None) classmethod ¶

get_empty_mask() classmethod ¶

get_shifted_object_annotation() ¶

to_coco_annotation() ¶

to_coco_prediction() ¶

to_imantics_annotation() ¶

to_shapely_annotation() ¶

Functions¶

`sahi.annotation` ¶

`BoundingBox` `dataclass` ¶

`area` `property` ¶

`maxx` `property` ¶

`maxy` `property` ¶

`minx` `property` ¶

`miny` `property` ¶

`shift_x` `property` ¶

`shift_y` `property` ¶

`__post_init__()` ¶

`repr()` ¶

`get_expanded_box(ratio=0.1, max_x=None, max_y=None)` ¶

`get_shifted_box()` ¶

`to_coco_bbox()` ¶

`to_voc_bbox()` ¶

`to_xywh()` ¶

`to_xyxy()` ¶

`Category` `dataclass` ¶

`__post_init__()` ¶

`repr()` ¶

`Mask` ¶

`segmentation` ¶

`full_shape` ¶

`shift_amount` ¶

`bool_mask` `property` ¶

`full_shape` `property` ¶

`shape` `property` ¶

`shift_amount` `property` ¶

`init(segmentation, full_shape, shift_amount=[0, 0])` ¶

`from_bool_mask(bool_mask, full_shape, shift_amount=None)` `classmethod` ¶

`from_float_mask(mask, full_shape, mask_threshold=0.5, shift_amount=None)` `classmethod` ¶

`get_shifted_mask()` ¶

`ObjectAnnotation` ¶

`init(bbox=None, segmentation=None, category_id=None, category_name=None, shift_amount=None, full_shape=None)` ¶

`repr()` ¶

`deepcopy()` ¶

`from_bool_mask(bool_mask, category_id=None, category_name=None, shift_amount=None, full_shape=None)` `classmethod` ¶

`from_coco_annotation_dict(annotation_dict, full_shape, category_name=None, shift_amount=None)` `classmethod` ¶

`from_coco_bbox(bbox, category_id=None, category_name=None, shift_amount=None, full_shape=None)` `classmethod` ¶

`from_coco_segmentation(segmentation, full_shape, category_id=None, category_name=None, shift_amount=None)` `classmethod` ¶

`from_imantics_annotation(annotation, shift_amount=None, full_shape=None)` `classmethod` ¶

`from_shapely_annotation(annotation, full_shape, category_id=None, category_name=None, shift_amount=None)` `classmethod` ¶

`get_empty_mask()` `classmethod` ¶

`get_shifted_object_annotation()` ¶

`to_coco_annotation()` ¶

`to_coco_prediction()` ¶

`to_imantics_annotation()` ¶

`to_shapely_annotation()` ¶