Added heuristic for bounding bbox ordering #835

MLDovakin · 2023-02-23T14:32:05Z

I have been using your opensource framework for quite some time and recently I needed to do a text detection task. The important part is that I had to keep the word order from left to right, but I saw in the documentation that there was no such feature.

I first tried using opencv bounding ordering but that didn't work because there were too many overlapping bounding bboxes. Then I tried to sort by height and set the Y threshold as the bounding bbox should differ minimally in height. This gave much better results than other methods and I would like to add such functionality here

Here are examples of using Y-threshold ordering

The result of get_sliced_prediction()

2, Draw ordering bboxes result

Plotting code


def bbox_sort(a, b, thresh):
    bbox_a = a
    bbox_b = b
    
    if abs(bbox_a[1] - bbox_b[1]) <= thresh: 
        return bbox_a[0] - bbox_b[0]
    
    return bbox_a[1] - bbox_b[1]

my_list = []

for ann in result.to_coco_annotations():
  ##type int so that there are no opencv errors when drawing lines

  current_bbox = ann['bbox']
  x = int(current_bbox[0])
  y = int(current_bbox[1])
  w = int(current_bbox[2])
  h = int(current_bbox[3])
  
  my_list.append((x, y, w, h))

thresh = 10
cnts = sorted(my_list, key=cmp_to_key(lambda a,b: bbox_sort(a, b, thresh)))

img = cv2.imread(f"/content/detect_images/output_01.jpg")
red = [0,0,255]

k = 0

font                   = cv2.FONT_HERSHEY_SIMPLEX
bottomLeftCornerOfText = (10,500)
fontScale              = 1
thickness              = 1
lineType               = 2

for i in cnts:
  q = cv2.circle(img, (i[0], i[1]), 5, red, -1)
  q = cv2.putText(q, f'{k}', (i[0],i[1]),font,1,(120,166,50),2)
  k += 1

cv2_imshow(q)

fcakyon · 2023-02-23T22:24:59Z

Can you please reformat your code and commit&push again as detailed in the contributing section of the readme :)

# Conflicts: # sahi/predict.py

fcakyon · 2023-02-26T19:00:08Z

@MLDovakin thanks alot for your contributions!

MLDovakin and others added 2 commits February 23, 2023 17:10

Added heuristic for bounding bbox ordering

b4c1862

Update predict.py

696999c

MLDovakin added 2 commits February 24, 2023 21:27

refactored

48abdc5

Merge remote-tracking branch 'origin/add-agg-bbox' into add-agg-bbox

28461d3

# Conflicts: # sahi/predict.py

MLDovakin closed this Feb 24, 2023

MLDovakin added 2 commits February 24, 2023 21:56

Merge remote-tracking branch 'origin/add-agg-bbox' into add-agg-bbox

c8011c7

# Conflicts: # sahi/predict.py

Merge remote-tracking branch 'origin/add-agg-bbox' into add-agg-bbox

3a60f85

# Conflicts: # sahi/predict.py

MLDovakin reopened this Feb 24, 2023

Merge branch 'main' into add-agg-bbox

7749f85

fcakyon enabled auto-merge February 26, 2023 18:56

fcakyon added this pull request to the merge queue Feb 26, 2023

fcakyon approved these changes Feb 26, 2023

View reviewed changes

Merged via the queue into obss:main with commit 761c0ff Feb 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added heuristic for bounding bbox ordering #835

Added heuristic for bounding bbox ordering #835

MLDovakin commented Feb 23, 2023

fcakyon commented Feb 23, 2023

fcakyon commented Feb 26, 2023

Added heuristic for bounding bbox ordering #835

Added heuristic for bounding bbox ordering #835

Conversation

MLDovakin commented Feb 23, 2023

fcakyon commented Feb 23, 2023

fcakyon commented Feb 26, 2023