Add object detection capability and python API #3472

alonfaraj · 2021-03-16T09:31:10Z

About

This PR adds the capability to detect objects with Unreal.
It support setting radius from camera to search for objects and setting object name in wildcard format.
One can control these settings for each camera, image type and vehicle combination separately.
It output relevant information as described in DetectionInfo.

Itcurrently support only ImageType::Scene, but can be extended by attaching the DetectionComponent in BP_PIPCamera to the relevant camera and add corresponding lines in APIPCamera::PostInitializeComponents

Detection APIs implementation:
- simSetDetectionFilterRadius
- simAddDetectionFilterMeshName
- simClearDetectionMeshNames
- simGetDetections
Detection struct:

class DetectionInfo(MsgpackMixin):
    name = ''
    geoPoint = GeoPoint()
    box2D = Box2D()
    box3D = Box3D()
    relative_pose = Pose()

TODO:

Implement some get API functions
Add Enable/Disable Detection capability API or from settings.json

Most of the detection and object filter code was copied from https://github.com/unrealgt/UnrealGT and changed for my own needs.

Probably much more work to do but I'm using it for a while and though other users might find it useful.

How Has This Been Tested?

Tested on Blocks and ModularNeighborhood environments by running the detection python script in this PR (Windows).

Example API Call -

camera_name = "0"
image_type = airsim.ImageType.Scene

client.simSetDetectionFilterRadius(camera_name, image_type, 80 * 100) # in [cm]
client.simAddDetectionFilterMeshName(camera_name, image_type, "Car_*") 
client.simGetDetections(camera_name, image_type)
client.simClearDetectionMeshNames(camera_name, image_type)

Example output -

Cylinder: <DetectionInfo> {   'box2D': <Box2D> {   'max': <Vector2r> {   'x_val': 617.025634765625,
    'y_val': 583.5487060546875},
    'min': <Vector2r> {   'x_val': 485.74359130859375,
    'y_val': 438.33465576171875}},
    'box3D': <Box3D> {   'max': <Vector3r> {   'x_val': 4.900000095367432,
    'y_val': 0.7999999523162842,
    'z_val': 0.5199999809265137},
    'min': <Vector3r> {   'x_val': 3.8999998569488525,
    'y_val': -0.19999998807907104,
    'z_val': 1.5199999809265137}},
    'geo_point': <GeoPoint> {   'altitude': 16.979999542236328,
    'latitude': 32.28772183970703,
    'longitude': 34.864785008379876},
    'name': 'Cylinder9_2',
    'relative_pose': <Pose> {   'orientation': <Quaternionr> {   'w_val': 0.9929741621017456,
    'x_val': 0.0038591264747083187,
    'y_val': -0.11333247274160385,
    'z_val': 0.03381215035915375},
    'position': <Vector3r> {   'x_val': 4.400000095367432,
    'y_val': 0.29999998211860657,
    'z_val': 1.0199999809265137}}}

Screenshots (if appropriate):

Blocks
Unreal

Python
ModularNeighborhood
Unreal

Python

alonfaraj · 2021-03-24T09:08:35Z

@rajat2004 any idea why checks failed for Unity build?

rajat2004 · 2021-03-24T14:55:52Z

You'll need to add some unimplemented methods in Unity WorldSimApi.cpp, .h files for compilation, just follow the format in the other methods

There's also a conflict which needs to be fixed, #3477 renamed the file and everywhere it was used. Will need to fix the spelling of RpcLibAdaptors wherever added in this PR

rajat2004

Just a brief review. I haven't yet looked at the detection part of the code, will do so later

One main point is that AirSim uses 4 spaces instead of tabs, it'll be best to convert to remain consistent. Another is that snake case is used for variables, which would also be another good thing to fix

AirLib/include/api/RpcLibClientBase.hpp

AirLib/src/api/RpcLibServerBase.cpp

PythonClient/detection/detection.py

Unreal/Plugins/AirSim/Source/AirBlueprintLib.cpp

Unreal/Plugins/AirSim/Source/DetectionComponent.h

Unreal/Plugins/AirSim/Source/ObjectFilter.cpp

Unreal/Plugins/AirSim/Source/PawnSimApi.h

evroon · 2021-03-25T15:21:36Z

I don't want to undermine your hard work, but I want to note that there is a simpler way of getting bounding boxes around objects such as cars by using the segmentation images. I explained it here. This way, you get pixel-perfect bounding boxes. By projecting a 3D box, you will always end up (except for objects that have the shape of a box of course) with bigger bounding boxes than optimal.

PPakalns · 2021-03-25T16:53:20Z

@evroon The bounding box retrieval from segmentation images works only in cases when objects do not overlap in the camera view. If there is a forest with a lot of trees or car in front of an another car, then these tightly coupled different objects can not be differentiated in segmentation images. So this bounding box feature is very welcomed in cases such as these.

evroon · 2021-03-25T17:04:37Z

@PPakalns Yes that's true, in case of occlusion "my" method does not work. But in my case I use it to train yolov4 for example and then you don't need such data AFAIK. I value correct bounding boxes more.
So I'm just curious what the use case is for having bounding boxes for (partially) occluded objects. Is it for training NNs or something else?

Btw I think you can also make my method work by taking multiple segmentation images of the same frame and changing the visibility of objects, but that is more complicated and less performant of course.

PPakalns · 2021-03-25T18:14:43Z

@evron I will be using it for prototyping object detection model where objects can be tightly located in the scene next to each other, like standard case of people passing in front of each other or, in my case, prototyping survey drone where it is important to correctly recognize each separate object, these objects can occlude each other little bit. Segmentation image approach makes it hard to annotate such objects with separate bounding boxes because their regions overlap.

@alonfaraj At least now DetectionInfo data returns only object geolocation, for generating annotated data it would be useful if object position and orientation relative to the camera could be returned additionally. At least I will try to add such information myself :)

UPDATE Looks like using name returned in DetectionInfo and AirSim api to get object, vehicle and camera poses such information can be calculated.

Tomorrow will test this code and see how it works. Thanks @alonfaraj for such implementation 🥇

MoBaT · 2021-03-26T20:54:14Z

@alonfaraj This is great! Was building the exact same thing but stumbled upon this. I have a few suggestions..

Can the simSetDetectionFilterRadius and simAddDetectionFilterMeshName api change where a camera name can be given? I would like to have different detections and radiuses done on different cameras. Looking at your implementation, it's a very easy add. So I suggest:

client.simSetDetectionFilterRadius("0", 80 * 100) # in [cm]
client.simAddDetectionFilterMeshName("0", "Car_*")

Change the simAddDetectionFilterMeshName to accept a regex string instead of just a single wildcard similar to the simListSceneObjects api.
Add a simClearDetections("regex") call or to make it easier, a simClearDetections() with no search criteria. I would like this in case I have a scene with dynamic objects and I want to poll for new objects at a certain frequency. By clearing detections and adding detections.
Modify simGetDetections to return a Detection with the 3D boundingBox info also.

Unreal/Plugins/AirSim/Source/ObjectFilter.cpp

PPakalns

Additionally, object detection results for some objects are flickering (in some frames object is visible, in some it is not even when camera position is not changed). Will look into cause of it.

Unreal/Plugins/AirSim/Source/ObjectFilter.cpp

Unreal/Plugins/AirSim/Source/PawnSimApi.cpp

AirLib/include/api/RpcLibAdapatorsBase.hpp

alonfaraj · 2021-03-29T17:05:11Z

@alonfaraj This is great! Was building the exact same thing but stumbled upon this. I have a few suggestions..

Can the simSetDetectionFilterRadius and simAddDetectionFilterMeshName api change where a camera name can be given? I would like to have different detections and radiuses done on different cameras. Looking at your implementation, it's a very easy add. So I suggest:
client.simSetDetectionFilterRadius("0", 80 * 100) # in [cm]
client.simAddDetectionFilterMeshName("0", "Car_*") 
Change the simAddDetectionFilterMeshName to accept a regex string instead of just a single wildcard similar to the simListSceneObjects api.

Add a simClearDetections("regex") call or to make it easier, a simClearDetections() with no search criteria. I would like this in case I have a scene with dynamic objects and I want to poll for new objects at a certain frequency. By clearing detections and adding detections.

Modify simGetDetections to return a Detection with the 3D boundingBox info also.

alonfaraj · 2021-03-29T17:09:11Z

@MoBaT Thanks for the suggestions!
I will probably make those changes soon.

About 4 - what is the purpose of 3D BB? Should it be in Geo as well?

alonfaraj · 2021-03-29T17:13:15Z

@PPakalns Thank you very much for testing and fix those bugs!
I will make some fixes soon.

@evroon Seems like you already discussed it but I like your approach too :)
As @evroon said, my scenario is mostly when object are partially occluded by others and in this case I think it might be easier to use a "real" detection and not a segmentation.

- Move FString ctor outside of loop

alonfaraj · 2021-05-24T08:45:04Z

@zimmy87 Seems like all set now.

Unreal/Plugins/AirSim/Source/DetectionComponent.cpp

Unreal/Plugins/AirSim/Source/PawnSimApi.cpp

…nly by API request

zimmy87

Mostly comments on naming guidelines

Unity/AirLibWrapper/AirsimWrapper/Source/PawnSimApi.cpp

Unreal/Plugins/AirSim/Source/DetectionComponent.cpp

Unreal/Plugins/AirSim/Source/DetectionComponent.h

Unreal/Plugins/AirSim/Source/ObjectFilter.cpp

AirLib/include/vehicles/car/api/CarRpcLibAdaptors.hpp

alonfaraj · 2021-06-08T08:01:33Z

@zimmy87 Thanks for the review! Hopefully I didn't miss anything

adding setup_path.py for convenience in calling detection.py

zimmy87

latest revision looks good to me; will move ahead with merge once all checks pass

jonyMarino · 2021-06-10T16:41:51Z

Hi, @alonfaraj! Congratulations on this merged Pull Request. You are in the top 5 AirSim contributors! However, this contribution would have a much greater impact if it had associated documentation. Can you create a new PR with documentation?

alonfaraj · 2021-06-10T17:12:32Z

@jonyMarino Thank you!
Sure, no problem.

rajat2004 · 2021-06-12T18:31:56Z

Should have mentioned this earlier, the API has a image_type argument, but the detection component is only present for Scene in PIPCamera.cpp, and different image types also don't really make sense since all the images will be same for the detections. Should the image_type arg be kept at all?

alonfaraj · 2021-06-13T07:37:07Z

@rajat2004 you right, this PR currently support only Scene as mentioned in the first post above and it's easy to extend it for all other types. I thought it would be mostly relevant to Scene so didn't add it for all other types.

I added the image_type argument because different image types (for the same camera) can have different parameters such as resolution, FOV etc. which can lead to different detection results.

I'm wondering if add it to all other types is necessary or remove the image_type argument.

rajat2004 · 2021-06-13T08:11:45Z

Yeah, makes sense to have different detections for each image type as well. Since the API already has the image_type arg, adding support for other types will be good

LIU-Xueming · 2021-11-05T09:15:38Z

Hi, @alonfaraj
Thank you very much for your work, it is very helpful to me！

But Could you please explain these parameters in detail? ,such as
geoPoint = GeoPoint()
relative_pose = Pose()

I am confused about these two parameters

Thanks a lot！

alonfaraj · 2021-11-07T10:12:20Z

Thank you @LIU-Xueming,

geoPoint is the geographical coordinate of the detected object. It's relevant in case you specify OriginGeopoint you can read more about it here.
relative_pose is position and orientation of the detected object, relative to the camera which generate the detection.

zohaibjan · 2022-08-08T03:04:24Z

Hi, @alonfaraj

I have a quick question. How will you accommodate for occluded objects?. Is there a way to determine whether the object is occluded or not ?. I potentially want to exclude the bounding box of an object if it is occluded.

Thank you.

alonfaraj added 8 commits March 16, 2021 10:21

- Add detections python api

4fd72e0

- Merge from AirsimHost

b89fe26

- Add GetDetection python api

b29b174

- Add simGetDetections to python api

e825439

- Update files from AirSimHost

52fe8fb

- Merge from AirSimHost

5c5e27f

- Update detection api

8acc4b2

- Add python script detection example

18083d1

alonfaraj changed the title ~~Add object detection ability and python API~~ Add object detection capability and python API Mar 16, 2021

rajat2004 reviewed Mar 24, 2021

View reviewed changes

jonyMarino closed this Mar 25, 2021

jonyMarino reopened this Mar 25, 2021

jonyMarino added the feature request label Mar 25, 2021

PPakalns reviewed Mar 29, 2021

View reviewed changes

Unreal/Plugins/AirSim/Source/ObjectFilter.cpp Outdated Show resolved Hide resolved

PPakalns reviewed Mar 29, 2021

View reviewed changes

Unreal/Plugins/AirSim/Source/ObjectFilter.cpp Outdated Show resolved Hide resolved

Unreal/Plugins/AirSim/Source/PawnSimApi.cpp Outdated Show resolved Hide resolved

evroon suggested changes Mar 29, 2021

View reviewed changes

AirLib/include/api/RpcLibAdapatorsBase.hpp Outdated Show resolved Hide resolved

alonfaraj closed this Mar 29, 2021

alonfaraj reopened this Mar 29, 2021

alonfaraj added 2 commits March 29, 2021 19:16

- Remove unused include

66c514b

- Shorten detectionIterator to itr

9e9ed86

- Move FString ctor outside of loop

- Fix clang format errors

10746e9

rajat2004 reviewed May 24, 2021

View reviewed changes

alonfaraj added 4 commits May 25, 2021 10:55

- Remove stray semicolon

442ee51

- Fix DpethPlanar name after merge

ee6ee8e

- Return detections by const ref

da45423

- Remove unused code

23439ff

evroon mentioned this pull request May 28, 2021

Get location and bounding box of objects in environment #232

Closed

- Move the compute detection logic to GetDetections() to compute it o…

995030c

…nly by API request

zimmy87 suggested changes Jun 3, 2021

View reviewed changes

rajat2004 mentioned this pull request Jun 4, 2021

Add support for fixed external cameras #3320

Merged

11 tasks

alonfaraj added 4 commits June 6, 2021 10:46

- Add missing unused() calls

7cd5229

- Modify code to follow naming convention and guidelines

8dc625b

- Fix variables name after rename

726ea60

- Fix clang errors

9425d35

alonfaraj requested a review from zimmy87 June 6, 2021 09:14

Create setup_path.py

0f463a8

adding setup_path.py for convenience in calling detection.py

zimmy87 approved these changes Jun 8, 2021

View reviewed changes

zimmy87 merged commit 83119a6 into microsoft:master Jun 8, 2021

This was referenced Jun 13, 2021

Add detection support for all image types #3788

Merged

Add object detection documentation #3789

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add object detection capability and python API #3472

Add object detection capability and python API #3472

alonfaraj commented Mar 16, 2021 •

edited

Loading

alonfaraj commented Mar 24, 2021

rajat2004 commented Mar 24, 2021

rajat2004 left a comment

evroon commented Mar 25, 2021

PPakalns commented Mar 25, 2021

evroon commented Mar 25, 2021 •

edited

Loading

PPakalns commented Mar 25, 2021 •

edited

Loading

MoBaT commented Mar 26, 2021 •

edited

Loading

PPakalns left a comment

alonfaraj commented Mar 29, 2021

alonfaraj commented Mar 29, 2021 •

edited

Loading

alonfaraj commented Mar 29, 2021

alonfaraj commented May 24, 2021

zimmy87 left a comment

alonfaraj commented Jun 8, 2021

zimmy87 left a comment

jonyMarino commented Jun 10, 2021

alonfaraj commented Jun 10, 2021

rajat2004 commented Jun 12, 2021

alonfaraj commented Jun 13, 2021 •

edited

Loading

rajat2004 commented Jun 13, 2021

LIU-Xueming commented Nov 5, 2021

alonfaraj commented Nov 7, 2021

zohaibjan commented Aug 8, 2022

Add object detection capability and python API #3472

Add object detection capability and python API #3472

Conversation

alonfaraj commented Mar 16, 2021 • edited Loading

About

How Has This Been Tested?

Screenshots (if appropriate):

alonfaraj commented Mar 24, 2021

rajat2004 commented Mar 24, 2021

rajat2004 left a comment

Choose a reason for hiding this comment

evroon commented Mar 25, 2021

PPakalns commented Mar 25, 2021

evroon commented Mar 25, 2021 • edited Loading

PPakalns commented Mar 25, 2021 • edited Loading

MoBaT commented Mar 26, 2021 • edited Loading

PPakalns left a comment

Choose a reason for hiding this comment

alonfaraj commented Mar 29, 2021

alonfaraj commented Mar 29, 2021 • edited Loading

alonfaraj commented Mar 29, 2021

alonfaraj commented May 24, 2021

zimmy87 left a comment

Choose a reason for hiding this comment

alonfaraj commented Jun 8, 2021

zimmy87 left a comment

Choose a reason for hiding this comment

jonyMarino commented Jun 10, 2021

alonfaraj commented Jun 10, 2021

rajat2004 commented Jun 12, 2021

alonfaraj commented Jun 13, 2021 • edited Loading

rajat2004 commented Jun 13, 2021

LIU-Xueming commented Nov 5, 2021

alonfaraj commented Nov 7, 2021

zohaibjan commented Aug 8, 2022

alonfaraj commented Mar 16, 2021 •

edited

Loading

evroon commented Mar 25, 2021 •

edited

Loading

PPakalns commented Mar 25, 2021 •

edited

Loading

MoBaT commented Mar 26, 2021 •

edited

Loading

alonfaraj commented Mar 29, 2021 •

edited

Loading

alonfaraj commented Jun 13, 2021 •

edited

Loading