Get world depth instead of FOV depth [C++] [RealSense] #7279

Jethrootje · 2020-09-08T13:27:35Z

Required Info
Camera Model	{ D435 }
Firmware Version	Latest
Operating System & Version	Windows 10
Kernel Version (Linux Only)	(e.g. 4.14.13)
Platform	PC
SDK Version	{ latest }
Language	{C++/opencv }
Segment	{?}

Issue Description

So currently i have made a small project to test some things with rs2_deproject_pixel_to_point, seen some issue threads but wasn't able to find any useful information. Right now i'm working on a detection in zones, now the problem is that the camera looks in it's own Field Of View perspective instead of the actual depth. Now i was going to work on a pythagorean method myself to calculate it, but then i found out about the deproject method, now my problem is that it's returning the exact same depth and the x seems to be a negative number, which shouldn't be happening if i'm correct.

    pipeline p;
    auto config = p.start();
    auto profile = config.get_stream(RS2_STREAM_COLOR);
    rs2::align align_to(RS2_STREAM_COLOR);

    float add = 0;
    while (true)
    {
        rs2::frameset frames = p.wait_for_frames();
        rs2::align align(rs2_stream::RS2_STREAM_COLOR);
        rs2::frameset aligned_frameset = align.process(frames);
        rs2::depth_frame depth = aligned_frameset.get_depth_frame();
        rs2::frame color_frame = aligned_frameset.get_color_frame();
        int width = depth.get_width();
        int height = depth.get_height();
        auto img = frame_to_mat(color_frame);
        Mat* imRef = &img;

        auto frt = color_frame.get_profile()
            .as<video_stream_profile>().get_intrinsics();
        float point[3] = { 0,0,0 };
        float checkPoint[2] = { 50 + add , 300 };
        add += 50;
        if (add > 400) {
            add = 0;
        }

        float checkDepth = depth.get_distance((int)checkPoint[0], (int)checkPoint[1]);
        cout << checkDepth << "\n";
        rs2_deproject_pixel_to_point(point, &frt, checkPoint, checkDepth);
        cout << point[0] << ", " << point[1] << ", " << point[2] << "\n\n";

        imshow("Test", *imRef);
        waitKey(1);
    }

Output:

As you can see the first depth stays the same as the second depth after using the method and the x sometimes is a negative number.

The text was updated successfully, but these errors were encountered:

MartyG-RealSense · 2020-09-08T17:37:43Z

Hi @Jethrootje The 3D origin (0,0,0) of the camera is at the center of the physical left IR imager component. As described by the Projection documentation (see the link below), the positive x-axis points to the right, the positive y-axis points down, and the positive z-axis points forward.

https://github.com/IntelRealSense/librealsense/wiki/Projection-in-RealSense-SDK-2.0#point-coordinates

So if a point is in the top left area of the image, it could have negative x and negative y values (because it is to the left of the imager center (minus X axis) and above the imager center (minus Y axis).

Therefore, a minus X and positive Y result would refer to a coordinate that is to the left and below the imager's center.

Jethrootje · 2020-09-09T07:19:02Z

Hi @Jethrootje The 3D origin (0,0,0) of the camera is at the center of the physical left IR imager component. As described by the Projection documentation (see the link below), the positive x-axis points to the right, the positive y-axis points down, and the positive z-axis points forward.

https://github.com/IntelRealSense/librealsense/wiki/Projection-in-RealSense-SDK-2.0#point-coordinates

So if a point is in the top left area of the image, it could have negative x and negative y values (because it is to the left of the imager center (minus X axis) and above the imager center (minus Y axis).

Therefore, a minus X and positive Y result would refer to a coordinate that is to the left and below the imager's center. That is my understanding of the principles.

So basically a minus X and positive Y result is normal? Then how about the distance?
The distance doesn't seem to change if i use rs2_deproject_pixel_to_point instead of only using get_distance.

Jethrootje · 2020-09-09T09:10:55Z

So now i've found this, but my code looks the exact same (with C++ instead of Python) but the depth from the FOV isn't different from the depth if i'd like to view it with "World Depth"

He also says

I translated the camera coordinates of each point found by deprojection to world coordinates, I ignored the Y coordinate and projected the X and the Z coordinates to the XZ world coordinate plane.

Now i'm kinda confused about how he translates the Z Coordinate to a World Coordinate, because right now it still looks at the FOV of the camera..

MartyG-RealSense · 2020-09-09T18:41:22Z

Could you clarify please what you mean when you say "The distance doesn't seem to change".

Do you mean that the distance does not update when you move the camera when using rs2_deproject_pixel_to_point? Or that the depth reading is the same when using rs2_deproject_pixel_to_point and when using get_distance()?

Jethrootje · 2020-09-09T18:48:23Z

Could you clarify please what you mean when you say "The distance doesn't seem to change".

Do you mean that the distance does not update when you move the camera when using rs2_deproject_pixel_to_point? Or that the depth reading is the same when using rs2_deproject_pixel_to_point and when using get_distance()?

The second.
Basically what my problem is that i'm trying to detect things with a area:
X (Starting X)
Y (Starting Y)
Width (End X)
Height (End Y)
MinimumZ (Minimum Depth)
MaximumZ (Maximum Depth)

My problem is basically this:

I want it to detect everything on the same depth, but if i make multiple Areas on top of eachother sometimes the area gets mismatched because the depth gets calculated from the FOV, and i thought that rs2_deproject_pixel_to_point would be able to fix that, but i think i'm wrong with that part.. Not sure how i'd do it though.

MartyG-RealSense · 2020-09-09T19:05:08Z

The above question is one that is suited to RealSense team member @ev-mp as an expert on stereo depth. @ev-mp could you kindly provide advice to @Jethrootje on the question above, please?

ev-mp · 2020-09-10T11:29:43Z

@Jethrootje , I added some details for clarification:

The terms "FOV Depths" and "World Depth" are subjective and can mean different things to different people. So to clarify the terms:

The depth stream defines an Euclidian Coordinate System (CS) with the origin being set at the camera's base (0,0,0).
The definition of the axes is according to @MartyG-RealSense explanation given above.
From that point on all the depth calculations are performed in that coordinate system.
The content of the depth frame is "Z" values calculated for every pixel in the camera's frustum (or cropped FOV). In the the sketch it is clear that while range (or radial distance) may coincide with depth (or "Z"), in 99.99% they will be different.
This will provide answer

The distance doesn't seem to change if i use rs2_deproject_pixel_to_point instead of only using get_distance.

The call to frame.get_distance(x,y) provides "Z" value.
The function rs2_deproject_pixel_to_point takes "Z" component of 3D coordinate and calculates the missing X and Y component so that it will produce a coherent [X,Y,Z] location within the mentioned CS. So the "Z" component will be identical to results obtained with frame.get_distance(x,y).
If you need to find the (radial) range from camera to the object then you need to calculate the Euclidean distance (sqrt(x^2+y^2+z^2))

In case you need to translate the the location of the pixel from camera CS to an arbitrary ("World") CS, like relative to the person standing 2 meter behind the camera) then you need to find the translation matrix between the origin of Depth and the "World" CS and do the multiplications.

Jethrootje · 2020-09-17T09:18:11Z

@Jethrootje , I added some details for clarification:

The terms "FOV Depths" and "World Depth" are subjective and can mean different things to different people. So to clarify the terms:

The depth stream defines an Euclidian Coordinate System (CS) with the origin being set at the camera's base (0,0,0).
The definition of the axes is according to @MartyG-RealSense explanation given above.
From that point on all the depth calculations are performed in that coordinate system.

The content of the depth frame is "Z" values calculated for every pixel in the camera's frustum (or cropped FOV). In the the sketch it is clear that while range (or radial distance) may coincide with depth (or "Z"), in 99.99% they will be different.

This will provide answer

The distance doesn't seem to change if i use rs2_deproject_pixel_to_point instead of only using get_distance.

The call to frame.get_distance(x,y) provides "Z" value.
The function rs2_deproject_pixel_to_point takes "Z" component of 3D coordinate and calculates the missing X and Y component so that it will produce a coherent [X,Y,Z] location within the mentioned CS. So the "Z" component will be identical to results obtained with frame.get_distance(x,y).
If you need to find the (radial) range from camera to the object then you need to calculate the Euclidean distance (sqrt(x^2+y^2+z^2))

In case you need to translate the the location of the pixel from camera CS to an arbitrary ("World") CS, like relative to the person standing 2 meter behind the camera) then you need to find the translation matrix between the origin of Depth and the "World" CS and do the multiplications.

Been trying alot of stuff, but i really can't get it to work, basically i want the blue arrows that you've drawn in meters, on the picture you're saying that's 1.30M but that only happens in the FOV, if you'd check it from straight up it'd be different. How would i calculate that distance, like, how can i pretend the camera to be facing straight up instead a FOV degrees. Because basically if you'd put the camera more to the side, it'd give the same distance in the middle but if you check the X and Y of the middle that it was placed in before it'd be different.

Basically i want to calculate the 1.10M at the sides, sorry if i'm a little confusing.

Edit:
Now i know that this'd work:

But basically if the depth isn't the same height, this would happen:

MartyG-RealSense · 2020-09-26T12:10:39Z

Hi @Jethrootje Do you still require assistance with this case please? Thanks!

Jethrootje · 2020-09-28T14:23:05Z

No not really, i tried a few new methods, thanks for both of your help!

MartyG-RealSense · 2020-09-28T14:24:52Z

You're very welcome @Jethrootje - thanks for the update!

MartyG-RealSense added D400 Series Windows labels Sep 8, 2020

MartyG-RealSense mentioned this issue Sep 9, 2020

2D pixel to 3D coordinates IntelRealSense/realsense-ros#1342

Closed

Jethrootje closed this as completed Sep 28, 2020

MartyG-RealSense mentioned this issue Nov 26, 2020

D415 Depth Origin Location Consistancy #7855

Closed

This was referenced Dec 9, 2020

Getting accurate depth value after converting both color and depth images to OpenCV matrices #7925

Closed

Understanding align process better #7958

Closed

MartyG-RealSense mentioned this issue Dec 17, 2020

How to find position (x,y,z) and orientation (Euler angles) of the measurement frame for D435i ? #7884

Closed

mrehacek mentioned this issue Dec 22, 2020

Remove perspective from depth data #8044

Closed

MartyG-RealSense mentioned this issue Apr 27, 2021

D435i coordinates #8914

Closed

MartyG-RealSense mentioned this issue Apr 2, 2022

How to estimate pose of each point in pointcloud? #10222

Closed

MartyG-RealSense mentioned this issue Oct 26, 2022

Meaning of x and y in rs2_deproject_pixel_to_point #11024

Closed

MartyG-RealSense mentioned this issue May 18, 2023

Is the depth solution of D435 based on binocular parallax principle or structured light scheme? #11811

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get world depth instead of FOV depth [C++] [RealSense] #7279

Get world depth instead of FOV depth [C++] [RealSense] #7279

Jethrootje commented Sep 8, 2020

MartyG-RealSense commented Sep 8, 2020 •

edited

Loading

Jethrootje commented Sep 9, 2020

Jethrootje commented Sep 9, 2020 •

edited

Loading

MartyG-RealSense commented Sep 9, 2020

Jethrootje commented Sep 9, 2020

MartyG-RealSense commented Sep 9, 2020

ev-mp commented Sep 10, 2020

Jethrootje commented Sep 17, 2020 •

edited

Loading

MartyG-RealSense commented Sep 26, 2020

Jethrootje commented Sep 28, 2020

MartyG-RealSense commented Sep 28, 2020

Get world depth instead of FOV depth [C++] [RealSense] #7279

Get world depth instead of FOV depth [C++] [RealSense] #7279

Comments

Jethrootje commented Sep 8, 2020

Issue Description

Output:

MartyG-RealSense commented Sep 8, 2020 • edited Loading

Jethrootje commented Sep 9, 2020

Jethrootje commented Sep 9, 2020 • edited Loading

MartyG-RealSense commented Sep 9, 2020

Jethrootje commented Sep 9, 2020

MartyG-RealSense commented Sep 9, 2020

ev-mp commented Sep 10, 2020

Jethrootje commented Sep 17, 2020 • edited Loading

MartyG-RealSense commented Sep 26, 2020

Jethrootje commented Sep 28, 2020

MartyG-RealSense commented Sep 28, 2020

MartyG-RealSense commented Sep 8, 2020 •

edited

Loading

Jethrootje commented Sep 9, 2020 •

edited

Loading

Jethrootje commented Sep 17, 2020 •

edited

Loading