Support 3D pose estimation #35

tucan9389 · 2020-08-01T02:36:42Z

lightweight-human-pose-estimation-3d-demo.pytorch

repo: https://github.com/Daniil-Osokin/lightweight-human-pose-estimation-3d-demo.pytorch

Models

pytorch model: human-pose-estimation-3d.pth.zip
onnx model: human-pose-estimation-3d.onnx.zip
mlmodel: human-pose-estimation-3d.mlmodel.zip

Metadata

input: a 256x448 color image
outputs:
- features:
- heatmaps:
- pafs:

Parsing Reference

sebo361 · 2020-10-06T13:33:51Z

Hi @tucan9389 how about using ARKit's 3D motion tracking: https://developer.apple.com/documentation/arkit/capturing_body_motion_in_3d ? Or do you think your proposed 3D pose estimation is more accurate?

tucan9389 · 2020-10-06T13:49:47Z

@sebo361
Thanks for your suggestion. I agree with that If the case prefers more.
But I think there is some other side of benefits when using Core ML rather than AR Kit.

When you want to inference person's 3D keypoints with a single image
When you want to inference not only a person's but also an object's 3D keypoints which was trained by yourself

sebo361 · 2020-10-06T13:58:59Z

@tucan9389
True, these are interesting benefits to find out! However I am more interested on comparing both approaches regarding the inference speed and accuracy of 3D human pose keypoints. Now as the latest iPad Pro (and iPhone 12) have the LIDAR sensor, i expect to have more precision in 3D human motion tracking, but I am note sure if ARKit's 3D motion tracking uses LIDAR Depth API by default - do you have any insights of that?

Maybe I should setup a project to compare ARKit's 3D motion tracking with your proposed CoreML 3D pose estimation.

jookovjook · 2020-11-13T16:55:39Z

@sebo361
Also, Apple's 3D body tracking ARBodyTrackingConfiguration can't work simultaneously with the world tracking ARWorldTrackingConfiguration. So, unfortunately, you can't use Apple's 3D body tracking if you also need world tracking

tucan9389 · 2021-03-25T13:54:20Z

https://github.com/tucan9389/PoseEstimation-TFLiteSwift

Here is 3D pose estimation demo by using TFLiteSwift. I implemented softargmax with pure Swift and Accelerate framework. In Numpy or Pytorch(or TF), there support dimension summation to use easily, but in swift there is no softargmax function in multi-dimension matrix(tensor). So the implementation of softargmax is a little bit hard.

tucan9389 self-assigned this Aug 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support 3D pose estimation #35

Support 3D pose estimation #35

tucan9389 commented Aug 1, 2020 •

edited

Loading

sebo361 commented Oct 6, 2020

tucan9389 commented Oct 6, 2020

sebo361 commented Oct 6, 2020

jookovjook commented Nov 13, 2020 •

edited

Loading

tucan9389 commented Mar 25, 2021

Support 3D pose estimation #35

Support 3D pose estimation #35

Comments

tucan9389 commented Aug 1, 2020 • edited Loading

lightweight-human-pose-estimation-3d-demo.pytorch

Models

Metadata

Parsing Reference

sebo361 commented Oct 6, 2020

tucan9389 commented Oct 6, 2020

sebo361 commented Oct 6, 2020

jookovjook commented Nov 13, 2020 • edited Loading

tucan9389 commented Mar 25, 2021

tucan9389 commented Aug 1, 2020 •

edited

Loading

jookovjook commented Nov 13, 2020 •

edited

Loading