bad performance on the same wild video #6

bucktoothsir · 2018-12-12T18:05:16Z

hello

I downloaded the same skating video with 1920*1080 resolution from youtube.
I predicted 2d coco joints for this video by the model you provided in Test in the wild #2
I made a dataset file and replaced the res_w and res_h in h36m_dataset.py
Then I get a result by d-pt-243.bin as follows.

Obviously it is wrose than your result

I noticed that your video is with a high resolution and much more accurate 2d joints. Could you please release the original skate video and test in the wild code?

Godatplay · 2018-12-12T19:05:36Z

In terms of the output resolution, you set that with --viz-size. I chose 10 and it seems close, the default is 5.

I'm not sure how much difference it'll make, but consider also changing center as well since all 3 are used to renormalize the camera.

How did you build your dataset file?

dariopavllo · 2018-12-12T23:54:46Z

Did you follow the instructions mentioned in my last post here?

Also, in this comment I mentioned that we used CPN to extract the 2D keypoints for the videos in the wild, which produces slightly better results. Anyway, if you followed the steps correctly, Detectron poses should be very similar.

We took the video from YouTube as well, in 1080p resolution.

wishvivek · 2018-12-13T00:06:08Z

@bucktoothsir Regarding getting visualizations of in-the-wild videos, in the second step, where you converted the input video to individual frames, how did you preprocess this incoming frame (scale, crop, center, etc.?) before getting the output from the Detectron?

bucktoothsir · 2018-12-13T08:38:59Z

@Godatplay

In terms of the output resolution, you set that with --viz-size. I chose 10 and it seems close, the default is 5.

I'm not sure how much difference it'll make, but consider also changing center as well since all 3 are used to renormalize the camera.

How did you build your dataset file?

your advices works. thanks. Now I get a high resolution output, but the performance remains bad.

I built a dataset file as the same structure as original dataset file. Specifically, I built a fake 3d dataset file and a 2d dataset file. The structure is 'S0/skating' and you could rename subjects and actions, then change the corresponding name in your test scripts.

bucktoothsir · 2018-12-13T08:39:55Z

@bucktoothsir Regarding getting visualizations of in-the-wild videos, in the second step, where you converted the input video to individual frames, how did you preprocess this incoming frame (scale, crop, center, etc.?) before getting the output from the Detectron?

I didn't take any preprocessing steps.

bucktoothsir · 2018-12-13T08:54:45Z

In terms of the output resolution, you set that with --viz-size. I chose 10 and it seems close, the default is 5.

I'm not sure how much difference it'll make, but consider also changing center as well since all 3 are used to renormalize the camera.

How did you build your dataset file?

I also write a dataset file by myself.

wishvivek · 2018-12-18T00:17:15Z

@bucktoothsir Thanks for the response. Also, I'm trying to get keypoints on my images using the Detectron Model (using the R-50-FPN End-to-End Keypoint-Only Mask R-CNN Baseline model, in this page), using the command:

python Detectron.pytorch/tools/infer_simple.py --dataset coco --cfg Detectron.pytorch/configs/baselines/e2e_keypoint_rcnn_R-50-FPN_1x.yaml --load_detectron Detectron.pytorch/data/pretrained_model/e2e_keypoint_rcnn_R-50-FPN_1x.pkl --image_dir videoframes --output_dir Detectron.pytorch/keypoints

but getting this error:

RuntimeError: The expanded size of the tensor (81) must match the existing size (2) at non-singleton dimension 0

So, it'll be great if you (or anyone else reading this) could provide any hints on how you're obtaining keypoints through this process. Thanks!

bucktoothsir · 2018-12-20T14:02:27Z

@wishvivek which version of python do you use?

wishvivek · 2018-12-20T17:15:53Z

@dariopavllo I have the 3D predictions from the model for my in-the-wild video, but they're all normalized (i.e., [-1,1]). So,

How do I unnormalize these 3D predictions? (My objective is to visualize the 3D reconstruction, just like the results at the top of this page.)
Usually, we use the mean and std of the dataset to normalize and unnormalize our data (Eg. as is done here. To my understanding, this is done w.r.t. the root joint. So, what is the normalization-unnormalization scheme used here?

Any help will be great, thanks!

lxy5513 · 2019-01-12T05:43:24Z

How to get keypts and bboxes ?

for 12_2017_baselines/e2e_keypoint_rcnn_R-101-FPN_s1x.yaml.
Is there any already traind model to get 2D keyps and bboxes?
like this /path/to/e2e_keypoint_rcnn_R-101-FPN_s1x.kpl

or I need to train on Detetron to get the model?Anyone can help me? thanks a lot

bucktoothsir · 2019-01-28T07:22:32Z

How to get keypts and bboxes ?

for 12_2017_baselines/e2e_keypoint_rcnn_R-101-FPN_s1x.yaml.
Is there any already traind model to get 2D keyps and bboxes?
like this /path/to/e2e_keypoint_rcnn_R-101-FPN_s1x.kpl

or I need to train on Detetron to get the model?Anyone can help me? thanks a lot

I used detectron as the author's advice.

tobiascz · 2019-01-28T12:37:35Z

Thanks @bucktoothsir to point me to this issue!

As I already mentioned in #2 I also was able to run the code on a in the wild example with my own fork of this repository. I also have some notes for Detectron in there for the people with difficulties. My 3D results are also way worse than the results created by @dariopavllo. I think my 2D poses are not accurate enough - also thanks to @lxy5513 who also suggested that.

So my next step would be to actually run the detectron poses through CPN to get better 2D results! If someone has another opinion please share maybe I did something wrong in my code?

My output

Authors output

YCyuchen · 2020-02-15T15:44:49Z

@Godatplay @tobiascz I use the inference code to run my own video, taking Detecton's 2d keypoints as input. the buttocks in my output seems fixed, while i think it should move. Have you met similar problem? Is there any potential solution i can try to improve the result？
My output

tobiascz · 2020-02-15T19:46:30Z

Hey @YCyuchen,

The reason for that is that the 3D Skeleton is always visualized relative to the center hip joint (you called it buttock). To avoid this you could use the ankles as the relative center of the visualization.
In your test video you can see that while the person is crouching the legs actually go up in the reconstruction.

#51

In this issue @dariopavllo already discussed this

bucktoothsir changed the title ~~bad performance in the same wild video~~ bad performance on the same wild video Dec 13, 2018

bucktoothsir mentioned this issue Jan 18, 2019

Finally I reproduce the result on the same wild skating video #23

Open

bucktoothsir mentioned this issue Jan 28, 2019

Test in the wild #2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bad performance on the same wild video #6

bad performance on the same wild video #6

bucktoothsir commented Dec 12, 2018 •

edited

Loading

Godatplay commented Dec 12, 2018 •

edited

Loading

dariopavllo commented Dec 12, 2018

wishvivek commented Dec 13, 2018

bucktoothsir commented Dec 13, 2018

bucktoothsir commented Dec 13, 2018

bucktoothsir commented Dec 13, 2018

wishvivek commented Dec 18, 2018 •

edited

Loading

bucktoothsir commented Dec 20, 2018

wishvivek commented Dec 20, 2018 •

edited

Loading

lxy5513 commented Jan 12, 2019

bucktoothsir commented Jan 28, 2019

tobiascz commented Jan 28, 2019 •

edited

Loading

YCyuchen commented Feb 15, 2020

tobiascz commented Feb 15, 2020 •

edited

Loading

bad performance on the same wild video #6

bad performance on the same wild video #6

Comments

bucktoothsir commented Dec 12, 2018 • edited Loading

Godatplay commented Dec 12, 2018 • edited Loading

dariopavllo commented Dec 12, 2018

wishvivek commented Dec 13, 2018

bucktoothsir commented Dec 13, 2018

bucktoothsir commented Dec 13, 2018

bucktoothsir commented Dec 13, 2018

wishvivek commented Dec 18, 2018 • edited Loading

bucktoothsir commented Dec 20, 2018

wishvivek commented Dec 20, 2018 • edited Loading

lxy5513 commented Jan 12, 2019

bucktoothsir commented Jan 28, 2019

tobiascz commented Jan 28, 2019 • edited Loading

My output

Authors output

YCyuchen commented Feb 15, 2020

tobiascz commented Feb 15, 2020 • edited Loading

bucktoothsir commented Dec 12, 2018 •

edited

Loading

Godatplay commented Dec 12, 2018 •

edited

Loading

wishvivek commented Dec 18, 2018 •

edited

Loading

wishvivek commented Dec 20, 2018 •

edited

Loading

tobiascz commented Jan 28, 2019 •

edited

Loading

tobiascz commented Feb 15, 2020 •

edited

Loading