Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chinese verison of paper #9

Open
yamiefun opened this issue Feb 24, 2021 · 8 comments
Open

Chinese verison of paper #9

yamiefun opened this issue Feb 24, 2021 · 8 comments

Comments

@yamiefun
Copy link

Hi,
Have you published the paper in Chinese?

Thanks very much.

@weixk2015
Copy link
Owner

Hi, sorry but no Chinese version is published yet and actually we don't have any plan to publish a Chinese version now.

@yamiefun
Copy link
Author

I have some question about the cost volume described in the paper.
In the supplemental materials, in the cost volume P-CV paragraph, you said "And the initial reference view depth map is projected to align numeric values with the warped source view depth map." But in the next paragraph about D-CV, you said "And the depth plane is projected to align with the source view depth map".
I wonder which one is correct? Align with "warped source view depth map" or "source view depth map"?

@weixk2015
Copy link
Owner

Sorry for the ambiguity in the paper. The initial reference view depth map/The depth plane is projected to align numeric values with the warped source view depth map. It is a typo in the D-CV paragraph, which should be "the depth plane is projected to align with the warped source view depth map"

@yamiefun
Copy link
Author

yamiefun commented Feb 25, 2021

Thanks very much!
In the Architecture paragraph talking about depth consistency, you said that "The second channels contains the depth values of the virtual planes in the target view by seeing them from the source view". I don't understand this. The depth value of a specific virtual plane should be a constant right? How do I make a constant number to a W*H size slice and concatenate it to the cost volume? Do I misunderstand anything?

@weixk2015
Copy link
Owner

weixk2015 commented Feb 25, 2021

Yes, a plane is a tensor of size W*H and all pixels are set as the same virtual depth value in the camera coordinate system of the target view. The plane(z coordinate) is transformed to the camera coordinate system of the source view following the camera projection equation.

@yamiefun
Copy link
Author

Thanks! I got it.

@yamiefun
Copy link
Author

yamiefun commented Feb 25, 2021

Hi, I have another question about the loss function in paragraph 3.5. I don't know what is "predicted coarse depth map and refined depth map". I don't understand where can you get the coarse depth map in your architecture? In paragraph 3.4, you will get depth maps after cost aggregation and soft-argmax, which is correspond to the "Refinement" part in Fig2, right?

Or the coarse depth map means the initial depth map, which is provided by DeMoN dataset?

@weixk2015
Copy link
Owner

The coarse depth map in 3.5 refers to the depth maps after cost aggregation and soft-argmax

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants