Question on OPENCV and OPENGL data convention #1286

yimingzhou1 · 2023-01-25T01:28:51Z

You mentioned you are using OPENGL data convention in this codebase. However I saw the following code in colmap_to_json() in nerfstudio/process_data/colmap_utils.py. Assume the output of COLMAP is in OPENCV data convention, can you please explain the following transformation code? Why are you switching the rows and why the third row is multiplied with -1? Thank you!

# Convert from COLMAP's camera coordinate system to ours
c2w[0:3, 1:3] *= -1
c2w = c2w[np.array([1, 0, 2, 3]), :]
c2w[2, :] *= -1

The text was updated successfully, but these errors were encountered:

rockywind · 2023-02-02T09:35:31Z

I also confused that.

tancik · 2023-02-02T17:25:08Z

Here is some info on the coordinate system - https://docs.nerf.studio/en/latest/quickstart/data_conventions.html#camera-view-space

rockywind · 2023-02-03T02:11:12Z

Thank you!

duonglt19 · 2023-02-09T04:31:57Z

I can get the meaning of the line c2w[0:3, 1:3] *= -1 that change camera coordinate system of orientation from [x,y,z] in OpenCV to [x,-y,-z] in OpenGL, but can please you explain why do you swap rows and flip z in the following code? Thank you.

c2w = c2w[np.array([1, 0, 2, 3]), :]
c2w[2, :] *= -1

Kai-46 · 2023-02-13T01:14:43Z

@duonglt19 @tancik I also got confused by these row swapping and z flipping operation, and did a quick investigation. It seems that these two operations have the effect of swapping x and y axes, and flipping z axis in the world space.

I'm not sure why this is needed though. (If it's not important, I could submit a PR removing these two lines to avoid future confusions.)

Here's the brief proof. (Please feel free to point out any errors)
First, note that these two lines can be summarized into matrix form:

A = np.array([[0, 1, 0, 0],
       [1, 0, 0, 0],
       [0, 0, -1, 0],
       [0, 0, 0, 1])).astype(float)
C2W = A @ C2W

The line c2w[0:3, 1:3] *= -1 can also be written in matrix form:

B = np.diag([1, -1, -1, 1]).astype(float)
C2W = C2W @ B

Putting together, we have the following matrix formulation of the function colmap2nerfstudio:

C2W = A @ C2W @ B

(Btw, the above A, B happen to satisfy A^{-1}=A, B^{-1}=B.)

Suppose we have a 3d point p=[x, y, z, 1]^T in camera space and denote C2W @ B @ p as [X, Y, Z, 1]^T. The additional A basically changes the world-space coordinate of this 3D point from [X, Y, Z, 1]^T to [Y, X, -Z, 1]^T. In other words, it swaps x and y axes, and flips z axis of the world coordinate frame.

wuzirui · 2023-05-29T08:24:09Z

It seems like the code first convert OPENCV camera coordinate (right down forward, RDF) to OPENGL coordinate (right up backward, RUB), and then change the world coordinate from RUB to (down right back, DRB), which can also be seen in the original nerf documentation .

But why do we need to convert RUB to DRB exactly?
this issue is also mentioned in #1504

nnop · 2024-02-28T13:31:34Z

It seems this logic origins from instant-ngp (code location).

Could @Tom94 @mmalex explain a bit for this?

maybeLx · 2024-11-28T07:54:25Z

It seems like the code first convert OPENCV camera coordinate (right down forward, RDF) to OPENGL coordinate (right up backward, RUB), and then change the world coordinate from RUB to (down right back, DRB), which can also be seen in the original nerf documentation .

But why do we need to convert RUB to DRB exactly? this issue is also mentioned in #1504

Maybe Nerfstuido not only want to use OpenGL camera coordinate, but aslo they want rotate the whole world system. multipling from right changes the camera coordinate, multipling from left changes the whole world system (it just like you want to roate the whole point cloud.) #2793

yimingzhou1 closed this as completed Feb 8, 2023

Ir1d mentioned this issue Mar 26, 2024

"applied_transform" in camera json file DL3DV-10K/Dataset#4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on OPENCV and OPENGL data convention #1286

Question on OPENCV and OPENGL data convention #1286

yimingzhou1 commented Jan 25, 2023

rockywind commented Feb 2, 2023

tancik commented Feb 2, 2023

rockywind commented Feb 3, 2023

duonglt19 commented Feb 9, 2023

Kai-46 commented Feb 13, 2023 •

edited

Loading

wuzirui commented May 29, 2023

nnop commented Feb 28, 2024 •

edited

Loading

maybeLx commented Nov 28, 2024

Question on OPENCV and OPENGL data convention #1286

Question on OPENCV and OPENGL data convention #1286

Comments

yimingzhou1 commented Jan 25, 2023

rockywind commented Feb 2, 2023

tancik commented Feb 2, 2023

rockywind commented Feb 3, 2023

duonglt19 commented Feb 9, 2023

Kai-46 commented Feb 13, 2023 • edited Loading

wuzirui commented May 29, 2023

nnop commented Feb 28, 2024 • edited Loading

maybeLx commented Nov 28, 2024

Kai-46 commented Feb 13, 2023 •

edited

Loading

nnop commented Feb 28, 2024 •

edited

Loading