WebVMT Video Pose Use Case #2

rjksmith · 2021-02-16T12:10:35Z

Question: How can GeoPose be integrated with timed video metadata to record camera location and orientation for moving images on the web?

Background

Moving object trajectories can be represented as WebVMT paths by recording location periodically and using interpolation to calculate intermediate values at any instant during the media timeline - a design aligned with OGC Moving Features. A camera pose feature has been proposed to extend this process to calculate GeoPose by recording camera orientation details based on discussion in the Spatial Data on the Web meeting on 25 June 2019 in Leuven.

Use Case

Consider the interval between two consecutive sample times A and B. A video camera moves from location A with a known orientation/pose to location B with another known pose. How can this be represented using GeoPose in a way that allows intermediate values to be determined during the interval?

There are (at least) two possible approaches.

Calculate instantaneous GeoPose in real time at times A and B and embed these values in JSON format within WebVMT - which can handle encapsulated JSON objects and their interpolation.
Record camera orientation in WebVMT as say heading, pitch and roll at times A and B which can be interpolated by WebVMT (along with location) and then export GeoPose values in a post-processing step.

Both approaches have pros and cons depending on the specific details of the use case such as whether real-time streaming is required.

Examples

Front-facing dashcam The dashcam calculates location from GNSS (global navigation satellite system) and heading from a compass, and these data can be captured in timed video metadata. Pose only needs to be calculated in 2D as the vehicle is on the ground and low precision is sufficient as the camera has a wide field of view.
Drone with gimballed camera The drone (unmanned aerial vehicle) calculates location from GNSS, height from an altimeter, orientation from a compass and gyro, and camera orientation from the gimbal controller. 3D pose is required as the camera is airborne and more precision is needed due its zoom capability which can reduce the field of view.

Related Issues

Resource limitations of video capture devices may include battery capacity, processing speed and data storage so capturing input parameters for GeoPose may be preferable to on-board calculation in real time.
Sampling rates for location and orientation may differ and be asynchonous so interpolated values may be required to calculate GeoPose.
i. Location may be sampled regularly every few seconds (<1Hz);
ii. Camera gimbals may move quickly and sporadically so their pose remains unchanged for many minutes and then rapidly changes in a few tens of milliseconds (~10-100Hz);
iii. Image stabilisation systems may produce pose data at millisecond rates or faster (>1000Hz).
Intermediate accuracy may be improved by post-processing data as future values are not known during real-time processing.
Multiple poses may be captured concurrently such as for forward- and rear-facing dashcams and CCTV systems.

3DXScape · 2021-02-19T00:48:29Z

One way to do the right thing for any relevant application domain or standard would be to follow the same path as we did for defining the coordinate systems -

use a reference using the same triple form authority ID parameters
specify ones that make sense to or SWG using https://ogc.geopose/v1 as the authority and perhaps "do not interpolate" and "may interpolate" as two ID values.
give suggestions on the three fields for matching external standards such as WebVMT.

Another possibility: Recognized values could be in a codelist maintained on the OGC definition service.

In GeoPose sequences, the extra verbosity is not a factor because the information only appears once in the series or stream header.

rjksmith · 2021-02-22T11:45:10Z

Many thanks for your feedback here and in the GeoPose SWG meeting on Friday (19/2/21).

Both implementation options are feasible, though the latter has advantages in terms of accuracy and modularity for live streaming, and brevity.

3DXScape · 2021-03-19T13:56:38Z

Consensus seems to be

use a reference using the same triple form authority ID parameters
specify ones that make sense to or SWG using https://ogc.geopose/v1 as the authority and perhaps "do not interpolate" and "may interpolate" as two ID values.
give suggestions on the three fields for matching external standards such as WebVMT.

rjksmith · 2021-03-19T14:56:33Z

The proposal for pose in WebVMT is:

heading;
pitch;
roll.

3DXScape · 2021-06-04T14:55:24Z

No further discussion so closing.

rjksmith mentioned this issue Feb 16, 2021

Transition model metadata #1

Closed

rjksmith mentioned this issue Mar 16, 2021

Video geotagging format (WebVMT) w3c/strategy#113

Open

3DXScape added the Proposed resolution label Mar 19, 2021

3DXScape closed this as completed Jun 4, 2021

rjksmith mentioned this issue Aug 31, 2021

Proposed web API attributes for camera with location over time webvmt/community-group#6

Open

mmccool mentioned this issue Jan 25, 2022

Various Typos #46

Closed

22 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WebVMT Video Pose Use Case #2

WebVMT Video Pose Use Case #2

rjksmith commented Feb 16, 2021

3DXScape commented Feb 19, 2021

rjksmith commented Feb 22, 2021

3DXScape commented Mar 19, 2021

rjksmith commented Mar 19, 2021

3DXScape commented Jun 4, 2021

WebVMT Video Pose Use Case #2

WebVMT Video Pose Use Case #2

Comments

rjksmith commented Feb 16, 2021

3DXScape commented Feb 19, 2021

rjksmith commented Feb 22, 2021

3DXScape commented Mar 19, 2021

rjksmith commented Mar 19, 2021

3DXScape commented Jun 4, 2021