g3dinference#

Runs PointPillars 3D object detection on LiDAR point clouds. The element consumes application/x-lidar buffers produced by g3dlidarparse, executes a PointPillars inference pipeline with OpenVINO, and attaches tensor metadata for downstream consumers.

Overview#

The g3dinference element is intended for LiDAR-only 3D detection pipelines where raw point clouds have already been parsed into a dense float buffer and annotated with LidarMeta.

Key operations:

LiDAR metadata validation: Requires LidarMeta on each input buffer and validates payload size against lidar_point_count
PointPillars inference: Loads the extension_lib, voxel_model, nn_model, and postproc_model entries from a JSON config and executes them in sequence. voxel_params document the voxelization settings used when exporting the PointPillars models and are not applied separately by g3dinference at runtime.
Tensor metadata attachment: Attaches detections as GstGVATensorMeta with pointpillars_3d format
Pipeline integration: Preserves the LiDAR payload and metadata so downstream elements can combine point clouds, detections, and converted JSON output

Properties#

Property	Type	Description	Default
config	String	Path to the PointPillars JSON configuration file. Required.	null
device	String	OpenVINO device used for the neural network stage. Currently `CPU`, `GPU`, and `GPU.<id>` are supported.	CPU
model-type	String	3D detector model type. Currently only `pointpillars` is supported.	pointpillars
score-threshold	Float	Drops detections below this score. `0.0` keeps all post-processing output unchanged.	0.0

Configuration#

The config property should point to a PointPillars JSON configuration file. In practice, the file contains the paths to the OpenVINO extension library and the three models used by the runtime. It may also include voxel_params for compatibility with the exported PointPillars config format.

Warning: voxel_params is kept for compatibility with the exported PointPillars config format. These values are used when exporting the PointPillars models and are already encoded in the generated voxelization model. g3dinference does not currently apply voxel_params as runtime-tunable settings.

Expected top-level entries:

voxel_params: PointPillars voxelization settings such as voxel_size, point_cloud_range, max_num_points, and max_voxels. These values are used when exporting the PointPillars models and are already encoded in the generated voxelization model; g3dinference does not currently apply them as runtime-tunable parameters.
extension_lib: Path to the custom OpenVINO extension library
voxel_model: Path to the voxelization model
nn_model: Path to the main neural network model
postproc_model: Path to the post-processing model

Example configuration:

{
  "voxel_params": {
    "voxel_size": [0.16, 0.16, 4],
    "point_cloud_range": [0, -39.68, -3, 69.12, 39.68, 1],
    "max_num_points": 32,
    "max_voxels": 16000
  },
  "extension_lib": "/path/to/pointPillars/ov_extensions/build/libov_pointpillars_extensions.so",
  "voxel_model": "/path/to/pointPillars/pretrained/pointpillars_ov_pillar_layer.xml",
  "nn_model": "/path/to/pointPillars/pretrained/pointpillars_ov_nn.xml",
  "postproc_model": "/path/to/pointPillars/pretrained/pointpillars_ov_postproc.xml"
}

Pipeline Examples#

Use multifilesrc for LiDAR input, including single-frame runs, so each frame is delivered as one buffer instead of filesrc block fragments.

Basic LiDAR inference pipeline#

gst-launch-1.0 multifilesrc location="lidar/%06d.bin" start-index=0 caps=application/octet-stream ! \
  g3dlidarparse stride=1 frame-rate=5 ! \
  g3dinference config=pointpillars_ov_config.json device=CPU ! \
  fakesink

Inference with JSON export#

gst-launch-1.0 multifilesrc location="lidar/%06d.bin" caps=application/octet-stream ! \
  g3dlidarparse ! \
  g3dinference config=pointpillars_ov_config.json device=GPU score-threshold=0.5 ! \
  gvametaconvert add-tensor-data=true format=json json-indent=2 ! \  
  gvametapublish file-format=2 file-path=pointpillars.json ! \
  fakesink

Input/Output#

Input Capability: application/x-lidar
Output Capability: application/x-lidar

The element operates in-place. It keeps the point cloud payload intact and appends inference results as metadata.

Metadata#

Required input metadata#

g3dinference expects LidarMeta attached by g3dlidarparse. It uses:

lidar_point_count
frame_id
stream_id
exit_source_timestamp

Output metadata#

The element adds GstGVATensorMeta with:

element_id = g3dinference
model_name = pointpillars or the configured model type
layer_name = pointpillars_3d_detection
format = pointpillars_3d
precision = FP32
dims = [N, 9]

Each detection row contains 9 FP32 values. The first dimension N is the number of detections in the frame, and the second dimension is fixed at 9 values per detection.

x: 3D bounding box center X coordinate
y: 3D bounding box center Y coordinate
z: 3D bounding box center Z coordinate
w: bounding box width
l: bounding box length
h: bounding box height
theta: bounding box rotation angle
confidence: detection score produced by the model post-processing stage
label_id: predicted class identifier

When gvametaconvert converts this tensor to JSON, these values are mapped into a bbox_3d object together with confidence and label_id fields.

Processing Pipeline#

Validates that runtime initialization succeeded and config is present
Retrieves LidarMeta from the input buffer
Maps the LiDAR payload and verifies its size matches lidar_point_count * 4 * sizeof(float)
Runs voxelization, network inference, and post-processing
Flattens detections into [N, 9] tensor data and attaches GstGVATensorMeta
Pushes the enriched LiDAR buffer downstream for metadata conversion, publishing, or further analytics

Element Details (gst-inspect-1.0)#

Pad Templates:
  SINK template: 'sink'
    Availability: Always
    Capabilities:
      application/x-lidar

  SRC template: 'src'
    Availability: Always
    Capabilities:
      application/x-lidar

Element has no clocking capabilities.
Element has no URI handling capabilities.

Pads:
  SINK: 'sink'
    Pad Template: 'sink'
  SRC: 'src'
    Pad Template: 'src'

Element Properties:

  config              : Path to PointPillars OpenVINO JSON configuration
                        flags: readable, writable
                        String. Default: null

  device              : OpenVINO device for NN model. Supported values: CPU, GPU, GPU.<id>
                        flags: readable, writable
                        String. Default: "CPU"

  model-type          : 3D detector model type
                        flags: readable, writable
                        String. Default: "pointpillars"

  name                : The name of the object
                        flags: readable, writable
                        String. Default: "g3dinference0"

  parent              : The parent of the object
                        flags: readable, writable
                        Object of type "GstObject"

  qos                 : Handle Quality-of-Service events
                        flags: readable, writable
                        Boolean. Default: false

  score-threshold     : Drop detections below this score (0 keeps all postproc output)
                        flags: readable, writable
                        Float. Range:               0 - 1 
                        Default:               0