Generate Grasp Pose from Point Cloud Detection Results
Functional Description
This operator is used to generate pick poses for detected objects by combining 3D point cloud information and 2D image detection results. It first matches and aligns the input list of point clouds with the input list of 2D detection results. Then, it calculates the depth (Z-coordinate) of the pick point based on the aligned point cloud and determines the XY coordinates and rotation angle around the Z-axis of the pick point using the geometric information (such as the minimum bounding rectangle) of the 2D detection result, finally outputting poses based on the robot’s base coordinate system.
Use Cases
-
Robot Picking: In vision-guided robot picking tasks, it’s necessary to calculate precise 6D pick poses for identified objects so that the robot can pick them accurately.
-
Pose Estimation: Estimate the complete pose of an object in space by combining the category and position information from 2D detection with the depth and shape information from the 3D point cloud.
Inputs and Outputs
Input Items |
Camera coordinate system point cloud: Input list of point clouds, typically point cloud clusters corresponding to the objects to be picked, obtained after filtering or clustering the point clouds extracted from detection results. Must be point clouds in the camera coordinate system and contain no NaN values. Image: The original image (color or grayscale) corresponding to the point cloud, used to project the point cloud back to pixel coordinates. Detection results: Output result list from a 2D detection operator, where each result contains information such as the object’s bounding box/polygon, score, category, angle, etc. Camera internal reference: 3x3 camera intrinsic matrix. Camera distortion: The camera’s distortion coefficient vector. Hand-eye calibration matrix: 4x4 transformation matrix from the camera coordinate system to the robot’s base coordinate system. |
Output Items |
Grab Positional Information: A list of calculated pick poses. Each element is a dictionary containing:
|
Parameter Descriptions
Use detection result angle
Parameter Description |
Determines whether the rotation angle (around the Z-axis) of the final output pose directly uses the angle provided by the 2D detection result or is recalculated based on the point cloud and the minimum bounding rectangle of the detection result. |
Tuning Description |
|
Alignment strategy
Parameter Description |
Selects the method for matching and aligning the input point cloud clusters with the 2D detection results. |
Tuning Description |
|
Calculation method of the z value of the capture point
Parameter Description |
Selects how to calculate the Z-coordinate value of the final pick pose based on the aligned point cloud cluster. |
Tuning Description |
|
Circle radius scale
Parameter Description |
Effective when "Calculation method of the z value of the capture point" is set to "Mask Center Circular Area Mean". Defines the radius of the circular area used to calculate the Z-value, which is a ratio relative to half the length of the short side of the detection result’s minimum bounding rectangle. |
Tuning Description |
This parameter controls the size of the object surface area referenced when calculating the depth Z-value.
|
Parameter Range |
[0, 1], Default: 0.3 |