4.6 Article

An Affordance Keypoint Detection Network for Robot Manipulation

Journal

IEEE ROBOTICS AND AUTOMATION LETTERS
Volume 6, Issue 2, Pages 2870-2877

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LRA.2021.3062560

Keywords

Deep learning in grasping and manipulation; perception for grasping and manipulation; RGB-D perception

Categories

Funding

  1. National Science Foundation Award [1605228]
  2. Div Of Chem, Bioeng, Env, & Transp Sys
  3. Directorate For Engineering [1605228] Funding Source: National Science Foundation

Ask authors/readers for more resources

This study explores the integration of keypoint detections into a deep network affordance segmentation pipeline to better interpret the functionality of object parts. By creating a new dataset and conducting joint training, the trained network AffKp shows promising performance in both affordance segmentation and keypoint detection.
This letter investigates the addition of keypoint detections to a deep network affordance segmentation pipeline. The intent is to better interpret the functionality of object parts from a manipulation perspective. While affordance segmentation does provide label information about the potential use of object parts, it lacks predictions on the physical geometry that would support such use. The keypoints remedy the situation by providing structured predictions regarding position, direction, and extent. To support joint training of affordances and keypoints, a new dataset is created based on the UMD dataset. Called the UMD+GT affordance dataset, it emphasizes household objects and affordances. The dataset has a uniform representation for five keypoints that encodes information about where and how to manipulate the associated affordance. Visual processing benchmarking shows that the trained network, called AffKp, achieves the state-of-the-art performance on affordance segmentation and satisfactory result on keypoint detection. Manipulation experiments show more stable detection of the operating position for AffKp versus segmentation-only methods and the ability to infer object part pose and operating direction for task execution.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available