Skip to content

Latest commit

 

History

History
23 lines (15 loc) · 1.16 KB

nuscenes.md

File metadata and controls

23 lines (15 loc) · 1.16 KB

July 2020

tl;dr: This is the first large dataset with camera, lidar and radar data.

Overall impression

The dataset is quite challenging in many aspects:

  • multiple country, multiple city (Boston, Singapore)
  • multiple weather condition
  • low framerate annotation (2 FPS for camera and lidar, although camera captured at 15 FPS and lidar 20 FPS). This makes tracking harder.

Key ideas

  • Revised mAP (different from KITTI) for 3D OD.
    • We use the Average Precision (AP) metric [32, 26], but define a match by thresholding the 2D center distance d on the ground plane instead of intersection over union (IOU). This is done in order to decouple detection from object size and orientation but also because objects with small footprints, like pedestrians and bikes, if detected with a small translation error, give 0 IOU (Figure 7). This makes it hard to compare the performance of vision-only methods which tend to have large localization errors [69].

Technical details

  • Summary of technical details

Notes

  • Questions and notes on how to improve/revise the current work