ScanNet
ScanNet is an RGB-D video dataset containing 2.5 million views in more than 1500 scans, annotated with 3D camera poses, surface reconstructions, and instance-level semantic segmentations.
ScanNet Data
If you would like to download the ScanNet data, please fill out an agreement to the ScanNet Terms of Use and send it to us at scannet@googlegroups.com.
If you have not received a response within a week, it is likely that your email is bouncing - please check this before sending repeat requests.
Please check the changelog for updates to the data release.
Data Organization
The data in ScanNet is organized by RGB-D sequence. Each sequence is stored under a directory with named scene_, or scene%04d_%02d, where each space corresponds to a unique location (0-indexed). The raw data captured during scanning, camera poses and surface mesh reconstructions, and annotation metadata are all stored together for the given sequence. The directory has the following structure:
|-- .sens
RGB-D sensor stream containing color frames, depth frames, camera poses and other data
|-- _vh_clean.ply
High quality reconstructed mesh
|-- _vh_clean_2.ply
Cleaned and decimated mesh for semantic annotations
|-- _vh_clean_2.0.010000.segs.json
Over-segmentation of annotation mesh
|-- .aggregation.json, _vh_clean.aggregation.json
Aggregated instance-level semantic annotations on lo-res, hi-res meshes, respectively
|-- _vh_clean_2.0.010000.segs.json, _vh_