Contains 51,583 descriptions of 11,046 objects from 800 ScanNet scenes. ScanRefer is the first large-scale effort to perform object localization via natural language expression directly in 3D.
Source: ScanRefer: 3D Object Localization in RGB-D Scans using Natural LanguagePaper | Code | Results | Date | Stars |
---|