Boombox is a multi-modal dataset for visual reconstruction from acoustic vibrations. Involves dropping objects into a box and capturing resulting images and vibrations. Used for training ML systems that predict images from vibration.
Potential application domain: Computer Vision, Multimodal Perception, Vision and Sound, Sight from Sound, Robotics, Deep Learning, and Machine Learning.
Paper | Code | Results | Date | Stars |
---|