InterHuman is a multimodal dataset, named InterHuman. It consists of about 107M frames for diverse two-person interactions, with accurate skeletal motions and 16,756 natural language descriptions.
Source: InterGen: Diffusion-based Multi-human Motion Generation under Complex InteractionsPaper | Code | Results | Date | Stars |
---|