1 code implementation • 27 Mar 2024 • De-An Huang, Shijia Liao, Subhashree Radhakrishnan, Hongxu Yin, Pavlo Molchanov, Zhiding Yu, Jan Kautz
In addition to leveraging existing video datasets with timestamps, we propose a new task, Reasoning Temporal Localization (RTL), along with the dataset, ActivityNet-RTL, for learning and evaluating this task.
no code implementations • 31 Jan 2024 • Shijia Liao, Shiyi Lan, Arun George Zachariah
The advent of Large Models marks a new era in machine learning, significantly outperforming smaller models by leveraging vast datasets to capture and synthesize complex patterns.
Ranked #1 on Speech Synthesis on LibriTTS
no code implementations • 30 Dec 2023 • Shreelekha Revankar, Shijia Liao, Yu Shen, Junbang Liang, Huaishu Peng, Ming Lin
We perform a comprehensive analysis on the impact of camera poses on HPS reconstruction outcomes.