1 code implementation • 3 Sep 2023 • Mohsen Zand, Ali Etemad, Michael Greenspan
We use normalizing flows to parameterize the noisy data at any arbitrary step of the diffusion process and utilize it as the prior in the reverse diffusion process.
1 code implementation • 31 Aug 2023 • Mohsen Zand, Ali Etemad, Michael Greenspan
Our experiments on two challenging benchmark datasets, CMU Mocap and Human3. 6M, demonstrate that our proposed method is able to effectively model the sequence information for motion prediction and outperform other techniques to set a new state-of-the-art.
no code implementations • 16 May 2023 • Andrew Farley, Mohsen Zand, Michael Greenspan
We propose a method that augments a simulated dataset using diffusion models to improve the performance of pedestrian detection in real-world data.
1 code implementation • 14 Oct 2022 • Yangzheng Wu, Alireza Javaheri, Mohsen Zand, Michael Greenspan
We propose a novel keypoint voting 6DoF object pose estimation method, which takes pure unordered point cloud geometry as input without RGB information.
1 code implementation • 14 Jul 2022 • Mohsen Zand, Ali Etemad, Michael Greenspan
We present ObjectBox, a novel single-stage anchor-free and highly generalizable object detection approach.
1 code implementation • 21 Feb 2022 • Mohsen Zand, Haleh Damirchi, Andrew Farley, Mahdiyar Molahasani, Michael Greenspan, Ali Etemad
As the detection and localization tasks are well-correlated and can be jointly tackled, our model benefits from a multitask solution by learning multiscale representations of encoded crowd images, and subsequently fusing them.
no code implementations • 12 Jun 2021 • Joy Mazumder, Mohsen Zand, Michael Greenspan
Applying our method can also improve the pose estimation average precision results of Op-Net by 6. 06% on average.
no code implementations • 24 Apr 2021 • Mohsen Zand, Ali Etemad, Michael Greenspan
A novel object detection method is presented that handles freely rotated objects of arbitrary sizes, including tiny objects as small as $2\times 2$ pixels.
1 code implementation • 9 Apr 2021 • Mohsen Zand, Ali Etemad, Michael Greenspan
We specifically propose to use conditional priors to factorize the latent space for the time dependent modeling.
1 code implementation • 6 Apr 2021 • Yangzheng Wu, Mohsen Zand, Ali Etemad, Michael Greenspan
We propose a novel keypoint voting scheme based on intersecting spheres, that is more accurate than existing schemes and allows for fewer, more disperse keypoints.
Ranked #1 on 6D Pose Estimation using RGBD on YCB-Video (ADDS AUC metric)