Abstract: Foundation models have significantly reduced the need for task-specific training, while also enhancing generaliz-ability. However, state-of-the-art 6D pose estimators either require further ...
This is the offical Pytorch implementation of PersPose, which estimates the 3D positions of joints from individual images. Below is the overall framework. We design Perspective Encoding (PE) to encode ...