My full publication list is available on Google Scholar.
We tackle the problem of obtaining dense 3D human reconstructions from single and partially occluded views. In such cases, the visual evidence is usually insufficient to identify a 3D reconstruction uniquely, so we recover a set of plausible and reconstructions, consistent with the input image. We train using a best-of-M loss, to which we add flexibility with a novel quantization scheme based on normalizing flows.
We introduce a fully automatic, end-to-end system for 3D dog reconstruction trained using only weak 2D supervision. We use SMBLD, a new 3D deformable template model which includes a detailed shape prior learnt training using expectation maximization. We also release StanfordExtra: the largest dataset of 2D keypoints and segmentations for an animal category.
A system to recover the 3D shape and motion of a wide variety of quadruped animals from video. We overcome the limited availability of animal motion capture data and ensure generalizability to real-world sequences by training on synthetic silhouettes. We apply our method on manually-segmented and automatically-segmented monocular animal videos and require no other form of user intervention.
We improve 3D body shape estimation for diverse body types. While existing methods successfully estimate 3D pose, reliably estimating precise shape remains challenging. To address this gap, we propose new loss functions and a test-time optimization routine that can be readily integrated into parametric 3D human reconstruction pipelines.
We propose deep implicit functions to reconstruct large-scale driving scenes. To avoid requiring watertight meshes for training, we instead use LiDAR to approximate ground truth occupancy labels. We evaluate on a real-world autonomous driving dataset and show incorporating geometric priors improves reconstruction quality.
Computer vision application for verifying regulatory gowning procedures in collaboration with GlaxoSmithKline. Won departmental award for best third year dissertation at the University of Warwick.
This thesis focuses on designing methods for animal reconstruction, making use of 3D morphable models. Topics covered include: training 3D animal reconstruction algorithms with synthetic silhouette data, refining 3D shape priors in-the-loop and handling ambiguous input images.
Behaviour and key point predictions at ~15fps by a deep learning architecture we refer to as RodentNet. Results shown on validation sequences from the SCORHE dataset.