I am a PhD student in the Department of Electrical Engineering and Computer Science (EECS) at the University of Michigan, Ann Arbor. I am advised by Prof. Walter S. Lasecki and am a member of CRO+MA Lab. I am currently at KAIST as a visiting student in KIXLAB working with Prof. Juho Kim.

My research interests are focused on developing crowd-powered systems for intelligent computer vision that allows machines to recognize objects, understand scenes, and therefore interact with the world.

Keywords: #Human-Computer Interaction; #Crowdsourcing; #Computer Vision; #Artificial Intelligence



Robust 4D Simulation of Rare Events enabled by Human-Augmented Computer Vision

Research in robotics and autonomous vehicles suffers from a lack of realistic training data and environments in which to test new approaches. Rare and unusual events such as traffic accidents occur several orders of magnitude less frequently than is needed to collect large enough training and testing sets, presenting a fundamental bottleneck in the research and deployment of such systems. Thus, we propose to use a crowdsourced human-­in-­the-­loop approach to guide computer vision algorithms to extract measurement information from large video corpora, allowing us to create simulations of scene dynamics for training and testing.

Crowdsourcing Emotion, Intention, and Context Annotations from Dialog Videos

Dialog videos contain rich contextual, emotional, and intentional cues of the characters and their surroundings. In this project, we aim to build a crowdsourcing platform that collects these information from a large dialog video dataset. The collection and aggregation process can be challenging because the temporal dimension of the dataset has to be considered, and the labels can be highly subjective. We combat these challenges by exploring crowdsourcing techniques to design workflows and answer aggregation methods that efficiently collects multi-dimensional labels and overcome the subjective nature of the collected annotations.

Improving Aggregate Crowd Performance on Crowd-Assisted Image Segmentation

In designing crowdsourcing tasks, we want to achieve as high accuracy as possible from the given resources. In this work, we introduced an approach of leveraging tool diversity as a means of improving aggregate crowd performance. We define tool diversity as a property of a system (or a task), that enables to use different tools for a same task. In semantic image segmentation tasks, we show that our approach improves the aggregate accuracy significantly, compared to using a single best tool alone.

Crowd-Assisted Robotics

We are building crowdsourcing tools to help autonomous robots recognize new contexts or problems in real-time. Our system uses a hybrid intelligent workflow that combines human intelligence from the crowd with automated support in the form of focused tasks (ones that the system is not able to complete on its own) and smart tools for aiding object segmentation.

Intermodal Non-rigid Image Registration

I've also collaborated with Prof. (Emeritus) Charles R. Meyer in the Department of Biomedical Engineering and Prof. Jeffrey A. Fessler in EECS. I worked on image registration stuff such as intermodal non-rigid image registration based on mutual information, and 2D-3D projection image registration.