Hui Wu's website


Principal Research Scientist
IBM Research AI

Hui Wu's website


Profile Photo

About Hui Wu

I am a Principal Research Scientist and Manager at IBM Research, leading the Model Feedback and Adoption initiative for IBM’s Granite AI Models, with a focus on enterprise AI applications. Previously, I led the Edge AI research team, advancing distributed and scalable AI model lifecycles. My research has been published in machine learning and computer vision venues, including NeurIPS, CVPR, and AAAI. My work in multimodal AI led to the creation of the Fashion IQ Challenge, an open-dataset competition featured at ICCV 2019 and CVPR 2020. I co-founded the Workshop on Computer Vision for Fashion, Art, and Design, which was hosted at ECCV 2018, ICCV 2019, and CVPR 2020. Before joining IBM Research in 2015, I earned my Ph.D. in Computer Science from the University of North Carolina at Charlotte, where my dissertation focused on machine learning and medical image analysis.

Contact me: wuhu AT us.ibm.com

This site is lightly maintained to reflect my current professional status. Earlier content remains available for reference and background.


Past Research Activities

Early Research Work

Dialog-based Interactive Image Retrieval

Dialog-based Interactive Image Retrieval
Xiaoxiao Guo*, Hui Wu*, Yu Cheng, Steven J. Rennie, Gerald Tesauro and Rogério S. Feris (* equal contribution)
NeurIPS 2018 [PDF] [CODE] [DEMO]

Overview

We proposed a novel type of dialog agent for the task of interactive image retrieval. Recently, there has been a rapid rise of research interest in visually grounded conversational agents, driven by the progress of deep learning techniques for both image and natural language understanding. A few interesting application scenarios have been explored by recent work, such as collaborative drawing, visual dialog and object guessing game. In this work, we tested the value of visually grounded dialog agents in a practical and yet challenging context. Specifially, we proposed a novel framework of image retrieval system which learns to seek natural and expressive dialog feedbacks from the user and iteratively refine the retrieval result.

Pooling with Stochastic Spatial Sampling

S3pool - Pooling with stochastic spatial sampling
Zhai, Shuangfei, Hui Wu, Abhishek Kumar, Yu Cheng, Yongxi Lu, Zhongfei Zhang, and Rogerio Feris
CVPR 2017 [PDF] [CODE]

Semantic-aware Food Visual Recognition

Learning to make better mistakes - Semantics-aware visual food recognition
Hui Wu, Michele Merler, Rosario Uceda-Sosa and John R. Smith
ACM Multimedia, 2016 [PDF] [Watson API]

The growing popularity of fitness applications and people’s need for easy logging of calorie consumption on mobile devices has made accurate food visual recognition increasingly desireable. In this project, we proposed a visual food recognition framework that integrates the semantic relationships among fine-grained food classes.