Learning and Inference from Complex, Imperfect, and Uncertain Data
Ph.D. Candidate · Department of Computer Science · Western University
About
I am a fourth-year Ph.D. candidate in the Department of Computer Science at Western University, advised by Prof. Grace Y. Yi and Prof. Boyu Wang.
My research broadly spans statistics and machine learning. I am interested in developing principled methodologies for learning, inference, and decision-making from complex, imperfect, and uncertain data. My work combines ideas from statistical modeling, machine learning, uncertainty quantification, and robust learning.
Broadly, I am interested in probabilistic and latent variable modeling, missing data, causal inference, trustworthy AI, foundation models, and other emerging areas of data-driven discovery. I am particularly interested in applications to healthcare and biomedical research, including multimodal electronic health records, medical imaging, and large language models.
Available for postdoctoral positions starting August 2026.
Research interests
Working papers
Instance-Dependent Bayesian Modeling and Robust Training with Crowdsourced Noisy Data. In preparation
Reliability-Aware Longitudinal Clinical Prediction from Multimodal Electronic Health Records. In preparation
Selected publications
Addressing both Variable Selection and Misclassified Responses with Parametric and Semiparametric methods.
Bernoulli 32(2): 1303-1327.
Revisiting Source-Free Domain Adaptation: a New Perspective via Uncertainty Control.
International Conference on Learning Representations (ICLR).
* Equal contribution
Physician Effects in Critical Care: A Causal Inference Approach Through Propensity Weighting with Parametric and Super Learning Methods.
Journal of Data Science 23(1): 130-148.
Learning from Noisy Labels via Conditional Distributionally Robust Optimization.
Neural Information Processing Systems (NeurIPS).
Label Correction of Crowdsourced Noisy Annotations with an Instance-Dependent Noise Transition Model
Neural Information Processing Systems (NeurIPS).
Selected Talks
Learning with Noisy Labels: Variable Selection and Misclassification Probability Modeling.
Case study: Personalized Physician Recommendation for Critical Care Using the TreeSHAP Method.
Beyond research
Outside of work, I enjoy photography (both digital 📷 and film 🎞️!).
Contact
Email: hguo288@uwo.ca
Office: Room 222, Middlesex College, Western University
London, Ontario, Canada