PhD supervision – When Statistics Meets Machine Learning…

Current PhD students

Chenxi (Maxine) Hua (September 2023 – , EPSRC studentship)

PhD opportunities

Currently, I’m interested in the following two projects, please feel free to contact me:

(My requirement for you is that you have to finish a Master’s degree with a high GPA, hard-working, and have good mathematical & programming skills.)

Project 1: Online change-point detection
The detection of abrupt changes is a crucial open problem in almost every branch of science. In statistics, there is a well-developed research direction called the ‘change-point problem’. However, the drawback of the traditional statistical methods is that they rely on a number of assumptions that are hard to validate in practical situations, for example, smoothness or distributional assumptions. This project aimed to detect the change-point by utilizing several machine learning techniques, for example, matrix factorization, and the problem will be solved under an online machine learning framework, where data becomes available in sequential order, in another word, we aimed to address the practical need for detecting change-point in a prospective way. In contrast, the majority of approaches in the existing literature are focused on an offline setting.

1. Aminikhanghahi, S., & Cook, D. J. (2017). A survey of methods for time series change point detection. Knowledge and information systems, 51(2), 339-367.
2. Chi, Y., Lu, Y. M., & Chen, Y. (2019). Nonconvex optimization meets low-rank matrix factorization: An overview. IEEE Transactions on Signal Processing, 67(20), 5239-5269.

Project 2: Dirty statistical models
This project aimed to develop a methodology to integrate data sets from several studies. However, the sample size of each data set is small so statistical analysis for each data set along will lead to low statistical power. At the same time, each data set may be biased so a naive pooling procedure will not work. We want to develop new methods under survival analysis models (for example, the Cox model, proportional odds model, linear transformation model, accelerated failure time model, etc.). The object of this project also includes the study of the non-asymptotic properties of the estimators.

1. Yang, E., & Ravikumar, P. (2013, December). Dirty statistical models. In Proceedings of the 26th International Conference on Neural Information Processing Systems-Volume 1 (pp. 611-619).
2. Chen, A., Owen, A. B., & Shi, M. (2015). Data enriched linear regression. Electronic journal of statistics, 9(1), 1078-1112.
3. Asiaee, A., Oymak, S., Coombes, K. R., & Banerjee, A. (2018). High Dimensional Data Enrichment: Interpretable, Fast, and Data-Efficient. arXiv preprint arXiv:1806.04047.
4. Wainwright, M. J. (2019). High-dimensional statistics: A non-asymptotic viewpoint (Vol. 48). Cambridge University Press.

Available PhD Studentships

EPSRC studentship (Available for Home/International students)

For details, please see: https://www.kent.ac.uk/scholarships/search/FNADEPSRCS02

GTA studentship (Available for Home/International students)

For details, please see: https://blogs.kent.ac.uk/pgrteaching/graduate-teaching-assistantship-employment-and-scholarship/

China Scholarship Council (CSC)-Kent PhD Scholarships (Available for Chinese students)

For details, please see: https://www.kent.ac.uk/scholarships/search/FNADCSCKS002

Alumni

Sa (Sarah) Ren, PhD (2018-2022). First job: PDRA in Statistics, School of Health and Related Research, University of Sheffield (Thesis Topic: Inferring Network Structures Using Hierarchical Exponential Random Graph Models)