We present a novel machine learning framework for the analysis of heterogeneity in neuroimaging studies. We propose a semi-supervised learning framework that integrates classification and clustering. The anatomical and genetic heterogeneity of Alzheimer's disease is explored using the proposed framework. The anatomical and genetic subtypes that are revealed are clinically meaningful and match well with previous studies.