Home Science and Nature Unsupervised identification of significant lineages of SARS-CoV-2 through scalable machine learning methods