Finding MoRI: Accurately identifying Molecules of Relevant Interest (MORI) in AFM images using various machine learning methods
Poster presented as part of the Crick BioImage Analysis Symposium.
Atomic force microscopy (AFM) is unique in its ability to image single molecules in liquid with sub-molecular resolution, without the need for labelling or averaging. Enabling us to probe biomolecular structures in native-like states. However, the lack of automated analysis tools in AFM, and slow integration of machine learning (ML) pipelines limits the analysis of the powerful data produced. One limiting factor for the design and integration of these tools is AFM specific dataissues, including; varied image contents and small datasets (compared to e.g. Cryo EM). Raw AFM images undergomany cleaning steps where poorly generalisable threshold methods make identifying desired molecules of relative interest (MoRIs) difficult within a dataset.
Initial trials using weakly-supervised random forests, and a recursive DBSCAN algorithm, can identify multiple MoRIs in one pass with higher accuracy and less user oversight than the gold-standard[1,2] software. Once MoIs are identified, we encounter a second problem - the inherent heterogeneity of many biomolecular structures. In molecules such as DNA, heterogeneity is driven by its inherent flexibility[3], but how can one distinguish between one conformation and another?
Here, we guide ML instance segmentation labelling via surveys to empirically distinguishDNA minicircle conformations. Neural networks should then localise, classify and segment large AFM datasets to help unravel the role of structure in DNA interactions.
Permission has been given by authors to upload to Crick Figshare. Copyright remains with the original holders.
Funding
Targeting Twist: Single-molecule insights into supercoiled DNA-topoisomerase interactions for drug discovery
Medical Research Council
Find out more...