4.5 Article

The predictive power of data-processing statistics

Journal

IUCRJ
Volume 7, Issue -, Pages 342-354

Publisher

INT UNION CRYSTALLOGRAPHY
DOI: 10.1107/S2052252520000895

Keywords

macromolecular crystallography; experimental phasing; machine learning; structure determination; phasing; X-ray crystallography

Funding

  1. Biotechnology and Biological Sciences Research Council [BB/L007398/1]
  2. BBSRC [BB/L007010/1, BB/L007398/1, BB/S007083/1] Funding Source: UKRI
  3. MRC [MC_UP_A025_1012] Funding Source: UKRI

Ask authors/readers for more resources

This study describes a method to estimate the likelihood of success in determining a macromolecular structure by X-ray crystallography and experimental single-wavelength anomalous dispersion (SAD) or multiple-wavelength anomalous dispersion (MAD) phasing based on initial data-processing statistics and sample crystal properties. Such a predictive tool can rapidly assess the usefulness of data and guide the collection of an optimal data set. The increase in data rates from modern macromolecular crystallography beamlines, together with a demand from users for real-time feedback, has led to pressure on computational resources and a need for smarter data handling. Statistical and machine-learning methods have been applied to construct a classifier that displays 95% accuracy for training and testing data sets compiled from 440 solved structures. Applying this classifier to new data achieved 79% accuracy. These scores already provide clear guidance as to the effective use of computing resources and offer a starting point for a personalized data-collection assistant.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available