The analysis of flexible multivalent oligomers on EM micrographs consists of the three following steps:
- Identifying and scoring oligomers from particle-picked micrographs.
- Distance-filtering oligomers to avoid crowded regions.
- Statistical correction procedure to ameliorate random proximity artifacts.
The theory behind this analysis as well as detailed pseudocodes outlining all procedures, such as defining input/output files, can be found in this publication: https://doi.org/10.1101/2020.06.16.154096
Run this code in the following way:
python score_oligomer.py <picks_file> <peaks_file> <cdfs_dir> <score_threshold>
where
<picks_file>
is the file with coordinates of picked particles (DoG picker output file, ‘star’ format)
<peaks_file>
is the micrograph image file (DoG picker output file, ‘png’ format)
<cdfs_dir>
is the path to the directory of discretized CDFs required for the scoring algorithm
<score_threshold>
is the the value below which identified oligomers are not considered.
The output is a compressed numpy array file (.npz
) with saved arrays of particle coordinates, oligomer coordinates, and corresponding oligomer scores.
Run this code in the following way:
python distance_filtering.py <file_in> <min_dist>
where
<file_in>
is the output file from score_oligomer.py
<min_dist>
is filtering distance for oligomers (in nm).
The output is a compressed numpy array file (.npz
) with saved arrays of free LC8 and oligomer coordinates, that are identified to be dinstance-filtered or not.
Run this code in the following way:
python statistical_correction.py <file_in>
where
<file_in>
is the output file from distance_filtering.py
.