PlmDCA

Pseudo-likelihood maximization in Julia. A complete description of the algorithm can be found at http://plmdca.csc.kth.se/. If you use this algorithm you should cite:

M. Ekeberg, C. Lovkvist, Y. Lan, M. Weigt, E. Aurell, Improved contact prediction in proteins: Using pseudolikelihoods to infer Potts models, Phys. Rev. E 87, 012707 (2013)
M. Ekeberg, T. Hartonen, E. Aurell, Fast pseudolikelihood maximization for direct-coupling analysis of protein structure from many homologous amino-acid sequences, arXiv:1401.4832 (supplementary material)

The present software is a Julia implementation of above mentioned papers, with no reference to the original MATLAB software implementation.

The code now requires at least Julia version 1.5 or later.

Install

To install just use the package manager and do a

(v1.?) pkg> add https://github.com/pagnani/PlmDCA

Overview

The code internally uses NLopt which provides a Julia interfaces to the free/open-source NLopt library.

Usage

To load the code just type

julia> using PlmDCA

The functions in this package are written to maximize performance. Most computationally-heavy functions can use multiple threads (start julia with the -t option or set the JULIA_NUM_THREADS environment variable). For more information on how set correctly the number of threads, please refer to the online Julia Documentation on Multi-Threading.

The program (only in its symmetric version plmdca_sym) can be run on multiple cores previous addprocs(nprocs) where nprocs should be some integer number np lower or equal to your (physical) number of cores.

The software provides two main functions plmdca(filename::String, ...) and plmdca_sym(filename::String,...) (resp. the asymmetric and symmetric coupling version of the algorithm). Empirically it turns out that the asymmetric version is faster and more accurate. This function take as input the name of a (possibly zipped) multiple sequence.

We also provide another function mutualinfo(filename::String,...) to compute the mutual information score.

There are a number of possible algorithmic strategies for the optimization problem. As long as local gradient-based optimization is concerned, this is a list of :symbols (associated to the different methods):

:LD_MMA, :LD_SLSQP, :LD_LBFGS, :LD_TNEWTON_PRECOND
:LD_TNEWTON_PRECOND_RESTART, :LD_TNEWTON, :LD_VAR2, :LD_VAR1

After some experiments we found that the best compromise between accuracy and speed is achieved by the Low Storage BFGS method :LD_LBFGS, which is the default method in the code. The other methods can be set changing the default optional argument (e.g. method=:LD_SLSQP).

There are more optional arguments that can be set (to be documented...).

Output

The functions output a type PlmOut (say X) with 4 fields:

X.Jtensor: the coupling matrix J[ri,rj,i,j] a symmetrized q x q x N x N array, where N is the number of residues in the multiple sequence alignment, and q is the alphabet "size" (typically 21 for proteins).
X.htensor: the external field h[r_i,i] q x N array.
X.pslike: the pseudolikelihood
X.score: a vector of Tuple{Int,Int,Float64} containing the candidate contacts in descending score order (residue1, residue2 , score12).

Requirements

The minimal julia version for using this code is 1.3 (package version <= v0.2.0)
From package versions 0.3.0 on the minimal julia requirement is 1.5

Todos

A lot!

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
.github/workflows		.github/workflows
data		data
src		src
test		test
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE.md		LICENSE.md
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md
REQUIRE		REQUIRE
appveyor.yml		appveyor.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PlmDCA

Install

Overview

Usage

Output

Requirements

Todos

About

Releases

Packages

Languages

License

giovact/PlmDCA

Folders and files

Latest commit

History

Repository files navigation

PlmDCA

Install

Overview

Usage

Output

Requirements

Todos

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages