Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

process froze when calling next_batch #79

Open
yuanqidu opened this issue Dec 14, 2021 · 7 comments
Open

process froze when calling next_batch #79

yuanqidu opened this issue Dec 14, 2021 · 7 comments

Comments

@yuanqidu
Copy link

yuanqidu commented Dec 14, 2021

When I call next_batch, the process froze forever.

@yuanqidu yuanqidu changed the title openbabel installed but cannot be found during cmake process froze when calling next_batch Dec 14, 2021
@dkoes
Copy link
Contributor

dkoes commented Dec 14, 2021

can you provide example code and data? Do the tests pass?

@yuanqidu
Copy link
Author

Thanks for your quick response. I just solved this problem by manually installing libmolgrid.

However, I have another question. The provided dataset has many files end with ginatype, how could I get sdf.gz from the ginatype files?

@dkoes
Copy link
Contributor

dkoes commented Dec 14, 2021

You can't. The gninatypes files contain the bare minimum needed for training (x,y,z and atom type) for efficient training. You can convert them to xyz files.
https://github.com/gnina/scripts/blob/master/types2xyz.py

@yuanqidu
Copy link
Author

Thanks for your help!

I have a further question, what is the struct object required for libmolgrid? Does this package support pocket discovery step for protein-ligand binding?

@dkoes
Copy link
Contributor

dkoes commented Dec 17, 2021

molgrid creates atomic density grids from molecules. That's it. You could use it as part of a neural network classifier for pocket identification, but that is up to you to develop.

@yuanqidu
Copy link
Author

yuanqidu commented Dec 17, 2021

I see. Thanks again! How are we supposed to prepare the structs as specified by the example?

image

Also, may I ask whether the crossdock dataset identify pockets or does it just provide full protein and ligand?

@dkoes
Copy link
Contributor

dkoes commented Dec 20, 2021

If relative paths are provided in the training file (fname), then data_root is prepended to the file path. The training file can refer to regular molecular data files in it (e.g. pdb, sdf, mol2, xyz). Each line is a training example with labels (first columns) and files names of the molecular data.

When given a receptor and ligand, the ligand defines the binding pocket.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants