BCC2020 has ended
➞ Set your timezone before doing anything else on this site (home page, on the right)
Limit what is shown by Type, Category, or Hemisphere
Registration closed July 15.

BCC2020 is online, global, and affordable. The meeting and training are now done, and the CoFest is under way.

The 2020 Bioinformatics Community Conference brings together the Bioinformatics Open Source Conference (BOSC) and the Galaxy Community Conference into a single event featuring training, a meeting, and a CollaborationFest. Events run from July 17 through July 25, and is held in both the eastern and western hemispheres.

Back To Schedule
Wednesday, July 22 • 01:10 - 01:25
Designing and executing workflows for virtual screening of the SARS-CoV-2 main protease 🍐

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!


The presenter(s) will be available for live Q&A in this session (BCC West).

Simon Bray 1, Tim Dudgeon 2, Björn Grüning 3

1 Bioinformatics Group, University of Freiburg. Email: sbray@informatik.uni-
2 Informatics Matters, Oxford.
3 Bioinformatics Group, University of Freiburg.

Project Website: https://cheminformatics.usegalaxy.eu (webserver);
https://covid19.galaxyproject.org/cheminformatics (project documentation)
Source Code: https://github.com/bgruening/galaxytools/tree/master/chemicaltoolbox;
License: Apache 2.0 (tools); MIT (documentation)

In silico analysis plays a vital role in drug discovery. Computational work allows
simulation and study of potential drug candidates on a scale that is impossible
experimentally. There are many different steps in virtual screening, each with a
different use-case, accuracy and computational cost. Thus, tools are often linked
into workflows, starting with low-accuracy, low-cost methods and filtering outputs
before applying high-accuracy, high-cost methods. The problem of how to link
multiple tools into a single workflow lends itself ideally to workflow management
systems such as Galaxy or Nextflow. There are already a wide range (over 100) of
cheminformatics tools available in Galaxy under the aegis of the ChemicalToolbox
(Bray et al., 2020, submitted, available at https://cheminformatics.usegalaxy.eu).
These draw on a range of open-source cheminformatics (OpenBabel, RDKit),
molecular dynamics (GROMACS) and molecular docking (rDock, AutoDock Vina)

Here, we present a case study in which the ChemicalToolbox was used to assemble
and run workflows for virtual screening of the SARS-CoV-2 main protease (Mpro).
Mpro is one of the main druggable targets for the SARS-CoV-2 virus, and in March
2020 the Diamond Light Source released the results of a crystallographic fragment
screen, providing crystal structures of Mpro in complex with over 50 small organic
molecules. These were used to calculate a list of ~40k candidate molecules and
Galaxy workflows were constructed for charge enumeration, 3D conformer
generation, docking into the Mpro active site with rDock, pose scoring with the
TransFS deep learning approach, and validation against the experimental
structures using the SuCOS structural similarity measure. Running these
workflows led to selection of a shortlist of 500 most promising molecules, to be
purchased for further in vitro study. During the course of the study, over 25
CPU/GPU years, provided by a Europe-wide compute infrastructure network, were
used to generate and score ~80 million docking poses.

In this presentation, the ChemicalToolbox will be briefly introduced, followed by
discussion of the workflow and results.

avatar for Simon Bray

Simon Bray

University of Freiburg
I'm a member of the European Galaxy Team at the University of Freiburg, interested in computational chemistry, molecular dynamics, and the use of workflow management systems for virtual screening.

Wednesday July 22, 2020 01:10 - 01:25 EDT