Industry-scale application and evaluation of deep learning for drug target prediction

dc.contributor.authorSturm, Noe
dc.contributor.authorMayr, Andreas
dc.contributor.authorVan, Thanh Le
dc.contributor.authorChupakhin, Vladimir
dc.contributor.authorCeulemans, Hugo
dc.contributor.authorWegner, Joerg
dc.contributor.authorGolib-Dzib, Jose-Felipe
dc.contributor.authorJeliazkova, Nina
dc.contributor.authorVandriessche, Yves
dc.contributor.authorBöhm, Stanislav
dc.contributor.authorCima, Vojtěch
dc.contributor.authorMartinovič, Jan
dc.contributor.authorGreene, Nigel
dc.contributor.authorVander Aa, Tom
dc.contributor.authorAshby, Thomas J.
dc.contributor.authorHochreiter, Sepp
dc.contributor.authorEngkvist, Ola
dc.contributor.authorKlambauer, Günter
dc.contributor.authorChen, Hongming
dc.date.accessioned2020-05-20T06:53:51Z
dc.date.available2020-05-20T06:53:51Z
dc.date.issued2020
dc.description.abstractArtificial intelligence (AI) is undergoing a revolution thanks to the breakthroughs of machine learning algorithms in computer vision, speech recognition, natural language processing and generative modelling. Recent works on publicly available pharmaceutical data showed that AI methods are highly promising for Drug Target prediction. However, the quality of public data might be different than that of industry data due to different labs reporting measurements, different measurement techniques, fewer samples and less diverse and specialized assays. As part of a European funded project (ExCAPE), that brought together expertise from pharmaceutical industry, machine learning, and high-performance computing, we investigated how well machine learning models obtained from public data can be transferred to internal pharmaceutical industry data. Our results show that machine learning models trained on public data can indeed maintain their predictive power to a large degree when applied to industry data. Moreover, we observed that deep learning derived machine learning models outperformed comparable models, which were trained by other machine learning algorithms, when applied to internal pharmaceutical company datasets. To our knowledge, this is the first large-scale study evaluating the potential of machine learning and especially deep learning directly at the level of industry-scale settings and moreover investigating the transferability of publicly learned target prediction models towards industrial bioactivity prediction pipelines.cs
dc.description.firstpageart. no. 26cs
dc.description.issue1cs
dc.description.sourceWeb of Sciencecs
dc.description.volume12cs
dc.identifier.citationJournal of Cheminformatics. 2020, vol. 12, issue 1, art. no. 26.cs
dc.identifier.doi10.1186/s13321-020-00428-5
dc.identifier.issn1758-2946
dc.identifier.urihttp://hdl.handle.net/10084/139495
dc.identifier.wos000529286600001
dc.language.isoencs
dc.publisherSpringer Naturecs
dc.relation.ispartofseriesJournal of Cheminformaticscs
dc.relation.urihttp://doi.org/10.1186/s13321-020-00428-5cs
dc.rightsCopyright © 2020, Springer Naturecs
dc.rights.accessopenAccesscs
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/cs
dc.subjectQSARcs
dc.subjectdeep learningcs
dc.subjectmachine learningcs
dc.subjectstructure-based virtual screeningcs
dc.subjectcheminformaticscs
dc.subjectBig datacs
dc.subjectChEMBLcs
dc.subjectPubChemcs
dc.subjectprospective evaluationcs
dc.subjectretrospective evaluationcs
dc.titleIndustry-scale application and evaluation of deep learning for drug target predictioncs
dc.typearticlecs
dc.type.statusPeer-reviewedcs
dc.type.versionpublishedVersioncs

Files

Original bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
1758-2946-2020v12i1an26.pdf
Size:
1.76 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
license.txt
Size:
718 B
Format:
Item-specific license agreed upon to submission
Description: