Optimize Development Process of New Drug Candidates

The Dataiku Molecular Property Prediction Solution helps perform molecular virtual screening on target protein responses by querying small molecules with known bioactivity via ChEMBL and PubChem databases.

Query Public Chemical Databases

Explore previously studied and novel molecular structures (SMILES) for drug candidates. Easily query public chemical databases with ChEMBL and PubChem APIs. Build the solution flow by selecting a preferred chemical database, then query by target protein and specify feature generation and modeling options for bioactivity prediction.

Reuse and Extend Coding Functions

Power recipes in the flow using Python with open source chemoinformatics libraries. Create molecular descriptors and fingerprint features from SMILES chemical structures with RDKit and pre-trained transformer language models like ChemBERTA with HuggingFace.

Cluster and Predict Bioactivity

Gain deeper understanding of the molecular structure space with dimension reduction methods like t-SNE, PCA, and clustering, then train ML models to predict the bioactivity of small molecules for a given protein target.

Deeply Explore Molecular Properties

Use dynamic, interactive dashboards to understand previously studied and novel molecules. Understand chemical properties and molecular descriptors with interactive visualizations, analyses, and tables. Filter to review individual molecule summaries, and screen leading candidates in-silico through the Dataiku App.

Visualize Development Candidates

Assess novel small molecules and gain valuable insights by computing and visualizing molecular descriptors and fingerprints. Use these insights and the trained bioactivity prediction model to score new molecules, and find similar molecules in previous experiments.

Accelerate Molecular Prediction

Gain full flexibility by adjusting to specific research fields with a composable and extendable solution. Adapt and expand research to gain new insights and make progressively better molecular predictions.

Answer Key Molecular Research Questions

The Dataiku Solution for Molecular Property Prediction helps answer a broad range of questions like: