Tutorial

This tutorial walks you through the full TridentSynth workflow using a heptane (C7H16) example with PKS + biological pathways. Whether you work with polyketide synthases, organic chemistry, or computational biology, no prior PKS knowledge is needed to follow along.

1 Enter your target molecule

Start by telling TridentSynth which molecule you want to synthesize. There are two ways to do this:

Option A: Type a SMILES string

Enter a i SMILES string directly into the text field. For example, heptane is represented as: CCCCCCC

Option B: Draw the structure

Click "Draw structure" to open the Ketcher molecule editor, where you can draw your target molecule graphically. When you save, the SMILES string is automatically extracted and filled into the text field.

Screenshot of the TridentSynth query form showing the SMILES input field, Draw structure button, Run button, and synthesis strategy checkboxes

The query form: enter a SMILES string or draw a structure, then select your synthesis strategies.

Note: TridentSynth automatically neutralizes charged molecules and removes stereochemistry information before running the synthesis. Only one molecule can be submitted at a time (no "." characters in the SMILES).
2 Choose synthesis strategies

TridentSynth offers three synthesis routes that can be used individually or in combination. Select one or more checkboxes on the query page:

Strategy What it does Best for
PKS Uses type I modular polyketide synthases to build a carbon backbone forward toward the target by stitching together acyl-CoA building blocks. Designs are generated by RetroTide. Molecules with repeating C2 units, long carbon chains, polyketide-like scaffolds
Bio Uses monofunctional enzyme reaction rules. In PKS+Bio mode, works forward from the PKS product toward the target. In Bio-only mode, works backward from the target to known precursors. Powered by DORAnet. Functional group modifications, hydroxylations, oxidations, enzymatic tailoring
Chem Uses synthetic organic chemistry reaction rules. Same directional logic as Bio (forward after PKS, backward otherwise). Transformations not easily achieved by enzymes, broader reaction diversity
Common combinations
PKS + Bio Build the backbone with PKSs, then tailor with enzymes. Used in this tutorial.
PKS + Bio + Chem Maximum search space — combines all three routes
PKS only Quick check if the target is directly reachable by PKS assembly alone
Bio only Find enzymatic retrosynthesis pathways from target back to common metabolites
Chem only Find synthetic chemistry retrosynthesis pathways from target back to common metabolites
For this tutorial: We select PKS + Bio with 1 biological step to design pathways to heptane.
3 Configure advanced parameters

Click "Advanced (Optional)" on the query page to expand additional settings. The defaults work well for most cases, but here is what each parameter controls:

Max biological / chemical synthesis steps

Controls how many successive reaction steps DORAnet will explore when tailoring the PKS product (or when working backward from the target in non-PKS mode). Range: 1–3 steps. More steps broaden the search but increase runtime.

PKS release mechanism

Determines how the polyketide chain is released from the PKS enzyme at the end of assembly:

  • Thiolysis — Cleaves the thioester bond to release a free carboxylic acid. This is the most common mechanism and is used in this tutorial.
  • Cyclization — Forms an intramolecular ring (lactone) by attacking an internal hydroxyl or amine. Choose this for macrolide-like targets.
PKS starters

The acyl-CoA substrate loaded onto the first (loading) module of the PKS. Available options include malonyl-CoA, methylmalonyl-CoA, ethylmalonyl-CoA, and others. Selecting more starters increases the design space that RetroTide explores.

PKS extenders

The acyl-CoA substrates used by extension modules to elongate the polyketide chain. Each extension module adds a two-carbon unit to the growing chain, with the specific extender determining the side-chain substituent.

Max atoms filter (DORAnet)

Optional upper limits on the number of C, N, and O atoms in intermediate metabolites during Bio/Chem network expansion. This can speed up computation by pruning intermediates that are too large. Leave blank to allow intermediates of any size.

For this tutorial: Release mechanism: thiolysis. Starters: methylmalonyl-CoA. Extenders: malonyl-CoA + methylmalonyl-CoA. Bio steps: 1.
4 Submit and wait

Click "Run" to submit your synthesis job. Here is what happens behind the scenes:

  1. Validation — Your SMILES is checked for correctness, sanitized with RDKit, and screened against a controlled substances list.
  2. Job queued — The synthesis task is sent to a background worker (Celery). You receive a unique job ID and a bookmarkable link to your results.
  3. PKS design (RetroTide) — If PKS is selected, RetroTide combinatorially explores module designs to build a carbon backbone that is structurally similar to your target.
  4. Post-PKS tailoring (DORAnet) — If Bio and/or Chem is selected, DORAnet searches for reaction pathways to convert the PKS product into the final target.
  5. Feasibility scoring — Each predicted reaction is scored by an XGBoost feasibility classifier trained on known enzymatic reactions.
  6. Enzyme retrieval — For enzymatic reactions, TridentSynth searches a database of ~55,000 known reactions to find closely-matched natural enzymes.

Runtime depends on the complexity of your target and the strategies selected. PKS-only jobs typically complete in under a minute; adding Bio/Chem steps with higher step counts may take several minutes.

5 Interpret results

The results page presents your synthesis designs. Here is a guide to each section:

Screenshot of the TridentSynth results page showing the job summary, status alert, and synthesis parameters table

The job summary panel: target structure, status alert, and synthesis parameters used for the run.

Status alerts

A colored banner at the top tells you the outcome:

  • Green — Full pathways to your exact target were found
  • Yellow — The exact target was not reached, but pathways to the most structurally similar product are shown
  • Blue — The target can be synthesized by PKS assembly alone (no post-PKS tailoring needed)
PKS module designs

If PKS was selected, each design shows the engineered PKS assembly as a series of modules. Each module contains catalytic domains that determine how the polyketide chain is modified at that step:

DomainRole
KS / KSqKetosynthase — catalyzes the condensation reaction that extends the chain by two carbons
ATAcyltransferase — selects and loads the extender substrate (e.g., malonyl-CoA)
KRKetoreductase — reduces the ketone to a hydroxyl group (with stereospecificity: A, B, or C type)
DHDehydratase — removes water to form a double bond (requires an active KR)
EREnoylreductase — reduces the double bond to a saturated bond (requires an active DH)
ACPAcyl carrier protein — tethers the growing chain (present in every module)
Screenshot of a pathway design showing PKS and post-PKS products with similarity scores, and the PKS module grid with domain annotations

A pathway design: the PKS product (left), post-PKS product (right), and the PKS module architecture with domain annotations.

Similarity scores

Each design reports how structurally similar its product is to your target, measured using the i maximum common substructure (MCS) metric on a 0–1 scale. A score of 1.0 means the product is an exact structural match to the target. Designs are sorted by similarity, with the best matches shown first.

Post-PKS pathways

If Bio and/or Chem steps were used, the results show the predicted reaction pathways connecting the PKS product to the final target (or to the closest reachable product). Each pathway lists:

  • Reaction SMILES — The chemical transformation at each step, shown as both a rendered image and a text string
  • Reaction rules — The named rule that predicted each reaction (e.g., rule0001 for enzymatic rules). Enzymatic rules link to their associated UniProt enzyme entries.
  • Feasibility scores — Each reaction is scored 0–1 by an XGBoost classifier. The net feasibility is the product of all individual scores — higher is better.
Screenshot of post-PKS pathways showing reaction images, reaction rules, UniProt enzyme links, enzyme retrieval button, and feasibility scores

Post-PKS pathways: reaction images, reaction rules with UniProt links, feasibility scores, and enzyme retrieval buttons.

Enzyme retrieval

For each predicted reaction, you can click the search button to retrieve closely-matched natural reactions from a database of ~55,000 known enzymatic reactions. Matches are ranked by i RCMFP similarity and include:

These matched enzymes can serve as starting points for experimental validation of the predicted pathway.

Try it yourself

Ready to try the workflow? Use the buttons below to either load the heptane example into the query form (so you can explore the settings and submit it yourself) or jump straight to the pre-computed results.

Further reading

  • About / Help — Detailed background on TridentSynth's algorithms and design
  • Publications — Papers to cite when using TridentSynth
  • Common precursors — The ~300 metabolites used as targets in non-PKS retrosynthesis