TridentSynth Tutorial

In this tutorial

Enter your target molecule
Choose synthesis strategies
Configure advanced parameters
Submit and wait
Interpret results
Try it yourself

1 Enter your target molecule

Start by telling TridentSynth which molecule you want to synthesize. There are two ways to do this:

Option A: Type a SMILES string

Enter a i SMILES string directly into the text field. For example, heptane is represented as: CCCCCCC

Option B: Draw the structure

Click "Draw structure" to open the Ketcher molecule editor, where you can draw your target molecule graphically. When you save, the SMILES string is automatically extracted and filled into the text field.

Screenshot of the TridentSynth query form showing the SMILES input field, Draw structure button, Run button, and synthesis strategy checkboxes

The query form: enter a SMILES string or draw a structure, then select your synthesis strategies.

Note: TridentSynth automatically neutralizes charged molecules and removes stereochemistry information before running the synthesis. Only one molecule can be submitted at a time (no "." characters in the SMILES).

2 Choose synthesis strategies

TridentSynth offers three synthesis routes that can be used individually or in combination. Select one or more checkboxes on the query page:

Strategy	What it does	Best for
PKS	Uses type I modular polyketide synthases to build a carbon backbone forward toward the target by stitching together acyl-CoA building blocks. Designs are generated by RetroTide.	Molecules with repeating C₂ units, long carbon chains, polyketide-like scaffolds
Bio	Uses monofunctional enzyme reaction rules. In PKS+Bio mode, works forward from the PKS product toward the target. In Bio-only mode, works backward from the target to known precursors. Powered by DORAnet.	Functional group modifications, hydroxylations, oxidations, enzymatic tailoring
Chem	Uses synthetic organic chemistry reaction rules. Same directional logic as Bio (forward after PKS, backward otherwise).	Transformations not easily achieved by enzymes, broader reaction diversity

Common combinations

PKS + Bio	Build the backbone with PKSs, then tailor with enzymes. Used in this tutorial.
PKS + Bio + Chem	Maximum search space — combines all three routes
PKS only	Quick check if the target is directly reachable by PKS assembly alone
Bio only	Find enzymatic retrosynthesis pathways from target back to common metabolites
Chem only	Find synthetic chemistry retrosynthesis pathways from target back to common metabolites

For this tutorial: We select PKS + Bio with 1 biological step to design pathways to heptane.

3 Configure advanced parameters

Click "Advanced (Optional)" on the query page to expand additional settings. The defaults work well for most cases, but here is what each parameter controls:

Max biological / chemical synthesis steps

Controls how many successive reaction steps DORAnet will explore when tailoring the PKS product (or when working backward from the target in non-PKS mode). Range: 1–3 steps. More steps broaden the search but increase runtime.

PKS release mechanism

Determines how the polyketide chain is released from the PKS enzyme at the end of assembly:

Thiolysis — Cleaves the thioester bond to release a free carboxylic acid. This is the most common mechanism and is used in this tutorial.
Cyclization — Forms an intramolecular ring (lactone) by attacking an internal hydroxyl or amine. Choose this for macrolide-like targets.

PKS starters

The acyl-CoA substrate loaded onto the first (loading) module of the PKS. Available options include malonyl-CoA, methylmalonyl-CoA, ethylmalonyl-CoA, and others. Selecting more starters increases the design space that RetroTide explores.

PKS extenders

The acyl-CoA substrates used by extension modules to elongate the polyketide chain. Each extension module adds a two-carbon unit to the growing chain, with the specific extender determining the side-chain substituent.

Max atoms filter (DORAnet)

Optional upper limits on the number of C, N, and O atoms in intermediate metabolites during Bio/Chem network expansion. This can speed up computation by pruning intermediates that are too large. Leave blank to allow intermediates of any size.

For this tutorial: Release mechanism: thiolysis. Starters: methylmalonyl-CoA. Extenders: malonyl-CoA + methylmalonyl-CoA. Bio steps: 1.

4 Submit and wait

Click "Run" to submit your synthesis job. Here is what happens behind the scenes:

Validation — Your SMILES is checked for correctness, sanitized with RDKit, and screened against a controlled substances list.
Job queued — The synthesis task is sent to a background worker (Celery). You receive a unique job ID and a bookmarkable link to your results.
PKS design (RetroTide) — If PKS is selected, RetroTide combinatorially explores module designs to build a carbon backbone that is structurally similar to your target.
Post-PKS tailoring (DORAnet) — If Bio and/or Chem is selected, DORAnet searches for reaction pathways to convert the PKS product into the final target.
Feasibility scoring — Each predicted reaction is scored by an XGBoost feasibility classifier trained on known enzymatic reactions.
Enzyme retrieval — For enzymatic reactions, TridentSynth searches a database of ~55,000 known reactions to find closely-matched natural enzymes.

Runtime depends on the complexity of your target and the strategies selected. PKS-only jobs typically complete in under a minute; adding Bio/Chem steps with higher step counts may take several minutes.

5 Interpret results

The results page presents your synthesis designs. Here is a guide to each section:

Screenshot of the TridentSynth results page showing the job summary, status alert, and synthesis parameters table

The job summary panel: target structure, status alert, and synthesis parameters used for the run.

Status alerts

A colored banner at the top tells you the outcome:

Green — Full pathways to your exact target were found
Yellow — The exact target was not reached, but pathways to the most structurally similar product are shown
Blue — The target can be synthesized by PKS assembly alone (no post-PKS tailoring needed)

PKS module designs

If PKS was selected, each design shows the engineered PKS assembly as a series of modules. Each module contains catalytic domains that determine how the polyketide chain is modified at that step:

Domain	Role
KS / KSq	Ketosynthase — catalyzes the condensation reaction that extends the chain by two carbons
AT	Acyltransferase — selects and loads the extender substrate (e.g., malonyl-CoA)
KR	Ketoreductase — reduces the ketone to a hydroxyl group (with stereospecificity: A, B, or C type)
DH	Dehydratase — removes water to form a double bond (requires an active KR)
ER	Enoylreductase — reduces the double bond to a saturated bond (requires an active DH)
ACP	Acyl carrier protein — tethers the growing chain (present in every module)

A pathway design: the PKS product (left), post-PKS product (right), and the PKS module architecture with domain annotations.

Similarity scores

Each design reports how structurally similar its product is to your target, measured using the i maximum common substructure (MCS) metric on a 0–1 scale. A score of 1.0 means the product is an exact structural match to the target. Designs are sorted by similarity, with the best matches shown first.

Post-PKS pathways

If Bio and/or Chem steps were used, the results show the predicted reaction pathways connecting the PKS product to the final target (or to the closest reachable product). Each pathway lists:

Reaction SMILES — The chemical transformation at each step, shown as both a rendered image and a text string
Reaction rules — The named rule that predicted each reaction (e.g., rule0001 for enzymatic rules). Enzymatic rules link to their associated UniProt enzyme entries.
Feasibility scores — Each reaction is scored 0–1 by an XGBoost classifier. The net feasibility is the product of all individual scores — higher is better.

Post-PKS pathways: reaction images, reaction rules with UniProt links, feasibility scores, and enzyme retrieval buttons.

Enzyme retrieval

For each predicted reaction, you can click the search button to retrieve closely-matched natural reactions from a database of ~55,000 known enzymatic reactions. Matches are ranked by i RCMFP similarity and include:

A rendered image of the matched natural reaction
The EC number with links to BRENDA, KEGG, and MetaCyc
Protein reference links to UniProt or GenBank

These matched enzymes can serve as starting points for experimental validation of the predicted pathway.

Try it yourself

Ready to try the workflow? Use the buttons below to either load the heptane example into the query form (so you can explore the settings and submit it yourself) or jump straight to the pre-computed results.

Load heptane example in query form View pre-computed heptane results

Tutorial