Documentation

Technical Reference Guide

Platform Overview

T-GUIDE is a bioinformatics platform for the design and analysis of TnpB-mediated genome editing guides. The system identifies target sites, predicts on-target efficiency using deep learning models, and evaluates genome-wide specificity via Cas-OFFinder. It further automates the generation of experimental validation primers.

The system provides quantitative metrics for the efficiency and specificity of proposed guide RNAs, facilitating high-precision targeting across a broad spectrum of species and ortholog variants.

Design Workflow

Four-Step Process

  • 1. Select Genome: Choose from a wide range of high-quality genomes spanning vertebrates, plants, and microbial systems.
  • 2. Select Variant: Pick the specific ortholog you are using. Each variant has a unique TAM requirement.
  • 3. Define Target: Enter a standard Gene Symbol, paste raw DNA, or fetch via NCBI Accession.
  • 4. Analyze Results: Generate a list of guides sorted by efficiency and checked for off-targets.

Input Flexibility

T-GUIDE accepts standard gene symbols (e.g., BRCA1) and automatically retrieves coordinates. For non-coding regions or custom constructs, raw sequence input allows for precise control over the target area.

Activity & Specificity

CNN Efficiency Scoring

Efficiency prediction is powered by a Convolutional Neural Network (CNN) trained on experimentally verified TnpB screening data. The model produces an activity score (0-100) based on the 20nt spacer and flanking genomic context.

Recommended: Select guides with scores >50 and GC content between 40-60%.

Cas-OFFinder Integration

Genome-wide specificity is evaluated for every identified guide. The results table provides a mismatch matrix (MM0 to MM3), allowing users to prioritize guides with zero or minimal off-target risks across the selected genome.

Genomes & Variants

We support an extensive collection of high-quality genomes sourced from Ensembl Release 109 and NCBI. Each genome is indexed for rapid sequence retrieval and off-target screening.

Common Name Scientific Name Accession ID
VERTEBRATES & METAZOA
HumanHomo sapiensGCA_000001405
MouseMus musculusGCA_000001635
RatRattus norvegicusGCA_036323735
ZebrafishDanio rerioGCA_000002035
ChickenGallus gallusGCA_016699485
CowBos taurusGCA_002263795
PigSus scrofaGCA_000003025
Fruit FlyDrosophila melanogasterGCA_000001215
RoundwormCaenorhabditis elegansGCA_000002985
PLANTS
Thale CressArabidopsis thalianaGCA_000001735
Rice JaponicaOryza sativaGCA_001433935
Rice IndicaOryza indicaGCA_000004655
Corn / MaizeZea maysGCA_902167145
WheatTriticum aestivumGCA_900519105
SoybeanGlycine maxGCA_000004515
SorghumSorghum bicolorGCA_000003195
BarleyHordeum vulgareGCA_904849725
TomatoSolanum lycopersicumGCA_000188115
PotatoSolanum tuberosumGCA_000226075
RapeseedBrassica napusGCA_000686985
Field MustardBrassica rapaGCA_000309985
Mung BeanVigna radiataGCA_000741045
Green FoxtailSetaria viridisGCA_000347855
BananaMusa acuminataGCA_000313855
CassavaManihot esculentaGCA_001659605
FUNGI
YeastSaccharomyces cerevisiaeGCA_000146045
Fission YeastSchizosaccharomyces pombeGCA_000002945
Rice Blast FungusPyricularia oryzaeGCA_000002495
Aspergillus nidulansAspergillus nidulansGCA_000011425
Candida albicansCandida albicansGCA_000182965
Neurospora crassaNeurospora crassaGCA_000182925
Fusarium graminearumFusarium graminearumGCA_900044135
Botrytis cinereaBotrytis cinereaGCA_000832945
PROTISTS
Potato BlightPhytophthora infestansGCA_000142945
BACTERIA
E. coli K12Escherichia coliGCA_000005845
Bacillus subtilisBacillus subtilisGCA_000009045

An extensive collection of TnpB variants are supported, each with specific Transposon-Associated Motif (TAM) requirements for precise editing.

TnpB Variant Organism TAM (5')
ISEfa4Enterococcus faeciumTTTA
ISAs2Aeromonas methanosarcinaCCAT
ISCpe2Clostridium perfringensTTTAT
ISMma22Methanosarcina mazeiTTTAG
ISBce3Bacillus cereusTTAC
ISBh16Bacillus thuringiensisTTTA
ISAeme8Aeromonas mediaTTAG
IS605Helicobacter pyloriTTTAA
ISTfu1Thermobifida fuscaTGAT
ISCco1Campylobacter coliTTAT
ISLjo5Lactobacillus johnsoniiTTTAA
ISSco1Salmonella entericaTGAC
ISSoc3Synechococcus sp.CCAT
ISTel2Thermosynechococcus elongatusTGAC
ISis2Sulfolobus islandicusTGAC
ISCht1Clostridium botulinumTTAC
ISNsp3Nostoc sp.TTAC
ISMac7Methanosarcina acetivoransTTTAT
ISHp608Helicobacter pyloriTTAC
ISHma12Haloarcula marismortuiTTTAG
ISClte2Clostridium tetaniTTTAC
ISsen1Salmonella entericaCCAT
IS606Helicobacter pyloriTTTAT
ISMba17Methanosarcina barkeriTTTAT
ISkpn69Klebsiella pneumoniaeCCAT
ISCsa5Caldicellulosiruptor saccharolyticusTTTA
ISHwa17Haloquadratum walsbyiTCTA
ISHalt1Halostagnicola larseniiTTTAA
ISMan18Natromonas pharaonisTGAT
ISCbt2Clostridium botulinumTACAT
ISDge10Deinococcus geothermalisTTAT
ISEc41Escherichia coliCCAT
ISNsp2Nostoc sp.TTAC
ISRor9Raoultella ornithinolyticaTTAA
ISAba30Acinetobacter baumanniiTGAC
ISCysp13Cyanothece sp.TGAA
ISMmu1Mitsuokella multacidaTTCAA
ISNsp7Nostoc sp. NIES-4103TTAG
ISAam1Anoxybacillus amylolyticusTTAA
ISTva1Trichormus variabilisTTGAT
ISClsp11Clostridium sp.TTAA
ISYmu1Youngiibacter multivoransTTGAT
ISVga2Vagococcus sp.TTAT
ISBsp10Bacillus sp.TTTAT
ISCath1C. thermoamylovoransTTTAT
ISCysp1Clyobacillus sp.TTTA
ISCvi1Clostridium vitiobionisTTAT
ISCvi2Clostridium vitiobionisTTTAA
ISAep0Anaerobacillus salsusTTAA
ISViha1V. halodenitrificansTTAT
ISCno2Clostridium novyiTTTAA
ISDra2Deinococcus radioduransTTGAT
ISAam1-altAnoxybacillus amylolyticusTTTAA

Experimental Validation

Automated Primer Design

T-GUIDE generates PCR validation primers for every guide RNA using the Primer3 engine. Pairs are optimized for a 60°C melting temperature and amplicon sizes between 250-350 bp, facilitating rapid gel or sequencing-based validation.

Genome Visualization

An interactive browser visualizes guide distribution across gene structures (exons and introns). This helps in selecting targets that maximize the likelihood of functional modifications.