TECHNOLOGY

Bioinformatics

Integrated bioinformatics for high-confidence break nomination

Gene editing off-target mapping routinely generates 100s of nominated sites per experiment. The technical challenge is no longer simply detection, it’s discrimination of what’s critical.

The results? Weeks to months of analysis, fragmented insight and unclear next steps. As programs move toward IND-enabling studies, these limitations become critical.

Most current analysis approaches are:

• Loosely coupled to wet-lab methods 
•
Built as part of academic projects and are infrequently maintained
• Lack of actionable insights  
• Dependent on specialist bioinformatics expertise 
•
Computationally intensive requiring complex local infrastructure
• Focused on data output rather than decision prioritization  

Purpose built for INDUCE-seq®

INDUCE-seq® Analysis is the dedicated bioinformatics engine developed alongside the INDUCE-seq® assay. Together they form an end-to-end architecture designed specifically for break mapping at scale, from read structure through to statistical nomination.

Tight coupling between assay chemistry and analytical framework enables higher-confidence interpretation of genome-wide breaks.

What this enables

Interactive reports for rapid interpretation 

Gradient icon of a tick

Easy to understand yet detailed graphical outputs  

Gradient icon of a tick

No demand for specialist bioinformatics team 

Gradient icon of a tick

No local infrastructure required

Gradient icon of a tick

Platform capabilities

Digital illustration of a person using a mousepad

Fully integrated and
user-friendly

Cloud with an upward arrow symbol

On-demand cloud platform

Simple stopwatch icon with a black background and gradient purple to orange clock face.

Typical analysis time< 2 hours  

Icon of a document with a checkmark

Decision-ready outputs 

Core analytical workflow

An arrow hitting the bullseye of a colorful target.

Precise break localization

INDUCE-seq® library reads map DNA breaks to exact genomic locations.

• FASTQ reads undergo QC and align to a reference genome
• Breaks are resolved at base-level precision
• Genome-wide break counts comparable across sites

Produces a genome-wide break frequency map.

An isometric illustration of a balance scale with two pans hanging from a central beam.

Normalized analysis  

Break sites are compared directly to matched controls.

• Sites are paired with equivalent control positions
• Treated:control ratios are calculated
• Background signal is filtered from endogenous breaks

This increases confidence in identifying true treatment-induced events.

A checklist on a white paper with three items marked with orange checkmarks, attached to a gray paperclip.

Break site annotation

Each detected break site is annotated to provide biological context.

• Gene and repeat region overlap 
• Reproducibility across biological replicates
• Proximity to guide-like sequences

This enables clear interpretation of break behavior.

Isometric illustration of a financial calculator with a percentage symbol on the screen and multiple buttons.

Probabilistic nomination  

Statistical modeling is applied to prioritize relevant sites.

• Distinguishes treatment-induced events from background
• Accounts for variation in high-background regions
• Prioritizes high-confidence candidate off-target sites

This supports focused downstream validation and risk assessment.

Report with confidence

Table 1: Showing nominated putative on- and off-target break sites resulting from treatment with CRISPR-Cas9 and an AAVS1 guide RNA. The rationale column demonstrates the criteria for nomination and the other columns include additional annotations that can be found in the full interactive report.

Figure 1: Showing the distribution and counts of breaks at the on-target site for an editing experiment using CRISPR-Cas9 and an EMX1 guide RNA. 1 base pair 5' overhang break structure can be observed.

Figure 2: Showing the distribution and counts of breaks at the on-target site for an editing experiment using CRISPR-Cas12a and an HPRT1 guide RNA. A broad 5' overhang break cutting pattern can be observed.