Helix Insight

Documentation / FAQ

Frequently Asked Questions

What file formats does Helix Insight accept?

Single-sample VCF files in version 4.1 or 4.2 format, either plain text (.vcf) or bgzipped (.vcf.gz). Multi-sample VCFs must be split into individual samples before upload.

Which genome builds are supported?

GRCh38 (hg38) is the primary build. GRCh37 (hg19) files are accepted and automatically lifted over to GRCh38 using CrossMap. Earlier builds are not supported.

How long does analysis take?

Processing time depends on the input type: gene panels typically complete in 1-2 minutes, whole exome sequencing in 2-5 minutes, and whole genome sequencing in under 15 minutes. Screening and phenotype matching add less than one minute each.

Where is patient data stored?

All data is processed and stored on EU-based infrastructure in Helsinki, Finland (Hetzner AX162R dedicated server). No patient data is transmitted to external services, cloud providers, or AI APIs.

Is patient data deleted after analysis?

Yes. VCF files and analysis results are retained only for the duration needed for clinical review. Data deletion policies comply with GDPR requirements.

Which ACMG criteria are automated?

Helix Insight automates 19 of 28 ACMG criteria: PVS1, PS1, PM1, PM2, PM4, PM5, PP2, PP3, PP4, PP5, BA1, BS1, BS2, BP1, BP3, BP4, BP5, BP6, and BP7. The remaining 9 require clinical judgment, functional studies, or family segregation data.

What computational predictor is used for PP3/BP4?

BayesDel_noAF with ClinGen SVI-calibrated thresholds (Pejaver et al. 2022). The "_noAF" variant excludes allele frequency from its model to avoid circular reasoning with PM2, BA1, and BS1. SpliceAI is used independently for splice impact (PP3_splice).

Can Helix Insight detect structural variants or CNVs?

No. The platform currently analyzes single nucleotide variants (SNVs) and small insertions/deletions (indels). Structural variants, copy number variants, repeat expansions, and mitochondrial variants require specialized pipelines.

Does the platform support somatic variant interpretation?

No. Helix Insight is designed for germline variant interpretation in Mendelian disease contexts. Somatic variant analysis requires tumor-normal paired analysis with different classification frameworks (AMP/ASCO/CAP).

What databases are used for annotation?

gnomAD v4.1 (population frequencies), ClinVar 2025-01 (clinical significance), dbNSFP 4.9c (functional predictions), SpliceAI (splice impact), HPO (gene-phenotype associations), ClinGen (dosage sensitivity), and Ensembl VEP Release 113 (consequence annotation). All databases are stored locally.

How does phenotype matching work?

Patient HPO terms are compared against gene-phenotype associations using Lin semantic similarity within the HPO ontology graph. Genes are ranked by the strength of phenotype correlation and assigned clinical priority tiers. See the Phenotype Matching documentation for details.

What does the AI clinical assistant use for its model?

A large language model hosted on dedicated GPU infrastructure within the EU. All inference happens on-premise through a secure internal connection. No patient data is sent to external AI services.

Can the AI assistant modify variant classifications?

No. ACMG classifications are determined by the automated pipeline. The AI assistant can explain why criteria were triggered and discuss the evidence, but it cannot change classifications. Only the reviewing geneticist can override classifications.

Is the clinical interpretation a medical diagnosis?

No. The AI-generated clinical interpretation is a decision support tool. All findings must be independently validated by a qualified clinical geneticist before being used in patient care. Reports include a standard disclaimer stating this requirement.

How often are reference databases updated?

ClinVar is updated quarterly or more frequently for clinically significant changes. gnomAD major releases are adopted within 3 months of publication. All updates undergo validation testing with a reference cohort before deployment. See the Database Update Policy for the complete schedule.

Can I use Helix Insight for research purposes?

Yes. The platform is suitable for both clinical and research use. For research applications, the same analytical rigor applies but reporting requirements may differ from clinical diagnostic settings.

What happens if ClinVar and the automated classification disagree?

Helix Insight applies a transparent priority system: BA1 always overrides ClinVar, expert-panel ClinVar assertions can override the automated classification, and conflicting evidence is flagged for manual review. See the ClinVar Integration documentation for the complete decision logic.