- Strains: 766
- WGS strains: 429
- Isotypes: 330
- Genome Version WS263
Wild isolate genomes are aligned and stored using the BAM format. BAMs are available in the table below.
Downloading All Alignment Data
You can download all alignment data using the script below. Before this script will work, you need to download and install wget. We recommend using Homebrew for this installation (Unix/Mac OS), or Cygwin on windows. See the FAQ for details on installing wget.
Information regarding alignment, variant calling, and annotation are available here.
We used samtools to identify single-nucleotide variant (SNV) sites as compared to the N2 reference genome. Variant data are provided as VCF or tab-delimited files.
VCFs generated from the variant calling pipeline are provided below.
- Soft-filter - Includes all variants and annotations. The QC status of variants is included.
- Hard-filter - Variants and genotypes that fail QC are removed.
- Imputed - An imputed dataset generated from the hard-filter VCF.
You can programmatically access specific regions of VCF files (rather then the entire file) from the command line:
Download Strain Data
Currently, we have performed variant calling across all wild isolates. We are working to add additional variant classes including insertion/deletion, structural, transposon, and additional variant classes.
We have recently performed an anaysis characterizing Transposon variation in C. elegans. The dataset will be further integrated with the site resources as time goes on. For now, the raw data are available below.Download Transposon Data
The following statistics were generated with
bcftools stats. The soft-filtered VCF for this release has had records and genotypes annotated but no data has been removed. The hard-filtered VCF removes records and genotypes that have been annotated with filters.
The hard-filtered VCF has stripped records and genotypes that have had filters applied.