This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Find out more here.

How 23andMe reports genotypes

In the raw data, you can view your genotype at a particular location in your DNA. The raw data is presented in an uninterpreted format. This data has undergone a general quality review, however only a subset of markers have been individually validated for accuracy. As such, the data from 23andMe's Browse Raw Data feature is suitable only for research, educational, and informational use and not for medical or other use.

Base Pairs (A, C, T, G)  |  Insertions and Deletions (II, DD)  |  Strandedness 

Base Pairs

For most SNPs, genotypes will be reported as the set of nucleotides or the base pair found at that location. There are four types of bases: adenine (A), thymine (T), guanine (G), and cytosine (C).

Insertions and Deletions

On occasion, one or more bases may be inserted into or deleted from the genetic code at a particular location. In the 23andMe raw data, an insertion is reported as "I" and a deletion as "D".

Depending on where in the genome the change is located, either “I” or “D” could represent the normal version of the variant. In other words, there are some places in the genome where having an extra base (insertion, or "I") is the normal variant, and having a deletion, or "D", is the rare variant. Conversely, there are some places in the genome where having an insertion is rare, making a deletion the normal variant at that location. The raw data does not indicate which version is considered the normal version.

Genotyping does not report on all possible insertions or deletions. In general, the ones reported on are small, spanning only one or a few bases.


The SNP genotypes in the Browse Raw Data feature might not match what you learn about the SNP from other sources such as dbSNP. This is because every SNP can be represented using either of the two DNA strands, and this representation will often differ from database to database or publication to publication.

For example, 23andMe might report that a SNP has two versions, G and A. But other sources may report that the versions are C and T. Because of the double-stranded nature of DNA, both ways of reporting the SNP are correct: G pairs with C on the opposite DNA strand, while A pairs with T.

All of the genotypes displayed in Browse Raw Data are oriented with respect to the positive strand on the reference assembly of the human genome (build 37). Note that this could be different from how the SNP is oriented in dbSNP, or how it might be presented in a publication.

Was this article helpful?
101 out of 121 found this helpful

Didn't find what you were looking for?

Submit a request

Or call 1-800-239-5230
Monday through Friday, 9:00am to 12:00pm and 1:00pm to 4:00pm PST/PDT

Let us know what you think of our Help Center by taking a quick survey.