Loading…

Allele Frequency from Genotype Counts

Biology • Population Genetics

View all topics
This calculator computes allele frequencies from genotype data: \[ p=\frac{2\cdot AA + Aa}{2N},\qquad q=\frac{2\cdot aa + Aa}{2N} \]
If one genotype is missing, you can still solve it if N is provided (the missing value is inferred by subtraction). If you only have phenotype counts (dominant vs recessive), allele frequencies are not identifiable without extra assumptions.

Inputs: genotype counts

Paste (CSV)

Paste one row with three values (AA, Aa, aa). Delimiters: comma, semicolon, or tab. Headers allowed.

Upload CSV

Upload a small CSV file containing AA, Aa, aa (one row or column).

I only have phenotype counts (not genotypes)
With only phenotype counts (dominant phenotype vs recessive phenotype), the genotype breakdown (AA vs Aa among dominants) is unknown. Therefore, p and q are not identifiable without assumptions (for example, assuming Hardy–Weinberg equilibrium).
Ready

Rate this calculator

0.0 /5 (0 ratings)
Be the first to rate.
Your rating
You can update your rating any time.

Frequently Asked Questions

How do you calculate allele frequency from genotype counts?

First compute N = AA + Aa + aa. Then p = (2*AA + Aa)/(2*N) and q = (2*aa + Aa)/(2*N), where 2*N is the total number of alleles in the sample.

How do I compute p and q from genotype frequencies instead of counts?

If you have fAA, fAa, and faa with fAA + fAa + faa = 1, then p = fAA + 0.5*fAa and q = faa + 0.5*fAa.

Can I solve allele frequencies if one genotype count is missing?

Yes, if you also provide the population size N. The missing genotype count can be inferred by subtracting the known counts from N before computing p and q.

Why are phenotype counts not enough to determine allele frequencies?

With only dominant vs recessive phenotype counts, the split between AA and Aa among dominants is unknown. Without extra assumptions (such as Hardy-Weinberg equilibrium), p and q are not identifiable from phenotype counts alone.