The need for averages

Averages arise everywhere. In sports, we want to project the average number of games that a team is expected to win. In gambling, we want to project the average losses incurred when playing blackjack. In business, companies want to calculate their average expected sales for the next quarter.

Molecular biology is not immune from the need for averages. Researchers need to predict the expected number of antibiotic-resistant pathogenic bacteria in a future outbreak, estimate the predicted number of locations in the genome that will match a given motif, and study the distribution of alleles throughout an evolving population. In this problem, we will begin discussing the third issue. First, we need to have a better understanding of what it means to average a random process.

Assignment

For a random variable $$X$$ taking integer values between 1 and $$n$$, the expected value of $$X$$ is \[E(X) = \sum_{k=1}^n k \cdot P(X=k)\] The expected value offers us a way of taking the long-term average of a random variable over a large number of trials.

As a motivating example, let $$X$$ be the number on a six-sided die. Over a large number of rolls, we should expect to obtain an average of 3.5 on the die (even though it's not possible to roll a 3.5). The formula for the expected value confirms that \[E(X) = \sum_{k=1}^6 k \cdot P(X = k) = \frac{1}{2} \cdot (1 + 2 + 3 + 4 + 5 + 6) = 3.5\]

More generally, a random variable for which every one of a number of equally spaced outcomes has the same probability is called a uniform random variable (in the die example, this "equal spacing" is equal to 1). We can generalize our die example to find that if $$X$$ is a real-valued uniform random variable with minimum possible value $$a$$ and maximum possible value $$b$$, then $$E(X) = \frac{a + b}{2}$$. You may also wish to verify that for the dice example, if $$Y$$ is the random variable associated with the outcome of a second die roll, then $$E(X+Y) = 7$$.

Write a function expectedDominantOffspring that takes six positive integers. The integers correspond to the number of couples in a population possessing each genotype pairing for a given factor. In order, the six given integers represent the number of couples having the following genotypes:

argument# genotype pairing
1 $$AA$$ - $$AA$$
2 $$AA$$ - $$Aa$$
3 $$AA$$ - $$aa$$
4 $$Aa$$ - $$Aa$$
5 $$Aa$$ - $$aa$$
6 $$aa$$ - $$aa$$

The function must return the expected number of offspring displaying the dominant phenotype in the next generation, under the assumption that every couple has exactly two offspring.

Example

>>> expectedDominantOffspring(1, 0, 0, 1, 0, 1)
3.5