Lorem ipsum dolor sit amet, consectetur adipisicing elit. Odit molestiae mollitia laudantium assumenda nam eaque, excepturi, soluta, perspiciatis cupiditate sapiente, adipisci quaerat odio voluptates consectetur nulla eveniet iure vitae quibusdam? Excepturi aliquam in iure, repellat, fugiat illum voluptate repellendus blanditiis veritatis ducimus ad ipsa quisquam, commodi vel necessitatibus, harum quos a dignissimos.
Close Save changesHelp F1 or ? Previous Page ← + CTRL (Windows) ← + ⌘ (Mac) Next Page → + CTRL (Windows) → + ⌘ (Mac) Search Site CTRL + SHIFT + F (Windows) ⌘ + ⇧ + F (Mac) Close Message ESC
We are still working towards finding the theoretical mean and variance of the sample mean:
If we re-write the formula for the sample mean just a bit:
we can see more clearly that the sample mean is a linear combination of the random variables \(X_1, X_2, \ldots, X_n\). That's why the title and subject of this page! That is, here on this page, we'll add a few a more tools to our toolbox, namely determining the mean and variance of a linear combination of random variables \(X_1, X_2, \ldots, X_n\). Before presenting and proving the major theorem on this page, let's revisit again, by way of example, why we would expect the sample mean and sample variance to have a theoretical mean and variance.
A statistics instructor conducted a survey in her class. The instructor was interested in learning how many siblings, on average, the students at Penn State University have? She took a random sample of \(n=4\) students, and asked each student how many siblings he/she has. The resulting data were: 0, 2, 1, 1. In an attempt to summarize the data she collected, the instructor calculated the sample mean and sample variance, getting:
The instructor realized though, that if she had asked a different sample of \(n=4\) students how many siblings they have, she'd probably get different results. So, she took a different random sample of \(n=4\) students. The resulting data were: 4, 1, 2, 1. Calculating the sample mean and variance once again, she determined:
Hmmm, the instructor thought that was quite a different result from the first sample, so she decided to take yet another sample of \(n=4\) students. Doing so, the resulting data were: 5, 3, 2, 2. Calculating the sample mean and variance yet again, she determined:
That's enough of this! I think you can probably see where we are going with this example. It is very clear that the values of the sample mean \(\bar\)and the sample variance \(S^2\) depend on the selected random sample. That is, \(\bar\) and \(S^2\) are continuous random variables in their own right. Therefore, they themselves should each have a particular:
We are still in the hunt for all three of these items. The next theorem will help move us closer towards finding the mean and variance of the sample mean \(\bar\).
Suppose \(X_1, X_2, \ldots, X_n\) are \(n\) independent random variables with means \(\mu_1,\mu_2,\cdots,\mu_n\) and variances \(\sigma^2_1,\sigma^2_2,\cdots,\sigma^2_n\).
Then, the mean and variance of the linear combination \(Y=\sum\limits_^n a_i X_i\), where \(a_1,a_2, \ldots, a_n\) are real constants are:
\(\mu_Y=\sum\limits_^n a_i \mu_i\)
\(\sigma^2_Y=\sum\limits_^n a_i^2 \sigma^2_i\)
Let's start with the proof for the mean first:
Now for the proof for the variance. Starting with the definition of the variance of \(Y\), we have:
Now, substituting what we know about \(Y\) and the mean of \(Y\) Y, we have:
\(\sigma^2_Y=E\left[\left(\sum\limits_^n a_i X_i-\sum\limits_^n a_i \mu_i\right)^2\right]\)
Because the summation signs have the same index (\(i=1\) to \(n\)), we can replace the two summation signs with one summation sign:
\(\sigma^2_Y=E\left[\left(\sum\limits_^n( a_i X_i-a_i \mu_i)\right)^2\right]\)
And, we can factor out the constants \(a_i\):
\(\sigma^2_Y=E\left[\left(\sum\limits_^n a_i (X_i-\mu_i)\right)^2\right]\)
Now, let's rewrite the squared term as the product of two terms. In doing so, use an index of \(i\) on the first summation sign, and an index of \(j\) on the second summation sign:
\(\sigma^2_Y=E\left[\left(\sum\limits_^n a_i (X_i-\mu_i)\right) \left(\sum\limits_^n a_j (X_j-\mu_j)\right) \right]\)
Now, let's pull the summation signs together:
\(\sigma^2_Y=E\left[\sum\limits_^n \sum\limits_^n a_i a_j (X_i-\mu_i) (X_j-\mu_j) \right]\)
Then, by the linear operator property of expectation, we can distribute the expectation:
\(\sigma^2_Y=\sum\limits_^n \sum\limits_^n a_i a_j E\left[(X_i-\mu_i) (X_j-\mu_j) \right]\)
Now, let's rewrite the variance of \(Y\) by evaluating each of the terms from \(i=1\) to \(n\) and \(j=1\) to \(n\). In doing so, recognize that when \(i=j\), the expectation term is the variance of \(X_i\), and when \(i\ne j\), the expectation term is the covariance between \(X_i\) and \(X_j\), which by the assumed independence, is 0:
Simplifying then, we get:
\(\sigma^2_Y=a_1^2 E\left[(X_1-\mu_1)^2\right]+a_2^2 E\left[(X_2-\mu_2)^2\right]+\cdots+a_n^2 E\left[(X_n-\mu_n)^2\right]\)
And, simplifying yet more using variance notation:
\(\sigma^2_Y=a_1^2 \sigma^2_1+a_2^2 \sigma^2_2+\cdots+a_n^2 \sigma^2_n\)
Finally, we have:
\(\sigma^2_Y=\sum\limits_^n a_i^2 \sigma^2_i\)