To do data analysis in Excel, you have to install add-in “Analysis ToolPak” on
Office Icon (left top corner) –> Excel options –> Add-Ins : highlight “analysis ToolPak” and click GO button
then we will have ‘Dana Analysis’ icon on Analysis group on DATA tab
The correlation coefficient between two series, say x and y, equals
Covariance(x,y) / [Sqrt(Variance(x)) * Sqrt(Variance(y))]
- Covariance(x,y) is the sample covariance between x and y: (1/(n-1)) × Σ i (xi – xbar)(yi – ybar)
- Variance(x) is the sample variance of x: (1/(n-1)) × Σ i (xi – xbar)2
- Variance(x) is the sample variance of y: (1/(n-1)) × Σ i (yi – ybar)2
CALCULATION USING THE DATA ANALYSIS ADD-IN
This requires the Data Analysis Add-in: see Excel 2007: Access and Activating the Data Analysis Add-in
The data used are in carsdata.xls
We consider only two series, but we could do the same for more than two series.
- In the Data Group select the Data Analysis Add-in
- Select Correlation
- Fill out the Correlation dialog box as below
Hit enter yields
The correlation coefficient is 0.894427.
This can be extended to several series.
For example if there are data in columns A, B, C, D and E then the array chosen is A1:E6 and produces a 5 x 5 table of correlations.
CALCULATION USING THE CORREL FUNCTION
This does not require the Data Analysis Add-in
- Click on the cell you want result to appear on.
- On the Formula Tab select the Function Library group and More Functions and Statistical
- Select Correlation and fill out the dialog box as below
Alternatively directly type = CORREL(A1:A6,B1:B6) which yields 0.894427.
Note that Excel dropped the first row (or labesl).
= CORREL(A2:A6,B2:B6) yields the same result.
This is obtained in a similar way to correlation.
- We can use Data Analysis Add-in and Covariance
- We can use function CORREL
IFor example, = COVAR(A1:A6,B1:B6) yields 0.8.