t test for unmatched data.
To come to this conclusion you should follow the steps outlined in appendix b.
B.1 What type of investigation am I designing?
This is an experiment, you are starting out with a question (hypothesis) (go to B.2).
B.2 Which type of hypotheses am I testing?
There are three types of hypotheses which you need to choose between. If you are not sure which type of hypotheses you will be testing read the information in B2.1 - B.2.3 before deciding. For more information about hypotheses and hypothesis testing read Chapter 4.
In our example the student does not have an expectation. She only had one treatment variable for each metal tested (streams) and so did not want to test for an association. Therefore she wished to test for differences (Hypotheses type 3).
B2.3 Do samples come from the same or different populations?
These are the type of hypotheses that are most frequently investigated by undergraduates. There are many tests that will test this type of hypotheses. These tests fall into parametric tests (Chapter 7) to be used when you have Normally distributed data and non-parametric tests (Chapter 8) when your data are not Normally distributed. To tell if your data are Normally distributed refer to BOX 3.2.
In this example the observations recorded will be 'concentration of a particular metal' (µg metal g-1 sediment ). Although this is a derived variable it is an interval scale and therefore at this stage we should consider parametric tests.
B.2.3.1 Parametric tests
These are largely selected on the basis of the experimental design - how many variables, how many categories in each variable, how many replicates in each category. In this example there is one treatment variable (streams), two samples (an urban stream and a rural stream) and each with 10 observations. From the table it would appear that a t or z test for unmatched data may be appropriate.
Experimental design |
Test |
You have one treatment variable. You are going to compare two samples. The data is unmatched. |
t or z test for unmatched data (7.1 or 7.2). |
You have one treatment variable. You are going to compare two samples. The data is matched. |
t or z test for matched data (7.3) |
You have one treatment variable. You are going to compare two or more samples. You wish to test general and specific hypotheses. |
One-way parametric ANOVA and Tukey's test (7.5 and 7.6) |
You have two treatment variables. Each variable has at least two categories or classes and all categories from one variable are combined with all categories from the second variable. You wish to test general and specific hypotheses. |
Two-way parametric ANOVA and Tukey's test (7.7 and 7.8) |
You have two treatment variables. Each variable has at least two categories. One variable is randomised or nested with respect to the second variable. You wish to test general hypotheses. |
Two-way nested ANOVA (7.9) |
You have three treatment variables. Each variable has at least two categories and all categories from each variable are combined with all other categories from the other variables. You wish to test general and specific hypotheses. |
Three-way parametric ANOVA (7.10) |
|
None of the above
|
Chapter 8. and Sokal & Rohlf, 1981. |
When we examine the criteria for the t and z tests for unmatched data it is clear that given there are only 10 observations in each sample we should consider the t test. To use the t test for unmatched data (7.2.1) you:
- Wish to test for differences in population means.
- Have one treatment variable and two samples.
- Have unmatched data.
- Have parametric data.
- Have fewer than 30 observations in each sample but the sample sizes need not be equal.
- Have homogeneous variances.
We can see that criteria 1, 2, 3 and 5 are met by the current design. It is not possible to confirm with any confidence that the data are parametric until they are available for checking. Similarly, you cannot carry out an F test to confirm that the variances are homogeneous until you have the data. At this point however it appears that a t test for unmatched data may be suitable. This test would be used to compare the concentration of each metal in the two streams in turn.