Section 8.2 Investigation 2.8: Turbidity
Another measure of water quality is turbidity, the degree of opaqueness produced in water by suspended particulate matter. Turbidity can be measured by seeing how light is scattered and/or absorbed by organic and inorganic material. Larger nephelometric turbidity units (NTU) indicate increased turbidity and decreased light penetration. If there is too much turbidity, then not enough light may be penetrating the water, affecting photosynthesis to the surface and leading to less dissolved oxygen.

CRiggs (2002) provides 244 turbidity monthly readings that were recorded between 1980-2000 from a reach of the Mermentau River in Southwest Louisiana (
MermentauTurbidity.txt). The unit of analysis was the monthly mean turbidity (NTU) computed from each monthβs systematic sample of 21 turbidity measurements. The investigators wanted to determine whether the mean turbidity was greater than the local criterion value of 150 NTU.
Checkpoint 8.2.1. Analyze original turbidity distribution.
Checkpoint 8.2.2. Apply log transformation.
Carry out a log transformation (See SectionΒ 6.2) and report the mean, median, and standard deviation of the transformed data. Verify that a log-normal probability model is reasonable for these data.
Checkpoint 8.2.3. Compare original and transformed statistics.
Are the mean and median values in checkpoint 8.2.2 similar to each other? Is the mean of the logged turbidity values similar to the log of the mean of the turbidity values? Is the median of the logged turbidity values similar to the log of the median of the turbidity values?
| Original scale | Log of original values | Transformed scale |
|---|---|---|
| Mean = | Log (mean) = | Mean (log turbidity) = |
| Median = | Log (median) = | Median (log turbidity) = |
Explanation:
Solution.
The mean and median values in (b) are similar as we would expect with the transformed data now being symmetric.
The median turbidity value (before transforming) was 75.5. The natural log of this value is 4.3241. This is the same as the median of the log-turbidity values.
The mean turbidity value (before transforming) was 100.307. The natural log of this value is 4.6082, which is not the same as the mean of the log-turbidity values.
| Original scale | Log of original values | Transformed scale |
|---|---|---|
| Mean = 100.3 | Log (mean) = 4.608 | Mean (log turbidity) = 4.301 |
| Median = 75.5 | Log (median) = 4.324 | Median (log turbidity) = 4.324 |
Checkpoint 8.2.6. Explain median vs. mean transformation.
Solution.
We expect the medians to match because the median is essentially the middle value (slightly untrue with an even number of observations) and the same observation will still be the median value after the log transformation. However, the mean utilizes the numerical values and the relative distances between observations is changed with the log transformation.
Checkpoint 8.2.7. Calculate confidence interval for transformed data.
Checkpoint 8.2.8. Back-transform and interpret interval.
Back-transform the endpoints of this interval to return to the original units and interpret the interval in context.
Discussion: Transformations and Back-Transformation.
Once you create a symmetric distribution, the mean and median will be similar. However, although transforming the data does not affect the ordering of the observations, it does impact the scaling of the values. So whereas the median of the transformed data is equal to taking the log of the median of the original data (at least with an odd number of observations), this does not hold for the mean. So when we back-transform our interval for the center of the population distribution, we will interpret this in terms of the median value rather than the mean value.
Study Conclusions.
A 95% confidence interval based on the transformed data equals (4.20, 4.40). Therefore, we will say we are 95% confident that the median turbidity in this river is between 66.69 NTU and 81.45 NTU, clearly less than the 150 NTU regulation. However, another condition for the validity of this procedure is that the observations are independent. Further examination of these data reveals seasonal trends. Adjustments need to be made to account for the seasonality before these data are analyzed.
Subsection 8.2.1 Practice Problem 2.8
Return to the
honking.txt data from Investigation 2.2.
Checkpoint 8.2.9.
Use the log transformation to calculate a 95% confidence interval for the mean log response time.
Checkpoint 8.2.11.
Outline a method for verifying that this procedure results in a 95% confidence interval for the population median.
You have attempted of activities on this page.



