Self help essay: The normal distribution Essays

The normal distribution Essays The normal distribution Essay The normal distribution Essay When many measures are taken of something (eg, scores in a test, peoples heights, pollution levels in rivers) the spread of the values will have a bell shape, called the normal distribution. A number of statistical tests use this characteristic distribution (or dispersion) of values to test whether two samples are the same or different. There are several basic terms that are commonly used with the normal distribution. Average (mean) A measure of the average score in a set of data. The mean is found by adding up all the scores and then dividing by the number of scores. Range The difference between the largest core and the and smallest score. Median If a set of scores are arranged from lowest to highest the median is the score in the middle, with half above and half below. Mode The value that occurs most often Standard deviation s A measure of the standard (average) deviation of the scores from the mean.The larger the standard deviation the larger the range of values/variation in the data 1. Subtract each score from the mean 2. Times each difference by itself (negs turn positive) 3. Add up all the squared differences 4. Divide the total by the number of scores minus 1 5. Take the square root Standard deviation The normal distribution at right shows the percentage of scores/observations that lie within one, two or three standard deviations either side of the mean. That is, 68%, 95% and 99.7%. The 95% value, which is used as the standard in tests of significance) lies between 1.96 standard deviations either side of the mean. Standard deviation of a sample is the squared difference between each score and the mean ? means the sum of all the squared differences (add them all up) n 1 means the number of scores minus 1 The table below shows how to calculate the average and the standard deviation of a set of seven example scores in the first column. The average is 39.93 and the standard deviation is 2.73. Sample scores Score average (average = 39.93) (Score average)2 41 1.07 1.14 43 3.07 9.42 37.5 -2.43 5.90 38.5 -1.43 2.04 44 4.07 16.56 38 -1.93 3.72 37.5 -2.43 5.90 Average = 39.93 44.71 Total up all the squared differences Range = Median = Mode = 6.5 38.5 37.5 Then calculate the square root to get the standard deviation Comparing two samples: using the t test The average, standard deviation and the number of scores in each sample are the three things needed to do a t test. A t test is used with two samples of data to test whether they are significantly different (ie, whether one is truly higher or lower than the other). The same sample of scores as used above is now compared with another sample of scores. Sample 1 scores Sample 2 scores 41 38 43 32 37.5 35.5 38.5 33 44 31.5 38 40.5 37.5 34 Average () 39.93 34.93 Standard deviation (s) 2.73 3.31 Number of scores (n) 7 7 1. Put the values into the equation and work it out carefully! 2. Note down the value of t found. In this case it is 3.08. 3. You will also need to know how many degrees of freedom to use with the critical values of t table. Degrees of freedom = (nsample1 + nsample2) 2 . In this example this equals 7 + 7 -2 = 12. 4. Find the value of t for that number of degrees of freedom using the table supplied (it is 2.179). Since the value calculated for your data is higher than it the difference is judged to be significant/real (at the 5% level). That is, the difference between the samples has less than a 5% chance of occurring by chance (being a fluke). It doesnt matter if the value of t is negative or positive: just use the positive value to work significance. Mathematics assignment (Due 12.00 pm Monday the 13th January 2003, and must be handed in to Jeanette Bray or Jean Worrell at the Help Desk in UH 255). I need to see how each question was worked out. I will not give marks for answers without calculations. I will be available for revision for the exam in GN101 10-12 and 2-4 on Fri 3rd. 1. The relationship between wombat weight and their production of methane gas is shown below. a) Draw a line of best fit through the data points and use it to derive the equation for the line (y = mx + c). b) Rearrange it to solve for x. That is x = .. c) Use the equation from part b to predict the weight of a wombat that produced 12.2 mg methane per hour. 2. Calculate the average, range, median and mode for the following set of data (a random set of your exam results from the last exam): 66.25, 15, 32.5, 26.25, 48.75, 48.75, 36.25, 35, 68.75, 72.5, 43.75, 40, 20, 48.75, 12.5, 41.25, 53.75, 50, 31.25, 95, 22.5, 33.75, 27.5, 55, 12.5, 45, 18.75, 42.5, 62.5, 85, 75 Degrees of freedom Value of t that must be exceeded (5% level) 1 12.706 2 4.303 3 3.182 4 2.776 5 2.571 6 2.447 7 2.365 8 2.306 9 2.262 10 2.228 11 2.201 12 2.179 13 2.160 14 2.145 15 2.131 16 2.120 17 2.110 18 2.101 19 2.093 20 2.086 22 2.074 24 2.064 26 2.056 28 2.048 30 2.042 40 2.021 60 2.000 120 1.980 3. The two sets of data given below are resting heart rates for a group of students and a group of professional athletes. Use the t test to find out if they are significantly different (using the table at right to test the value of t with the appropriate number of degrees of freedom). I need to see how the mean, standard deviation and t value were calculated. Professional Students athletes 57.1 61.7 47.6 47.0 58.0 55.5 74.8 62.6 66.4 41.8 51.9 60.8 64.2 50.2 49.6 44.2 67.2 45.4 62.6 39.3

Self help essay

Wednesday, March 18, 2020

The normal distribution Essays

No comments:

Post a Comment

Blog Archive

About Me