Præsentation er lastning. Vent venligst

Præsentation er lastning. Vent venligst

Agenda Informationer Skalaer Deskriptiv statistik Dagens øvelser

Lignende præsentationer


Præsentationer af emnet: "Agenda Informationer Skalaer Deskriptiv statistik Dagens øvelser"— Præsentationens transcript:

1 Agenda Informationer Skalaer Deskriptiv statistik Dagens øvelser
Supplerende litteratur ”Pitch-dag” den 4/3 Skalaer Bedst På Nettet skema Deskriptiv statistik Teori Små øvelser i Excel Dagens øvelser Pitch-tale

2 Spørgeskema fra www.bedstpaanettet.dk
2

3 Både tilfreds og utilfreds
Eksempler på skalaer Spørgsmål: Hjemmesiden giver alt i alt et positivt helhedsindtryk Mulige skalaer: Meget enig Enig Uenig Meget uenig Ved ikke Meget tilfreds Tilfreds Både tilfreds og utilfreds Utilfreds Meget utilfreds I meget høj grad I høj grad I middel grad I nogen grad Slet ikke

4 Valg ved udformning af skalaer skal der vælges …
Antal svarkategorier (2, 3, 4, 5, 6, 7, ...) Lige eller ulige antal svarkategorier (= skal der være en midterkategori?) Navne / symboler på svarkategorier Rækkefølge (”positiv” først eller sidst) Scoring af svarkategorier, f.eks. 5 = meget tilfreds og 1 = meget utilfreds 4

5 Det vi har til hensigt at måle
Validitet Validitet Validitet, (af lat. validitas 'styrke'), gyldighed, korrekthed, sandhed. Inden for de empiriske videnskaber stilles krav om en målings validitet, dvs. at der måles det, man har til hensigt at måle. Kilde: Den Store Danske Encyklopædi. Det vi har til hensigt at måle Det vi faktisk måler

6 Operationalisering Def.: Flow er en tilstand, hvor man udnytter sine evner fuldt ud og er så opslugt af sine opgaver, at man glemmer tid og sted. Tilstanden bevirker stærke positive følelser, hvilket er kroppens belønning for, at man udnytter sine evner fuldt ud. (Professor Mihaly Csikszentmihalyi). I flow tilstanden er man maksimalt engageret og man udnytter sine kompetencer maksimalt. 1.1 Jeg ved hvad der forventes af mig på kurset. 1.2 Jeg føler mig fagligt "klædt på" til at løse opgaverne. 1.3 Jeg har tilstrækkeligt med faglige udfordringer. 1.4 Jeg har mulighed for at lære nyt. 1.5 Jeg har indflydelse på tilrette-læggelsen af mine opgaver. 1.6 Jeg har gode fysiske rammer for at løse mine opgaver (plads, lokaler, redskaber osv). Kompetencekrav Kompetencer Angst Stress Flow zone Bekym ring Ked som hed Afslap ning Kontrol

7 Exploring Data with Graphs and Numerical Summaries
Calculating the mean Calculating the median Comparing the Mean & Median Definition of Resistant

8 Mean (gennemsnittet) The mean is the sum of the observations divided by the number of observations n betegner antallet af observationer (=stikprøvestørrelsen) y1, y2, y3, … yi ,..., yn betegner de n observationer betegner gennemsnittet It is the center of mass. Do it in Excel

9 Median The median is the midpoint of the observations when they are ordered from the smallest to the largest (or from the largest to smallest) Order observations If the number of observations is: Odd, then the median is the middle observation Even, then the median is the average of the two middle observations

10 ______________________________
Median 1) Sort observations by size, n = number of observations ______________________________ 2,a) If n is odd, the median is observation (n+1)/2 down the list  n = 9 (n+1)/2 = 10/2 = 5 Median = 99 2,b) If n is even, the median is the mean of the two middle observations n = 10  (n+1)/2 = 5,5 Median = (99+101) /2 = 100

11 Comparing the Mean and Median
The mean and median of a symmetric distribution are close together, For symmetric distributions, the mean is typically preferred because it takes the values of all observations into account

12 Comparing the Mean and Median
In a skewed distribution, the mean is farther out in the long tail than is the median For skewed distributions the median is preferred because it is better representative of a typical observation

13 Resistant Measures A numerical summary measure is resistant if extreme observations (outliers) have little, if any, influence on its value The Median is resistant to outliers The Mean is not resistant to outliers Hvis I kun kan få én oplysning om løn-niveauet i en virksomhed med 4 ansatte, vil I så have gennemsnit eller median?

14 Exploring Data with Graphs and Numerical Summaries
Calculate the range (variationsbredden) Calculate the standard deviation Know the properties of the standard deviation, s Know how to interpret the standard deviation, s: The Empirical Rule

15 Range One way to measure the spread is to calculate the range. The range is the difference between the largest and smallest values in the data set; Range = max  min The range is strongly affected by outliers

16 Standard Deviation Find the mean
Find the deviation of each value from the mean Square the deviations Sum the squared deviations Divide the sum by n-1

17 Standard Deviation Gives a measure of variation by summarizing the deviations of each observation from the mean and calculating an adjusted average of these deviations. Site Obs. Sum n Gns. Std.afv. U 5 15 3 0,0 V 4 6 1,0 X 7 2,0 Y 2 8 3,0 Z 1 9 4,0

18 Properties of the Standard Deviation
s measures the spread of the data s = 0 only when all observations have the same value, otherwise s > 0, As the spread of the data increases, s gets larger, s has the same units of measurement as the original observations. The variance=s2 has units that are squared. s is not resistant. Strong skewness or a few outliers can greatly increase s.

19 Magnitude of s: Empirical Rule

20 Exploring Data with Graphs and Numerical Summaries
Obtaining quartiles and the 5 number summary Calculating interquartile range and detecting potential outliers Drawing boxplots Comparing Distributions

21 Percentile The pth percentile is a value such that p percent of the observations fall below or at that value

22 Finding Quartiles Splits the data into four parts
Arrange the data in order The median is the second quartile, Q2 The first quartile, Q1, is the median of the lower half of the observations The third quartile, Q3, is the median of the upper half of the observations

23 Measure of spread: quartiles
Quartiles divide a ranked data set into four equal parts. The first quartile, Q1, is the value in the sample that has 25% of the data at or below it and 75% above The second quartile is the same as the median of a data set, 50% of the obs are above the median and 50% are below The third quartile, Q3, is the value in the sample that has 75% of the data at or below it and 25% above Q1= first quartile = 2,2 M = median = 3,4 We are going to start out with a very general way to describe the spread that doesn’t matter whether it is symmetric or not - quartiles, Just as the word suggests - quartiles is like quarters or quartets, it involves dividing up the distribution into 4 parts, Now, to get the median, we divided it up into two parts, To get the quartiles we do the exact same thing to the two halves, Use same rules as for median if you have even or odd number of observations, Now, what an we do with these that helps us understand the biology of these diseases? Q3= third quartile = 4,35

24 Quartile Example Find the first and third quartiles
Number of downloaded apps the last 10 days: What is the correct answer? Q1 = 2 Q3 = 47 Q1 = 12 Q3 = 31 Q1 = 11 Q3 = 32 Q1 =12 Q3 = 33

25 Calculating Interquartile range
The interquartile range is the distance between the third quartile and first quartile: IQR = Q3  Q1 IQR gives spread of middle 50% of the data

26 5 Number Summary The five-number summary of a dataset consists of the
Minimum value First Quartile Median Third Quartile Maximum value

27 Criteria for identifying an outlier
An observation is a potential outlier if it falls more than 1,5 x IQR below the first quartile or more than 1,5 x IQR above the third quartile

28 Exploring Data with Graphs and Numerical Summaries
Distribution Graphs for categorical data. Overvej: Søjle diagram Lagkage diagram Graphs for quantitative data. Overvej : Box-plot Sum kurve

29 Boxplot A box goes from the Q1 to Q3
A line is drawn inside the box at the median A line goes from the lower end of the box to the smallest observation that is not a potential outlier and from the upper end of the box to the largest observation that is not a potential outlier The potential outliers are shown separately

30 Comparing Distributions
Box Plots are useful for making graphical comparisons of two or more distributions

31 Sum kurve

32 Agenda Informationer Skalaer Deskriptiv statistik Dagens øvelser
Supplerende litteratur ”Pitch-dag” den 4/3 Skalaer Bedst På Nettet skema Deskriptiv statistik Teori Små øvelser i Excel Dagens øvelser Pitch Deskriptiv analyse


Download ppt "Agenda Informationer Skalaer Deskriptiv statistik Dagens øvelser"

Lignende præsentationer


Annoncer fra Google