c:ma:2018:schedule
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
c:ma:2018:schedule [2018/11/27 02:53] – [Week13 (Nov. 27, 30)] hkimscil | c:ma:2018:schedule [2018/12/17 10:01] (current) – [Week15 (Dec. 11, 14)] hkimscil | ||
---|---|---|---|
Line 465: | Line 465: | ||
* name rows " | * name rows " | ||
* get means for each subject | * get means for each subject | ||
+ | * attach the above data to the matrix data and name it " | ||
* get standard deviation for each trial | * get standard deviation for each trial | ||
+ | * attach the above data to the matrix data, " | ||
+ | |||
< | < | ||
Line 478: | Line 481: | ||
* Use Cars93 data, get MPG.city mean by Origin. | * Use Cars93 data, get MPG.city mean by Origin. | ||
+ | __Using pnorm, qnorm__ | ||
+ | pnorm : get proportion out of normal distribution whose characteristics are mean and sd | ||
+ | < | ||
+ | * What is the value of the below? | ||
+ | < | ||
+ | * How would you get 68, 95, 99% from pnorm | ||
+ | * use ?pnorm and see the default option | ||
+ | |||
+ | * generate 10 random numbers with runif function | ||
+ | |||
+ | < | ||
+ | world.series <- data.frame(year)</ | ||
+ | * get 10 year samples out of world.series data with " | ||
+ | * how would you get the sample sample again latter? | ||
+ | |||
+ | < | ||
+ | * What would be the result from the above? | ||
+ | |||
+ | < | ||
+ | tbl = table(survey$Smoke, | ||
+ | tbl # the contingency table</ | ||
+ | |||
+ | < | ||
+ | </ | ||
+ | * read the above output and interpret | ||
+ | * what about the below one? | ||
+ | < | ||
+ | </ | ||
+ | |||
+ | see first [[: | ||
+ | see [[: | ||
+ | |||
+ | < | ||
+ | | ||
+ | | ||
+ | </ | ||
+ | * Can you say the types of cars are different by the Origins? | ||
+ | |||
+ | < | ||
+ | dur</ | ||
+ | * make the above data into z-score (zdur). | ||
+ | * get mean of the zdur | ||
+ | * get sd of the zdur | ||
+ | |||
+ | < | ||
+ | set.seed(1123) | ||
+ | x <- rnorm(50, mean=100, sd=15) | ||
+ | </ | ||
+ | * test x against population | ||
+ | * test x against population | ||
+ | * are they different from each other? | ||
+ | * what would you do if you want to see the different result from the second one? | ||
+ | |||
+ | < | ||
+ | |||
+ | > t.test(a, mu=60) | ||
+ | |||
+ | One Sample t-test | ||
+ | |||
+ | data: a | ||
+ | t = 2.3079, df = 9, p-value = 0.0464 | ||
+ | alternative hypothesis: true mean is not equal to 60 | ||
+ | 95 percent confidence interval: | ||
+ | | ||
+ | sample estimates: | ||
+ | mean of x | ||
+ | | ||
+ | </ | ||
+ | * find the t critical value with function qt. | ||
+ | * explain what happens in the next code | ||
+ | * read (or remind) what pnorm and qnorm do. | ||
+ | < | ||
+ | > m <- mean(x) | ||
+ | > n <- length(x) | ||
+ | > n | ||
+ | [1] 50 | ||
+ | > m | ||
+ | [1] 96.00386 | ||
+ | > s | ||
+ | [1] 17.38321 | ||
+ | > SE <- s / sqrt(n) | ||
+ | > SE | ||
+ | [1] 2.458358 | ||
+ | > E <- qt(.975, df=n-1)*SE | ||
+ | > E | ||
+ | [1] 4.940254 | ||
+ | > m + c(-E, E) | ||
+ | [1] 91.0636 100.9441 | ||
+ | > </ | ||
+ | |||
+ | |||
+ | * what's wrong with the below? | ||
+ | < | ||
+ | |||
+ | < | ||
+ | * using aggregate, get mean for each trnas. type. | ||
+ | * compare the difference of mileage between auto and manual cars. | ||
+ | * use t.test (two sample) | ||
+ | * "use var.equal=T" | ||
+ | |||
+ | < | ||
+ | b = c(185, 169, 173, 173, 188, 186, 175, 174, 179, 180) | ||
+ | </ | ||
+ | * stack them into data c | ||
+ | * convert colnames into score and trans | ||
+ | * t.test score by trans with var.equal option true. | ||
+ | * aov test | ||
+ | * see t.test t value, t = -0.9474 and F value, | ||
</ | </ | ||
Line 489: | Line 600: | ||
<WRAP half column> | <WRAP half column> | ||
===== Concepts and ideas ===== | ===== Concepts and ideas ===== | ||
+ | ANOVA | ||
+ | [[:r:oneway anova]] | ||
+ | [[:r:twoway anova]] | ||
+ | [[:r:linear regression]] | ||
+ | [[: | ||
+ | [[:partial and semipartial correlation]] | ||
+ | |||
+ | [[: | ||
+ | [[: | ||
+ | |||
+ | |||
+ | [[:factor analysis]] | ||
+ | |||
Linear Regression and ANOVA | Linear Regression and ANOVA | ||
http:// | http:// | ||
- | [[: | + | |
</ | </ | ||
<WRAP half column> | <WRAP half column> | ||
===== Assignment ===== | ===== Assignment ===== | ||
- | - 자신의 전공과 관심사에 맞는 아래의 테스트를 수행하기 위한 가설을 작성하시오. | ||
- | - T-test | ||
- | - F-test | ||
- | - factorial f-test | ||
- | - Simple regression | ||
- | - Multiple regression | ||
- | - 각 가설의 독립변인과 종속변인을 밝히고 이를 측정하는 방법에 대해서 논하시오. | ||
- | - 가설과 관련이 있는 논문을 찾아서 (적어도 하나 이상씩) 관련 논문이 밝힌 것을 설명하고 자신의 가설과의 연관성을 논하시오. | ||
- | - 각 가설에 필요한 데이터를 구한 후, 적절한 테스를 하시오 (r의 인풋과 아웃풋 필요). | ||
- | - 테스트 결과를 논하시오. | ||
</ | </ | ||
====== Week15 (Dec. 11, 14) ====== | ====== Week15 (Dec. 11, 14) ====== | ||
<WRAP half column> | <WRAP half column> | ||
- | Group Presentation | + | Final quiz |
+ | Part I (필기시험): | ||
+ | * [[: | ||
+ | * [[: | ||
+ | * [[:multiple regression]] | ||
+ | * [[: | ||
+ | * [[:factor analysis]] - 이론적인 이해와 관련된 부분 | ||
+ | * r 과 관련된 내용 중 통계에 대한 이해와 관련된 부분, 예를 들면 | ||
+ | * t-test, ANOVA, Factorial | ||
+ | * regression, multiple regression output에 대한 이해 등 | ||
+ | Part II (r 실기시험): | ||
+ | * [[: | ||
+ | * [[: | ||
+ | * [[: | ||
+ | * [[:r:input output]] | ||
+ | * [[:r:data structures]] | ||
+ | * [[:r:data transformations]] | ||
+ | * [[: | ||
+ | * [[: | ||
+ | * [[: | ||
+ | * [[: | ||
+ | * [[:r:linear regression]] | ||
+ | * [[: | ||
+ | * [[:partial and semipartial correlation]] | ||
+ | * [[: | ||
</ | </ | ||
<WRAP half column> | <WRAP half column> | ||
</ | </ | ||
- | <WRAP half column> | ||
====== Week16 (Dec. 18, 21) ====== | ====== Week16 (Dec. 18, 21) ====== | ||
- | Group Presentation | + | <WRAP half column> |
__**Final-term**__ | __**Final-term**__ | ||
</ | </ |
c/ma/2018/schedule.1543254809.txt.gz · Last modified: 2018/11/27 02:53 by hkimscil