Differences

This shows you the differences between two versions of the page.

--- c:ms:2020:schedule [2020/06/26 15:22] – [Week16 (June 22, 25)] hkimscil
+++ c:ms:2020:schedule [2020/06/27 14:00] – [Week15 (June 22, 25)] hkimscil
@@ Line 73: / Line 73: @@
 <WRAP half column>
 ===== Concepts and ideas =====
 <WRAP box>
 Some [[:b:r cookbook:basics|basics]]
@@ Line 487: / Line 485: @@
 <WRAP half column>
 ===== Concepts and ideas =====
-Do the following
-<code>S1 <- c(89, 85, 85, 86, 88, 89, 86, 82, 96, 85, 93, 91,
-, 87, 94, 77, 87, 98, 85, 89, 95, 85, 93, 93,
-, 71, 97, 93, 75, 68, 98, 95, 79, 94, 98, 95)
-S2 <- c(60, 98, 94, 95, 99, 97, 100, 73, 93, 91, 98,
-, 66, 83, 77, 97, 91, 93, 71, 91, 95, 100,
-, 96, 91, 76, 100, 97, 99, 95, 97, 77, 94,
-, 88, 100, 94, 93, 86)
-S3 <- c(95, 86, 90, 90, 75, 83, 96, 85, 83, 84, 81, 98,
-, 94, 84, 89, 93, 99, 91, 77, 95, 90, 91, 87,
-, 76, 99, 99, 97, 97, 97, 77, 93, 96, 90, 87,
-, 88)
-S4 <- c(67, 93, 63, 83, 87, 97, 96, 92, 93, 96, 87, 90,
-, 90, 82, 91, 85, 93, 83, 90, 87, 99, 94, 88,
-, 72, 81, 93, 93, 94, 97, 89, 96, 95, 82, 97)
-scores <- list(S1=S1,S2=S2,S3=S3,S4=S4)</code>
-  * find means for each element in "scores" in a list format
-  * find standard deviation for each element in "scores" in a data frame format
-  * find variance  for each element in "scores" in a data frame format without using "var" function
-<code>longdata<- c(-1.850152, -1.406571, -1.0104817, -3.7170704,
-           -0.2804896, 0.9496313, 1.346517, -0.1580926, 1.6272786,
-           -2.4483321, -0.5407272, -1.708678, -0.3480616, -0.2757667,
-           -1.2177024)</code>
-  * make "longdata" to a matrix whose size is  3 by 5
-  * name columns "trial1, trial2, . . . . trial5"
-  * name rows "subject1, subject2, subject3"
-  * get means for each subject
-  * attach the above data to the matrix data and name it "longtemp."
-  * get standard deviation for each trial
-  * attach the above data to the matrix data, "longtemp."
-<code>suburbs <- read.csv("http://commres.net/wiki/_export/code/r/data_transformations?codeblock=15", head=T, sep="	")</code>
-  * get subrubs data  as the above
-  * get population means by each state (listed in the data, suburbs)
-    * use aggregate and refer to the below e.g.
-<code>attach(Cars93)
-aggregate(MPG.city ~ Origin, Cars93, mean)</code>
-  * get population sum by each county with  tapply function.
-  * tapply(number, byfactor, function)
-  * how many counties are there?
-  * Use Cars93 data, get MPG.city mean by Origin.
-__Using pnorm, qnorm__
-pnorm : get proportion out of normal distribution whose characteristics are mean and sd
-<code>pnorm(84, mean=72, sd=15.2, lower.tail=FALSE)</code>
-  * What is the value of  the below?
-<code>pnorm(1)</code>
-  * How would you get 68, 95, 99% from pnorm
-      * use ?pnorm and see the default option
-  * generate 10 random numbers with runif function
-<code>year <- c(1900:2016)     # years in vector year
-world.series <- data.frame(year)</code>
-  * get 10 year samples out of world.series data with "sample" command
-  * how would you get the sample sample again latter?
-<code>pnorm(110, mean=100, sd=10)</code>
-  * What would be the result from the above?
-<code>library(MASS)       # load the MASS package
-tbl = table(survey$Smoke, survey$Exer)
-tbl                 # the contingency table</code>
-<code>summary(tbl)
-</code>
-  * read the above output and interpret
-  * what about the below one?
-<code>chisq.test(tbl)
-</code>
-see first [[:chi-square test]]
-see [[:b:r cookbook:chi-square test]] in r document space for more
-<code> library(MASS)
- cardata <- data.frame(Cars93$Origin, Cars93$Type)
- cardata
-</code>
-  * Can you say the types of cars are different by the Origins?
-<code>dur <- faithful$eruptions
-dur</code>
-  * make the above data into z-score (zdur).
-  * get mean of the zdur
-  * get sd of the zdur
-<code>
-set.seed(1123)
-x <- rnorm(50, mean=100, sd=15)
-</code>
-  * test x against population  mean 95.
-  * test x against population  mean 99.
-  * are they different from each other?
-  * what would you do if you want to see the different result from the  second one?
-<code>a = c(65, 78, 88, 55, 48, 95, 66, 57, 79, 81)
-> t.test(a, mu=60)
-	One Sample t-test
-data:  a
-t = 2.3079, df = 9, p-value = 0.0464
-alternative hypothesis: true mean is not equal to 60
-percent confidence interval:
-.22187 82.17813
-sample estimates:
-mean of x
-.2
-</code>
-  * find the t critical value with function qt.
-  * explain what happens in the next code
-  * read (or remind) what pnorm and qnorm do.
-<code>> s <- sd(x)
-> m <- mean(x)
-> n <- length(x)
-> n
-[1] 50
-> m
-[1] 96.00386
-> s
-[1] 17.38321
-> SE <- s / sqrt(n)
-> SE
-[1] 2.458358
-> E <- qt(.975, df=n-1)*SE
-> E
-[1] 4.940254
-> m + c(-E, E)
-[1]  91.0636 100.9441
-> </code>
-  * what's wrong with the below?
-<code>t.test(x)</code>
-<code>> mtcars</code>
-  * using aggregate, get mean for each trnas. type.
-  * compare the difference of mileage between auto and manual cars.
-    * use t.test (two sample)
-    * "use var.equal=T" option
-<code>a = c(175, 168, 168, 190, 156, 181, 182, 175, 174, 179)
-b = c(185, 169, 173, 173, 188, 186, 175, 174, 179, 180)
-</code>
-  * stack them into data c
-  * convert colnames into score and trans
-  * t.test score by trans with var.equal option true.
-  * aov test
-  * see  t.test t value, t = -0.9474 and F value,  F = ?
 </WRAP>
@@ Line 752: / Line 596: @@
 </WRAP>
 <WRAP half column>
+===== Assignment =====
+그룹과제 내용수정 ----
+그룹과제 내용을 아래와 같이 수정합니다.
+그룹 assignment: independent t-test, repeated measures t-test, ANOVA, Factorial ANOVA, repeated measures ANOVA, regression, multiple regression 와 관련된 가설을 만들고, <del>구글독스를 이용하여</del> 설문문항을 작성하시오. 이를 이용하여 <color #ed1c24>**일부**</color> 데이터를 수집한 후 검증을 하시오. 검증 결과를 최대한 자세하게 논하시오. 과제는 기본적으로 아래를 수행하여야 합니다.
+  * 가설은 일반상식, 알고있는 사회과학 이론 등에 기반을 해서 만듭니다
+    * 가설작성에는 가설에 대한 설명이 포함되어야 합니다. 즉, 가설만 만들어서는 부족합니다.
+    * <color #ed1c24>가설은 위에서 언급한 7가지 테스트를 하기 위한 가설을 모두 포함합니다.</color>
+    * <color #ed1c24>가설에 대응하는 서베이 문항 또한 모두 만듭니다 (MS word 파일에 작성)</color>
+  * <del>구글서베이를 이용하여</del> 서베이 문항을 만들 때 아래를 포함하여야 합니다.
+    * <del>응답자 학번, 이름, 이메일 (참여 평가를 위해서: 참여 + (불)성실응답)</del>
+    * 각 가설을 (7개, indepedent t-test, repeated measures t-test, ANOVA, etc.)  검증할 수 있는 문항들
+  * R을 이용하여 가설을 검증합니다.
+    * 데이터 수집은 각 그룹이 자의적으로 만듭니다  (즉, 서베이를 할 필요는 없습니다).
+    * 데이터 수집과 분석은 다음의 가설만을 포함하면 됩니다.
+      * Independent t-test
+      * ANOVA
+      * Factorial ANOVA
+      * Multiple Regression (두 가지 이상의 IV 포함한)
+  * 검증 결과를 의미있게 논합니다.
+----
+과제제출
+  * 가설 소개와 설명
+    * independent t-test
+    * repeated measures t-test
+    * ANOVA
+    * Factorial ANOVA
+    * repeated measures ANOVA
+    * regression
+    * multiple regression
+  * 가설에 따른 설문 문항과 이 때의 IV와 DV 파악 및 측정 수준에 대한 설명
+    * independent t-test
+    * repeated measures t-test
+    * ANOVA
+    * Factorial ANOVA
+    * repeated measures ANOVA
+    * regression
+    * multiple regression
+  * 각 가설검증 분석결과 및 논의
+    * independent t-test
+    * repeated measures t-test
+    * ANOVA
+    * Factorial ANOVA
+    * repeated measures ANOVA
+    * regression
+    * multiple regression
+첨부파일 제출
+  * 서베이 참여자 명단 (survey.participants.group.01.xlsx 와 같은 이름의 excel 파일 형식으로 따로 제출)
+    * 강사가 우선 클래스메이트 명단을 배포할 것입니다 (excel 파일로).
+    * 스프레드시트에 참여한 사람의 성과이름은 붙여서 적습니다.
+    * 자신의 조에 속한 조원들도 자기 조 서베이에 참여합니다.
+      * 완전참여 = 1
+      * 비참여 = 0
+      * 불완전참여 = 2
 </WRAP>
 ====== Week16 (June 22, 25) ======
@@ Line 757: / Line 657: @@
 __**Final-term**__
   * July 02. 목요일 Quiz 봅니다.
-  * 퀴즈 시간은 12:00 - 9:00 입니다. 퀴즈 시간은 한정되어 있습니다. 연장이나 늦게 제출되지 않도록 할 예정입니다. 60분이 제한시간이면 이 시간이 지나면 자동제출됩니다.
+  * 퀴즈 시간은 12:00 - 5:00 입니다. 퀴즈 시간은 한정되어 있습니다. 연장이나 늦게 제출되지 않도록 할 예정입니다. 60분이 제한시간이면 이 시간이 지나면 자동제출됩니다.
   * 범위는 다음과 같습니다.
     * t-test