第I部¶

第4章母集団と標本¶

options(repr.plot.width = 4, repr.plot.height = 4)

curve(dnorm(x,mean=0,sd=1),from=-4,to=4)

Warning message:
: Removed 68 rows containing missing values (geom_area).

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

[1] 48.97004
[1] 51.73457
[1] 50.53473

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

.
   0    1 
1132 8868

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

library(ggplot2)

ggplot(data.frame(x = c(-4, 4)), aes(x)) + 
    stat_function(fun = dnorm, args = list(mean = 0, sd = 1))

Warning message:
: Removed 68 rows containing missing values (geom_area).

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

[1] 48.97004
[1] 51.73457
[1] 50.53473

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

.
   0    1 
1132 8868

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

last_plot() +  
    stat_function(fun = dnorm, args = list(mean = 1, sd = 1), colour = "red") + 
    stat_function(fun = dnorm, args = list(mean = 0, sd = 2), colour = "green")

Warning message:
: Removed 68 rows containing missing values (geom_area).

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

[1] 48.97004
[1] 51.73457
[1] 50.53473

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

.
   0    1 
1132 8868

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

ggplot(data.frame(x = c(-3, 3)), aes(x)) + 
    stat_function(fun=function(x){y <- dnorm(x); y[x < -1|  x > 1] <- NA; return(y);} , geom="area", alpha = 0.5) + 
    stat_function(fun = dnorm, args = list(mean = 0, sd = 1)) + 
    scale_x_continuous(breaks = c(-3:3, 1))

Warning message:
: Removed 68 rows containing missing values (geom_area).

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

[1] 48.97004
[1] 51.73457
[1] 50.53473

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

.
   0    1 
1132 8868

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

正規分布の確率密度関数

$$f(x) = \frac{1}{\sqrt{2\pi\sigma}}\exp\left(\frac{-(x - \mu)^2}{2\sigma^2}\right)$$

4.4.7¶

小標本の場合

samples = rnorm(n = 5, mean = 50, sd = 10)
ggplot(data.frame(samples), aes(x = samples)) + 
    geom_histogram()

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

[1] 48.97004
[1] 51.73457
[1] 50.53473

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

.
   0    1 
1132 8868

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

大標本の場合

samples.large <- rnorm(n = 10000, mean = 50, sd = 10)
ggplot(data.frame(samples.large), aes(x = samples.large)) + 
    geom_histogram()

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

[1] 48.97004
[1] 51.73457
[1] 50.53473

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

.
   0    1 
1132 8868

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------
Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------

4.5 標本分布¶

4.5.3¶

for (i in 1:3) {
    x <- rnorm(n = 10, mean = 50, sd = 10)
    print(mean(x))
}

[1] 48.97004
[1] 51.73457
[1] 50.53473

4.5.4¶

sample_means <- numeric(length = 10000)
for(i in 1:10000){
    x <- rnorm(n = 10, mean = 50, sd = 10)
    sample_means[i] <- mean(x)
}

ggplot(data.frame(sample_means), aes(x = sample_means)) + 
    geom_histogram()

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

.
   0    1 
1132 8868

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------
Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------

 setting  value                       
 version  R version 3.2.3 (2015-12-10)
 system   x86_64, mingw32             
 ui       RTerm                       
 language en                          
 collate  Japanese_Japan.932          
 tz       Asia/Tokyo                  
 date     2016-05-16                  

 package      * version date       source                            
 assertthat     0.1     2013-12-06 CRAN (R 3.2.1)                    
 base64enc      0.1-3   2015-07-28 CRAN (R 3.2.2)                    
 Cairo        * 1.5-9   2015-09-26 CRAN (R 3.2.2)                    
 colorspace     1.2-6   2015-03-11 CRAN (R 3.2.1)                    
 DBI            0.3.1   2014-09-24 CRAN (R 3.2.1)                    
 devtools       1.10.0  2016-01-23 CRAN (R 3.2.3)                    
 digest         0.6.9   2016-01-08 CRAN (R 3.2.3)                    
 dplyr        * 0.4.3   2015-09-01 CRAN (R 3.2.2)                    
 evaluate       0.8     2015-09-18 CRAN (R 3.2.2)                    
 ggplot2      * 2.0.0   2015-12-18 CRAN (R 3.2.3)                    
 gtable         0.1.2   2012-12-05 CRAN (R 3.2.1)                    
 IRdisplay      0.3     2015-04-27 local                             
 IRkernel       0.6     2016-02-08 Github (IRkernel/IRkernel@40dc791)
 jsonlite       0.9.19  2015-11-28 CRAN (R 3.2.2)                    
 labeling       0.3     2014-08-23 CRAN (R 3.2.1)                    
 lazyeval       0.1.10  2015-01-02 CRAN (R 3.2.1)                    
 magrittr       1.5     2014-11-22 CRAN (R 3.2.1)                    
 memoise        1.0.0   2016-01-29 CRAN (R 3.2.3)                    
 munsell        0.4.2   2013-07-11 CRAN (R 3.2.1)                    
 pbdZMQ         0.2-1   2016-01-21 CRAN (R 3.2.3)                    
 pipeR        * 0.6.0.6 2015-07-08 CRAN (R 3.2.1)                    
 plyr           1.8.3   2015-06-12 CRAN (R 3.2.1)                    
 R6             2.1.2   2016-01-26 CRAN (R 3.2.3)                    
 RColorBrewer * 1.1-2   2014-12-07 CRAN (R 3.2.1)                    
 Rcpp           0.12.3  2016-01-10 CRAN (R 3.2.3)                    
 repr           0.4     2015-11-16 local                             
 scales         0.3.0   2015-08-25 CRAN (R 3.2.2)                    
 stringi        1.0-1   2015-10-22 CRAN (R 3.2.3)                    
 stringr        1.0.0   2015-04-30 CRAN (R 3.2.3)                    
 tidyr        * 0.4.1   2016-02-05 CRAN (R 3.2.3)                    
 uuid           0.1-2   2015-07-28 CRAN (R 3.2.2)

誤差絶対値5以下

library(pipeR)

abs(sample_means - 50) %>>% 
    sapply(function(x){ifelse(x <= 5, 1, 0)}) %>>%
    table()

.
   0    1 
1132 8868

$N(\mu, \sigma^2)$ の母集団からn標本を抽出したとき，標本平均の標本分布は $N(\mu, \frac{\sigma^2}{n})$ となる．
つまり，$N(50, 10^2)$ から n = 10 抽出すると， $N(50, 10^2 / 10) = N(50, 10)$ となる

mean(sample_means)

var(sample_means)

ggplot(data.frame(sample_means), aes(x = sample_means)) + 
    geom_histogram(aes(y = ..density..), col = "gray", alpha = 0.8) + 
    stat_function(fun = dnorm, args = list(mean = 50, sd = sqrt(10)))

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------
Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------

 setting  value                       
 version  R version 3.2.3 (2015-12-10)
 system   x86_64, mingw32             
 ui       RTerm                       
 language en                          
 collate  Japanese_Japan.932          
 tz       Asia/Tokyo                  
 date     2016-05-16                  

 package      * version date       source                            
 assertthat     0.1     2013-12-06 CRAN (R 3.2.1)                    
 base64enc      0.1-3   2015-07-28 CRAN (R 3.2.2)                    
 Cairo        * 1.5-9   2015-09-26 CRAN (R 3.2.2)                    
 colorspace     1.2-6   2015-03-11 CRAN (R 3.2.1)                    
 DBI            0.3.1   2014-09-24 CRAN (R 3.2.1)                    
 devtools       1.10.0  2016-01-23 CRAN (R 3.2.3)                    
 digest         0.6.9   2016-01-08 CRAN (R 3.2.3)                    
 dplyr        * 0.4.3   2015-09-01 CRAN (R 3.2.2)                    
 evaluate       0.8     2015-09-18 CRAN (R 3.2.2)                    
 ggplot2      * 2.0.0   2015-12-18 CRAN (R 3.2.3)                    
 gtable         0.1.2   2012-12-05 CRAN (R 3.2.1)                    
 IRdisplay      0.3     2015-04-27 local                             
 IRkernel       0.6     2016-02-08 Github (IRkernel/IRkernel@40dc791)
 jsonlite       0.9.19  2015-11-28 CRAN (R 3.2.2)                    
 labeling       0.3     2014-08-23 CRAN (R 3.2.1)                    
 lazyeval       0.1.10  2015-01-02 CRAN (R 3.2.1)                    
 magrittr       1.5     2014-11-22 CRAN (R 3.2.1)                    
 memoise        1.0.0   2016-01-29 CRAN (R 3.2.3)                    
 munsell        0.4.2   2013-07-11 CRAN (R 3.2.1)                    
 pbdZMQ         0.2-1   2016-01-21 CRAN (R 3.2.3)                    
 pipeR        * 0.6.0.6 2015-07-08 CRAN (R 3.2.1)                    
 plyr           1.8.3   2015-06-12 CRAN (R 3.2.1)                    
 R6             2.1.2   2016-01-26 CRAN (R 3.2.3)                    
 RColorBrewer * 1.1-2   2014-12-07 CRAN (R 3.2.1)                    
 Rcpp           0.12.3  2016-01-10 CRAN (R 3.2.3)                    
 repr           0.4     2015-11-16 local                             
 scales         0.3.0   2015-08-25 CRAN (R 3.2.2)                    
 stringi        1.0-1   2015-10-22 CRAN (R 3.2.3)                    
 stringr        1.0.0   2015-04-30 CRAN (R 3.2.3)                    
 tidyr        * 0.4.1   2016-02-05 CRAN (R 3.2.3)                    
 uuid           0.1-2   2015-07-28 CRAN (R 3.2.2)

4.5.6 標準誤差¶

標準誤差: 推定量の標本分布の標準偏差
$N(\mu, \sigma^2)$ の母集団からn標本を抽出したとき，標本平均の標本分布は $N(\mu, \frac{\sigma^2}{n})$ となるので，標準誤差は $\frac{\sigma}{\sqrt{n}}$ となる．

sample_means <- numeric(length = 10000)
for(i in 1:10000){
    x <- rnorm(n = 100, mean = 50, sd = 10)
    sample_means[i] <- mean(x)
}

var(sample_means)

ggplot(data.frame(sample_means), aes(x = sample_means)) + 
    geom_histogram()

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------
Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------

 setting  value                       
 version  R version 3.2.3 (2015-12-10)
 system   x86_64, mingw32             
 ui       RTerm                       
 language en                          
 collate  Japanese_Japan.932          
 tz       Asia/Tokyo                  
 date     2016-05-16                  

 package      * version date       source                            
 assertthat     0.1     2013-12-06 CRAN (R 3.2.1)                    
 base64enc      0.1-3   2015-07-28 CRAN (R 3.2.2)                    
 Cairo        * 1.5-9   2015-09-26 CRAN (R 3.2.2)                    
 colorspace     1.2-6   2015-03-11 CRAN (R 3.2.1)                    
 DBI            0.3.1   2014-09-24 CRAN (R 3.2.1)                    
 devtools       1.10.0  2016-01-23 CRAN (R 3.2.3)                    
 digest         0.6.9   2016-01-08 CRAN (R 3.2.3)                    
 dplyr        * 0.4.3   2015-09-01 CRAN (R 3.2.2)                    
 evaluate       0.8     2015-09-18 CRAN (R 3.2.2)                    
 ggplot2      * 2.0.0   2015-12-18 CRAN (R 3.2.3)                    
 gtable         0.1.2   2012-12-05 CRAN (R 3.2.1)                    
 IRdisplay      0.3     2015-04-27 local                             
 IRkernel       0.6     2016-02-08 Github (IRkernel/IRkernel@40dc791)
 jsonlite       0.9.19  2015-11-28 CRAN (R 3.2.2)                    
 labeling       0.3     2014-08-23 CRAN (R 3.2.1)                    
 lazyeval       0.1.10  2015-01-02 CRAN (R 3.2.1)                    
 magrittr       1.5     2014-11-22 CRAN (R 3.2.1)                    
 memoise        1.0.0   2016-01-29 CRAN (R 3.2.3)                    
 munsell        0.4.2   2013-07-11 CRAN (R 3.2.1)                    
 pbdZMQ         0.2-1   2016-01-21 CRAN (R 3.2.3)                    
 pipeR        * 0.6.0.6 2015-07-08 CRAN (R 3.2.1)                    
 plyr           1.8.3   2015-06-12 CRAN (R 3.2.1)                    
 R6             2.1.2   2016-01-26 CRAN (R 3.2.3)                    
 RColorBrewer * 1.1-2   2014-12-07 CRAN (R 3.2.1)                    
 Rcpp           0.12.3  2016-01-10 CRAN (R 3.2.3)                    
 repr           0.4     2015-11-16 local                             
 scales         0.3.0   2015-08-25 CRAN (R 3.2.2)                    
 stringi        1.0-1   2015-10-22 CRAN (R 3.2.3)                    
 stringr        1.0.0   2015-04-30 CRAN (R 3.2.3)                    
 tidyr        * 0.4.1   2016-02-05 CRAN (R 3.2.3)                    
 uuid           0.1-2   2015-07-28 CRAN (R 3.2.2)

4.6¶

母分散の推定量は不偏分散のほうが良い

varps <- numeric(length = 10^4)
vars <- numeric(length = 10^4)
for (i in 1:10^4) {
    x <- rnorm(n = 10, mean = 50, sd = 10)
    varps[i] <- mean((x - mean(x))^2)
    vars[i] <- var(x)
}

標本分散

mean(varps)

不偏分散

mean(vars)

不偏分散のほうがばらつきが大きい

sd(varps)

sd(vars)

library(tidyr)

library(Cairo)

options(repr.plot.width = 8, repr.plot.height = 4)

lab <- as_labeller(c(`varps` = "標本分散", `vars` = "不偏分散"))
data.frame(varps, vars) %>>% 
    gather(key, val) %>>% 
    ggplot(aes(val)) + 
        geom_histogram(breaks = seq(0, 500, 10)) + 
        ylab("Frequency") +
        theme(strip.text.x = element_text(family = "IPAexGothic"), axis.title.x = element_blank()) -> gp
Cairo(type = "raster")
gp + facet_wrap(~key, labeller = lab)
dev.off()

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------
Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------

 setting  value                       
 version  R version 3.2.3 (2015-12-10)
 system   x86_64, mingw32             
 ui       RTerm                       
 language en                          
 collate  Japanese_Japan.932          
 tz       Asia/Tokyo                  
 date     2016-05-16                  

 package      * version date       source                            
 assertthat     0.1     2013-12-06 CRAN (R 3.2.1)                    
 base64enc      0.1-3   2015-07-28 CRAN (R 3.2.2)                    
 Cairo        * 1.5-9   2015-09-26 CRAN (R 3.2.2)                    
 colorspace     1.2-6   2015-03-11 CRAN (R 3.2.1)                    
 DBI            0.3.1   2014-09-24 CRAN (R 3.2.1)                    
 devtools       1.10.0  2016-01-23 CRAN (R 3.2.3)                    
 digest         0.6.9   2016-01-08 CRAN (R 3.2.3)                    
 dplyr        * 0.4.3   2015-09-01 CRAN (R 3.2.2)                    
 evaluate       0.8     2015-09-18 CRAN (R 3.2.2)                    
 ggplot2      * 2.0.0   2015-12-18 CRAN (R 3.2.3)                    
 gtable         0.1.2   2012-12-05 CRAN (R 3.2.1)                    
 IRdisplay      0.3     2015-04-27 local                             
 IRkernel       0.6     2016-02-08 Github (IRkernel/IRkernel@40dc791)
 jsonlite       0.9.19  2015-11-28 CRAN (R 3.2.2)                    
 labeling       0.3     2014-08-23 CRAN (R 3.2.1)                    
 lazyeval       0.1.10  2015-01-02 CRAN (R 3.2.1)                    
 magrittr       1.5     2014-11-22 CRAN (R 3.2.1)                    
 memoise        1.0.0   2016-01-29 CRAN (R 3.2.3)                    
 munsell        0.4.2   2013-07-11 CRAN (R 3.2.1)                    
 pbdZMQ         0.2-1   2016-01-21 CRAN (R 3.2.3)                    
 pipeR        * 0.6.0.6 2015-07-08 CRAN (R 3.2.1)                    
 plyr           1.8.3   2015-06-12 CRAN (R 3.2.1)                    
 R6             2.1.2   2016-01-26 CRAN (R 3.2.3)                    
 RColorBrewer * 1.1-2   2014-12-07 CRAN (R 3.2.1)                    
 Rcpp           0.12.3  2016-01-10 CRAN (R 3.2.3)                    
 repr           0.4     2015-11-16 local                             
 scales         0.3.0   2015-08-25 CRAN (R 3.2.2)                    
 stringi        1.0-1   2015-10-22 CRAN (R 3.2.3)                    
 stringr        1.0.0   2015-04-30 CRAN (R 3.2.3)                    
 tidyr        * 0.4.1   2016-02-05 CRAN (R 3.2.3)                    
 uuid           0.1-2   2015-07-28 CRAN (R 3.2.2)

library(dplyr)

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union

分散の推定値が200以上

data.frame(varps, vars) %>>% 
    mutate_each(funs(whatever = ifelse(. >= 200, 1, 0))) %>>% 
    rename(標本分散=varps, 不偏分散=vars) %>>%
    gather(key, val) %>>% 
    group_by(key, val) %>>%
    tally %>>%
    spread(val, n)

不偏分散の平方根は母標準偏差の不偏推定量ではない

sqrt(vars) %>>% mean

4.6.2 中央値の標本分布¶

means <- numeric(length = 10^4)
medians <- numeric(length = 10^4)
for (i in 1:10^4) {
    x <- rnorm(n = 10, mean = 50, sd = 10)
    means[i] <- mean(x)
    medians[i] <- median(x)
}

平均

mean(means)

mean(medians)

標準誤差

sd(means)

sd(medians)

lab <- as_labeller(c(`means` = "標本平均", `medians` = "標本中央値"))
data.frame(means, medians) %>>% 
    gather(key, val) %>>% 
    ggplot(aes(val)) + 
        geom_histogram() + 
        ylab("Frequency") +
        theme(strip.text.x = element_text(family = "IPAexGothic"), axis.title.x = element_blank()) -> gp
Cairo(type = "raster")
gp + facet_wrap(~key, labeller = lab)
dev.off()

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------
Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------

 setting  value                       
 version  R version 3.2.3 (2015-12-10)
 system   x86_64, mingw32             
 ui       RTerm                       
 language en                          
 collate  Japanese_Japan.932          
 tz       Asia/Tokyo                  
 date     2016-05-16                  

 package      * version date       source                            
 assertthat     0.1     2013-12-06 CRAN (R 3.2.1)                    
 base64enc      0.1-3   2015-07-28 CRAN (R 3.2.2)                    
 Cairo        * 1.5-9   2015-09-26 CRAN (R 3.2.2)                    
 colorspace     1.2-6   2015-03-11 CRAN (R 3.2.1)                    
 DBI            0.3.1   2014-09-24 CRAN (R 3.2.1)                    
 devtools       1.10.0  2016-01-23 CRAN (R 3.2.3)                    
 digest         0.6.9   2016-01-08 CRAN (R 3.2.3)                    
 dplyr        * 0.4.3   2015-09-01 CRAN (R 3.2.2)                    
 evaluate       0.8     2015-09-18 CRAN (R 3.2.2)                    
 ggplot2      * 2.0.0   2015-12-18 CRAN (R 3.2.3)                    
 gtable         0.1.2   2012-12-05 CRAN (R 3.2.1)                    
 IRdisplay      0.3     2015-04-27 local                             
 IRkernel       0.6     2016-02-08 Github (IRkernel/IRkernel@40dc791)
 jsonlite       0.9.19  2015-11-28 CRAN (R 3.2.2)                    
 labeling       0.3     2014-08-23 CRAN (R 3.2.1)                    
 lazyeval       0.1.10  2015-01-02 CRAN (R 3.2.1)                    
 magrittr       1.5     2014-11-22 CRAN (R 3.2.1)                    
 memoise        1.0.0   2016-01-29 CRAN (R 3.2.3)                    
 munsell        0.4.2   2013-07-11 CRAN (R 3.2.1)                    
 pbdZMQ         0.2-1   2016-01-21 CRAN (R 3.2.3)                    
 pipeR        * 0.6.0.6 2015-07-08 CRAN (R 3.2.1)                    
 plyr           1.8.3   2015-06-12 CRAN (R 3.2.1)                    
 R6             2.1.2   2016-01-26 CRAN (R 3.2.3)                    
 RColorBrewer * 1.1-2   2014-12-07 CRAN (R 3.2.1)                    
 Rcpp           0.12.3  2016-01-10 CRAN (R 3.2.3)                    
 repr           0.4     2015-11-16 local                             
 scales         0.3.0   2015-08-25 CRAN (R 3.2.2)                    
 stringi        1.0-1   2015-10-22 CRAN (R 3.2.3)                    
 stringr        1.0.0   2015-04-30 CRAN (R 3.2.3)                    
 tidyr        * 0.4.1   2016-02-05 CRAN (R 3.2.3)                    
 uuid           0.1-2   2015-07-28 CRAN (R 3.2.2)

練習問題¶

（1）¶

sample_means <- numeric(length = 5000)
for(i in 1:5000) {
    x <- rnorm(n = 20, mean = 50, sd = 10)
    sample_means[i] <- mean(x)
}

options(repr.plot.width = 4, repr.plot.height = 4)

ggplot(data.frame(sample_means), aes(x = sample_means)) + 
    geom_histogram(aes(y = ..density..), col = "gray", alpha = 0.8) + 
    stat_function(fun = dnorm, args = list(mean = 50, sd = sqrt(10^2/20)))

`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------
Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------

 setting  value                       
 version  R version 3.2.3 (2015-12-10)
 system   x86_64, mingw32             
 ui       RTerm                       
 language en                          
 collate  Japanese_Japan.932          
 tz       Asia/Tokyo                  
 date     2016-05-16                  

 package      * version date       source                            
 assertthat     0.1     2013-12-06 CRAN (R 3.2.1)                    
 base64enc      0.1-3   2015-07-28 CRAN (R 3.2.2)                    
 Cairo        * 1.5-9   2015-09-26 CRAN (R 3.2.2)                    
 colorspace     1.2-6   2015-03-11 CRAN (R 3.2.1)                    
 DBI            0.3.1   2014-09-24 CRAN (R 3.2.1)                    
 devtools       1.10.0  2016-01-23 CRAN (R 3.2.3)                    
 digest         0.6.9   2016-01-08 CRAN (R 3.2.3)                    
 dplyr        * 0.4.3   2015-09-01 CRAN (R 3.2.2)                    
 evaluate       0.8     2015-09-18 CRAN (R 3.2.2)                    
 ggplot2      * 2.0.0   2015-12-18 CRAN (R 3.2.3)                    
 gtable         0.1.2   2012-12-05 CRAN (R 3.2.1)                    
 IRdisplay      0.3     2015-04-27 local                             
 IRkernel       0.6     2016-02-08 Github (IRkernel/IRkernel@40dc791)
 jsonlite       0.9.19  2015-11-28 CRAN (R 3.2.2)                    
 labeling       0.3     2014-08-23 CRAN (R 3.2.1)                    
 lazyeval       0.1.10  2015-01-02 CRAN (R 3.2.1)                    
 magrittr       1.5     2014-11-22 CRAN (R 3.2.1)                    
 memoise        1.0.0   2016-01-29 CRAN (R 3.2.3)                    
 munsell        0.4.2   2013-07-11 CRAN (R 3.2.1)                    
 pbdZMQ         0.2-1   2016-01-21 CRAN (R 3.2.3)                    
 pipeR        * 0.6.0.6 2015-07-08 CRAN (R 3.2.1)                    
 plyr           1.8.3   2015-06-12 CRAN (R 3.2.1)                    
 R6             2.1.2   2016-01-26 CRAN (R 3.2.3)                    
 RColorBrewer * 1.1-2   2014-12-07 CRAN (R 3.2.1)                    
 Rcpp           0.12.3  2016-01-10 CRAN (R 3.2.3)                    
 repr           0.4     2015-11-16 local                             
 scales         0.3.0   2015-08-25 CRAN (R 3.2.2)                    
 stringi        1.0-1   2015-10-22 CRAN (R 3.2.3)                    
 stringr        1.0.0   2015-04-30 CRAN (R 3.2.3)                    
 tidyr        * 0.4.1   2016-02-05 CRAN (R 3.2.3)                    
 uuid           0.1-2   2015-07-28 CRAN (R 3.2.2)

（2）¶

library(RColorBrewer)

options(repr.plot.width = 4, repr.plot.height = 8)

display.brewer.all()

Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------
Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------

 setting  value                       
 version  R version 3.2.3 (2015-12-10)
 system   x86_64, mingw32             
 ui       RTerm                       
 language en                          
 collate  Japanese_Japan.932          
 tz       Asia/Tokyo                  
 date     2016-05-16                  

 package      * version date       source                            
 assertthat     0.1     2013-12-06 CRAN (R 3.2.1)                    
 base64enc      0.1-3   2015-07-28 CRAN (R 3.2.2)                    
 Cairo        * 1.5-9   2015-09-26 CRAN (R 3.2.2)                    
 colorspace     1.2-6   2015-03-11 CRAN (R 3.2.1)                    
 DBI            0.3.1   2014-09-24 CRAN (R 3.2.1)                    
 devtools       1.10.0  2016-01-23 CRAN (R 3.2.3)                    
 digest         0.6.9   2016-01-08 CRAN (R 3.2.3)                    
 dplyr        * 0.4.3   2015-09-01 CRAN (R 3.2.2)                    
 evaluate       0.8     2015-09-18 CRAN (R 3.2.2)                    
 ggplot2      * 2.0.0   2015-12-18 CRAN (R 3.2.3)                    
 gtable         0.1.2   2012-12-05 CRAN (R 3.2.1)                    
 IRdisplay      0.3     2015-04-27 local                             
 IRkernel       0.6     2016-02-08 Github (IRkernel/IRkernel@40dc791)
 jsonlite       0.9.19  2015-11-28 CRAN (R 3.2.2)                    
 labeling       0.3     2014-08-23 CRAN (R 3.2.1)                    
 lazyeval       0.1.10  2015-01-02 CRAN (R 3.2.1)                    
 magrittr       1.5     2014-11-22 CRAN (R 3.2.1)                    
 memoise        1.0.0   2016-01-29 CRAN (R 3.2.3)                    
 munsell        0.4.2   2013-07-11 CRAN (R 3.2.1)                    
 pbdZMQ         0.2-1   2016-01-21 CRAN (R 3.2.3)                    
 pipeR        * 0.6.0.6 2015-07-08 CRAN (R 3.2.1)                    
 plyr           1.8.3   2015-06-12 CRAN (R 3.2.1)                    
 R6             2.1.2   2016-01-26 CRAN (R 3.2.3)                    
 RColorBrewer * 1.1-2   2014-12-07 CRAN (R 3.2.1)                    
 Rcpp           0.12.3  2016-01-10 CRAN (R 3.2.3)                    
 repr           0.4     2015-11-16 local                             
 scales         0.3.0   2015-08-25 CRAN (R 3.2.2)                    
 stringi        1.0-1   2015-10-22 CRAN (R 3.2.3)                    
 stringr        1.0.0   2015-04-30 CRAN (R 3.2.3)                    
 tidyr        * 0.4.1   2016-02-05 CRAN (R 3.2.3)                    
 uuid           0.1-2   2015-07-28 CRAN (R 3.2.2)

options(repr.plot.width = 4, repr.plot.height = 4)

col = brewer.pal(n = 9, name = "YlGnBu")
ggplot(data.frame(x = c(-3, 3)), aes(x)) + 
    stat_function(fun = dnorm, args = list(mean = 0, sd = sqrt(1/1)), col = col[1]) + 
    stat_function(fun = dnorm, args = list(mean = 0, sd = sqrt(1/4)), col = col[3]) + 
    stat_function(fun = dnorm, args = list(mean = 0, sd = sqrt(1/9)), col = col[5]) + 
    stat_function(fun = dnorm, args = list(mean = 0, sd = sqrt(1/16)), col = col[7]) + 
    stat_function(fun = dnorm, args = list(mean = 0, sd = sqrt(1/25)), col = col[9])

Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------
Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------

 setting  value                       
 version  R version 3.2.3 (2015-12-10)
 system   x86_64, mingw32             
 ui       RTerm                       
 language en                          
 collate  Japanese_Japan.932          
 tz       Asia/Tokyo                  
 date     2016-05-16                  

 package      * version date       source                            
 assertthat     0.1     2013-12-06 CRAN (R 3.2.1)                    
 base64enc      0.1-3   2015-07-28 CRAN (R 3.2.2)                    
 Cairo        * 1.5-9   2015-09-26 CRAN (R 3.2.2)                    
 colorspace     1.2-6   2015-03-11 CRAN (R 3.2.1)                    
 DBI            0.3.1   2014-09-24 CRAN (R 3.2.1)                    
 devtools       1.10.0  2016-01-23 CRAN (R 3.2.3)                    
 digest         0.6.9   2016-01-08 CRAN (R 3.2.3)                    
 dplyr        * 0.4.3   2015-09-01 CRAN (R 3.2.2)                    
 evaluate       0.8     2015-09-18 CRAN (R 3.2.2)                    
 ggplot2      * 2.0.0   2015-12-18 CRAN (R 3.2.3)                    
 gtable         0.1.2   2012-12-05 CRAN (R 3.2.1)                    
 IRdisplay      0.3     2015-04-27 local                             
 IRkernel       0.6     2016-02-08 Github (IRkernel/IRkernel@40dc791)
 jsonlite       0.9.19  2015-11-28 CRAN (R 3.2.2)                    
 labeling       0.3     2014-08-23 CRAN (R 3.2.1)                    
 lazyeval       0.1.10  2015-01-02 CRAN (R 3.2.1)                    
 magrittr       1.5     2014-11-22 CRAN (R 3.2.1)                    
 memoise        1.0.0   2016-01-29 CRAN (R 3.2.3)                    
 munsell        0.4.2   2013-07-11 CRAN (R 3.2.1)                    
 pbdZMQ         0.2-1   2016-01-21 CRAN (R 3.2.3)                    
 pipeR        * 0.6.0.6 2015-07-08 CRAN (R 3.2.1)                    
 plyr           1.8.3   2015-06-12 CRAN (R 3.2.1)                    
 R6             2.1.2   2016-01-26 CRAN (R 3.2.3)                    
 RColorBrewer * 1.1-2   2014-12-07 CRAN (R 3.2.1)                    
 Rcpp           0.12.3  2016-01-10 CRAN (R 3.2.3)                    
 repr           0.4     2015-11-16 local                             
 scales         0.3.0   2015-08-25 CRAN (R 3.2.2)                    
 stringi        1.0-1   2015-10-22 CRAN (R 3.2.3)                    
 stringr        1.0.0   2015-04-30 CRAN (R 3.2.3)                    
 tidyr        * 0.4.1   2016-02-05 CRAN (R 3.2.3)                    
 uuid           0.1-2   2015-07-28 CRAN (R 3.2.2)

devtools::session_info()

Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------
Session info -------------------------------------------------------------------
Packages -----------------------------------------------------------------------

 setting  value                       
 version  R version 3.2.3 (2015-12-10)
 system   x86_64, mingw32             
 ui       RTerm                       
 language en                          
 collate  Japanese_Japan.932          
 tz       Asia/Tokyo                  
 date     2016-05-16                  

 package      * version date       source                            
 assertthat     0.1     2013-12-06 CRAN (R 3.2.1)                    
 base64enc      0.1-3   2015-07-28 CRAN (R 3.2.2)                    
 Cairo        * 1.5-9   2015-09-26 CRAN (R 3.2.2)                    
 colorspace     1.2-6   2015-03-11 CRAN (R 3.2.1)                    
 DBI            0.3.1   2014-09-24 CRAN (R 3.2.1)                    
 devtools       1.10.0  2016-01-23 CRAN (R 3.2.3)                    
 digest         0.6.9   2016-01-08 CRAN (R 3.2.3)                    
 dplyr        * 0.4.3   2015-09-01 CRAN (R 3.2.2)                    
 evaluate       0.8     2015-09-18 CRAN (R 3.2.2)                    
 ggplot2      * 2.0.0   2015-12-18 CRAN (R 3.2.3)                    
 gtable         0.1.2   2012-12-05 CRAN (R 3.2.1)                    
 IRdisplay      0.3     2015-04-27 local                             
 IRkernel       0.6     2016-02-08 Github (IRkernel/IRkernel@40dc791)
 jsonlite       0.9.19  2015-11-28 CRAN (R 3.2.2)                    
 labeling       0.3     2014-08-23 CRAN (R 3.2.1)                    
 lazyeval       0.1.10  2015-01-02 CRAN (R 3.2.1)                    
 magrittr       1.5     2014-11-22 CRAN (R 3.2.1)                    
 memoise        1.0.0   2016-01-29 CRAN (R 3.2.3)                    
 munsell        0.4.2   2013-07-11 CRAN (R 3.2.1)                    
 pbdZMQ         0.2-1   2016-01-21 CRAN (R 3.2.3)                    
 pipeR        * 0.6.0.6 2015-07-08 CRAN (R 3.2.1)                    
 plyr           1.8.3   2015-06-12 CRAN (R 3.2.1)                    
 R6             2.1.2   2016-01-26 CRAN (R 3.2.3)                    
 RColorBrewer * 1.1-2   2014-12-07 CRAN (R 3.2.1)                    
 Rcpp           0.12.3  2016-01-10 CRAN (R 3.2.3)                    
 repr           0.4     2015-11-16 local                             
 scales         0.3.0   2015-08-25 CRAN (R 3.2.2)                    
 stringi        1.0-1   2015-10-22 CRAN (R 3.2.3)                    
 stringr        1.0.0   2015-04-30 CRAN (R 3.2.3)                    
 tidyr        * 0.4.1   2016-02-05 CRAN (R 3.2.3)                    
 uuid           0.1-2   2015-07-28 CRAN (R 3.2.2)

	key	0	1
1	標本分散	9810	190
2	不偏分散	9619	381