Goal: Practice statistical concepts and sampling methods.

For the next questions assume the population consists of all infraction details (rows) in the dinesafe data.

  1. Assume you are interested in the average number of significant/crucial infractions per establishment (SEVERITY equal to “S - Significant” or “C - Crucial”). Find the actual value of the parameter.

  2. Estimate the parameter using SRS of size \(n=150\), where your sampling frame is all establishments in the dinesafe data.

The function replicate() can be used to run an expression multiple times and return its results in a matrix, one column for each replication. E.g. the following command replicates sampling 5 values from the letters vector 10 times. The result is a 5-by-10 matrix, where each row contans the sampled letters.

letters
replicate( 10, sample( letters, size = 5 ) ) 
  1. Use replicate() to repeat the establishment sampling process 200 times. Plot a histogram of the resulting statistics (avg # of S/C-type infractions per establishment) and overlay it with the parameter value (use geom_vline()). Do the samples look biased? Justify your answer.

  2. Collect a convenience sample by choosing all establishments whose address is on Kingston road, and caclulate the sample statistic. Briefly explain what can go wrong with approach?

  3. Consider the following two survey reports published by the City of Toronto.

The first few pages (p. 1 & 3) contain information on the survey. Read them and fill in the following table as well as you can. If you cannot find the relevant information for a field you can leave it blank. The last row is about your personal assessment of the survey.

Area Survey 1 Survey 2
Target population
Sampling frame
Sample size
Response rate
Possible issues
  1. [Extra] Read about two interesting cases of sampling surveys that went terribly wrong!
LS0tDQp0aXRsZTogIlNUQUE1NyAtIFdvcmtTaGVldCA5Ig0KYXV0aG9yOiAnTmFtZTogICAgLCBJRCM6ICAgJw0Kb3V0cHV0OiBodG1sX25vdGVib29rDQotLS0NCg0KKioqIA0KDQoqKkdvYWwqKjogUHJhY3RpY2Ugc3RhdGlzdGljYWwgY29uY2VwdHMgYW5kIHNhbXBsaW5nIG1ldGhvZHMuDQoNCmBgYHtyLCBpbmNsdWRlPUZBTFNFfQ0KbGlicmFyeSh0aWR5dmVyc2UpDQpkaW5lc2FmZSA9IHJlYWRfY3N2KCIuLi9kYXRhL2RpbmVzYWZlLmNzdiIpDQpgYGANCg0KRm9yIHRoZSBuZXh0IHF1ZXN0aW9ucyBhc3N1bWUgdGhlICpwb3B1bGF0aW9uKiBjb25zaXN0cyBvZiBhbGwgaW5mcmFjdGlvbiBkZXRhaWxzIChyb3dzKSBpbiB0aGUgYGRpbmVzYWZlYCBkYXRhLg0KDQoxLiBBc3N1bWUgeW91IGFyZSBpbnRlcmVzdGVkIGluIHRoZSBhdmVyYWdlIG51bWJlciBvZiBzaWduaWZpY2FudC9jcnVjaWFsIGluZnJhY3Rpb25zIHBlciBlc3RhYmxpc2htZW50IChTRVZFUklUWSBlcXVhbCB0byAiUyAtIFNpZ25pZmljYW50IiBvciAiQyAtIENydWNpYWwiKS4gRmluZCB0aGUgYWN0dWFsIHZhbHVlIG9mIHRoZSBwYXJhbWV0ZXIuICANCg0KMi4gRXN0aW1hdGUgdGhlIHBhcmFtZXRlciB1c2luZyBTUlMgb2Ygc2l6ZSAkbj0xNTAkLCB3aGVyZSB5b3VyIHNhbXBsaW5nIGZyYW1lIGlzIGFsbCAqZXN0YWJsaXNobWVudHMqIGluIHRoZSBkaW5lc2FmZSBkYXRhLg0KDQpUaGUgZnVuY3Rpb24gYHJlcGxpY2F0ZSgpYCBjYW4gYmUgdXNlZCB0byBydW4gYW4gZXhwcmVzc2lvbiBtdWx0aXBsZSB0aW1lcyBhbmQgcmV0dXJuIGl0cyByZXN1bHRzIGluIGEgbWF0cml4LCBvbmUgY29sdW1uIGZvciBlYWNoIHJlcGxpY2F0aW9uLiBFLmcuIHRoZSBmb2xsb3dpbmcgY29tbWFuZCByZXBsaWNhdGVzIHNhbXBsaW5nIDUgdmFsdWVzIGZyb20gdGhlIGBsZXR0ZXJzYCB2ZWN0b3IgMTAgdGltZXMuIFRoZSByZXN1bHQgaXMgYSA1LWJ5LTEwIG1hdHJpeCwgd2hlcmUgZWFjaCByb3cgY29udGFucyB0aGUgc2FtcGxlZCBsZXR0ZXJzLg0KDQpgYGB7cn0NCmxldHRlcnMNCnJlcGxpY2F0ZSggMTAsIHNhbXBsZSggbGV0dGVycywgc2l6ZSA9IDUgKSApIA0KYGBgDQoNCjMuIFVzZSBgcmVwbGljYXRlKClgIHRvIHJlcGVhdCB0aGUgZXN0YWJsaXNobWVudCBzYW1wbGluZyBwcm9jZXNzIDIwMCB0aW1lcy4gUGxvdCBhIGhpc3RvZ3JhbSBvZiB0aGUgcmVzdWx0aW5nIHN0YXRpc3RpY3MgKGF2ZyAjIG9mIFMvQy10eXBlIGluZnJhY3Rpb25zIHBlciBlc3RhYmxpc2htZW50KSBhbmQgb3ZlcmxheSBpdCB3aXRoIHRoZSBwYXJhbWV0ZXIgdmFsdWUgKHVzZSBgZ2VvbV92bGluZSgpYCkuIERvIHRoZSBzYW1wbGVzIGxvb2sgYmlhc2VkPyBKdXN0aWZ5IHlvdXIgYW5zd2VyLg0KDQo0LiBDb2xsZWN0IGEgKmNvbnZlbmllbmNlKiBzYW1wbGUgYnkgY2hvb3NpbmcgYWxsIGVzdGFibGlzaG1lbnRzIHdob3NlIGFkZHJlc3MgaXMgb24gS2luZ3N0b24gcm9hZCwgYW5kIGNhY2x1bGF0ZSB0aGUgc2FtcGxlIHN0YXRpc3RpYy4gQnJpZWZseSBleHBsYWluIHdoYXQgY2FuIGdvIHdyb25nIHdpdGggYXBwcm9hY2g/ICANCg0KDQo1LiBDb25zaWRlciB0aGUgZm9sbG93aW5nIHR3byBzdXJ2ZXkgcmVwb3J0cyBwdWJsaXNoZWQgYnkgdGhlIENpdHkgb2YgVG9yb250by4NCiogW1N1cnZleSAxOiBUb3JvbnRvIEVtcGxveW1lbnRdKGh0dHBzOi8vd3d3LnRvcm9udG8uY2Evd3AtY29udGVudC91cGxvYWRzLzIwMTgvMDMvOTViOC1Ub3JvbnRvLUVtcGxveW1lbnQtU3VydmV5LTIwMTctQnVsbGV0aW4ucGRmKQ0KKiBbU3VydmV5IDI6IExpdmluZyBpbiB0aGUgQ2l0eV0oaHR0cHM6Ly93d3cudG9yb250by5jYS9sZWdkb2NzL21taXMvMjAxOC9wZy9iZ3JkL2JhY2tncm91bmRmaWxlLTExMDUyMC5wZGYpDQoNClRoZSBmaXJzdCBmZXcgcGFnZXMgKHAuIDEgJiAzKSBjb250YWluIGluZm9ybWF0aW9uIG9uIHRoZSBzdXJ2ZXkuIFJlYWQgdGhlbSBhbmQgZmlsbCBpbiB0aGUgZm9sbG93aW5nIHRhYmxlIGFzIHdlbGwgYXMgeW91IGNhbi4gSWYgeW91IGNhbm5vdCBmaW5kIHRoZSByZWxldmFudCBpbmZvcm1hdGlvbiBmb3IgYSBmaWVsZCB5b3UgY2FuIGxlYXZlIGl0IGJsYW5rLiBUaGUgbGFzdCByb3cgaXMgYWJvdXQgeW91ciBwZXJzb25hbCBhc3Nlc3NtZW50IG9mIHRoZSBzdXJ2ZXkuICANCg0KfCBBcmVhIHwgU3VydmV5IDEgfCBTdXJ2ZXkgMiB8DQp8IC0tLSB8IC0tLSB8IC0tLSB8IA0KVGFyZ2V0IHBvcHVsYXRpb24gfCB8ICB8DQpTYW1wbGluZyBmcmFtZSB8IHwgfCANClNhbXBsZSBzaXplIHwgfCB8IA0KUmVzcG9uc2UgcmF0ZSB8IHwgfCANClBvc3NpYmxlIGlzc3VlcyB8IHwgfCANCg0KDQo2LiBbRXh0cmFdIFJlYWQgYWJvdXQgdHdvIGludGVyZXN0aW5nIGNhc2VzIG9mIHNhbXBsaW5nIHN1cnZleXMgdGhhdCB3ZW50IHRlcnJpYmx5IHdyb25nISAgDQoqIFtDYXNlIDFdKGh0dHBzOi8vd3d3Lm1hdGgudXBlbm4uZWR1L35kZXR1cmNrL20xNzAvd2s0L2xlY3R1cmUvY2FzZTEuaHRtbCkgIA0KKiBbQ2FzZSAyXShodHRwczovL3d3dy5tYXRoLnVwZW5uLmVkdS9+ZGV0dXJjay9tMTcwL3drNC9sZWN0dXJlL2Nhc2UyLmh0bWwpICA=