Skip to content

Articles and resources for providing real world examples for discussion in undergraduate statistics courses.

Notifications You must be signed in to change notification settings

lizwillow/Real-World-Statistics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 

Repository files navigation

Real-World-Statistics

Real news articles and other resources to motivate discussion in undergraduate statistics courses.

New news cite: https://realworlddatascience.net/

STAT 200: Introductory Statistics

Visual Representations

Estimating Proportions and Simple Random Sampling

Confidence Intervals and Paired Samples

Hypothesis Testing

Standard Normal Distribution and Z Scores

Outliers

MATH/STAT 318: Elementary Probability

Relative Frequency

  • 2019 - How does Mike Soroka do it? The last plot shows the relative frequency of the different pitches for each count. The experiment is Mike Soroka throwing a pitch and the relative frequency of a specific pitch is the number of those pitches that he threw divided by the total number of pitches. You can imagine that if he threw an infinite number of pitches we could get the exact probabilities.
  • 2018 - When does the hottest day of the year usually occur? In the 2nd through 5th plot, the relative frequency is plotted in space. In figure 2, for example, we see the number of years where the hottest day was in June divided by the total number of years. The experiment is seeing when the hottest day is each year.
  • 2015 - The most distinctive words in online dating profiles, by state. They compare the relative frequency of words in dating profiles in the different states. The experiment would be each word and the relative frequency of a specific word would be the number of times that word was used divided by the total number of words.

Combinatorics

  • 2017 - Combinations? Permutations? Those words don't mean what you think they mean. "Suppose you are performing clickstream analysis for a company and there is a large number of ways in which a customer can navigate through the website. Assuming your data set is large, and there are many visits to the website, you're likely to apply machine learning (ML) in your investigations. A crucial point to consider very early on is whether you are interested in customers taking specific routes through the site (permutations), or just visiting groups of pages together (combinations) because that can significantly impact the choice of ML algorithm you might use."

Prosecuter's Fallacy, Ecological Fallacy

Bayes' Theorem

Hypergeometric Distribution

  • 2020 - Using the Hypergeometric distribution in the Magic game
  • 2009 - One place the Hypergeometric distribution is used is in representative drug sampling. Basically, the police seize a lot of packages of drugs, but testing each package to see if it contains controlled substances is time consuming and expensive. Instead they randomly sample some and look at the probability of all the packages in the sample containing a controlled substance given certain values of the parameter r. If you are extremely dedicated, there is a section in this document which described the method in detail.

Binomial Distribution

Moment Generating Functions

Poisson Distribution and Poisson Process

Product rule

About

Articles and resources for providing real world examples for discussion in undergraduate statistics courses.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published