Skip to content
This repository has been archived by the owner on Apr 22, 2022. It is now read-only.

ST-314-nilsstreedain/DA3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data Analysis 3 - Probability and Distributions

Topics:

  • Discrete Random Variables, Probability Mass Functions, Expectations.

Grading:

  • Points are listed next to each question and should total 25 points overall.
  • Grading will be based on the content of the data analysis as well as the overall appearance of the document.
  • Late assignments will not be graded.

Instructions:

  • Clearly label and type answers to the questions on the proceeding pages in Word, Google Docs, or other word processing software.
  • Insert diagrams or plots as a picture in an appropriate location.
  • Math Formulas need to be typed with Math Type, LaTeX, or clearly using key board symbols such as +, -. *, /, sqrt() and ^
  • Submit assignment to the Gradescope link as a PDF. Indicate the pages to the individual questions and also verify the correct document has been uploaded. Failing to follow this direction may result in point deductions.
    • If you are unable to answer any questions before the submission deadline, please still match the question location to the correct page when submitting in Gradescope.

Allowances:

  • You may use any resources listed or posted on the Canvas page for the course.
  • You are encouraged to discuss the problems with other students, the instructor and TAs, however, all work must be your own words. Duplicate wording will be considered plagiarism.

Part 1 (6 points)

For each random variable, state whether the random variable should be modeled with a Binomial distribution or a Poisson distribution. Explain your reasoning. State the parameter values that describe the distribution and give the probability mass function.

Random Variable 1 (3 points)

A cabinet manufacturer tests the quality of its drawers by how easily the drawers slide open and shut. From prior studies, the manufacturer determined that 3% of drawers fail the easy slide test. A manufacturer samples 20 drawers from a batch. Assuming the chance of failure is independent between drawers, what type of distribution could be used to model the number of failed drawers from the sample of 20? Hint: the event of interest here is a drawer failing.

The random variable should be modeled with a binomial distribution because two
possible outcomes are being measured, successes and failures. The parameter n
(number of trials) is 20, and the parameter p (probability of failure) is 0.03.
PMF: P(X = x) = nCx(0.03^x)(0.97)^(20 - x)

Random Variable 2 (3 points)

The warranty for a particular model of car is 2 years, during which time there is no limit to the number of warranty claims per car. Historically, the average number of claims per car during the period is 0.65 claims. What type of distribution could be used to model the number of warranty claims per car?

This random variable would be modeled with a poisson distribution because it is
measuring a quantity of occurrences. The parameter ʎ (average quantity) is 0.65.
PMF: P(X = x) = (e^-0.65)(0.65^x)/(x!)

Part 2: (11 points)

Wheel of Fortune is a popular game show on Television. Contestants spin a wheel and try to guess a correct letter from a word puzzle. If they guess correctly, they earn the dollar amount from the wheel. If they spin “bankrupt” or “lose a turn” they get nothing and can’t play. To the right is an example of the wheel. Watch this video to see an example of someone spinning the wheel.

image

The outcome of a spin on the wheel is a discrete random variable. Consider X the dollar amount spun on the wheel, where Bankrupt and Lose a Turn = $0, and Free Play = $500. There are 24 wedges on the wheel.

image

The following table provides the probability mass function for X. Round values to 3 decimal places.

x $0 $300 $350 $400 $450 $500 $550 $600 $700 $800 $900 $5000
Count 2 5 1 2 1 3 1 3 1 2 2 1
P(x) 0.083 0.208 0.042 0.083 0.042 0.125 0.042 0.125 0.042 0.083 0.083 0.042
  1. (1 point) What is the most likely dollar amount?
$300
  1. (1 point) How likely is it to spin Lose a turn or Bankrupt?
0.083
  1. (3 points) What is the average dollar amount? Show work!
= [2(0) + 5(300) + 1(350) + 2(400) + 1(450) + 3(500) + 1(550) + 3(600) + 1(700) + 2(800) + 2(900) + 1(5000)]
  / [2 + 5 + 1 + 2 + 1 + 3 + 1 + 3 + 1 + 2 + 2 + 1]
= 16,050/24
= 668.75
  1. (3 points) Suppose a contestant spins the wheel three times, how likely is it they spin $0 each time? Show work!
p(x = 3) = p3 = (2/24)3 = (0.083)3 = 0.00057870̅3̅7̅ ≈ 5.788x10-4
  1. (3 points) Suppose a contestant spins the wheel three times, how likely is it they spin $0 at least one time? Show work!
p(x ≤ 1) = 1 - p’3 = 1 - (22/24)3 = 1 - (0.917)3 ≈ 0.23

Part 3: (6 points)

The PMF in Part 2 is based on probability theory. Do these probabilities stand up when a contestant actually spins the wheel? Go back to the Data Analysis #3 instructions page on Canvas, download the R script titled: Wheel_of_Fortune_Spin_Script.R , open the file it will automatically open in R. You need R software on the computer to open the script window. Follow the instructions in the code then answer the following:

  1. (1 point) What value did you spin?
300
  1. (1 point) What is the average of the 1000 simulated spins? How different is this from part 2c?
634.5
  1. (1 point) Paste the probability mass function and the plot of the probability mass function from R.
x $0 $300 $350 $400 $450 $500 $550 $600 $700 $800 $900 $5000
P(x) 0.090 0.204 0.032 0.078 0.049 0.125 0.045 0.102 0.042 0.099 0.102 0.032

image

  1. (1 point) How different are the simulated probabilities to the theoretical probabilities in part 2?
They are extremely close, some of which being exactly the same down to the thousandth.
  1. (1 point) Based on the plot is the most likely outcome the same as it is in part 2a?
The most likely outcome is 300 in both the theoretical and experimental.
  1. (1 point) In general, what action will make the simulated values more like the theoretical ones?
An increased sample size would reduce error, making the values even closer to the theoretical.

Gradescope Page Matching (2 points)

When you upload your PDF file to Gradescope, you will need to match each question on this assignment to the correct pages. Video instructions for doing this are available in the Start Here module on Canvas on the page “Submitting Assignments in Gradescope”. Failure to follow these instructions will result in a 2-point deduction on your assignment grade. Match this page to outline item “Gradescope Page Matching”.

Releases

No releases published

Packages

No packages published

Languages