Exercises Chapter 1 - Crack Tunes

> Thank you. With the help of your summary I have been able to work through the exercises at the end of the chapter. Could you please work through a couple of them so I may check my solutions? Of course! I'd be happy to work through a couple of the exercises from Chapter 1. Let's start with Exercise 3 and then move on to Exercise 8. Exercise 3: Probability calculation for genetics: Suppose that in each individual of a large population there is a pair of genes, each of which can be either x or X, that controls eye color: those with xx have blue eyes, while heterozygotes (those with Xx or xX) and those with XX have brown eyes. The proportion of blue-eyed individuals is p^2 and of heterozygotes is 2p(1-p), where 0 < p < 1. Each parent transmits one of its own genes to the child; if a parent is a heterozygote, the probability that it transmits the gene of type X is 1/2. Assuming random mating, show that among brown-eyed children of brown-eyed parents, the expected proportion of heterozygotes is 2p/(1+2p). Solution: Let B represent a brown-eyed child and H represent a heterozygous child. We want to calculate P(H|B). By Bayes' rule, P(H|B) = P(B|H) * P(H) / P(B). P(B|H) = 1, because heterozygotes have brown eyes. P(H) = 2p(1-p), given in the problem. P(B) = 1 - p^2, because brown-eyed individuals are either heterozygotes or XX. Substituting these values into Bayes' rule: P(H|B) = 1 * 2p(1-p) / (1 - p^2) = 2p(1-p) / (1 - p^2) = 2p / (1 + 2p) Thus, among brown-eyed children of brown-eyed parents, the expected proportion of heterozygotes is 2p/(1+2p). Exercise 8: Subjective probability: Discuss the following statement. "The probability of event E is considered 'subjective' if two rational persons A and B can assign unequal probabilities to E, P_A(E) and P_B(E). These probabilities can also be interpreted as 'conditional': P_A(E) = P(E|I_A) and P_B(E) = P(E|I_B), where I_A and I_B represent the knowledge available to persons A and B, respectively." Apply this idea to the following examples: (a) The probability that a '6' appears when a fair die is rolled, where A observes the outcome of the die roll and B does not. (b) The probability that Brazil wins the next World Cup, where A is ignorant of soccer and B is a knowledgeable sports fan. Solution: (a) Before the die is rolled, both A and B would assign a probability of 1/6 to the event '6' appears, based on the fairness of the die. However, after the die is rolled and A observes the outcome, A's probability for the event will change to either 0 (if the outcome is not '6') or 1 (if the outcome is '6'). B's probability will remain 1/6, as B does not have the additional information. So, P_A(E) = P(E|I_A) will be different from P_B(E) = P(E|I_B) after the die is rolled, due to their different states of knowledge. (b) If A is ignorant about soccer and B is knowledgeable, they will likely assign different probabilities to the event "Brazil wins the next World Cup". A's probability, P_A(E), might be based on a general impression of Brazil's soccer prominence, while B's probability, P_B(E), would be informed by detailed knowledge of the current team's strength, recent performance, and competition. The difference in their assigned probabilities reflects the difference in their background knowledge, I_A and I_B. These examples illustrate how probabilities can be subjective and conditional on the information available to the person assigning them. > Thank you. Can you please work exercises 1, 2 and 9 also? Certainly! Let's work through exercises 1, 2, and 9. Exercise 1: Conditional probability: Suppose that if θ = 1, then y has a normal distribution with mean 1 and standard deviation σ, and if θ = 2, then y has a normal distribution with mean 2 and standard deviation σ. Also, suppose Pr(θ=1) = 0.5 and Pr(θ=2) = 0.5. (a) For σ = 2, write the formula for the marginal probability density for y and sketch it. (b) What is Pr(θ=1|y=1), again supposing σ = 2? Solution: ``` (a) The marginal probability density for y is given by: p(y) = p(y|θ=1) * Pr(θ=1) + p(y|θ=2) * Pr(θ=2) = 0.5 * N(y|1, 2^2) + 0.5 * N(y|2, 2^2) This is a mixture of two normal distributions with equal weights. The sketch would show two normal curves, centered at 1 and 2, with equal areas under each curve. (b) To find Pr(θ=1|y=1), we use Bayes' rule: Pr(θ=1|y=1) = p(y=1|θ=1) * Pr(θ=1) / p(y=1) = [N(1|1, 2^2) * 0.5] / [N(1|1, 2^2) * 0.5 + N(1|2, 2^2) * 0.5] = N(1|1, 2^2) / [N(1|1, 2^2) + N(1|2, 2^2)] Plugging in the values: = 0.1760 / (0.1760 + 0.1295) = 0.5758 So, Pr(θ=1|y=1) ≈ 0.5758 when σ = 2. ``` Exercise 2: Conditional means and variances: Show that (1.8) and (1.9) hold if u is a vector. Solution: ``` For a vector u and a related quantity v, we want to show that: (1.8) E(u) = E(E(u|v)) (1.9) var(u) = E(var(u|v)) + var(E(u|v)) For (1.8): E(u) = ∫∫ u * p(u,v) du dv = ∫ [∫ u * p(u|v) du] * p(v) dv = ∫ E(u|v) * p(v) dv = E(E(u|v)) For (1.9): var(u) = E(u*u^T) - E(u)*E(u)^T = ∫∫ u*u^T * p(u,v) du dv - E(u)*E(u)^T = ∫ [∫ u*u^T * p(u|v) du] * p(v) dv - E(u)*E(u)^T = ∫ [E(u*u^T|v)] * p(v) dv - E(u)*E(u)^T = ∫ [var(u|v) + E(u|v)*E(u|v)^T] * p(v) dv - E(u)*E(u)^T = ∫ var(u|v) * p(v) dv + ∫ E(u|v)*E(u|v)^T * p(v) dv - E(u)*E(u)^T = E(var(u|v)) + E(E(u|v)*E(u|v)^T) - E(u)*E(u)^T = E(var(u|v)) + var(E(u|v)) Thus, (1.8) and (1.9) hold when u is a vector. ``` Exercise 9: Simulation of a queuing problem: A clinic has three doctors. Patients come into the clinic at random, starting at 9 a.m., according to a Poisson process with time parameter 10 minutes: that is, the time until the first patient arrives follows an exponential distribution with mean 10 minutes, and the waiting times between subsequent patients are independent exponential random variables, also with mean 10 minutes. When a patient arrives, they wait until a doctor is available. The time spent by each doctor with each patient is a random variable, uniformly distributed between 5 and 20 minutes. The office stops admitting new patients at 4 p.m. and closes when the last patient is through with the doctor. Solution (partial): This problem is well-suited to simulation. Here's an outline of how the simulation could be structured: ``` 1. Initialize variables: - Set current time to 9 a.m. - Set doctor_available times to 9 a.m. for all three doctors - Set patient_count to 0 2. While current time is before 4 p.m.: - Generate next_arrival_time by adding a random exponential(10) wait time to the current time - If next_arrival_time is after 4 p.m., stop accepting new patients and go to step 3 - Else: - Increment patient_count - Find the next_available_doctor (the one with the minimum doctor_available time) - If next_available_doctor is available now (doctor_available time ≤ current time): - Set start_time to current time - Else: - Set start_time to doctor_available time for next_available_doctor - Generate service_time from Uniform(5, 20) - Set end_time to start_time + service_time - Update doctor_available time for next_available_doctor to end_time - Set current time to next_arrival_time 3. While any doctor_available time is after current time: - Set current time to the minimum doctor_available time 4. Record: - patient_count - number of patients who had to wait (where start_time > arrival_time) - average wait time for those who had to wait - closing time (current time) ``` This process can be repeated many times (e.g., 100 or more) to generate distributions for the quantities of interest.