Monthly Archives: June 2012

You are browsing the site archives by month.

No. 63: St. Petersburg Paradox

26 June, 2012 1:32 AM / Leave a Comment / Gene Dan

Hey everyone,

Work has kept me busy, as I’ve spent the last three weeks catching up on the tasks I pushed back while studying for my exam. I didn’t have any time to work on anything creative, so I decided to pull up an old project I had done a year ago on the St. Petersburg Paradox. The St. Petersburg Paradox is a famous problem in mathematics, proposed in 1713 by Nicolas Bernoulli in a letter to fellow mathematician Pierre Raymond de Montmort. The problem consists of a game where the player continuously flips a fair coin until it lands tails. The payout is equal to 2 raised to the number of consecutive heads obtained. For example, if the player gets 1 head, he receives a payout of 2. If he gets 2 heads in a row, he receives a payout of 4. If he gets 3 heads in a row, he receives a payout of 8, and so on and so forth. More formally, we can express the expected payout as:

$latex displaystyle mathrm{E}[mbox{payout}] = sum_{i=1}^infty (mbox{payout for }imbox{ consecutive heads})timesmathrm{Pr}(imbox{ consecutive heads}) $

$latex displaystyle mathrm{E}[mbox{payout}] = sum_i^infty 2^i (.5)^i $

$latex displaystyle = 2timesfrac{1}{2} + 4timesfrac{1}{4} + 8timesfrac{1}{8}cdots $

$latex displaystyle mathrm{E}[mbox{payout}] = 1+1+1cdots = infty$

As you can see, the expected payout is infinite. So, how much would the typical person pay to play this game? It turns out that you might be able to get a person to pay $1 or $5 to play, but once the fee gets into the double-digits, people start getting reluctant. Why is this so? Shouldn’t a rational person be willing to put down any finite amount of money to play the game? That’s why the problem is called the St. Petersburg Paradox. Even though the expected payout is infinite, most people aren’t willing to gamble a large sum of money to play the game.

Intuitively, this makes sense. If you’ve ever tried flipping a penny over and over again, you’ll notice that getting just 5 heads in a row is a rare event. Even then, the payout is just 32. The odds of getting a payout of 1024 is just under one out of a thousand. So, unless you have something better to do than to flip coins all day (and flip them really fast), you’re better off flipping burgers if you want to make a living wage (keep in mind you have to pay each time you play).

I wrote a couple of VBA subroutines to simulate the problem and posted a video of the execution on YouTube, a little more than a year ago (I believe I used a payout of 2^(i-1) instead of 2^i for this simulation):

[youtube=http://www.youtube.com/watch?v=QAizn9gqhO8]

Here, I allow the user to set some values for the simulation, like the game price, starting balance, and number of games. In addition, I added a progress bar, data bars, and sparklines to test out some new features in Excel 2010. The subroutine runs much faster without these features (like a thousand times faster), since the computer gets interrupted each time it has to update the progress bar and spreadsheet features. The columns to the right compare the theoretical distribution to the empirical distribution obtained through the simulation. As you can see, large payouts are very rare, and the player spends most of the time with a negative balance. The important lesson to glean out of this is that it would take a ridiculously large number of games to get a large payout, and relying on expected payouts alone would be a foolish choice to make in a business situation (especially in insurance).

Posted in: Logs, Mathematics / Tagged: st. petersburg paradox, st. petersburg paradox simulation, youtube st. petersburg paradox

No. 62: The Dowry Problem

19 June, 2012 2:30 AM / 1 Comment / Gene Dan

Hey everyone,

Let me introduce you to a famous problem in mathematics. Suppose you’re looking for someone to marry. In order to find that special someone, you, like most people, go on a series of dates, and then pick the best candidate amongst those whom you’ve dated. Unfortunately, while you have the ability to rank the suitors in retrospect, you have no way to determine if your ultimate choice will have been the best one out of all the available options. For example, if you were to pick the 8th person whom you’ve dated as your lifelong partner, until death do you part, you would be able to rank that person amongst the 7 previous candidates, but you would have no idea if anyone better would have come along had you rejected that 8th person and kept looking instead. Hence, the dowry problem. (Also called The Secretary Problem, The Fussy Suitor Problem, The Marriage Problem, The Sultan’s Dowry Problem, The Gogol Game, and The Best Choice Problem).

More formally, suppose you have 100 rankable suitors whom you will date one at a time, in succession. After each date, you have the choice to either reject or accept the suitor. If you accept the suitor, you cannot date anymore candidates and are stuck with that person for life. If you reject the suitor, you will not have the chance to date that candidate again and will move on to the next suitor. With each additional suitor, you have the ability to rank that suitor unambiguously amongst your previous suitors. The process continues until you either accept a suitor, or reject all 100 suitors and doom yourself to a life of lonely bachelorhood or spinsterhood.

So, how do you maximize your chance of selecting the best candidate? It turns out the solution to the problem is to employ an optimal stopping rule. This means you date X number suitors, rank those suitors, and then choose the next suitor who is at least as good as those X suitors. In this problem the optimal stopping rule is to date 37 candidates, and then choose the next suitor who is at least as good as the best of the 37.

It turns out the problem has an elegant solution when n suitors are involved. Generally, the optimal stopping rule is to date n/e (where e is Euler’s constant) candidates, and then choose the next candidate that is at least as good as the the first n/e candidates. This means you should date about 37% of the candidate pool, and then choose the next person who is at least as good as the best of that 37%. So it works no matter how big n is:

100: date the first 37 candidates
1,000: date the first 368 candidates
10,000: date the first 3,679 candidates
100,000: date the first 36,788 candidates
1,000,000: date the first 367,879 candidates

And so on.

I decided to use VBA to run some simulations on this problem to see if I could confirm what we have seen here. Unfortunately, I was not successful, but after looking more carefully, I think I found the problem had to do with the built-in random number generator, as well as the shuffling algorithm I employed to randomize the order of the suitors. I basically generated an array of 100 integers from 1 to 100, shuffled them, applied the stopping rule from 1 to 100, and repeated the procedure to see if the optimal stopping point (deterimined by the maximum empirical probability of success) matched 37.

As you can see, as the number of iterations increases, the curve approaches that of the true distribution.

After many thousands of iterations I discovered the optimal stopping point centered around 60, and I spent about an hour trying to correct the issue without success. Then I tried counting the proportion of iterations in which the best candidate (integer 100) wound up within the first 33 indices of the array. It turned out that best candidate ended up in the first 33 only 17%-20% of the time, when it ought to be 33%. This would explain why the optimal stopping point for this distribution ended up being further to the right than 100/e. From here I concluded that the shuffling algorithm I used was not shuffling the array uniformly. Here is the code below:

Sub ShuffleArrayInPlace(InArray() As Variant) '''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''' ' ShuffleArrayInPlace ' This shuffles InArray to random order, randomized in place. '''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''' Dim N As Long Dim Temp As Variant Dim J As Long

Randomize For N = LBound(InArray) To UBound(InArray) J = CLng(((UBound(InArray) - N) * Rnd) + N) If N <> J Then Temp = InArray(N) InArray(N) = InArray(J) InArray(J) = Temp End If Next N End Sub

I obtained the shuffling algorithm from Chip Pearson’s VBA page. I’m willing to give the problem another shot with one that shuffles the array more uniformly, so if anyone can help me out that would be great!

Posted in: Logs, Mathematics / Tagged: best choice problem, fussy suitor problem, marriage problem, simulation, sultan's dowry problem, the secretary problem

No. 61: Pass

5 June, 2012 3:48 AM / 2 Comments / Gene Dan

Hey everyone,

I’ve got some good news – I passed exam C/4! In retrospect, it felt like one of the most difficult exams I’ve ever taken, if not the hardest. I sacrificed three weeks of training and pretty much studied nonstop until I walked into the testing center for my sitting. The exam was a computerized test – the questions were drawn randomly with the passing score calibrated based on the difficulty of the questions. That means a test with easy questions will on average have a higher passmark than a test with hard questions. I must have gotten unlucky this time around because almost all of my questions were hard…there weren’t many questions I knew how to do right away, and I had to skip several when time ran out. I just felt relieved when I found out I had passed after submitting my answers.

I won’t get my score until mid-July, but my preliminary passing result has a 99.9975% chance of matching my final result. The one (or two) exams in the history of computerized actuarial exams whose preliminary results didn’t match the final results were due to human error in handling the scores. So, I’m confident that I passed…maybe barely. In my opinion didn’t study very well – I didn’t follow through with my plan to memorize every single definition and theorem from Loss Models. I did however, put in the recommended 300-400 hours, which is probably more than what most people did, but I think at my level, the efficiency of study was somewhat lacking. I ought to have improved over last time, but I didn’t get rid of the bad habit of holding off until the last two months to study…I kind of got away with it since I’m really familiar with the way standardized tests work. The raw probability of any one individual passing 4 exams in a row is only 2.6%…I ought to consider myself lucky for having a smooth path so far, but I aim to make my next pass based off hard work.

There might have been a slight improvement over the last sitting as I was seriously looking at the material at least two months ahead of time, compared to 6 weeks before the exam date last time and 4 weeks the time before that. Those are merely rough guesses, though, so for my next test, MFE/3F, which I’ll be taking in November, I’ve made it a point to start early (I started studying the day after I passed), to record each day what I’ve studied, and to use linear interpolation to predict when I’ll finish the material. I suppose in this sense I shouldn’t expect a miraculous improvement in studying…if I can just do one thing better than I did last time (like memorizing all of the definitions) for each of the next 6 exams, the cumulative effect of these small improvements should pay off over time.

I’m working on some interesting things – I’m currently designing some simulations on the dowry problem that I’ll write about next week. But for now I’m pretty tired. I finally got around to reading The Cyclist’s Training Bible and I’m almost a third of the way through it. There are a lot of things I learned from that book, and I’m looking forward to building a new training program in the coming months. It looks like I’m too out of shape to participate in the summer races, and I think I might have sacrificed too much of my fitness to target the late-season races…then again, I wouldn’t be able to race at all if I didn’t make the income to support it, so in the end I think it was worth it.

By the way, it looks like I finally broke my New Year’s resolution. I managed to update this blog 18 weeks in a row but I stopped because of the exam. Now I’m back.

Posted in: Logs

Monthly Archives: June 2012

No. 63: St. Petersburg Paradox

No. 62: The Dowry Problem

No. 61: Pass

Archives

Categories

Links

Texas Cycling