<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" version="2.0">

<channel>
	<title>Messy Matters</title>
	
	<link>http://messymatters.com</link>
	<description>Bring Your Own Data</description>
	<lastBuildDate>Thu, 29 Jul 2010 19:35:17 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/messymatters" /><feedburner:info xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" uri="messymatters" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>What Your Friends Don’t Know About You</title>
		<link>http://messymatters.com/2010/06/30/friendsense/</link>
		<comments>http://messymatters.com/2010/06/30/friendsense/#comments</comments>
		<pubDate>Thu, 01 Jul 2010 03:56:37 +0000</pubDate>
		<dc:creator>Sharad Goel</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[homophily]]></category>
		<category><![CDATA[politics]]></category>
		<category><![CDATA[social networks]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=859</guid>
  		<description><![CDATA[<p><a href="http://messymatters.com/wp-content/upl...]]></description>
	  		<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/06/friends.gif"><img class="aligncenter size-full wp-image-866" title="Elephants and donkeys. This is a little subtle!" src="http://messymatters.com/wp-content/uploads/2010/06/friends.gif" alt="Elephants and donkeys. This is a little subtle!" width="362" height="269" /></a></p>
<p>How well do you know your friends&#8217; political views? According to <a href="http://www.cam.cornell.edu/~sharad/papers/friendsense.pdf">recent work</a> by <a href="http://smallsocialsystems.com/" target="_blank">Winter Mason</a>, <a href="http://research.yahoo.com/Duncan_Watts" target="_blank">Duncan Watts</a>, and myself, you probably don&#8217;t know them as well as you think. In particular, we found that <strong>when friends disagree on a political issue, they are unaware of that disagreement about 60% of the time. Even close friends who discuss politics are typically unaware of their differences in opinions.</strong></p>
<p><a href="http://en.wikipedia.org/wiki/Homophily" target="_blank">Homophily</a>, the tendency for individuals to associate with similar others, is one of the more consistently observed phenomena of the social world. While reports of homophily have focused on socio-demographic attributes&#8212;like age, gender, and race&#8212;it&#8217;s reasonable that homophily would extend to attitudes as well, in part because individuals likely seek out those who agree with them, and in part because social pressure often breeds conformity. However, <em>measuring</em> the extent to which friends agree with one another is a tough empirical problem, and quantifying the differences between actual and perceived homophily was, until recently, prohibitively difficult.</p>
<p>Enter Facebook, the social science experimentation platform with a subject pool of <a href="http://www.facebook.com/press/info.php?statistics" target="_blank">400 million strong</a> (and <a href="http://blog.comscore.com/2010/01/strong_year_for_facebook.html" target="_blank">growing</a>). To examine the differences between real and perceived attitude agreement, we built a Facebook game where players both answered a series of questions about their own attitudes, and attempted to guess how particular friends would respond. We collected 12,000+ responses in which we had a player&#8217;s answer to a given question, their friend&#8217;s answer to that same question, and the player&#8217;s guess as to what their friend would say.</p>
<p>The figure below shows that <strong>people consistently overestimate the likelihood that their friends agree with them on political issues</strong>. Notably, even though close friends (so-called strong ties<a id="refTIES" href="#TIES"><sup>[1]</sup></a>) are in reality more likely to agree with one another than distant friends, people do not appropriately adjust their perceptions. In other words, <strong>though we think close and distant friends are about equally likely to agree with us on political issues, in reality we are much more likely to agree with close friends</strong>.</p>
<p><a href="http://messymatters.com/wp-content/uploads/2010/06/agreement.gif"><img class="aligncenter size-full wp-image-862" title="agreement" src="http://messymatters.com/wp-content/uploads/2010/06/agreement.gif" alt="" width="300" height="300" /></a></p>
<p>Upon reflection, it&#8217;s perhaps not so surprising that we often don&#8217;t know where our friends stand on hot-button political issues. How many times have you actually asked your friends how they feel about abortion, capital punishment, or affirmative action? Inferring friends&#8217; attitudes is further complicated by the fact that political positions tend to be only weakly correlated across issues. For example, according to the 2008 <a href="http://www.norc.org/GSS+Website/" target="_blank">General Social Survey</a>, 65% of people who support abortion rights also support capital punishment, compared to 68% among those who oppose abortions. That is, someone&#8217;s position on abortion tells you nearly nothing about their view on capital punishment.</p>
<p>I think society would benefit from friends and neighbors engaging in more active political deliberation, discussions that at the very least leave participants aware of each other&#8217;s views. On the other hand, if your friends really knew what you thought, you might have a lot fewer friends.</p>
<p><em>N.B.</em> For more details, check out our <a href="http://www.cam.cornell.edu/~sharad/papers/friendsense.pdf">paper</a>.</p>
<p><em>Illustration by </em><em><a href="http://krsavage.com" target="_blank">Kelly Savage</a></em></p>
<h3>Footnotes</h3>
<p><a id="TIES" href="#refTIES">[1]</a> We define the tie strength between a pair of individuals by considering the number of mutual friends that they have, and whether or not the pair discusses politics.</p>
<img src="http://feeds.feedburner.com/~r/messymatters/~4/BnxLnVPzxgI" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/06/30/friendsense/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Extremely Weak Cryptography: Rot13 for Numbers</title>
		<link>http://messymatters.com/2010/05/31/sealedbids/</link>
		<comments>http://messymatters.com/2010/05/31/sealedbids/#comments</comments>
		<pubDate>Tue, 01 Jun 2010 03:00:50 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[auctions]]></category>
		<category><![CDATA[crypto]]></category>
		<category><![CDATA[mechanism design]]></category>
		<category><![CDATA[sealed bids]]></category>
		<category><![CDATA[yootles]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=640</guid>
  		<description><![CDATA[<p><img src="http://messymatters.com/wp-content/up...]]></description>
	  		<content:encoded><![CDATA[<p><img src="http://messymatters.com/wp-content/uploads/2010/02/crypt-300x211.png" alt="Two people giving (encrypted) arguments about various encryption schemes. ... 'EBG13 vf terng!' ... 'Double ROT13 is better!'" title="Two people giving (encrypted) arguments about various encryption schemes. ... 'EBG13 vf terng!' ... 'Double ROT13 is better!'" width="300" height="211" class="aligncenter size-medium wp-image-641" /></p>
<p>The idea of <a href="http://rot13.com">rot13</a> is to obscure text, for example, to prevent spoilers. It&#8217;s not meant to be cryptographically secure but simply to make sure that only people who are sure they want to read something will read it.</p>

<p>We, being pretty extreme nerds, sometimes find ourselves wanting to do something similar for numbers.  This is typically in the context of a sealed-bid auction where you want to submit a bid that the other party can unseal whenever they&#8217;re ready, with no further coordination needed.  Most recently, Sharad and I had to decide who would be presenting our <a href="http://messymatters.com/predmarkets">Prediction Without Markets</a> paper at the upcoming <a href="http://www.sigecom.org/ec10/">ACM Ecommerce conference</a>.  To determine who wanted to present more (i.e., who minded least), I sent him my sealed bid (how much I would pay him, at most, to present) and told him to unseal it when he had committed to his own bid.  (Eliminating the need for further coordination was especially useful here because Sharad was about to get on a plane.  It&#8217;s similarly handy when coordinating by text message when one or both parties are about to get on the subway.)</p>
<p>The outcome in that case was that Sharad bid lower and so I paid him that lower bid to be the one to present our paper.</p>

<p>In general what we want is a way for me to send you a number and trust you to pick your own number, uninfluenced by mine. And then you should be able to reveal mine when you&#8217;re ready, without requiring further input from me or any third party. Note the assumption that the recipient is being trusted not to cheat.</p>
<p>So how do we do that?  It&#8217;s not as simple as rot13 because certain numbers, like 1 and 2, will recur often enough that you might remember that, say, &#8220;gjb&#8221; is really 2.</p>
<p>To be more specific, we seek a function, seal(), that maps a real number to a real number (or a string). It should <i>not</i> be deterministic &#8212; seal(7) should not map to the same thing every time. But the corresponding function, reveal(), should be deterministic &#8212; reveal(seal(x)) should equal x for all x.</p>
<p>For example: </p>
<pre>
   &gt; seal(7)              # Some random-seeming
   429964                 # number or string.

   &gt; seal(7)              # A completely different
   749932                 # random-seeming number.

   &gt; reveal(seal(7))      # Revealing always uncovers
   7                      # the original number.
</pre>
<p>Can you think of a way to implement seal() and reveal()?</p>
<h2>How to Seal and Reveal a Secret Number</h2>
<p>There are <a href="http://stackoverflow.com/questions/809137/rot13-for-numbers">many ways</a>, it turns out, to accomplish this but here, in lay terms, is an elegant scheme that can be carried out easily with a pocket calculator:</p>
<p>Pick a random number (from your head is fine) between 9 and 99 and multiply it by 9999, then add your secret number. This will yield a 5 or 6-digit number that encodes your secret number. To decode it, divide by 9999, subtract the part to the left of the decimal point, then multiply by 9999. (This is known to children and mathematicians as &#8220;finding the remainder when dividing by 9999&#8221; and &#8220;mod&#8217;ing by 9999&#8221;, respectively.)</p>
<p>This works for nonnegative numbers less than 9999 (if that&#8217;s not enough, use 99999 or as many nines as you want). If you want to allow negative numbers, then the magic 9999 number needs to be twice the biggest possible number. And when decoding, if the result is greater than half of 9999, i.e., 5000 or more, then subtract 9999 to get the actual (negative) number.</p>
<p>Again, this is not a true <a href="http://en.wikipedia.org/wiki/Commitment_scheme">commitment scheme</a>. But for cases where the honor system suffices and the only goal is to not be influenced by each other&#8217;s number, we find this quite handy.</p>
<h2>Pseudocode</h2>
<pre>
   M = 9999  # Numbers between -M/2 and M/2 can be sealed.

   seal(x): M * randInt(9, 99) + x

   reveal(x): m = mod(x, M);
              if m &gt; M/2 return m - M else return m
</pre>
</p>
<p><i>Image by <a href="http://www.moserware.com">Jeff Moser</a>.</i></p>
<p><i>Thanks to <a href="http://stackoverflow.com/users/31615/svante">Svante v. Erichsen</a> for the idea behind this scheme, and to a dozen other <a href="http://stackoverflow.com/questions/809137/rot13-for-numbers">Stack Overflow</a> users for alternative schemes.</i></p>
<img src="http://feeds.feedburner.com/~r/messymatters/~4/fwYYQ6CeO3Y" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/05/31/sealedbids/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Misleading Means: On Drugs and Discrimination</title>
		<link>http://messymatters.com/2010/04/30/misleading-means/</link>
		<comments>http://messymatters.com/2010/04/30/misleading-means/#comments</comments>
		<pubDate>Fri, 30 Apr 2010 11:15:31 +0000</pubDate>
		<dc:creator>Sharad Goel</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[bias]]></category>
		<category><![CDATA[discrimination]]></category>
		<category><![CDATA[RDS]]></category>
		<category><![CDATA[variance]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=745</guid>
  		<description><![CDATA[<p><a href="http://messymatters.com/wp-content/upl...]]></description>
	  		<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/04/diffusion.jpeg"><img class="aligncenter size-full wp-image-749" title="diffusion" src="http://messymatters.com/wp-content/uploads/2010/04/diffusion.jpeg" alt="" width="335" height="242" /></a></p>
<p>How do you estimate the prevalence of HIV among drug addicts? In a perfect (though somewhat Orwellian) world you&#8217;d have a giant list of all drug users, and you&#8217;d randomly select a subset of them to test for HIV. Alas, the world is messy, and we&#8217;re left to <a href="http://www.respondentdrivensampling.org/reports/AIDS_2005.pdf">less rigorous methods</a>, for example time-location sampling, where surveyors recruit drug users from the local &#8220;shooting gallery&#8221; (heroin users&#8217; version of a crack house) at randomly selected times.</p>
<p>A few years ago, sociologists <a href="http://www.soc.cornell.edu/faculty/heckathorn.html" target="_blank">Doug Heckathorn</a> and <a href="http://www.princeton.edu/~mjs3/" target="_blank">Matthew Salganik</a> proposed a new approach to surveying these so-called hard-to-reach or hidden populations: <a href="http://www.respondentdrivensampling.org/" target="_blank">respondent-driven sampling</a>. RDS is a variant of snowball sampling&#8212;where existing study participants recruit the next wave of participants, usually in exchange for payment&#8212;with a few clever tweaks. Matt and Doug realized that popular drug users (i.e., those with a lot of friends) are disproportionately likely to be in the final snowball sample, for the simple reason that popular users know more people who could potentially recruit them. In RDS, recruits are thus down-weighted by the number of <a href="http://pubs.amstat.org/doi/abs/10.1198/jasa.2009.ap08518" target="_blank">people they know</a>. In just a few short years, RDS has already been applied in more than 20 countries, and it is currently used by the <a href="http://www.cdc.gov/" target="_blank">Centers for Disease Control and Prevention</a> (CDC) to help track the HIV epidemic in the United States. Now for the bad news: Matt and I just wrote a <a href="http://www.cam.cornell.edu/~sharad/papers/assessing-rds.pdf">paper</a> showing that <strong>RDS may not be suitable for key aspects of public health surveillance where it is now extensively applied</strong>.</p>
<p>Our critique of RDS boils down to a simple empirical reality: <strong>outcomes are typically not typical</strong>. To give an example, while the mean <a href="http://en.wikipedia.org/wiki/Household_income_in_the_United_States" target="_blank">household income</a> in the United States is around $60K, half of households earn either less than $30K or more than $100K. In other words, households typically have incomes that are far from typical. So what does this have to do with RDS? The trick of down-weighting popular drug users guarantees&#8212;albeit under <a href="http://arxiv.org/abs/0904.1855" target="_blank">strong assumptions</a>&#8212;that RDS will on average yield the true infection level. (In statistical parlance, RDS is an asymptotically <a href="http://en.wikipedia.org/wiki/Bias_of_an_estimator" target="_blank">unbiased estimator</a>.) In the RDS community, this theoretical property was widely interpreted as meaning that RDS is a generally reliable method of estimation. What we show is that even in situations where RDS estimates are correct on average, these estimates are typically so far from the truth that meaningful statistical inference is difficult. To clarify the point, suppose we estimate the mean age in New York by   sampling one person uniformly at random. Even though our estimate would be perfect on  average (i.e., unbiased), it could hardly be called accurate. Since RDS participants tend to know, and hence recruit, people who are similar to themselves, this caricatured example is actually not so far from the <a href="http://www.cam.cornell.edu/~sharad/papers/assessing-rds.pdf">truth</a> about why RDS estimates are so volatile.</p>
<p>I suspect that this tendency to neglect the variance also plays a role in discriminatory attitudes. Consider the following statistic, culled from the 2008 <a href="http://www.norc.org/GSS+Website/" target="_blank">General Social Survey</a>: white Americans on average watch 2.8 hours of television daily, compared to 4.4 hours watched on average by black Americans. The difference in group means is large, both on a relative scale&#8212;blacks on average watch nearly 60% more television than whites&#8212;and on an absolute scale&#8212;1.6 hours a day. (Citing a similar statistic, conservative radio host, and Rush Limbaugh protege, <a href="http://www.michaelmedved.com/" target="_blank">Michael Medved</a> has <a href="http://www.wnd.com/index.php?pageId=13623" target="_blank">pontificated</a> on the subject.) Now, if you&#8217;re walking down the street and you see a random black person and a random white person, how likely is it that the black person watches more TV? Given the large group differences, you might be tempted to conclude it&#8217;s pretty likely. As it turns out, however, individual variation is so large that it&#8217;s basically a coin flip (about 60%) as to who watches more TV&#8212;in other words, race is a <a href="http://en.wikipedia.org/wiki/Cue_validity" target="_blank">poor cue</a>. The plot below makes this phenomenon visually apparent: despite the substantial difference in group means, whites and blacks have distributions of TV watching that are tough to distinguish. Just as in RDS, the means are  misleading.</p>
<p style="text-align: center;"><a href="http://messymatters.com/wp-content/uploads/2010/04/tv.png"><img class="aligncenter size-full wp-image-810" title="TV viewing habits for whites and blacks" src="http://messymatters.com/wp-content/uploads/2010/04/tv.png" alt="" width="570" height="285" /></a><em> </em></p>
<div id="magicdomid13">In  fictional <a href="http://en.wikipedia.org/wiki/Lake_Wobegon" target="_blank">Lake Wobegon</a>, all the children are above average. Though that&#8217;s not <a href="http://agoraphilia.blogspot.com/2007/01/more-than-50-percent-can-be-above.html" target="_blank">quite true</a> in the real world, it actually is the case that most people are far from average.</div>
<p>NB: For more details on RDS, check out our <a href="http://www.cam.cornell.edu/~sharad/papers/assessing-rds.pdf">paper</a>.</p>
<p><em>Illustration by <a href="http://krsavage.com" target="_blank">Kelly Savage</a>: A stylized depiction of an RDS recruitment tree, where nodes correspond to sample members and links indicate who recruited whom. Based on an RDS study of drug users in New York City, presented in <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2527186/" target="_blank">Abdul-Quader, et al, 2006.</a><br />
</em></p>
<img src="http://feeds.feedburner.com/~r/messymatters/~4/kR5J7I8yKT0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/04/30/misleading-means/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yes, You Are (Maybe) Overconfident</title>
		<link>http://messymatters.com/2010/03/31/calibration-results/</link>
		<comments>http://messymatters.com/2010/03/31/calibration-results/#comments</comments>
		<pubDate>Thu, 01 Apr 2010 03:57:57 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[calibration]]></category>
		<category><![CDATA[decision theory]]></category>
		<category><![CDATA[overcoming bias]]></category>
		<category><![CDATA[overconfidence]]></category>
		<category><![CDATA[prediction]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[rationality]]></category>
		<category><![CDATA[wisdom of crowds]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=715</guid>
  		<description><![CDATA[<p><a href="http://messymatters.com/wp-content/upl...]]></description>
	  		<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/03/stumped.jpg"><img src="http://messymatters.com/wp-content/uploads/2010/03/stumped-300x199.jpg" alt="" title="Confidence intervals are hard. Let&#039;s go shopping." width="300" height="199" class="aligncenter size-medium wp-image-716" /></a></p>
<p>218 of you took our <a href="http://messymatters.com/calibration">calibration quiz</a>, not counting the 10% of submissions that had to be thrown out for not being complete or giving ranges with the min greater than the max or other sanity check failures. (<a href="http://messymatters.com/calib-quiz/quiz-results.txt">Here&#8217;s the raw data.</a>)</p>
<p>The bad news is that you&#8217;re terrible at making 90% confidence intervals. For example, not a single person had all 10 of their intervals contain the true answer, which, if everyone were perfectly calibrated, should&#8217;ve happened by chance to 35% of you. Getting less than 6 good intervals should, statistically, not have happened to anyone. How many actually had 5 or fewer good intervals?  76% of you.</p>
<p>Here&#8217;s a histogram of the number of good intervals you got, out of 10:</p>
<p><a href="http://messymatters.com/wp-content/uploads/2010/03/histwi.png"><img src="http://messymatters.com/wp-content/uploads/2010/03/histwi.png" alt="" title="Histogram of number of good intervals out of 10." width="360" height="229" class="aligncenter size-full wp-image-717" /></a></p>

<p>The overlaid phantom histogram is what it would look like if it were really the case that every interval people gave had a 90% chance of containing the true answer. In other words, you should&#8217;ve made your intervals much wider. When we ask for a 90% confidence interval there&#8217;s in fact only a 41% chance that your interval contains the true answer.</p>
<p>We ran this quiz on <a href="http://mturk.com">Mechanical Turk</a> as well and you marginally outperformed the turkers. The histogram of turkers&#8217; good intervals is indicated by the red dots in the above graph. They failed our sanity checks at almost twice the rate (19%) of Messy Matters readers and of the remaining responses, the mean number of good intervals was 3.5 out of 10.</p>
<p>The more we&#8217;ve thought about (and read the literature on &#8212; or rather, consulted endlessly with <a href="http://www.dangoldstein.com/">Dan Goldstein</a>, who knows the literature on) these kinds of overconfidence results, however, the less clear it is that the moral of this quiz is simply &#8220;people are overconfident&#8221;. For one thing, overconfidence depends on the question. The fraction of good intervals in your responses ranged from 23% (the length of the Nile and the gestation period of an Asian elephant) to 75% (number of OPEC countries). Of course, even 75% is not the 90% that was asked for.</p>
<p>More interestingly, in an ongoing follow-up study on Mechanical Turk we&#8217;re finding that after you get people&#8217;s intervals, more than half of them realize in retrospect that too few of their intervals are good. This suggests that people can learn to perform much better at this task.</p>
<h3>Obligatory Wisdom of Crowds Demonstration</h3>
<p>It&#8217;s not a fair demonstration since people weren&#8217;t asked for their best guesses, but here&#8217;s a table of median lower bounds, upper bounds, and midpoints of everyone&#8217;s ranges. Interestingly, people&#8217;s upper bounds are overall most accurate.</p>
<style>
table {
  border:1px solid black;  
  border-collapse: collapse;
  font: 12px arial,helvetica,sans-serif;
}  
th {
  font-size: 1.1em;
}  
td,th {
  margin:0px;
  padding: 5px;
  border:1px solid black;
}
</style>
<table>
<tr>
<th></th>
<th>MLK</th>
<th>Nile</th>
<th>OPEC</th>
<th>Bible</th>
<th>Moon</th>
<th>747</th>
<th>Mozart</th>
<th>Elephant</th>
<th>Tokyo</th>
<th>Ocean</th>
</tr>
<tr>
<td><b>True</b></td>
<td>39</td>
<td>4132</td>
<td>12</td>
<td>39</td>
<td>2160</td>
<td>390000</td>
<td>1756</td>
<td>645</td>
<td>5959</td>
<td>35994</td>
</tr>
<tr>
<td><b>Min</b></td>
<td>35</td>
<td>900</td>
<td>6</td>
<td>8</td>
<td>1000</td>
<td>20000</td>
<td>1700</td>
<td>180</td>
<td>5000</td>
<td>13500</td>
</tr>
<tr>
<td><b>Mid</b></td>
<td>45</td>
<td>1750</td>
<td>13</td>
<td>15</td>
<td>3500</td>
<td>63250</td>
<td>1725</td>
<td>320</td>
<td>8000</td>
<td>30000</td>
</tr>
<tr>
<td><b>Max</b></td>
<td>55</td>
<td>3000</td>
<td>20</td>
<td>20</td>
<td>5000</td>
<td>100000</td>
<td>1790</td>
<td>400</td>
<td>10000</td>
<td>40000</td>
</tr>
</table>
<p><br/><br/></p>
<p><em>Thanks to <a href="http://www.cam.cornell.edu/~sharad/">Sharad Goel</a>, <a href="http://www.dangoldstein.com/">Dan Goldstein</a>, <a href="http://bethany.pirateship.org/">Bethany Soule</a>, <a href="http://twitter.com/dakami">Dan Kaminsky</a>, and <a href="http://www.google.com/profiles/michael.tiffany">Michael J.J. Tiffany</a>.</em></p>
<p><em>Image: Kelly Savage</em></p>
<img src="http://feeds.feedburner.com/~r/messymatters/~4/wgurDDJu2jM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/03/31/calibration-results/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Are You Overconfident?</title>
		<link>http://messymatters.com/2010/02/28/calibration/</link>
		<comments>http://messymatters.com/2010/02/28/calibration/#comments</comments>
		<pubDate>Mon, 01 Mar 2010 04:52:20 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[calibration]]></category>
		<category><![CDATA[decision theory]]></category>
		<category><![CDATA[overconfidence]]></category>
		<category><![CDATA[prediction]]></category>
		<category><![CDATA[rationality]]></category>
		<category><![CDATA[wisdom of crowds]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=672</guid>
  		<description><![CDATA[<p><a href="http://messymatters.com/wp-content/upl...]]></description>
	  		<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/02/moon.jpg"><img class="aligncenter size-medium wp-image-673" title="How to estimate the distance from the Earth to the Moon, if you're the Little Prince." src="http://messymatters.com/wp-content/uploads/2010/02/moon-300x231.jpg" alt="How to estimate the distance from the Earth to the Moon, if you're the Little Prince." width="300" height="231" /></a></p>
<p>We shall now find out if Messy Matters readers are smarter than <a href="http://mturk.com">Mechanical Turkers</a>.  For each of the questions below, provide a numerical range that you are 90% sure contains the correct answer. In particular, if you have “no idea” then give a very wide range; and if you happen to be quite certain then give a narrow range.</p>
<p>If you are perfectly calibrated then 90% of your intervals (no more, no less) should contain the right answers.</p>
<p><strong>Please try your best, but don&#8217;t look up the answers!</strong></p>
<p><iframe src="http://messymatters.com/calib-quiz" width=470 height=700></iframe><br />
(If the quiz doesn&#8217;t show up in your news reader, <a href="http://messymatters.com/calibration">click through to the post on messymatters.com</a>.)</p>
<p>We&#8217;ll discuss the results in the next post.</p>
<p>PS: <a href="http://messymatters.com/2010/03/31/calibration-results/">Here are the results.</a></p>
<p><i>Quiz from <a href="http://www.amazon.com/Decision-Traps-J-Edward-Russo/dp/0385248350">Decision Traps</a> by Russo and Shoemaker.</i><br />
<i>Image by Kelly Savage.</i></p>
<img src="http://feeds.feedburner.com/~r/messymatters/~4/1u6ctqLxOpQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/02/28/calibration/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Prediction Without Markets</title>
		<link>http://messymatters.com/2010/01/14/prediction-without-markets/</link>
		<comments>http://messymatters.com/2010/01/14/prediction-without-markets/#comments</comments>
		<pubDate>Thu, 14 Jan 2010 06:28:44 +0000</pubDate>
		<dc:creator>Sharad Goel</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[economics]]></category>
		<category><![CDATA[fallacies]]></category>
		<category><![CDATA[prediction]]></category>
		<category><![CDATA[sports]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=581</guid>
  		<description><![CDATA[<p><a href="http://messymatters.com/wp-content/upl...]]></description>
	  		<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2010/01/photo-finish.jpg"><img class="aligncenter size-full wp-image-625" title="photo-finish" src="http://messymatters.com/wp-content/uploads/2010/01/photo-finish.jpg" alt="photo-finish" width="388" height="273" /></a></p>
<p><span>In the 2008 Summer Olympics <a href="http://en.wikipedia.org/wiki/Usain_Bolt" target="_blank">Usain Bolt</a> ran 100 meters in <a href="http://sports.yahoo.com/olympics/beijing/track_field/results/ATM001/full" target="_blank">9.69 seconds</a>, earning the gold medal and garnering the international attention that comes with being the &#8220;<a href="http://en.wikipedia.org/wiki/100_metres" target="_blank">fastest man in the world</a>.&#8221; While Bolt became a household name, his competitors didn&#8217;t fare nearly as well: far fewer people know that <a href="http://en.wikipedia.org/wiki/Richard_Thompson_(athlete)" target="_blank">Richard Thompson</a> and <a href="http://en.wikipedia.org/wiki/Walter_Dix" target="_blank">Walter Dix</a> received silver and bronze, and I suspect that 8th place <a href="http://en.wikipedia.org/wiki/Darvis_Patton" target="_blank">Darvis Patton</a> is practically unknown outside the sprinting world. The 340 milliseconds that separated Bolt from Patton&#8212;the <a href="http://en.wikipedia.org/wiki/Blink">duration of a blink of the eye</a>&#8212;was the difference between celebrity and obscurity. <strong>While a fascination with rank is perhaps justifiable for sports, such focus&#8212;let&#8217;s call it the Gold Medal Syndrome&#8212;is often problematic in statistical analysis.</strong></span></p>
<p><span> </span>Consider the case of prediction markets. In these markets, participants buy and sell securities that realize a value based on the occurrence of some future outcome, such as the result of an election, the box office revenue of an upcoming film, or the market share of a new product. For example, the day before the 2008 U.S. presidential election you could have paid $0.92 for a contract in the <a href="http://www.biz.uiowa.edu/iem">Iowa Electronic Markets</a> that yielded $1 when Barack Obama won, implying a 92% market-estimated probability that Obama would win. There are compelling theoretical reasons to expect prediction markets to outperform all other available forecasting methods. As formalized by the <a href="http://en.wikipedia.org/wiki/Efficient-market_hypothesis" target="_blank">efficient-market hypothesis</a>, if there were some way to beat the market then at least one savvy trader would presumably exploit that advantage to make money; hence, market prices should update to eliminate any performance disparity.</p>
<p>Inspired by such theoretical arguments, and also by a growing body of empirical findings that show markets beat alternatives, several researchers have called for widespread application of prediction markets to real-world business strategy and policy development problems. In a 2007 <a href="http://online.wsj.com/article/SB117885086047199534.html" target="_blank">Wall Street Journal op-ed</a>, economists Robert Hahn and Paul Tetlock write:</p>
<blockquote><p><em>&#8220;Imagine the president had a crystal ball to predict more accurately the impact of broader prescription coverage on the Medicare budget, the effect of more frequent audits on tax compliance&#8212;or even the consequences of a political settlement in Iraq on oil prices. Now, stop imagining: Such crystal balls [prediction markets] are within our grasp.&#8221;</em></p></blockquote>
<p>The work on which these appeals are based, however, primarily addresses the relative ranking of prediction methods. By contrast, the magnitude of the differences in question has received much less attention, and as such, it remains unclear whether the performance improvement associated with prediction markets is meaningful from a practical perspective.</p>
<p><strong>In a new <a href="http://www.cam.cornell.edu/~sharad/papers/pred-wo-markets.pdf">study</a>, <a href="http://ai.eecs.umich.edu/people/dreeves/" target="_blank">Daniel Reeves</a>, <a href="http://research.yahoo.com/Duncan_Watts" target="_blank">Duncan Watts</a>, <a href="http://dpennock.com/">Dave Pennock</a> and I compare the performance of prediction markets to conventional means of forecasting, namely polls and statistical models. Examining thousands of sporting and movie events, we find that the relative advantage of prediction markets is remarkably small.</strong> For example, the Las Vegas market for professional football is only 3% more accurate in predicting final game scores than a simple, three parameter statistical model, and the market is only 1% better than a poll of football enthusiasts. The plot below shows how the three methods perform on the complementary task of estimating the probability the home team wins.</p>
<p><span><a href="http://messymatters.com/wp-content/uploads/2010/01/mm_calibration.png"><img class="aligncenter size-full wp-image-598" title="mm_calibration" src="http://messymatters.com/wp-content/uploads/2010/01/mm_calibration.png" alt="mm_calibration" width="504" height="186" /></a></span></p>
<p>Given that sports and entertainment markets are among the most mature and successful, our results challenge the view that prediction markets are substantively superior to alternative forecasting mechanisms. Nevertheless, it is certainly possible that there are forecasting applications where either the relative advantage of markets is larger, or that such differences in performance are consequential. <strong>Thus, while prediction markets may yet prove to be useful, it would seem the enthusiasm for their predictive prowess has outpaced the evidence.</strong></p>
<p>NB: Check out our <a href="http://www.cam.cornell.edu/~sharad/papers/pred-wo-markets.pdf" target="_blank">paper</a> for more details.</p>
<p><em>Illustration by Kelly Savage</em></p>
<img src="http://feeds.feedburner.com/~r/messymatters/~4/5zYboDd7goM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2010/01/14/prediction-without-markets/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
		<item>
		<title>Scroogenomics vs Ulterior Motives (and Other Justifications for Gift Giving)</title>
		<link>http://messymatters.com/2009/12/31/scrooge/</link>
		<comments>http://messymatters.com/2009/12/31/scrooge/#comments</comments>
		<pubDate>Fri, 01 Jan 2010 04:57:09 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[christmas]]></category>
		<category><![CDATA[economics]]></category>
		<category><![CDATA[social efficiency]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=552</guid>
  		<description><![CDATA[<p><img class="aligncenter size-medium wp-image-56...]]></description>
	  		<content:encoded><![CDATA[<p><img class="aligncenter size-medium wp-image-564" src="http://messymatters.com/wp-content/uploads/2009/12/scrooge_v_santa-300x219.jpg" alt="scrooge vs santa" title="Scrooge and Santa fighting" width="300" height="219" /></p>
<p>As a <a href="http://cheeptalk.wordpress.com/2009/12/23/the-real-economics-of-holiday-gift-giving/">poser economist</a> (as Jeff Ely calls me), I love to complain about the social inefficiency of gift giving. It&#8217;s a terrible idea, guys! <a href="http://scroogenomics.com">We waste 13 billion dollars a year on it!</a> <a id="refONE" href="#ONE"><sup>[1]</sup></a> But I do appreciate that I have a tendency to <a href="http://xkcd.com/592/">pooh-pooh social conventions far too blithely</a>.  Nonetheless, social conventions change.  Drastically so, given enough time.  As <a href="http://marketdesigner.blogspot.com/2009/12/give-cash-next-year.html">Al Roth points out</a>, wedding registries, gift receipts, and, more <a href="http://www.nytimes.com/2009/12/12/your-money/credit-and-debit-cards/12money.html?_r=1">regrettably, gift cards</a> represent a shift towards cash as gifts, and thus greater efficiency.</p>
<p>But as much as I applaud this trend and agree that holiday gift giving makes us all poorer and generally worse off, I&#8217;m sure Joel Waldfogel&#8217;s estimate (that the value we get from gifts is 25 cents on the dollar) is overblown. So, in the spirit of the holiday season, here are some reasons why gift giving isn&#8217;t quite so bad as it seems:</p>
<ol>
<li> <strong>Ulterior Motives</strong>. When put that way it doesn&#8217;t exactly sound holiday spirited but I think ulterior motives is the most overlooked and perhaps most legitimate justification for gift giving.  If I give you a ticket to see a show with me because I really want to see it with you, then it really doesn&#8217;t matter how much you would&#8217;ve paid for that ticket yourself.  I wanted your company and this is a socially acceptable way for me to pay for it. <a id="refTWO" href="#TWO"><sup>[2]</sup></a> Homer Simpson giving Marge a bowling ball (or a power screwdriver or somesuch) is another example.  Nothing&#8217;s wasted if Homer gets enough use out of those gifts himself! A more common example is giving one&#8217;s girlfriend lingerie.  But the ulterior motives need not be selfish, exactly.  It might well be socially efficient for me to buy you a book you wouldn&#8217;t have bought, if I have a lot of value for discussing it with you.</li>
<p/>
<li> <strong>Signaling Value</strong>.  I don&#8217;t pretend to understand the intricacies of this, but presumably it&#8217;s socially useful to prove our thoughtfulness.  We may also find it socially valuable to be mutually indebted to each other (and in ways hard to quantify).</li>
<p/>
<li> <strong>The Ritual</strong>.  The process itself may have value to us, including the process of figuring out each other&#8217;s desires well enough to buy gifts.</li>
<p/>
<li> <strong>Extrospection</strong>.  In rare cases you might actually know the recipient&#8217;s utility function better than they do.  Usually this is ill-advised, from an efficiency standpoint.  Why not just recommend your new favorite music to your friend and let them decide whether to buy it?  The one case where extrospection is a slam dunk, of course, is giving gifts to small children (all the other excuses for gift giving &#8212; except perhaps number 2 &#8212; also apply in this case).</li>
<p/>
<li> <strong>Transactional Efficiency</strong>. Related to number 4, it may be easier for you to get your friend something than it is for them to get it.  Maybe you&#8217;re buying a book for yourself and it&#8217;s just as easy to get a second copy.  Or you may have access to something the recipient doesn&#8217;t &#8212; for example, if you&#8217;re visiting a foreign country.</li>
</ol>
<p>The last two don&#8217;t really count, of course, since Waldfogel&#8217;s analysis accounts for those. But the first three may put a legitimate dent in that 13 billion dollar deadweight loss.</p>
<p><strong>Added:</strong> <a href="http://www.cs.duke.edu/~conitzer">Vincent Conitzer</a> adds a great one in the comments:</p>
<ol start="6">
<li><strong>Plausible Deniability</strong>.  There are some things that you just have embarrassingly high utility for.  And &#8220;hey, it was a gift!&#8221; can be a nice airtight excuse.</li>
</ol>
<p/>
&nbsp;</p>
<p/>
<h3>Addendum: Other Inefficiencies of the Season</h3>
<p>Sharad pointed out that &#8220;after-Christmas sale&#8221; is a euphemism for &#8220;pre-Christmas price-gouging.&#8221;  This is not an inefficiency if you include the utility of the retailers but from consumers&#8217; perspective, it&#8217;s inefficient to concentrate demand around a single day of the year.  Bah humbug!</p>
<p>I recently encountered an inefficiency related to holiday travel.  Many outbound NYC flights were canceled the weekend before Christmas due to a snowstorm.  People were generally rebooked a few days later.  But on Monday everything was back to normal.  I know because I flew out of New York on Monday.  My aunt was supposed to fly out on Saturday but got stranded until Tuesday.  She clearly wanted to depart on Monday more than we did.  It&#8217;s a shame that there&#8217;s essentially no secondary market for airline tickets! (Though there is in a very limited sense: You can volunteer to be bumped and the airline will compensate you with travel vouchers, worth deceptively little. But naturally the airlines are not bumping willing travelers in order to rebook people sooner when their original flights were canceled.)</p>
<p>
&nbsp;
</p>
<h3>Footnotes</h3>
<p><a id="ONE" href="#refONE">[1]</a> That&#8217;s not a claim that we <em>spend</em> $13B/year on gifts.  We spend 17 billion, 13 billion of which is pure waste. That&#8217;s Joel Waldfogel&#8217;s estimate of the difference between what we spend on holiday gifts and what the recipients would have been willing to spend to buy those items themselves.</p>
<p><a id="TWO" href="#refTWO">[2]</a> Alright, it sounds much ickier when you put it that way!</p>
<p/>
<p><em>Thanks to <a href="http://paul.rutgers.edu/~strehl/">Alex Strehl</a>, Eleanor Strehl, <a href="http://www.davidreiley.com">David Reiley</a>, Jill Sazama, <a href="http://bethany.pirateship.org">Bethany Soule</a>, and <a href="http://www.cam.cornell.edu/~sharad/">Sharad Goel</a> for helpful discussion.</em></p>
<p><em>Illustration by Kelly Savage</em></p>
<img src="http://feeds.feedburner.com/~r/messymatters/~4/ZJ0ESPTETWg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2009/12/31/scrooge/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>What Can Search Predict?</title>
		<link>http://messymatters.com/2009/11/30/what-can-search-predict/</link>
		<comments>http://messymatters.com/2009/11/30/what-can-search-predict/#comments</comments>
		<pubDate>Tue, 01 Dec 2009 02:16:47 +0000</pubDate>
		<dc:creator>Sharad Goel</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[prediction]]></category>
		<category><![CDATA[web search]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=520</guid>
  		<description><![CDATA[<p><a href="http://messymatters.com/wp-content/upl...]]></description>
	  		<content:encoded><![CDATA[<p><a href="http://messymatters.com/wp-content/uploads/2009/11/moviesearch.jpg"><img class="aligncenter size-full wp-image-524" title="moviesearch" src="http://messymatters.com/wp-content/uploads/2009/11/moviesearch.jpg" alt="moviesearch" width="388" height="283" /></a></p>
<p>Statistical wisdom is sometimes found in unusual places. Take, for example, the following exchange in, “<a href="http://www.imdb.com/title/tt1172233/" target="_blank">Whip It</a>,” a story of roller-derby loving misfits:</p>
<p>Team: <em>(chanting, after a game)</em> We&#8217;re number two! We&#8217;re number two!<br />
Coach: You guys came in second out of two teams.<br />
Team: Woo!</p>
<p>So what’s the moral? Well, there&#8217;s been a fair amount of buzz this year over the finding that web search behavior is correlated with offline outcomes, including <a href="http://www.journals.uchicago.edu/doi/abs/10.1086/593098" target="_blank">flu</a> <a href="http://www.nature.com/nature/journal/v457/n7232/full/nature07634.html" target="_blank">incidence</a> and <a href="http://googleresearch.blogspot.com/2009/04/predicting-present-with-google-trends.html" target="_blank">economic activity</a> (e.g., auto and home sales). For example, an increase in queries for &#8220;flu&#8221; and &#8220;cold&#8221; is often associated with a rise in actual flu caseloads. The correlation between search volume and flu levels in fact seems quite good, hovering around 0.95. My <a href="http://messymatters.com/2009/03/21/the-future-is-yesterday/" target="_blank">initial reaction</a> to these results was somewhat critical, and boils down to the observation that simple statistical models are comparable to, often even better than, search-based predictions. Invoking the first lesson of Whip It: <strong>Success is best evaluated in context. </strong>To give another example of this principle, it may sound impressive to predict with 97% accuracy whether it&#8217;ll be cloudy in Cold Bay, AK  &#8212; at least until you learn Cold Bay is only sunny <a href="http://www.climate-zone.com/climate/united-states/alaska/cold-bay/" target="_blank">10 days a year</a>. This assessment, however, should be weighed against the second lesson of Whip It: <strong>Number two can still be cool.</strong> The fact that search is so highly correlated with health and economic outcomes says something interesting about human behavior, regardless of whether or not it&#8217;ll help <a href="http://freakonomics.blogs.nytimes.com/2009/03/17/google-will-save-the-world/">save the world</a>.</p>
<p>Motivated by these examples, fellow yahoos (<a href="http://www.jakehofman.com/" target="_blank">Jake Hofman</a>, <a href="http://research.yahoo.com/Sebastien_Lahaie">Sébastien Lahaie</a>, <a href="http://dpennock.com/" target="_blank">Dave Pennock</a>, and <a href="http://research.yahoo.com/Duncan_Watts" target="_blank">Duncan Watts</a>) and I recently <a href="http://www.cam.cornell.edu/~sharad/papers/searchpreds.pdf" target="_blank">investigated</a> the extent to which search behavior predicts the success of cultural products, namely movies, video games, and music. In a departure from past work that has focused on real-time reporting of current activity (e.g., <a href="http://www.google.org/flutrends/" target="_blank">flu trends</a>) &#8212; what Choi and Varian cleverly call &#8220;<a href="http://googleresearch.blogspot.com/2009/04/predicting-present-with-google-trends.html" target="_blank">predicting the present</a>&#8221; &#8212; our objective was to predict future events, typically days to weeks in advance. Specifically, we use query volume to forecast opening weekend box-office revenue for feature films, first month sales of video games, and the rank of songs on the <a href="http://www.billboard.com/charts/hot-100" target="_blank">Billboard Hot 100</a>. In all cases that we consider, we find <strong>search volume on its own is predictive of future outcomes, but search is nevertheless outperformed by baseline models trained on publicly available data; combining search and baseline models generally leads to modest improvements</strong>.</p>
<p><a href="http://messymatters.com/wp-content/uploads/2009/11/searchpreds.jpg"><img class="aligncenter size-full wp-image-522" title="searchpreds" src="http://messymatters.com/wp-content/uploads/2009/11/searchpreds.jpg" alt="searchpreds" width="425" height="250" /></a></p>
<p>Whether web search is useful in predicting real-world activity is therefore likely a matter of circumstance and necessity: On the one hand, across a variety of domains, adding search to baseline models does not dramatically boost performance; but on the other hand, in certain situations, such baseline estimates may be difficult to generate, and for some applications, even small gains in performance are valuable. In other words, <strong>the benefit of web search as a prediction tool may have less to do with its superiority over other methods than with its generality, low cost, and real-time nature</strong>.</p>
<p>NB: For more details, check out our <a href="http://www.cam.cornell.edu/~sharad/papers/searchpreds.pdf">paper</a>.</p>
<p><em>Illustration by Kelly Savage</em></p>
<img src="http://feeds.feedburner.com/~r/messymatters/~4/P5hkM61fodU" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2009/11/30/what-can-search-predict/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Precious, Precious Daylight</title>
		<link>http://messymatters.com/2009/10/31/dst/</link>
		<comments>http://messymatters.com/2009/10/31/dst/#comments</comments>
		<pubDate>Sun, 01 Nov 2009 03:47:13 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[dst]]></category>
		<category><![CDATA[opinion]]></category>
		<category><![CDATA[psa]]></category>
		<category><![CDATA[time]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=498</guid>
  		<description><![CDATA[<p><img class="aligncenter size-full wp-image-499"...]]></description>
	  		<content:encoded><![CDATA[<p><img class="aligncenter size-full wp-image-499" title="Oink!" src="http://messymatters.com/wp-content/uploads/2009/10/piggy.jpg" alt="piggy" width="361" height="257" /></p>
<p>Tomorrow, after-work or after-school activities that require daylight come to an abrupt end for most of us in the US and Canada.  <strong>(Remember to &#8220;fall back&#8221; tonight!)</strong></p>
<p>There seems to be a lot of misdirected vitriol surrounding <a href="http://en.wikipedia.org/wiki/Daylight_saving_time">daylight saving time</a> (DST). First, let&#8217;s get this out of way: no one (worth arguing with) thinks daylight saving time <a id="refONE" href="#ONE"><sup>[1]</sup></a> literally saves daylight. (I would&#8217;ve said that no one worth arguing with even thinks that anyone thinks that, but people arguing against DST keep dragging out that straw man. &#8220;The sun doesn&#8217;t care what we do with our clocks!&#8221; they cry.  Debates don&#8217;t tend to be productive when one side thinks the other are imbeciles.) The actual debate is about how or whether to shift our schedules to make the best use of daylight. I&#8217;m starting with the assumption that it is both energy inefficient and socially inefficient to waste daylight hours asleep. So as the days get longer, you want to start the day earlier. The point is to shift our schedules to stay in sync with the sun. If you&#8217;re in the middle of your timezone and as far from the equator as London then night and day look like this:</p>
<p><img class="aligncenter size-medium wp-image-500" title="Nominal hours of daylight vs time of the year, in London, courtesy of Wikipedia." src="http://messymatters.com/wp-content/uploads/2009/10/dst-chart-300x235.png" alt="dst-chart" width="300" height="235" /></p>
<p>(With a different x-axis for the southern hemisphere, but <a href="http://en.wikipedia.org/wiki/Daylight_saving_time_around_the_world">the southern hemisphere isn&#8217;t so into DST</a>.)</p>

<p>As the diagram makes clear, the value of DST is the value of having daylight at 8pm instead 5am, roughly. Sure, we could change business hours and train schedules and whatnot to be earlier during the summer months, but changing the clocks twice a year is easier than that.  Realistically, we&#8217;re just not going to start waking up earlier to take advantage of the increased daylight. Unless we trick ourselves into doing so by changing the clocks.</p>
<p>I view daylight savings time as a clever <a href="http://en.wikipedia.org/wiki/Hack_%28technology%29">hack</a> to circumvent an otherwise intractable coordination problem: getting everyone to shift their schedules to not waste all that extra morning daylight.</p>
<h2>Addendum: We can&#8217;t abolish time zones either</h2>
<p>Many techie types fantasize about abolishing both daylight savings time and time zones.  &#8220;Let&#8217;s use UTC [universal coordinated time, aka GMT] everywhere,&#8221; says <a href="http://rudary.com">Matt Rudary</a>, echoing a sentiment I hear a lot in the circles I hang out in. &#8220;I have no problem working 14-22 instead of 9-5.&#8221;  That would be convenient in a lot of ways, but I think Matt underestimates the power of <a href="http://en.wikipedia.org/wiki/Focal_point_%28game_theory%29">focality</a>.  This is what I predict would happen if both time zones and DST were scrapped:</p>
<p>Everyone in what used to be the eastern time zone would settle on 14-22 as standard business hours. (And similarly for every other time zone &#8212; just make the one-time conversion to figure out what the old business hours are called in the new system.)  We&#8217;d then need DST as much as ever because there would be no hope of shifting standard business hours earlier in the summer to make better use of the extra early-morning daylight.</p>
<p>Note that it doesn&#8217;t even matter if most people&#8217;s jobs are flexible enough not to care about standard business hours. There will still be a too-sizable minority that will balk at having softball practice (or whatever) at what we now call 4:30pm on a weekday.  But rename 4:30 to &#8220;5:30&#8221;  and everyone&#8217;s fine with it. And for good reason. Because it&#8217;s all about coordination. We can&#8217;t shift standard business hours unless we all do it in sync. I&#8217;m not sure anything but DST could do that.  Maybe something like DST but more fine-grained, or even continuous, in the not-too-near future when all clocks are computerized.</p>
<p><em>Illustration by Kelly Savage.</em></p>
<h3>Footnote:</h3>
<p><a id="ONE" href="#refONE">[1]</a> You know what?  Screw it. &#8220;Daylight savings time&#8221; just sounds much more natural to me.  Suck it, <a href="http://messymatters.com/2009/09/30/scarequotes">Prescriptivists</a>.</p>
<img src="http://feeds.feedburner.com/~r/messymatters/~4/8ubQNGBiFpo" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2009/10/31/dst/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Grammatical Interlude: The Two Acceptable Uses of Quotation Marks</title>
		<link>http://messymatters.com/2009/09/30/scarequotes/</link>
		<comments>http://messymatters.com/2009/09/30/scarequotes/#comments</comments>
		<pubDate>Thu, 01 Oct 2009 03:58:31 +0000</pubDate>
		<dc:creator>dreeves</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[grammar]]></category>
		<category><![CDATA[pedantry]]></category>

		<guid isPermaLink="false">http://messymatters.com/?p=460</guid>
  		<description><![CDATA[<p><img src="http://messymatters.com/wp-content/up...]]></description>
	  		<content:encoded><![CDATA[<p><img src="http://messymatters.com/wp-content/uploads/2009/09/scarequotes.jpg" title="Scary scare quotes surrounding 'scare quotes'." width="360" height="167" class="aligncenter size-full wp-image-483" /></p>
<p>Grammatical prescriptivism annoys me to no end.  (No ending sentences with prepositions? Ridiculous! Don&#8217;t split infinitives? Poppycock!) Well, except when I do it&#8230;</p>
<p><strong>There are exactly two valid uses of quotation marks: when you mean something literally, and when you mean something nonliterally.</strong> It ought to be hard to mess that up.  But the following are <em>not</em> valid uses:</p>
<ol start="0">
<li><em>I really, truly, extra mean this!</em><br />
I wasn&#8217;t going to mention this one because I presume that you (dear reader of Messy Matters) have never done this, but the <a href="http://www.unnecessaryquotes.com/">blog of unnecessary quotes</a> is just too funny.  And I suppose it is a bit confusing: italics can sometimes serve the function of quotation marks (read on) but not the other way around.</li>
</ol>
<p>On to more the more literate misuses (admit it, you&#8217;ve done these):</p>
<ol>
<li><em>I&#8217;m not going to tell you what I mean by this.</em><br />
E.g., &#8220;the result is &#8216;nearly&#8217; optimal&#8221; by which the author means &#8220;the result is nearly optimal, for some definition of &#8216;nearly&#8217;, which I can&#8217;t be bothered to specify.&#8221;</li>
<p></p>

<li><em>This sounds too colloquial/informal/idiomatic.</em><br />
E.g., &#8220;The subjects got &#8216;carried away&#8217;.&#8221; The author here is thinking, &#8220;this sounds too colloquial or idiomatic for formal writing but it&#8217;s OK because I&#8217;m not really writing it, I&#8217;m just quoting a more informal version of myself.&#8221; Please. Just write it or don&#8217;t write it.  The only exception is when you expect the idiom to be unknown to the reader.  In that case, the second acceptable usage below applies.</li>
<p></p>
<li><em>I&#8217;m coining this term/phrase or introducing terminology.</em><br />
Use italics instead. (&#8220;We say that an allocation is <em>satisficing</em> if&#8230;&#8221;)  Exception: it is acceptable if the usage is also literal.  (&#8220;We use the term &#8216;satisficing&#8217; to refer to&#8230;&#8221; Though italics are acceptable here as well.)</li>
</ol>
<p>Which brings us to the two acceptable usages:</p>
<ol>
<li><em>I mean this literally. I&#8217;m referring to the word or phrase itself.</em><br />
E.g., &#8220;&#8216;Promulgate&#8217; is hard to pronounce.&#8221; Or, &#8220;Alice said &#8216;Look out!&#8217;&#8221;  Another instance of this rule is referring to works by title, with the <a href="http://www1.umn.edu/urelate/style/italics.html#Anchor-TITLES-47383">slew of exceptions</a> that certain titles, like books and movies, should be italicized rather than quoted.  There are also other times when italics may substitute for quotation marks, such as in the first example with the word <em>promulgate</em>. <a id="refONE" href="#ONE"><sup>[1]</sup></a></li>
<p></p>
<li><em>I don&#8217;t mean this literally.</em><br />
You can use <a href="http://en.wikipedia.org/wiki/Scare_quotes">scare quotes</a> when something is not the right word or phrase. Of course what I really mean by &#8220;not the right word&#8221; is that it simultaneously is and isn&#8217;t the right word. E.g., &#8220;the player then &#8216;wins&#8217; with a loss of $100&#8221; (they technically won but really they lost). Or if the word is accurate only loosely by analogy, e.g., &#8220;the program bids &#8216;spitefully&#8217;&#8221; (there&#8217;s a tit-for-tat aspect to its bidding but, being a computer, it doesn&#8217;t actually have spite).  Wrapping something in scare quotes is like adding a parenthetical &#8220;not really&#8221;.  It should be done sparingly; typically only when the nonliteral usage might be confusing if not &#8220;called out&#8221;.</li>
</ol>
<p><strong>P.S.</strong> If the quotation marks in the last sentence were like fingernails on a blackboard to you then you didn&#8217;t need to read any of this. And, ironically, you&#8217;re likely the only one who didn&#8217;t find this all a painfully pedantic waste of time!</p>
<p><strong>EDIT:</strong> Per <a href="http://itre.cis.upenn.edu/~myl/languagelog/archives/002035.html">Hartman&#8217;s Law of Prescriptivist Retaliation</a> I had to fix a couple errors since this went to press (embarrassingly, &#8220;slough&#8221; instead of &#8220;slew&#8221;).  Also, thank you to Eva Revesz, <a href="http://www.eecs.umich.edu/~klochner/">Kevin Lochner</a>, and Reema Bennekaa for commenting on drafts of this.  </p>
<p><em>Illustration by Kelly Savage.</em></p>
<h3>Footnote:</h3>
<p><a id="ONE" href="#refONE">[1]</a> There&#8217;s little obvious rhyme or reason to these quotes-vs-italics rules but I believe the principle is just this:  Both quotation marks and italics are serving the same functional purpose (marking a word or phrase as literal) but with different forms to suggest various dichotomies, such as book titles vs. chapters within a book. Although even I couldn&#8217;t bring myself to care if you violated these conventions, as long as you&#8217;re internally consistent.</p>
<img src="http://feeds.feedburner.com/~r/messymatters/~4/Y_-1JDz0XcE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://messymatters.com/2009/09/30/scarequotes/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
	</channel>
</rss>
