<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>David Simmons-Duffin</title>
	
	<link>http://davidsd.org</link>
	<description>Just another WordPress weblog</description>
	<lastBuildDate>Fri, 28 Sep 2012 19:26:39 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/davidsd" /><feedburner:info uri="davidsd" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><geo:lat>42.379146</geo:lat><geo:long>-71.128031</geo:long><item>
		<title>The arXiv According to arXiv vs. snarXiv</title>
		<link>http://feedproxy.google.com/~r/davidsd/~3/jLDplelGm-k/</link>
		<comments>http://davidsd.org/2010/09/the-arxiv-according-to-arxiv-vs-snarxiv/#comments</comments>
		<pubDate>Fri, 17 Sep 2010 14:29:13 +0000</pubDate>
		<dc:creator>davidsd</dc:creator>
				<category><![CDATA[Humor]]></category>
		<category><![CDATA[Physics]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://davidsd.org/?p=2074</guid>
		<description><![CDATA[After more than 3/4 of a million guesses, in over 50,000 games played in 67 countries, the results are clear: Science sounds like gobbledygook. arXiv vs. snarXiv has been live for 6 months now, and it’s time to take a look at the results. Here’s how the game works. The user sees two titles: one [...]]]></description>
				<content:encoded><![CDATA[<p class='lead'>After more than 3/4 of a million guesses, in over 50,000 games played in 67 countries, the results are clear: Science sounds like gobbledygook.</p>
<p><a href="http://snarxiv.org/vs-arxiv/">arXiv vs. snarXiv</a> has been live for 6 months now, and it’s time to take a look at the results.  Here’s how the game works.  The user sees two titles: one is the title of an actual theoretical high energy physics paper on <a href="http://arxiv.org">the arXiv</a>, and the other is a completely fake title randomly generated by <a href="http://davidsd.org/2010/03/the-snarxiv/">the snarXiv</a>.  The user guesses which one is real, finds out if they’re right or wrong, and then starts over with a new pair of titles.</p>
<p><a href="http://snarxiv.org/vs-arxiv/"><img src="http://davidsd.org/wp-content/uploads/2010/09/arXiv-vs-snarXiv.png" alt="" title="arXiv-vs-snarXiv" width="515" height="333" class="aligncenter size-full wp-image-2122" /></a></p>
<p>I’ve been recording the result of each guess, originally just out of curiosity.  I never expected to get reasonable statistics on the over 120,000 high energy theory papers on the arXiv.  But after more than <a href="http://snarxiv.org/vs-arxiv/highscores/">750,000 guesses</a>, that’s exactly what I’ve got, which means we can do some fun stuff.<span id="more-2074"></span></p>
<h3>The Most Fake-Sounding Papers</h3>
<p>First, let’s take a look at the most fake-sounding papers on the arXiv.  These are the papers whose titles get the lowest percentage of correct guesses when users try to distinguish them from a randomly generated title.  I designed arXiv vs. snarXiv to cycle such papers through the game more often, generating better statistics for them.  Here are the 15 most fake-sounding papers with at least 30 guesses.</p>
<div class='full'>
<table class='papers' id='most-fake'>
<tr>
<th>guesses</th>
<th>percent</th>
<th>paper</th>
</tr>
<tr>
<td class='stat'>36/138</td>
<td class='percent'>26%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-ph/9806285'>Highlights of the Theory</a><span class='author'>B. Z. Kopeliovich and R. Peschanski</span><span class='date'>(June 1998)</span></td>
</tr>
<tr>
<td class='stat'>41/153</td>
<td class='percent'>26%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/0408121'>Heterotic on Half-flat</a><span class='author'>Sebastien Gurrieri, Andre Lukas and Andrei Micu</span><span class='date'>(August 2004)</span></td>
</tr>
<tr>
<td class='stat'>13/48</td>
<td class='percent'>27%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/0611070'>Relativistic confinement of neutral fermions with a trigonometric   tangent potential</a><span class='author'>Luis B. Castro and Antonio S. de Castro</span><span class='date'>(November 2006)</span></td>
</tr>
<tr>
<td class='stat'>13/47</td>
<td class='percent'>27%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/0612302'>Toric Kahler metrics and AdS_5 in ring-like co-ordinates</a><span class='author'>Bobby S. Acharya, Suresh Govindarajan and Chethan N. Gowdigere</span><span class='date'>(December 2006)</span></td>
</tr>
<tr>
<td class='stat'>9/32</td>
<td class='percent'>28%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-ph/0507226'>Aspects of U_A(1) breaking in the Nambu and Jona-Lasinio model</a><span class='author'>Alexander A. Osipov, Brigitte Hiller, Veronique Bernard and Alex H. Blin</span><span class='date'>(July 2005)</span></td>
</tr>
<tr>
<td class='stat'>35/116</td>
<td class='percent'>30%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/0912.4258'>Energy’s and amplitudes’ positivity</a><span class='author'>Alberto Nicolis, Riccardo Rattazzi and Enrico Trincherini</span><span class='date'>(December 2009)</span></td>
</tr>
<tr>
<td class='stat'>16/53</td>
<td class='percent'>30%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-ph/9603226'>A covariant diquark-quark model of the nucleon in the Salpeter approach</a><span class='author'>Volker Keiner</span><span class='date'>(March 1996)</span></td>
</tr>
<tr>
<td class='stat'>51/167</td>
<td class='percent'>30%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/0603053'>Noncommutative Bundles and Instantons in Tehran</a><span class='author'>Giovanni Landi and Walter van Suijlekom</span><span class='date'>(March 2006)</span></td>
</tr>
<tr>
<td class='stat'>38/124</td>
<td class='percent'>30%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/0905.2511'>Baby steps beyond rainbow-ladder</a><span class='author'>Richard Williams and Christian S. Fischer</span><span class='date'>(May 2009)</span></td>
</tr>
<tr>
<td class='stat'>13/42</td>
<td class='percent'>30%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/0501073'>Transverse force on a moving vortex with the acoustic geometry</a><span class='author'>Peng-ming Zhang, Li-ming Cao, Yi-shi Duan and Cheng-kui Zhong</span><span class='date'>(January 2005)</span></td>
</tr>
<tr>
<td class='stat'>44/138</td>
<td class='percent'>31%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-ph/0111235'>Testing factorization</a><span class='author'>Gudrun Hiller</span><span class='date'>(November 2001)</span></td>
</tr>
<tr>
<td class='stat'>19/59</td>
<td class='percent'>32%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-ph/0604192'>Prospects for Mirage Mediation</a><span class='author'>Aaron Pierce and Jesse Thaler</span><span class='date'>(April 2006)</span></td>
</tr>
<tr>
<td class='stat'>49/152</td>
<td class='percent'>32%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/0607069'>Determining the dual</a><span class='author'>Arjan Keurentjes</span><span class='date'>(July 2006)</span></td>
</tr>
<tr>
<td class='stat'>11/34</td>
<td class='percent'>32%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/9309106'>Gravitational Dressing of Renormalization Group</a><span class='author'>I. R. Klebanov, I. I. Kogan and A. M. Polyakov</span><span class='date'>(September 1993)</span></td>
</tr>
<tr>
<td class='stat'>53/163</td>
<td class='percent'>32%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/0809.0154'>Charging Black Saturn?</a><span class='author'>Brenda Chng, Robert Mann, Eugen Radu and Cristian Stelea</span><span class='date'>(September 2008)</span></td>
</tr>
</table>
</div>
<p>A tip for future arXiv vs. snarXiv players: the snarXiv is more grammatical than the arXiv.  If you see “Heterotic on Half-flat” and think “uh… Half-flat what?” then you can be nearly certain it’s a real scientific paper that was written to advance the boundaries of human knowledge.</p>
<p>As a bonus, here are some up-and-comers: papers with between 10 and 30 guesses, a spectacularly low percentage of which were correct.</p>
<div class='full'>
<table class='papers' id='up-and-comers'>
<tr>
<th>guesses</th>
<th>percent</th>
<th>paper</th>
</tr>
<tr>
<td class='stat'>2/13</td>
<td class='percent'>15%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/0306066'>Actions and Fermionic symmetries for D-branes in bosonic backgrounds</a><span class='author'>Donald Marolf, Luca Martucci and Pedro J. Silva</span><span class='date'>(June 2003)</span></td>
</tr>
<tr>
<td class='stat'>3/15</td>
<td class='percent'>20%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-ph/0112336'>CERN LEP2 constraint on 4D QED having dynamically generated spatial   dimension</a><span class='author'>Gi-Chol Cho, Etsuko Izumi and Akio Sugamoto</span><span class='date'>(December 2001)</span></td>
</tr>
<tr>
<td class='stat'>6/27</td>
<td class='percent'>22%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-ph/0001072'>Families as Neighbors in Extra Dimension</a><span class='author'>G. Dvali and M. Shifman</span><span class='date'>(January 2000)</span></td>
</tr>
<tr>
<td class='stat'>6/25</td>
<td class='percent'>24%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-ph/9310283'>The Greening of Quantum Field Theory: George and I</a><span class='author'>Julian Schwinger</span><span class='date'>(October 1993)</span></td>
</tr>
<tr>
<td class='stat'>3/12</td>
<td class='percent'>25%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/0904.4357'>Dyonic solution of Horava-Lifshitz Gravity</a><span class='author'>Eoin Ó Colgáin and Hossein Yavartanoo</span><span class='date'>(April 2009)</span></td>
</tr>
</table>
</div>
<h3>Stump the Experts</h3>
<p>People with all sorts of backgrounds play arXiv vs. snarXiv.  My guess is that non-physicists are extremely suspicious of ridiculous-sounding words that have been co-opted for technical purposes — “‘Mirage Mediation?’ that can’t be real!” High energy physicists, on the other hand, are used to their own unfortunate verbiage.  Unfortunately for them, however, there are still plenty of papers on the arXiv that sound like they were written by a computer.</p>
<p>Let’s define an “expert” game to have least 5 guesses and a score of 80% or higher.  So far, there have been 3,916 expert games out of 49,258 total (as of September 16, 2010).  Tallying up all the guesses in these games, we can get a sense for which papers stymie even those who excel in arXiv vs. snarXiv.  Here are the top 10.</p>
<div class='full'>
<table class='papers' id='stump-the-experts'>
<tr>
<th>guesses</th>
<th>percent</th>
<th>paper</th>
</tr>
<tr>
<td class='stat'>1/6</td>
<td class='percent'>16%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/9912205'>Fields</a><span class='author'>W. Siegel</span><span class='date'>(December 1999)</span></td>
</tr>
<tr>
<td class='stat'>1/5</td>
<td class='percent'>20%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/0211215'>An extended model for monopole catalysis of nucleon decay</a><span class='author'>Y. Brihaye, D. Yu. Grigoriev, V. A. Rubakov and D. H. Tchrakian</span><span class='date'>(November 2002)</span></td>
</tr>
<tr>
<td class='stat'>1/5</td>
<td class='percent'>20%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/0407257'>Space-time symmetry restoration in cosmological models with Kalb–Ramond   and scalar fields</a><span class='author'>E. Di Grezia, G. Mangano and G. Miele</span><span class='date'>(July 2004)</span></td>
</tr>
<tr>
<td class='stat'>1/5</td>
<td class='percent'>20%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/9511045'>Towards Evaluation of Stringy Non-Perturbative Effects</a><span class='author'>R. Brustein and B. A. Ovrut</span><span class='date'>(November 1995)</span></td>
</tr>
<tr>
<td class='stat'>1/5</td>
<td class='percent'>20%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/9807044'>Dimensional Reduction</a><span class='author'>Corinne A. Manogue and Tevian Dray</span><span class='date'>(July 1998)</span></td>
</tr>
<tr>
<td class='stat'>1/5</td>
<td class='percent'>20%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/0812.1506'>The Ridge, the Glasma and Flow</a><span class='author'>Larry McLerran</span><span class='date'>(December 2008)</span></td>
</tr>
<tr>
<td class='stat'>1/5</td>
<td class='percent'>20%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-ph/9606335'>Generalized Bunching Parameters and Multiplicity Fluctuations in   Restricted Phase-Space Bins</a><span class='author'>S. V. Chekanov, W. Kittel and V. I. Kuvshinov</span><span class='date'>(June 1996)</span></td>
</tr>
<tr>
<td class='stat'>2/7</td>
<td class='percent'>28%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/0906.3305'>BGWM as Second Constituent of Complex Matrix Model</a><span class='author'>A. Alexandrov, A. Mironov and A. Morozov</span><span class='date'>(June 2009)</span></td>
</tr>
<tr>
<td class='stat'>3/10</td>
<td class='percent'>30%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-ph/0111235'>Testing factorization</a><span class='author'>Gudrun Hiller</span><span class='date'>(November 2001)</span></td>
</tr>
<tr>
<td class='stat'>2/6</td>
<td class='percent'>33%</td>
<td class='paperinfo'><a href='http://arxiv.org/abs/hep-th/0104070'>Supersymmetric Potentials in Einstein-Cartan-Brans-Dicke Cosmology</a><span class='author'>L. C. Garcia de Andrade</span><span class='date'>(April 2001)</span></td>
</tr>
</table>
</div>
<p>I have to say, some of these definitely sound like they came straight from the snarXiv.  I really don’t know what Glasma is, though.  The snarXiv does not have Glasma.</p>
<h3>Famous Physicists</h3>
<p>Ok, so the papers above sound especially ridiculous.  However, the average across all 750,000 guesses on all papers is still only 59% correct.  While better than a monkey, this is not particularly good.  Who’s responsible?  Surely not the world’s top minds?</p>
<p>Here’s a ranking of some of the most-highly cited physicists on the arXiv (<a href="http://en.wikipedia.org/wiki/H-index">H-index</a> of 40 or higher, with a few other notable folks thrown in), according to the percentage of correct guesses on their papers.<sup>[<a href="http://davidsd.org/2010/09/the-arxiv-according-to-arxiv-vs-snarxiv/#footnote_0_2074" id="identifier_0_2074" class="footnote-link footnote-identifier-link" title="I couldn&#039;t find an h-index ranking of physicists that was more recent than this one from 2005. I&#039;m probably missing lots of names. Let me know and I&#039;ll add them. ">1</a>]</sup>  A smaller percentage means that their papers sound more like complete flapdoodle.  I should note for the sake of my career that this has absolutely nothing to do with the quality of said papers.</p>
<div class='full'>
<table id='topcite' class='ranking'>
<tr>
<td class='stat'>178/360</td>
<td class='percent'>49%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Denef_F/0/1/0/all/0/1'>Frederik Denef</a></td>
<td class='stat'>246/429</td>
<td class='percent'>57%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Turok_N/0/1/0/all/0/1'>Neil Turok</a></td>
<td class='stat'>289/488</td>
<td class='percent'>59%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Polchinski_J/0/1/0/all/0/1'>Joseph Polchinski</a></td>
</tr>
<tr>
<td class='stat'>129/253</td>
<td class='percent'>50%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Polyakov_A/0/1/0/all/0/1'>A. M. Polyakov</a></td>
<td class='stat'>310/539</td>
<td class='percent'>57%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Randall_L/0/1/0/all/0/1'>Lisa Randall</a></td>
<td class='stat'>573/963</td>
<td class='percent'>59%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Witten_E/0/1/0/all/0/1'>Edward Witten</a></td>
</tr>
<tr>
<td class='stat'>69/135</td>
<td class='percent'>51%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Weinberg_S/0/1/0/all/0/1'>Steven Weinberg</a></td>
<td class='stat'>353/612</td>
<td class='percent'>57%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Nanopoulos_D/0/1/0/all/0/1'>Dimitri Nanopoulos</a></td>
<td class='stat'>1194/1981</td>
<td class='percent'>60%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Ellis_J/0/1/0/all/0/1'>John Ellis</a></td>
</tr>
<tr>
<td class='stat'>189/357</td>
<td class='percent'>52%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Georgi_H/0/1/0/all/0/1'>Howard Georgi</a></td>
<td class='stat'>209/362</td>
<td class='percent'>57%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Verlinde_H/0/1/0/all/0/1'>Herman Verlinde</a></td>
<td class='stat'>368/610</td>
<td class='percent'>60%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Kachru_S/0/1/0/all/0/1'>Shamit Kachru</a></td>
</tr>
<tr>
<td class='stat'>764/1419</td>
<td class='percent'>53%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Vafa_C/0/1/0/all/0/1'>Cumrun Vafa</a></td>
<td class='stat'>450/778</td>
<td class='percent'>57%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Dvali_G/0/1/0/all/0/1'>Gia Dvali</a></td>
<td class='stat'>315/520</td>
<td class='percent'>60%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Jackiw_R/0/1/0/all/0/1'>Roman Jackiw</a></td>
</tr>
<tr>
<td class='stat'>332/615</td>
<td class='percent'>53%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Susskind_L/0/1/0/all/0/1'>Leonard Susskind</a></td>
<td class='stat'>297/513</td>
<td class='percent'>57%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Arkani_Hamed_N/0/1/0/all/0/1'>Nima Arkani-Hamed</a></td>
<td class='stat'>311/510</td>
<td class='percent'>60%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Ooguri_H/0/1/0/all/0/1'>Hirosi Ooguri</a></td>
</tr>
<tr>
<td class='stat'>6/11</td>
<td class='percent'>54%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Politzer_H/0/1/0/all/0/1'>H. David Politzer</a></td>
<td class='stat'>416/717</td>
<td class='percent'>58%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Seiberg_N/0/1/0/all/0/1'>Nathan Seiberg</a></td>
<td class='stat'>592/966</td>
<td class='percent'>61%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Murayama_H/0/1/0/all/0/1'>Hitoshi Murayama</a></td>
</tr>
<tr>
<td class='stat'>63/115</td>
<td class='percent'>54%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Hooft_G/0/1/0/all/0/1'>Gerard t Hooft</a></td>
<td class='stat'>94/162</td>
<td class='percent'>58%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Krauss_L/0/1/0/all/0/1'>Lawrence M. Krauss</a></td>
<td class='stat'>118/192</td>
<td class='percent'>61%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Gross_D/0/1/0/all/0/1'>David J. Gross</a></td>
</tr>
<tr>
<td class='stat'>282/514</td>
<td class='percent'>54%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Green_M/0/1/0/all/0/1'>Michael B. Green</a></td>
<td class='stat'>368/633</td>
<td class='percent'>58%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Banks_T/0/1/0/all/0/1'>Thomas Banks</a></td>
<td class='stat'>381/606</td>
<td class='percent'>62%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Hall_L/0/1/0/all/0/1'>Lawrence J. Hall</a></td>
</tr>
<tr>
<td class='stat'>348/622</td>
<td class='percent'>55%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Wilczek_F/0/1/0/all/0/1'>Frank Wilczek</a></td>
<td class='stat'>535/917</td>
<td class='percent'>58%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Klebanov_I/0/1/0/all/0/1'>Igor R. Klebanov</a></td>
<td class='stat'>118/186</td>
<td class='percent'>63%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Hawking_S/0/1/0/all/0/1'>Stephen Hawking</a></td>
</tr>
<tr>
<td class='stat'>164/293</td>
<td class='percent'>55%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Verlinde_E/0/1/0/all/0/1'>Erik Verlinde</a></td>
<td class='stat'>412/705</td>
<td class='percent'>58%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Gubser_S/0/1/0/all/0/1'>Steven S. Gubser</a></td>
<td class='stat'>383/600</td>
<td class='percent'>63%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Manohar_A/0/1/0/all/0/1'>Aneesh V. Manohar</a></td>
</tr>
<tr>
<td class='stat'>109/194</td>
<td class='percent'>56%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Glashow_S/0/1/0/all/0/1'>Sheldon Glashow</a></td>
<td class='stat'>356/603</td>
<td class='percent'>59%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Maldacena_J/0/1/0/all/0/1'>Juan Maldacena</a></td>
<td class='stat'>170/257</td>
<td class='percent'>66%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Peskin_M/0/1/0/all/0/1'>Michael Peskin</a></td>
</tr>
<tr>
<td class='stat'>406/718</td>
<td class='percent'>56%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Dimopoulos_S/0/1/0/all/0/1'>Savas Dimopoulos</a></td>
<td class='stat'>481/813</td>
<td class='percent'>59%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Strominger_A/0/1/0/all/0/1'>Andrew Strominger</a></td>
<td class='stat'>222/326</td>
<td class='percent'>68%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Schwarz_J/0/1/0/all/0/1'>John H. Schwarz</a></td>
</tr>
<tr>
<td class='stat'>351/615</td>
<td class='percent'>57%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Wise_M/0/1/0/all/0/1'>Mark B. Wise</a></td>
<td class='stat'>174/294</td>
<td class='percent'>59%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Greene_B/0/1/0/all/0/1'>Brian R. Greene</a></td>
</tr>
</table>
</div>
<p>I’d especially like to congratulate Frederik on his anomalously low 49 percent.  You make us all worse than a monkey, Frederik.<sup>[<a href="http://davidsd.org/2010/09/the-arxiv-according-to-arxiv-vs-snarxiv/#footnote_1_2074" id="identifier_1_2074" class="footnote-link footnote-identifier-link" title="In addition to being a top-notch physicist, Frederik also happens to be one of the world&#039;s best arXiv vs. snarXiv players.">2</a>]</sup></p>
<h3>The Blogosphere</h3>
<p>Now let’s turn to even more famous people: physics bloggers (and authors).  Here are some of the most prominent, ranked from most fake-sounding papers (smallest percentage) to least fake-sounding papers (largest percentage).  I think there’s a lesson here somewhere, though it’s hard to be sure in some cases, due to small statistics.</p>
<div class="full">
<table id='blogosphere' class='ranking'>
<tr>
<td class='stat'>117/217</td>
<td class='percent'>53%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Carroll_S/0/1/0/all/0/1'>Sean M. Carroll</a> <a href="http://blogs.discovermagazine.com/cosmicvariance/">[blog]</a></td>
<td class='stat'>68/112</td>
<td class='percent'>60%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Motl_L/0/1/0/all/0/1'>Lubos Motl</a> <a href="http://motls.blogspot.com/">[blog]</a></td>
</tr>
<tr>
<td class='stat'>129/234</td>
<td class='percent'>55%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Distler_J/0/1/0/all/0/1'>Jacques Distler</a> <a href="http://golem.ph.utexas.edu/~distler/blog/">[blog]</a></td>
<td class='stat'>202/331</td>
<td class='percent'>61%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Trodden_M/0/1/0/all/0/1'>Mark Trodden</a> <a href="http://blogs.discovermagazine.com/cosmicvariance/">[blog]</a></td>
</tr>
<tr>
<td class='stat'>343/609</td>
<td class='percent'>56%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Johnson_C/0/1/0/all/0/1'>Clifford V. Johnson</a> <a href="http://asymptotia.com/">[blog]</a></td>
<td class='stat'>98/159</td>
<td class='percent'>61%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Hossenfelder_S/0/1/0/all/0/1'>Sabine Hossenfelder</a> <a href="http://backreaction.blogspot.com/">[blog]</a></td>
</tr>
<tr>
<td class='stat'>56/97</td>
<td class='percent'>57%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Baez_J/0/1/0/all/0/1'>John C. Baez</a> <a href="http://johncarlosbaez.wordpress.com/">[blog]</a></td>
<td class='stat'>220/352</td>
<td class='percent'>62%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Smolin_L/0/1/0/all/0/1'>Lee Smolin</a> <a href="http://www.leesmolin.com/">[site]</a></td>
</tr>
<tr>
<td class='stat'>306/505</td>
<td class='percent'>60%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Hewett_J/0/1/0/all/0/1'>JoAnne Hewett</a> <a href="http://blogs.discovermagazine.com/cosmicvariance/">[blog]</a></td>
<td class='stat'>6/8</td>
<td class='percent'>75%</td>
<td class='name'><a href='http://arxiv.org/find/grp_physics/1/au:+Woit_P/0/1/0/all/0/1'>Peter Woit</a> <a href="http://www.math.columbia.edu/~woit/wordpress/">[blog]</a></td>
</tr>
</table>
</div>
<h3>Fake-Sounding and Real-Sounding Words</h3>
<p>Suppose you’re writing a scientific paper, and you want to ensure that the general public doesn’t think it’s complete malarkey.  How do you do it?  Here are the 10 words with the lowest percentage of correct guesses (most fake-sounding) for titles containing those words (to ensure no single paper dominates this percentage, I’m requiring that each word appear in at least 5 titles).</p>
<div class='full'>
<table id='fake-words' class='ranking'>
<tr>
<td class='stat'>174/521</td>
<td class='percent'>33%</td>
<td class='name'>Saturn</td>
<td class='stat'>76/195</td>
<td class='percent'>38%</td>
<td class='name'>multiskyrmions</td>
</tr>
<tr>
<td class='stat'>66/196</td>
<td class='percent'>33%</td>
<td class='name'>half-flat</td>
<td class='stat'>100/252</td>
<td class='percent'>39%</td>
<td class='name'>secret</td>
</tr>
<tr>
<td class='stat'>69/189</td>
<td class='percent'>36%</td>
<td class='name'>charging</td>
<td class='stat'>99/249</td>
<td class='percent'>39%</td>
<td class='name'>perturbing</td>
</tr>
<tr>
<td class='stat'>54/147</td>
<td class='percent'>36%</td>
<td class='name'>caustic</td>
<td class='stat'>78/194</td>
<td class='percent'>40%</td>
<td class='name'>pollution</td>
</tr>
<tr>
<td class='stat'>80/208</td>
<td class='percent'>38%</td>
<td class='name'>highlights</td>
<td class='stat'>87/214</td>
<td class='percent'>40%</td>
<td class='name'>enough</td>
</tr>
</table>
</div>
<p>Avoid these words! Turns out people don’t believe in “multiskyrmions.”  Also, you shouldn’t mention “Saturn,” or use normal english words like “secret” or “enough.”  By contrast, here’s a list of the 10 words with the highest percentage of correct guesses (most realistic-sounding) for titles containing those words.</p>
<div class='full'>
<table id='real-words' class='ranking'>
<tr>
<td class='stat'>76/100</td>
<td class='percent'>76%</td>
<td class='name'>cp-even</td>
<td class='stat'>87/122</td>
<td class='percent'>71%</td>
<td class='name'>spin-spin</td>
</tr>
<tr>
<td class='stat'>75/101</td>
<td class='percent'>74%</td>
<td class='name'>Argon</td>
<td class='stat'>90/127</td>
<td class='percent'>70%</td>
<td class='name'>anomaly-free</td>
</tr>
<tr>
<td class='stat'>140/191</td>
<td class='percent'>73%</td>
<td class='name'>two-particle</td>
<td class='stat'>127/180</td>
<td class='percent'>70%</td>
<td class='name'>atlas</td>
</tr>
<tr>
<td class='stat'>74/102</td>
<td class='percent'>72%</td>
<td class='name'>self-coupling</td>
<td class='stat'>70/100</td>
<td class='percent'>70%</td>
<td class='name'>supersymmetry-breaking</td>
</tr>
<tr>
<td class='stat'>84/117</td>
<td class='percent'>71%</td>
<td class='name'>unusual</td>
<td class='stat'>128/183</td>
<td class='percent'>69%</td>
<td class='name'>naked</td>
</tr>
</table>
</div>
<p>In other words, if you want to be taken seriously as a scientist, you should call your next paper <em>Unusual Naked, but Anomaly-Free</em>.</p>
<h3>Incidence of Apparent Hooey in Various Subfields</h3>
<p>Papers on the arXiv can be associated with one or more physics subfields.  Here’s a ranking of subfields with at least 50 guesses from most fake-sounding to least fake-sounding.</p>
<div class='full'>
<table id='subjects' class='ranking'>
<tr>
<td class='stat'>39/81</td>
<td class='percent'>48%</td>
<td class='name'>Adaptation and Self-Organizing Systems</td>
<td class='stat'>134/226</td>
<td class='percent'>59%</td>
<td class='name'>Combinatorics</td>
</tr>
<tr>
<td class='stat'>48/98</td>
<td class='percent'>48%</td>
<td class='name'>Popular Physics</td>
<td class='stat'>638/1075</td>
<td class='percent'>59%</td>
<td class='name'>Other</td>
</tr>
<tr>
<td class='stat'>172/340</td>
<td class='percent'>50%</td>
<td class='name'>Data Analysis, Statistics and Probability</td>
<td class='stat'>116/195</td>
<td class='percent'>59%</td>
<td class='name'>Accelerator Physics</td>
</tr>
<tr>
<td class='stat'>212/414</td>
<td class='percent'>51%</td>
<td class='name'>History of Physics</td>
<td class='stat'>140/235</td>
<td class='percent'>59%</td>
<td class='name'>Soft Condensed Matter</td>
</tr>
<tr>
<td class='stat'>167/307</td>
<td class='percent'>54%</td>
<td class='name'>Operator Algebras</td>
<td class='stat'>1898/3172</td>
<td class='percent'>59%</td>
<td class='name'>Differential Geometry</td>
</tr>
<tr>
<td class='stat'>136/248</td>
<td class='percent'>54%</td>
<td class='name'>Rings and Algebras</td>
<td class='stat'>114/190</td>
<td class='percent'>60%</td>
<td class='name'>Group Theory</td>
</tr>
<tr>
<td class='stat'>101/184</td>
<td class='percent'>54%</td>
<td class='name'>Disordered Systems and Neural Networks</td>
<td class='stat'>195/324</td>
<td class='percent'>60%</td>
<td class='name'>Fluid Dynamics</td>
</tr>
<tr>
<td class='stat'>282/512</td>
<td class='percent'>55%</td>
<td class='name'>Pattern Formation and Solitons</td>
<td class='stat'>216/358</td>
<td class='percent'>60%</td>
<td class='name'>Functional Analysis</td>
</tr>
<tr>
<td class='stat'>128/227</td>
<td class='percent'>56%</td>
<td class='name'>Classical Analysis and ODEs</td>
<td class='stat'>94/155</td>
<td class='percent'>60%</td>
<td class='name'>Dynamical Systems</td>
</tr>
<tr>
<td class='stat'>295/523</td>
<td class='percent'>56%</td>
<td class='name'>Representation Theory</td>
<td class='stat'>96/158</td>
<td class='percent'>60%</td>
<td class='name'>Algebraic Topology</td>
</tr>
<tr>
<td class='stat'>39/69</td>
<td class='percent'>56%</td>
<td class='name'>Probability</td>
<td class='stat'>500/820</td>
<td class='percent'>60%</td>
<td class='name'>Atomic Physics</td>
</tr>
<tr>
<td class='stat'>67/118</td>
<td class='percent'>56%</td>
<td class='name'>Geophysics</td>
<td class='stat'>180/295</td>
<td class='percent'>61%</td>
<td class='name'>Geometric Topology</td>
</tr>
<tr>
<td class='stat'>138/242</td>
<td class='percent'>57%</td>
<td class='name'>Number Theory</td>
<td class='stat'>206/337</td>
<td class='percent'>61%</td>
<td class='name'>Symplectic Geometry</td>
</tr>
<tr>
<td class='stat'>757/1327</td>
<td class='percent'>57%</td>
<td class='name'>Strongly Correlated Electrons</td>
<td class='stat'>41/67</td>
<td class='percent'>61%</td>
<td class='name'>Symbolic Computation</td>
</tr>
<tr>
<td class='stat'>4724/8190</td>
<td class='percent'>57%</td>
<td class='name'>Quantum Algebra</td>
<td class='stat'>103/168</td>
<td class='percent'>61%</td>
<td class='name'>Classical Physics</td>
</tr>
<tr>
<td class='stat'>2696/4666</td>
<td class='percent'>57%</td>
<td class='name'>Exactly Solvable and Integrable Systems</td>
<td class='stat'>54/88</td>
<td class='percent'>61%</td>
<td class='name'>Complex Variables</td>
</tr>
<tr>
<td class='stat'>766/1317</td>
<td class='percent'>58%</td>
<td class='name'>Superconductivity</td>
<td class='stat'>758/1229</td>
<td class='percent'>61%</td>
<td class='name'>Mesoscopic Systems and Quantum Hall Effect</td>
</tr>
<tr>
<td class='stat'>88/151</td>
<td class='percent'>58%</td>
<td class='name'>Computational Physics</td>
<td class='stat'>50/81</td>
<td class='percent'>61%</td>
<td class='name'>Materials Science</td>
</tr>
<tr>
<td class='stat'>1763/3011</td>
<td class='percent'>58%</td>
<td class='name'>Algebraic Geometry</td>
<td class='stat'>50/80</td>
<td class='percent'>62%</td>
<td class='name'>Category Theory</td>
</tr>
<tr>
<td class='stat'>8306/14161</td>
<td class='percent'>58%</td>
<td class='name'>Mathematical Physics</td>
<td class='stat'>76/120</td>
<td class='percent'>63%</td>
<td class='name'>K-Theory and Homology</td>
</tr>
<tr>
<td class='stat'>2072/3511</td>
<td class='percent'>59%</td>
<td class='name'>Statistical Mechanics</td>
<td class='stat'>59/93</td>
<td class='percent'>63%</td>
<td class='name'>Instrumentation and Detectors</td>
</tr>
<tr>
<td class='stat'>279/472</td>
<td class='percent'>59%</td>
<td class='name'>Chaotic Dynamics</td>
<td class='stat'>55/86</td>
<td class='percent'>63%</td>
<td class='name'>Spectral Theory</td>
</tr>
<tr>
<td class='stat'>58/98</td>
<td class='percent'>59%</td>
<td class='name'>Analysis of PDEs</td>
<td class='stat'>82/121</td>
<td class='percent'>67%</td>
<td class='name'>Optics</td>
</tr>
<tr>
<td class='stat'>99/167</td>
<td class='percent'>59%</td>
<td class='name'>Plasma Physics</td>
</tr>
</table>
</div>
<h3>Performance by Country</h3>
<p>These last few statistics have less to do with the arXiv, and more to do with arXiv vs. snarXiv itself.  I have location data for the most recent quarter-million guesses.<sup>[<a href="http://davidsd.org/2010/09/the-arxiv-according-to-arxiv-vs-snarxiv/#footnote_2_2074" id="identifier_2_2074" class="footnote-link footnote-identifier-link" title="When I told my software-startup friend about arXiv vs. snarXiv and mentioned that I wasn&#039;t logging ip addresses, he looked at me very seriously and said: &quot;You&#039;re not logging ip addresses?  You should always log ip addresses.&quot;">3</a>]</sup>  So let’s look at how performance varies across the the globe.  Here’s a ranking of correct guesses from countries with at least 2000 total guesses.<sup>[<a href="http://davidsd.org/2010/09/the-arxiv-according-to-arxiv-vs-snarxiv/#footnote_3_2074" id="identifier_3_2074" class="footnote-link footnote-identifier-link" title="The high-scores leader &quot;Ed&quot; is from Portugal, which would have done very well in the rankings had I included his guesses.  Unfortunately, &quot;Ed&quot; was cheating (probably already obvious to everyone, though I can also prove it with certitude), so I&#039;ve removed all his guesses from this analysis.">4</a>]</sup></p>
<div class='full'>
<table id='countries' class='ranking'>
<tr>
<td class='stat'>1400/2256</td>
<td class='percent'>62%</td>
<td class='name'>Austria</td>
<td class='stat'>2393/4189</td>
<td class='percent'>57%</td>
<td class='name'>Japan</td>
</tr>
<tr>
<td class='stat'>10665/17467</td>
<td class='percent'>61%</td>
<td class='name'>Germany</td>
<td class='stat'>89632/158059</td>
<td class='percent'>56%</td>
<td class='name'>United States</td>
</tr>
<tr>
<td class='stat'>3111/5183</td>
<td class='percent'>60%</td>
<td class='name'>Israel</td>
<td class='stat'>12137/21471</td>
<td class='percent'>56%</td>
<td class='name'>United Kingdom</td>
</tr>
<tr>
<td class='stat'>1658/2825</td>
<td class='percent'>58%</td>
<td class='name'>Spain</td>
<td class='stat'>7233/13053</td>
<td class='percent'>55%</td>
<td class='name'>Canada</td>
</tr>
<tr>
<td class='stat'>4134/7080</td>
<td class='percent'>58%</td>
<td class='name'>Italy</td>
<td class='stat'>2281/4183</td>
<td class='percent'>54%</td>
<td class='name'>India</td>
</tr>
<tr>
<td class='stat'>5804/9960</td>
<td class='percent'>58%</td>
<td class='name'>France</td>
<td class='stat'>1652/3037</td>
<td class='percent'>54%</td>
<td class='name'>Finland</td>
</tr>
<tr>
<td class='stat'>2355/4071</td>
<td class='percent'>57%</td>
<td class='name'>Switzerland</td>
<td class='stat'>2270/4266</td>
<td class='percent'>53%</td>
<td class='name'>Russian Federation</td>
</tr>
<tr>
<td class='stat'>3485/6083</td>
<td class='percent'>57%</td>
<td class='name'>Australia</td>
<td class='stat'>3496/6593</td>
<td class='percent'>53%</td>
<td class='name'>Netherlands</td>
</tr>
<tr>
<td class='stat'>1690/2958</td>
<td class='percent'>57%</td>
<td class='name'>Sweden</td>
<td class='stat'>1061/2001</td>
<td class='percent'>53%</td>
<td class='name'>Argentina</td>
</tr>
</table>
</div>
<p>It looks like having English as a first language is not particularly helpful.</p>
<h3>Performance by School</h3>
<p>Finally, universities account for about 1/8th of the total number of guesses on arXiv vs. snarXiv.  Altogether, their performance is almost exactly average (59%).  However, there are variations… Here’s a ranking of schools with at least 400 total guesses.</p>
<div class='full'>
<table id='schools' class='ranking'>
<tr>
<td class='stat'>1145/1388</td>
<td class='percent'>82%</td>
<td class='name'>University of Colorado at Boulder</td>
<td class='stat'>560/963</td>
<td class='percent'>58%</td>
<td class='name'>The University of Chicago</td>
</tr>
<tr>
<td class='stat'>553/785</td>
<td class='percent'>70%</td>
<td class='name'>University of Regensburg</td>
<td class='stat'>278/481</td>
<td class='percent'>57%</td>
<td class='name'>UC Santa Barbara</td>
</tr>
<tr>
<td class='stat'>317/481</td>
<td class='percent'>65%</td>
<td class='name'>University of Washington</td>
<td class='stat'>264/461</td>
<td class='percent'>57%</td>
<td class='name'>Madison</td>
</tr>
<tr>
<td class='stat'>718/1097</td>
<td class='percent'>65%</td>
<td class='name'>Penn State</td>
<td class='stat'>542/967</td>
<td class='percent'>56%</td>
<td class='name'>University of Cambridge</td>
</tr>
<tr>
<td class='stat'>1001/1538</td>
<td class='percent'>65%</td>
<td class='name'>Berkeley</td>
<td class='stat'>237/426</td>
<td class='percent'>55%</td>
<td class='name'>Cornell University</td>
</tr>
<tr>
<td class='stat'>549/849</td>
<td class='percent'>64%</td>
<td class='name'>Princeton University</td>
<td class='stat'>476/861</td>
<td class='percent'>55%</td>
<td class='name'>UC Davis</td>
</tr>
<tr>
<td class='stat'>1277/1981</td>
<td class='percent'>64%</td>
<td class='name'>MIT</td>
<td class='stat'>471/855</td>
<td class='percent'>55%</td>
<td class='name'>Columbia University</td>
</tr>
<tr>
<td class='stat'>444/691</td>
<td class='percent'>64%</td>
<td class='name'>Imperial College London</td>
<td class='stat'>859/1577</td>
<td class='percent'>54%</td>
<td class='name'>California Institute of Technology</td>
</tr>
<tr>
<td class='stat'>349/544</td>
<td class='percent'>64%</td>
<td class='name'>Monash University</td>
<td class='stat'>393/723</td>
<td class='percent'>54%</td>
<td class='name'>Harvard University</td>
</tr>
<tr>
<td class='stat'>376/597</td>
<td class='percent'>62%</td>
<td class='name'>University of Illinois at Urbana-Champaign</td>
<td class='stat'>363/671</td>
<td class='percent'>54%</td>
<td class='name'>Stanford University</td>
</tr>
<tr>
<td class='stat'>287/457</td>
<td class='percent'>62%</td>
<td class='name'>Hebrew University of Jerusalem</td>
<td class='stat'>219/423</td>
<td class='percent'>51%</td>
<td class='name'>Yale University</td>
</tr>
<tr>
<td class='stat'>284/461</td>
<td class='percent'>61%</td>
<td class='name'>The University of Edinburgh</td>
<td class='stat'>308/599</td>
<td class='percent'>51%</td>
<td class='name'>University of Minnesota</td>
</tr>
<tr>
<td class='stat'>261/435</td>
<td class='percent'>60%</td>
<td class='name'>Boston University</td>
<td class='stat'>281/551</td>
<td class='percent'>50%</td>
<td class='name'>University of Warwick</td>
</tr>
</table>
</div>
<p>Congratulations to the University of Colorado at Boulder, which is the clear winner here.<sup>[<a href="http://davidsd.org/2010/09/the-arxiv-according-to-arxiv-vs-snarxiv/#footnote_4_2074" id="identifier_4_2074" class="footnote-link footnote-identifier-link" title="Or rather, congratulations to the two dudes at the University of Colorado at Boulder who together played over 118 games with a total of 1215 guesses and an average score of 85%.">5</a>]</sup>  Also, I just wanted to say: seriously Harvard?  Seriously?</p>
<h3>Disclaimer</h3>
<p>Finally, before heading into the comments, let me do a crapload of disclaiming.  This is obviously the least scientific survey of science ever conducted.  The ranking of a paper as “fake-sounding” or “realistic-sounding” has as much to do with the peculiarities of the snarXiv as with the arXiv itself.<sup>[<a href="http://davidsd.org/2010/09/the-arxiv-according-to-arxiv-vs-snarxiv/#footnote_5_2074" id="identifier_5_2074" class="footnote-link footnote-identifier-link" title="I already got slammed for this on marginalrevolution.com, and I&#039;ll surely get slammed again.">6</a>]</sup>  Also, although 750,000 guesses is a lot in total — such that I’m fairly certain that the 59% overall average isn’t going anywhere — the statistics get dicey when chopped into small bits (see what I did there?).  To be sure of anything, I guess we’ll just have to wait until the blogosphere writes more papers.</p>
<ol class="footnotes"><li id="footnote_0_2074" class="footnote">I couldn’t find an h-index ranking of physicists that was more recent than <a href="http://www.mazalien.nl/weblog/archives/2006/10/01/the-h-index/">this one</a> from 2005. I’m probably missing lots of names. Let me know and I’ll add them. </li><li id="footnote_1_2074" class="footnote">In addition to being a top-notch physicist, Frederik also happens to be one of the world’s best arXiv vs. snarXiv players.</li><li id="footnote_2_2074" class="footnote">When I told my software-startup friend about arXiv vs. snarXiv and mentioned that I wasn’t logging ip addresses, he looked at me very seriously and said: “You’re not logging ip addresses?  You should always log ip addresses.”</li><li id="footnote_3_2074" class="footnote">The <a href="http://snarxiv.org/vs-arxiv/highscores/">high-scores</a> leader “Ed” is from Portugal, which would have done very well in the rankings had I included his guesses.  Unfortunately, “Ed” was cheating (probably already obvious to everyone, though I can also prove it with certitude), so I’ve removed all his guesses from this analysis.</li><li id="footnote_4_2074" class="footnote">Or rather, congratulations to the two dudes at the University of Colorado at Boulder who together played over 118 games with a total of 1215 guesses and an average score of 85%.</li><li id="footnote_5_2074" class="footnote">I already got <a href="http://www.marginalrevolution.com/marginalrevolution/2010/06/assorted-links-8.html">slammed for this</a> on marginalrevolution.com, and I’ll surely get slammed again.</li></ol><img src="http://feeds.feedburner.com/~r/davidsd/~4/jLDplelGm-k" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://davidsd.org/2010/09/the-arxiv-according-to-arxiv-vs-snarxiv/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		<feedburner:origLink>http://davidsd.org/2010/09/the-arxiv-according-to-arxiv-vs-snarxiv/</feedburner:origLink></item>
		<item>
		<title>Lie Group Computations With Python</title>
		<link>http://feedproxy.google.com/~r/davidsd/~3/MOOc3qr1N68/</link>
		<comments>http://davidsd.org/2010/03/lie-group-computations-with-python/#comments</comments>
		<pubDate>Wed, 31 Mar 2010 06:08:12 +0000</pubDate>
		<dc:creator>davidsd</dc:creator>
				<category><![CDATA[Code]]></category>
		<category><![CDATA[Math]]></category>
		<category><![CDATA[Physics]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://davidsd.org/?p=2000</guid>
		<description><![CDATA[lie is a python module for computations with Lie groups, Lie algebras, representations, root systems, and more. I based it on the computer algebra package LiE, written by M. A. A. van Leeuwen, A. M. Cohen and B. Lisser in the early 90’s. They chose to implement a proprietary scripting language as a wrapper for [...]]]></description>
				<content:encoded><![CDATA[<p class="lead"><a href="http://github.com/davidsd/lie"><em>lie</em></a> is a <a href="http://www.python.org">python</a> module for computations with <a href="http://en.wikipedia.org/wiki/Lie_group">Lie groups</a>, <a href="http://en.wikipedia.org/wiki/Lie_algebra">Lie algebras</a>, <a href="http://en.wikipedia.org/wiki/Representation_theory">representations</a>, <a href="http://en.wikipedia.org/wiki/Root_system">root systems</a>, and more.</p>
<p>I based it on the computer algebra package <a href="http://www-math.univ-poitiers.fr/~maavl/LiE/">LiE</a>, written by M. A. A. van Leeuwen, A. M. Cohen and B. Lisser in the early 90’s.  They chose to implement a proprietary scripting language as a wrapper for all the fancy mathematical algorithms.  While this language is useful for interactive computations and short scripts, python is more expressive and powerful — definitely what you want when exploring your favorite exceptional group.</p>
<h3>A Fun Example</h3>
<p>Here’s an example of using <em>lie</em> to do a calculation that’s near and dear to every high energy theorist’s heart. We’ll show how the 10 + 5bar + 1 representation of SU(5) <a href="http://en.wikipedia.org/wiki/Grand_Unified_Theory">contains a single standard model generation</a>.  First we’ll fire up python and import the <em>lie</em> module.<span id="more-2000"></span></p>
<pre class="brush: python; title: ; notranslate">Python 3.1 (r31:73578, Jun 27 2009, 21:49:46)
&gt;&gt;&gt; from lie import *
</pre>
<p>Let’s let g be the GUT group SU(5) (A4 in Cartan’s classifiation).</p>
<pre class="brush: python; title: ; notranslate">
&gt;&gt;&gt; g = A4; g.pprint()
'SU(5)'
</pre>
<p>We’ll let the representation r be a single GUT generation: 10 + 5bar + 1 of SU(5).</p>
<pre class="brush: python; title: ; notranslate">
&gt;&gt;&gt; r = g.fund().dual() + g.alt_tensor(2) + g.trivial()
&gt;&gt;&gt; r.pprint()
'(1) + (5b) + (10)'
</pre>
<p>Here, we’ll check that SU(2)×SU(3) is a subgroup of SU(5). Then we’ll get the restriction matrix for SU(2)×SU(3) in SU(5), and add the U(1) part by hand:</p>
<pre class="brush: python; title: ; notranslate">
&gt;&gt;&gt; [h.pprint() for h in g.max_subgrps()]
['SU(4)', 'SO(5)', 'SU(2)xSU(3)']
&gt;&gt;&gt; m = g.res_mat(A1*A2)
&gt;&gt;&gt; m = m.transpose().concat(mat([[-2,1,-1,2]])).transpose()
</pre>
<p>Finally, we decompose r under the group SU(2)×SU(3)×U(1). The first two numbers are the SU(2) and SU(3) rep dimensions, and the third is the U(1) charge (which differs from the conventional hypercharge by a factor of 6).</p>
<pre class="brush: python; title: ; notranslate">
&gt;&gt;&gt; r.branch(A1*A2*T1, m).pprint() 
'(1,1,0) + (1,1,6) + (1,3b,-4) + (1,3b,2) + (2,1,-3) + (2,3,1)'
</pre>
<p>These are indeed the representations of a generation of standard model fermions (written as left-handed Weyl spinors).  In order, we have: a heavy neutrino, the positron, the up quark, the down quark, the lepton doublet, and the left-handed quark doublet.  Yay!</p>
<h3>Background</h3>
<p>I decided I needed something like <em>lie</em> last summer while working on <em><a href="http://arxiv.org/abs/0910.4585">Superconformal Flavor Simplified</a></em> with David Poland.  Since none of the mathematical tools that physicists commonly use (e.g. Mathematica) had any routines for Lie group representation theory, I was delighted to stumble upon LiE, which implemented basically all the algorithms I wanted.  However, the authors’ choice to create their own scripting language was a little silly, and definitely not future-proof.  For instance, while LiE implements lists of numbers (vectors) and lists of lists of numbers (matrices), it doesn’t know anything about lists of lists of lists of numbers.  This might seem insignificant, but it was frustrating for two reasons:</p>
<ul>
<li>Every real programming language on earth implements lists of lists of lists.</li>
<li>I needed lists of lists of lists.</li>
</ul>
<p>LiE uses a list of integers to represent the highest weight vector of a representation, so if I want lists of representations, like in a model, I need lists of lists.  And if I want lists of models, I need lists of lists of lists.  Turns out I was trying to write a whole paper about lists of models.</p>
<h3>Future Work</h3>
<p>I developed <em>lie</em> to the point where it was capable of doing what I needed for <em>Superconformal Flavor Simplified</em>.  There’s definitely some debugging and restructuring to be done.  For instance, the sourcecode currently includes the entire lexer and parser from the original LiE, just because I haven’t bothered to extract them from everything else (I originally took the approach of trying to modify the LiE source as little as possible).  It’s a work in progress, but lie has already been useful to me, so I figure others might benefit from it too.</p>
<p>So check out <a href="http://github.com/davidsd/lie">the source</a>, give it a whirl, and if it doesn’t do what you want, help me make it better!</p>
<img src="http://feeds.feedburner.com/~r/davidsd/~4/MOOc3qr1N68" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://davidsd.org/2010/03/lie-group-computations-with-python/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		<feedburner:origLink>http://davidsd.org/2010/03/lie-group-computations-with-python/</feedburner:origLink></item>
		<item>
		<title>The snarXiv</title>
		<link>http://feedproxy.google.com/~r/davidsd/~3/T9TtnRbvlrs/</link>
		<comments>http://davidsd.org/2010/03/the-snarxiv/#comments</comments>
		<pubDate>Thu, 11 Mar 2010 02:32:51 +0000</pubDate>
		<dc:creator>davidsd</dc:creator>
				<category><![CDATA[Code]]></category>
		<category><![CDATA[Humor]]></category>
		<category><![CDATA[Physics]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://davidsd.org/?p=1959</guid>
		<description><![CDATA[The snarXiv is a random high-energy theory paper generator incorporating all the latest trends, entropic reasoning, and exciting moduli spaces. The arXiv is similar, but occasionally less random.[1] Actually, the snarXiv only generates tantalizing titles and abstracts at the moment, while the arXiv delivers matching papers as well. Details of the implementation are below.[2] I’m [...]]]></description>
				<content:encoded><![CDATA[<p class="lead">The <a href="http://snarxiv.org">snarXiv</a> is a random high-energy theory paper generator incorporating all the latest trends, entropic reasoning, and exciting moduli spaces.  The <a href="http://arxiv.org">arXiv</a> is similar, but occasionally less random.<sup>[<a href="http://davidsd.org/2010/03/the-snarxiv/#footnote_0_1959" id="identifier_0_1959" class="footnote-link footnote-identifier-link" title=" For those who aren&#039;t high energy physicists, and are still interested (though I can&#039;t imagine who that would be), the &quot;X&quot; in arXiv or snarXiv is supposed to be a greek chi.  We&#039;re meant to pronounce them like archive (as in &quot;archive of physics papers&quot;) and snarchive (as in &quot;snarky archive of physics papers&quot;). ">1</a>]</sup></p>
<p>Actually, the snarXiv only generates tantalizing titles and abstracts at the moment, while the arXiv delivers matching papers as well.  Details of the implementation are below.<sup>[<a href="http://davidsd.org/2010/03/the-snarxiv/#footnote_1_1959" id="identifier_1_1959" class="footnote-link footnote-identifier-link" title=" Please don&#039;t sue me, arXiv.org, for stealing your CSS file and your beautiful color scheme.  Also, Werner Heisenberg, if you&#039;re still alive, please don&#039;t sue me or my computer for libel. ">2</a>]</sup> <a href="http://davidsd.org">I’m</a> the author, and I don’t remember exactly why I decided to do this.  I did already have the framework lying around from <a href="http://davidsd.org/2009/01/the-real-theorem-generator-a-context-free-grammar/">a previous project</a>, and I swear I spent more time doing research last weekend than implementing snarXiv.org.</p>
<h3>Suggested Uses for the snarXiv<sup>[<a href="http://davidsd.org/2010/03/the-snarxiv/#footnote_2_1959" id="identifier_2_1959" class="footnote-link footnote-identifier-link" title=" If someone pretentious is annoying you, and you use the theorem generator instead, you could try something like this. ">3</a>]</sup></h3>
<ul>
<li>If you’re a graduate student, gloomily read through the abstracts, thinking to yourself that you don’t understand papers on the real arXiv any better.</li>
<li>If you’re a post-doc, reload until you find something to work on.</li>
<li>If you’re a professor, get really excited when a paper claims to solve the hierarchy problem, the little hierarchy problem, the mu problem, and the confinement problem.  Then experience profound disappointment.</li>
<li>If you’re a famous physicist, keep reloading until you see your name on something, then claim credit for it. <span id="more-1959"></span></li>
<li>Everyone else should play <a href="http://snarxiv.org/vs-arxiv/">arXiv vs. snarXiv</a>.<sup>[<a href="http://davidsd.org/2010/03/the-snarxiv/#footnote_3_1959" id="identifier_3_1959" class="footnote-link footnote-identifier-link" title="And check out the results.  Also, pick up the unofficial arXiv vs. snarXiv wallpaper.">4</a>]</sup> </li>
</ul>
<h3>Context-Free Grammars</h3>
<p>The snarXiv is based on a <a href="http://en.wikipedia.org/wiki/Context-free_grammar">context free grammar</a> (CFG) — basically a set of rules for computer-generated mad libs.<sup>[<a href="http://davidsd.org/2010/03/the-snarxiv/#footnote_4_1959" id="identifier_4_1959" class="footnote-link footnote-identifier-link" title=" I first encountered these in freshman year of college in an assignment for CS51: Abstraction and Design in Computer Programming.  We had to implement a CFG in LISP, and the cleverest won its author lunch at the faculty club.  The eventual winner was my friend Matt Gline&#039;s theorem generator, which has since been enhanced with LaTeX, commutative diagrams, ajax, and stuff like that. ">5</a>]</sup>  Each rule in a CFG consists of a term, and a set of choices for how to make that term.  The choices can contain text, or other terms, or even refer recursively to the term being defined.  The CFG syntax used on the snarXiv is a collection of statements “term ::= <em>choices</em>”, where <em>choices</em> is a list of possibilities separated by “|”.  Some possibilities are just text, but the ones that look like “&lt;<em>newterm</em>&gt;” are directions to go find the definition for <em>newterm</em> and fill it in.  For instance, the following grammar</p>
<pre class="brush: grammar; title: ; notranslate">
nounphrase ::= &lt;noun&gt; | &lt;adj&gt; &lt;adj&gt; &lt;noun&gt; | super &lt;nounphrase&gt;
noun ::= apple | pear | mailman
adj ::= smelly | chartreuse | enormous
</pre>
<p>can produce nounphrases like “apple,” “enormous smelly mailman,” or “super super smelly chartreuse mailman.”  The <a href="http://snarxiv.org/grammar/snarxiv.gram">snarxiv’s grammar</a> is 622 lines long, and ends like this:</p>
<pre class="brush: grammar; title: ; notranslate">
...
morecomments ::= &lt;smallinteger&gt; figures | JHEP style | Latex file
  | no figures | BibTeX | JHEP3 | typos corrected
  | &lt;nzdigit&gt; tables | added refs | minor changes
  | minor corrections | published in PRD
  | reference added | pdflatex
  | based on a talk given on &lt;physicistname&gt;'s &lt;nzdigit&gt;0th birthday
  | talk presented at the international &lt;pluralphysconcept&gt; workshop
comments ::= &lt;smallinteger&gt; pages | &lt;comments&gt;, &lt;morecomments&gt;

primarysubj ::= High Energy Physics - Theory (hep-th)|
                High Energy Physics - Phenomenology (hep-ph)|
secondarysubj ::= Nuclear Theory (nucl-th)|
      Cosmology and Extragalactic Astrophysics (astro-ph.CO)|
      General Relativity and Quantum Cosmology (gr-qc)|
      Statistical Mechanics (cond-mat.stat-mech)
papersubjects ::= &lt;primarysubj&gt; | &lt;papersubjects&gt;; &lt;secondarysubj&gt;

paper ::= &lt;title&gt; \\ &lt;authors&gt; \\ &lt;comments&gt; \\ &lt;papersubjects&gt; \\ &lt;abstract&gt;
...
</pre>
<p>The coolest and most natural thing to do with a CFG is exploit recursiveness as much as possible.  The more recursion built in, the less predictable and richer the output.  For instance, the following definition of a “space” has three rules: <em>space</em>, <em>singspace</em>, <em>pluralspace</em>, which refer recursively to each other in many different ways, allowing for a huge number of possibilities.</p>
<pre class="brush: grammar; title: ; notranslate">
space ::= &lt;pluralspace&gt; | &lt;singspace&gt; | &lt;mathspace&gt;

singspace ::= a &lt;spacetype&gt; | a &lt;spaceadj&gt; &lt;spacetype&gt;
   | &lt;properspacename&gt; | &lt;spaceadj&gt; &lt;properspacename&gt;
   | &lt;mathspace&gt; | &lt;mathspace&gt;
   | a &lt;bundletype&gt; bundle over &lt;space&gt;
   | &lt;singspace&gt; fibered over &lt;singspace&gt;
   | the moduli space of &lt;pluralspace&gt;
   | a &lt;spacetype&gt; &lt;spaceproperty&gt;
   | the &lt;spacepart&gt; of &lt;space&gt;
   | a &lt;group&gt; &lt;groupaction&gt; of &lt;singspace&gt;
   | the near horizon geometry of &lt;singspace&gt;
pluralspace ::= &lt;spacetype&gt;s | &lt;spaceadj&gt; &lt;spacetype&gt;s
   | &lt;n&gt; copies of &lt;mathspace&gt;
   | &lt;pluralspace&gt; fibered over &lt;space&gt;
   | &lt;spacetype&gt;s &lt;spaceproperty&gt;
   | &lt;bundletype&gt; bundles over &lt;space&gt;
   | moduli spaces of &lt;pluralspace&gt;
   | &lt;group&gt; &lt;groupaction&gt;s of &lt;pluralspace&gt;
</pre>
<p>Of course, there’s also a danger that in a very small number of cases the output might be a little pathological.  The <em>nounphrase</em> example above, for instance, can produce any phrase of the form “super super … super enormous pear.”  The snarXiv similarly occasionally mentions QFTs living on “the moduli space of moduli spaces of moduli spaces of moduli spaces of moduli spaces of SU(3) bundles over elliptically fibered Enriques surfaces.”  Too much recursion can also quickly lead to exponentially long abstracts, which are even harder to read all the way through than the usual ones on the arXiv.</p>
<h3>The Guts</h3>
<p>To get some actual output from the grammar definition, the most straightforward thing would be to write a script that reads in the grammar, and works its way down the tree, starting with the top term, filling in definitions recursively until it gets a block of text.  Instead of using an external script, the snarXiv compiles each grammar into its own program, a technique that originated from a freshman CS project and evolved minimally from there — it’s less straightforward, not clearly better, but maybe a bit more fun.  A <a href="http://snarxiv.org/grammar/compile-grammar">perl script</a> compiles the grammar file into <a href="http://en.wikipedia.org/wiki/OCaml">OCaml</a> code (<a href="http://snarxiv.org/grammar/snarxiv.ml">snarxiv.ml</a>):</p>
<pre class="brush: ocaml; title: ; notranslate">
type phrase = Str of string | Opts of phrase array array

let _ = Random.self_init ()

let randelt a = a.(Random.int (Array.length a))
let rec print phr = match phr with
  Str  s       -&gt; print_string s
| Opts options -&gt;
    let parts = randelt options in
    Array.iter print parts

(* Grammar definitions *)
let rec top = Opts [|
  [| paper;|];
|]

...

and comments = Opts [|
  [| smallinteger; Str &quot; pages&quot;;|];
  [| comments; Str &quot;, &quot;; morecomments;|];
|]

and primarysubj = Opts [|
  [| Str &quot;High Energy Physics - Theory (hep-th)&quot;;|];
  [| Str &quot;High Energy Physics - Phenomenology (hep-ph)&quot;;|];
|]

and secondarysubj = Opts [|
  [| Str &quot;Nuclear Theory (nucl-th)&quot;;|];
  [| Str &quot;Cosmology and Extragalactic Astrophysics (astro-ph.CO)&quot;;|];
  [| Str &quot;General Relativity and Quantum Cosmology (gr-qc)&quot;;|];
  [| Str &quot;Statistical Mechanics (cond-mat.stat-mech)&quot;;|];
|]

and papersubjects = Opts [|
  [| primarysubj;|];
  [| papersubjects; Str &quot;; &quot;; secondarysubj;|];
|]

and paper = Opts [|
  [| title; Str &quot; \\\\ &quot;; authors; Str &quot; \\\\ &quot;; comments; Str &quot; \\\\ &quot;; papersubjects; Str &quot; \\\\ &quot;; abstract; Str &quot; &quot;;|];
|]

let _ = print top
let _ = print_string &quot;\n&quot;
</pre>
<p>And snarxiv.ml is now a specialized program that, when compiled and run, spits out a paper title and abstract.  This setup is more elaborate than necessary, but OCaml is a lovely language for recursive structures, and the code is nice and simple.  OCaml is also <a href="http://shootout.alioth.debian.org/u64q/which-programming-languages-are-fastest.php?calc=chart&#038;ocaml=on&#038;python3=on">fast</a>, allowing the snarXiv to generate papers even more swiftly than your favorite python script, or Ed Witten in the 80’s.</p>
<h3>Other CFGs</h3>
<p>A few years ago, the CFG-based CS paper generator <a href="http://pdos.csail.mit.edu/scigen/">SCIgen</a> made a splash by getting one of their papers accepted to the conference SCI 2005.  Their website has details, and links to some other random generators around the web.</p>
<ol class="footnotes"><li id="footnote_0_1959" class="footnote"> For those who aren’t high energy physicists, and are still interested (though I can’t imagine who that would be), the “X” in <em>arXiv</em> or <em>snarXiv</em> is supposed to be a <a href="http://en.wikipedia.org/wiki/Chi_(letter)">greek chi</a>.  We’re meant to pronounce them like <em>archive</em> (as in “archive of physics papers”) and <em>snarchive</em> (as in “snarky archive of physics papers”). </li><li id="footnote_1_1959" class="footnote"> Please don’t sue me, arXiv.org, for stealing your CSS file and your beautiful color scheme.  Also, Werner Heisenberg, if you’re still alive, please don’t sue me or my computer for libel. </li><li id="footnote_2_1959" class="footnote"> If someone pretentious is annoying you, and you use the <a href="http://davidsd.org/theorem">theorem generator</a> instead, you could try something like <a href="http://undergrad.davidsd.org/theorem/applications.html">this</a>. </li><li id="footnote_3_1959" class="footnote">And check out <a href="http://davidsd.org/2010/09/the-arxiv-according-to-arxiv-vs-snarxiv/">the results</a>.  Also, pick up the <a href="http://snarxiv.org/vs-arxiv/img/snarxraft.jpg">unofficial arXiv vs. snarXiv wallpaper</a>.</li><li id="footnote_4_1959" class="footnote"> I first encountered these in freshman year of college in an assignment for <a href="http://www.fas.harvard.edu/~lib51/">CS51: Abstraction and Design in Computer Programming</a>.  We had to implement a CFG in <a href="http://en.wikipedia.org/wiki/Lisp_programming_language">LISP</a>, and the cleverest won its author lunch at the faculty club.  The eventual winner was my friend Matt Gline’s <a href="http://davidsd.org/theorem/">theorem generator</a>, which has since <a href="http://davidsd.org/2009/01/the-real-theorem-generator-a-context-free-grammar/">been enhanced with LaTeX, commutative diagrams, ajax, and stuff like that.</a> </li></ol><img src="http://feeds.feedburner.com/~r/davidsd/~4/T9TtnRbvlrs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://davidsd.org/2010/03/the-snarxiv/feed/</wfw:commentRss>
		<slash:comments>33</slash:comments>
		<feedburner:origLink>http://davidsd.org/2010/03/the-snarxiv/</feedburner:origLink></item>
		<item>
		<title>Energy Secretary! Evolve!</title>
		<link>http://feedproxy.google.com/~r/davidsd/~3/6oqEIk6NOHY/</link>
		<comments>http://davidsd.org/2009/07/energy-secretary-evolve/#comments</comments>
		<pubDate>Tue, 07 Jul 2009 15:35:17 +0000</pubDate>
		<dc:creator>davidsd</dc:creator>
				<category><![CDATA[Cartoon]]></category>
		<category><![CDATA[Humor]]></category>

		<guid isPermaLink="false">http://davidsd.org/?p=1710</guid>
		<description />
				<content:encoded><![CDATA[<p><a href="http://davidsd.org/wp-content/uploads/2009/07/stevenchu.png"><img src="http://davidsd.org/wp-content/uploads/2009/07/stevenchu.png" alt="stevenchu" title="stevenchu" width="490" height="869" class="aligncenter size-full wp-image-1717" /></a></p>
<img src="http://feeds.feedburner.com/~r/davidsd/~4/6oqEIk6NOHY" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://davidsd.org/2009/07/energy-secretary-evolve/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://davidsd.org/2009/07/energy-secretary-evolve/</feedburner:origLink></item>
		<item>
		<title>The Real Theorem Generator: a Context Free Grammar</title>
		<link>http://feedproxy.google.com/~r/davidsd/~3/QLjgFqjKc-4/</link>
		<comments>http://davidsd.org/2009/01/the-real-theorem-generator-a-context-free-grammar/#comments</comments>
		<pubDate>Wed, 21 Jan 2009 03:14:02 +0000</pubDate>
		<dc:creator>davidsd</dc:creator>
				<category><![CDATA[Code]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://davidsd.org/?p=476</guid>
		<description><![CDATA[I should probably document the real origin of the Theorem of the Day and Philosophy of the Day. Coffee and Henry David Thoreau are perhaps less involved than originally indicated. The theorem generator was written by a good friend of mine, Matt Gline, as a project for CS51: Abstraction and Design in Computer Programming, which [...]]]></description>
				<content:encoded><![CDATA[<p class="lead">I should probably document the real origin of the <a href="/theorem">Theorem of the Day</a> and <a href="/2009/01/philosophy-of-the-day/">Philosophy of the Day</a>.  Coffee and Henry David Thoreau are perhaps less involved than originally indicated.
</p>
<p><a href="http://davidsd.org/wp-content/uploads/2009/01/nothoreau.png"><img src="http://davidsd.org/wp-content/uploads/2009/01/nothoreau-300x236.png" alt="nothoreau" title="nothoreau" width="200" class="alignright size-medium wp-image-564" /></a>The theorem generator was written by a good friend of mine, Matt Gline, as a project for <a href="http://www.fas.harvard.edu/~lib51/">CS51: Abstraction and Design in Computer Programming</a>, which we took together as freshmen.</p>
<p>The assignment was to use <a href="http://en.wikipedia.org/wiki/Lisp_programming_language">LISP</a> to implement a <a href="http://en.wikipedia.org/wiki/Context-free_grammar">context free grammar</a> — basically a set of rules for computer-generated mad libs.  The subject was whatever we wanted.  Good ones from past years include computer-generated mystery novellas, course-guide reports, and performance art directions.  Every year there’s a contest, and Matt’s theorem generator was hysterical enough to win him lunch at the faculty club.<span id="more-476"></span></p>
<h3>Context-Free Grammars</h3>
<p>Each rule in a context-free grammar consists of a term, and a set of choices for how to make that term.  The choices can contain text, or other terms, or even refer recursively to the term being defined.  For instance, a grammar like</p>
<pre class="brush: grammar; title: ; notranslate">
nounphrase ::= &lt;noun&gt; | &lt;adj&gt; &lt;adj&gt; &lt;noun&gt; | super &lt;nounphrase&gt;
noun ::= apple | pear | mailman
adj ::= smelly | chartreuse | enormous
</pre>
<p>can produce phrases like “apple,” “enormous smelly mailman,” or “super super smelly chartreuse mailman.”  The original grammars we wrote for CS51 were hundred-line clumps of LISP code that spit out un-punctuated paragraphs in all-caps.  Over the years, my roommate Mike and I spiffed-up the theorem generator a bit, resulting in the <a href="/theorem">current shiny ajax/latex version</a>.</p>
<p>The current theorem grammar is defined in a grammar file (<a href="/misc/theorem/grammar/thm.gram">thm.gram</a>) that looks like this</p>
<pre class="brush: grammar; title: ; notranslate">
theorem ::= \begin{theorem}[&lt;thname&gt;]&lt;thmstatement&gt;.
          \end{theorem} \begin{proof} &lt;proof&gt;
          \end{proof}

proof ::= &lt;statement&gt;. &lt;conclusion&gt;. | See &lt;citation&gt;.
    | &lt;statement&gt;. &lt;conclusion&gt;. | &lt;statement&gt;. &lt;conclusion&gt;.

conclusion ::= 
    The result follows by d\'evissage | The theorem follows trivially
    | The conclusion is self-evident | Clearly, the theorem holds
    | We leave the rest as an exercise | This is the desired result
    | The rest follows from &lt;citation&gt; | QED
    | A simple application of &lt;famoustheorem&gt; completes the proof

famoustheorem ::=
    &lt;mathguy&gt;'s theorem
    | Hilbert's problem &lt;nzdigit&gt;&lt;zdigit&gt;
    | the Riemann Hypothesis
    | Perelman's theorem (formerly the Poincar\'e Conjecture)
    | horizontal Iwasawa theory | Kummer theory | the different
    | Dynkin diagrams | Gegenbauer polynomials
    | trichotomy
...
</pre>
<p>A perl script (<a href="/misc/theorem/grammar/compile-grammar">compile-grammar</a>):</p>
<pre class="brush: perl; title: ; notranslate">
...
print &quot;parsing grammar '$gram' into '$out.ml'...&quot;;
foreach (@entries) {
    s/\s* \# .* $//xmg;
    if (/\s*(\S*)\s*::=\s*(.*)/s) {
        $lhs = $1;
        $rhs = $2;

        if (!defined ($mainphrase)) {
            $mainphrase = $lhs;
            print OUT &quot;let rec $lhs = Opts [|\n&quot;;
        } else {
            print OUT &quot;and $lhs = Opts [|\n&quot;;
        }

        @opts = split(/\s*\|\s*/, $rhs);
        foreach $opt (@opts) {
            print OUT &quot;  [|&quot;;
            $opt =~ s/\s*\n\s*/ /g;

            # split just before &lt; and after &gt;
...
</pre>
<p>processes the grammar file and compiles it into <a href="http://en.wikipedia.org/wiki/OCaml">OCaml</a> code (<a href="/misc/theorem/grammar/thm.ml">thm.ml</a>).</p>
<pre class="brush: ocaml; title: ; notranslate">
type phrase = Str of string | Opts of phrase array array

let _ = Random.self_init ()

let randelt a = a.(Random.int (Array.length a))
let rec print phr = match phr with
  Str  s       -&gt; print_string s
| Opts options -&gt;
    let parts = randelt options in
    Array.iter print parts

(* Grammar definitions *)
let rec top = Opts [|
  [| theorem;|];
|]

and theorem = Opts [|
  [| Str &quot;\\begin{theorem}[&quot;; thname; Str &quot;]&quot;; thmstatement; Str &quot;. \\end{theorem} \\begin{proof} &quot;;
 proof; Str &quot; \\end{proof}&quot;;|];
|]
...
</pre>
<p>This is almost certainly more elaborate than necessary, but OCaml is a lovely language for recursive structures, and the code is nice and simple.  Running thm.ml spits out a ready-made LaTeX-ed theorem, which gets run through <a href="http://redsymbol.net/software/l2p/">LaTeX to png</a>, and cached, ready for viewing.</p>
<h3>The Philosophy of the Day</h3>
<p>My entry into the CS51 contest was a <a href="http://davidsd.org/2009/01/philosophy-of-the-day/">philosophy generator</a>, which spits out semi-plausible definitions of philosophies from numerous cultural and intellectual traditions.  It was never as clever as the theorem generator (which still cracks me up every time it produces anything attributed to Lipschitz), and the philosophies tend to involve concepts that are only of interest to nerdy Harvard freshmen.  It’s still amusing, though.  Here’s the <a href="/misc/philosophy/grammar/philo.gram">grammar definition</a>.</p>
<h3> Have Fun!</h3>
<p>The system I set up to create and run context-free grammars is pretty easy to use.  Grammar files are simple to write, and from there all you need is compile-grammar and an <a href="http://caml.inria.fr/">OCaml installation</a>.  Here, once again, are the relevant files</p>
<ol>
<li>
Some example grammar files: <a href="/misc/theorem/grammar/thm.gram">thm.gram</a>, <a href="/misc/philosophy/grammar/philo.gram">philo.gram</a>
</li>
<li>
Parse grammar files into ml code: <a href="/misc/theorem/grammar/compile-grammar">compile-grammar</a>
</li>
</ol>
<p>Anyone’s absolutely welcome to send extensions to thm.gram and philo.gram, or any new grammars you might write.  I’ll ajaxify them and post them here, if you like. Have fun!</p>
<img src="http://feeds.feedburner.com/~r/davidsd/~4/QLjgFqjKc-4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://davidsd.org/2009/01/the-real-theorem-generator-a-context-free-grammar/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://davidsd.org/2009/01/the-real-theorem-generator-a-context-free-grammar/</feedburner:origLink></item>
		<item>
		<title>Philosophy of the Day</title>
		<link>http://feedproxy.google.com/~r/davidsd/~3/W6pXZ4tPD0Q/</link>
		<comments>http://davidsd.org/2009/01/philosophy-of-the-day/#comments</comments>
		<pubDate>Wed, 21 Jan 2009 02:37:06 +0000</pubDate>
		<dc:creator>davidsd</dc:creator>
				<category><![CDATA[Humor]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://davidsd.org/?p=525</guid>
		<description><![CDATA[New Philosophy How it Works You press button. Button tell robot hand “pick up Henry David Thoreau, move over next to pond.” Robot hand pick up Henry David Thoreau, put him next to pond. Result recorded.]]></description>
				<content:encoded><![CDATA[<p><script type="text/javascript" src="/misc/philosophy/ajax.js"></script><span id="loader"></span>
<p><a href="javascript:void(0);" id="generate" class="button">New Philosophy</a></p>
<p><span id="more-525"></span></p>
<h3>How it Works</h3>
<p>You press button. Button tell robot hand “pick up Henry David Thoreau, move over next to pond.” Robot hand pick up Henry David Thoreau, put him next to pond. Result recorded.</p>
<div id="attachment_526" class="wp-caption aligncenter" style="width: 570px"><a href="http://davidsd.org/wp-content/uploads/2009/01/philo.jpg"><img src="http://davidsd.org/wp-content/uploads/2009/01/philo.jpg" alt="How we get the philosophy of the day" title="philo" width="450" class="size-full wp-image-526 noborder" /></a><p class="wp-caption-text">How we get the philosophy of the day</p></div>
<img src="http://feeds.feedburner.com/~r/davidsd/~4/W6pXZ4tPD0Q" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://davidsd.org/2009/01/philosophy-of-the-day/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://davidsd.org/2009/01/philosophy-of-the-day/</feedburner:origLink></item>
		<item>
		<title>Honda Needs a Tune-Up</title>
		<link>http://feedproxy.google.com/~r/davidsd/~3/qKgULwRs1HA/</link>
		<comments>http://davidsd.org/2008/12/honda-needs-a-tune-up/#comments</comments>
		<pubDate>Tue, 23 Dec 2008 12:45:09 +0000</pubDate>
		<dc:creator>davidsd</dc:creator>
				<category><![CDATA[Humor]]></category>
		<category><![CDATA[Music]]></category>
		<category><![CDATA[Physics]]></category>

		<guid isPermaLink="false">http://davidsd.org/?p=12</guid>
		<description><![CDATA[This is the story of how Honda engineers screwed up a big expensive project with a simple arithmetic mistake, tried to fudge their result with sound editing software, and congratulated themselves for being totally awesome. When I was a kid, my family used to drive up to The Pinery in Ontario, a beautiful park by [...]]]></description>
				<content:encoded><![CDATA[<p class="lead">This is the story of how Honda engineers screwed up a big expensive project with a simple arithmetic mistake, tried to fudge their result with sound editing software, and congratulated themselves for being totally awesome.</p>
<p>When I was a kid, my family used to drive up to The Pinery in Ontario, a beautiful park by Lake Huron.  Very scenic.  My favorite part, though, was a stretch of road a half-hour outside of the park.  To discourage reckless Canadians from barreling past the houses and barns, the local government carved five sets of grooves in the road before every stop sign.  Drive over them, and the car would vibrate: <em>“vbvbvbvb… vbvbvbvb… vbvbvbvb… vbvbvbvb… vbvbvbvb.”</em>  The faster you drive, the higher the pitch.</p>
<p>My <a href="http://music.case.edu/duffin/">Dad</a> is a musicologist, with a <a href="http://www.amazon.com/Equal-Temperament-Ruined-Harmony-Should/dp/0393062279">particular</a> <a href="http://music.case.edu/duffin/Vallotti/default.html">interest</a> <a href="http://music.case.edu/duffin/JustTuning/Index.html">in</a> <a href="http://music.case.edu/duffin/BaroqueTemp/Default.html">tuning</a>.  So there was <em>no way</em> he was going to pass up the chance to experiment with this instrument.  Every time we approached some grooves, he’d start fast over the first set, and try to slow down by the last set, to play a descending scale: G-F-E-D-C.  If there was no oncoming traffic after the stop sign, he’d swing over to the other side of the road and play an ascending scale as we sped up.  <span id="more-12"></span></p>
<p>Ratios of speeds correspond to ratios of vibration frequencies, which correspond to intervals between notes.  To play an ascending scale C-D-E-F-G, you need to drive at these ratios to your starting speed: <tt>1 — 9/8 — 5/4 — 4/3 — 3/2</tt> (for example, <tt>24 — 27 — 30 — 32 — 36</tt> mph)<sup>[<a href="http://davidsd.org/2008/12/honda-needs-a-tune-up/#footnote_0_12" id="identifier_0_12" class="footnote-link footnote-identifier-link" title="If anyone&#039;s wondering what happened to the 1/12th powers of 2 in this whole tuning discussion, I&#039;m using what&#039;s called Just Intonation, which is an (often better-sounding) approximation to the Equal Temperament system most people know.  Actually, it&#039;s really the other way around: the reason we use 12 equal semitones is that it lets us approximate nice integer ratios like 3/2, 4/3, 5/4, etc..  This is a long story that I&#039;m not going to get into here.">1</a>]</sup>.</p>
<p>Playing a scale with a ’95 Toyota Previa is not easy.  The notes tend to come out a little wonky — we’d get the half-step between E and F too wide, and with not enough space between F and G.  It usually sounded kinda modal… but still awesome.</p>
<h3>Professionals?</h3>
<p>So imagine my delight when I heard about this <a href="http://reviews.cnet.com/8301-13746_7-10049007-48.html">musical road [CNET]</a> that Honda built in Lancaster, CA..  A team of engineers carved some grooves into a highway that were carefully spaced to play the <a href="http://en.wikipedia.org/wiki/William_Tell_Overture">William Tell Overture</a> as you drive over them at a constant speed.  Awesome, right?  The problem is, it’s <em>spectacularly</em> out of tune.</p>
<p><iframe width="500" height="375" src="http://www.youtube.com/embed/sJFGacuxcSM?feature=oembed" frameborder="0" allowfullscreen></iframe></p>
<p>Here’s the original melody:</p>
<p>And here’s the Honda road again:</p>
<p>The Honda version isn’t simply “out of tune”… the notes are just wrong.  The original starts with a rising 4th, F-B♭<sup>[<a href="http://davidsd.org/2008/12/honda-needs-a-tune-up/#footnote_1_12" id="identifier_1_12" class="footnote-link footnote-identifier-link" title="Actually, the starting note in the recording is around a B&amp;#9837;.  I&#039;m going to pretend like everything is in the key of B&amp;#9837; (so the starting note is F), since that&#039;s the key they talk about in the making-of videos.a picture of Honda&amp;#039;s scoreSorry to the perfect-pitch people.">2</a>]</sup>, and eventually reaches an octave above the starting note before descending to the tonic F-E♭-D-B♭.<sup>[<a href="http://davidsd.org/2008/12/honda-needs-a-tune-up/#footnote_2_12" id="identifier_2_12" class="footnote-link footnote-identifier-link" title="The original melody actually has a run down to the B&amp;#9837;: F-E&amp;#9837;-D-C-B&amp;#9837;.  Honda apparently decided this was too complicated and used a simplified version.  That&#039;s what I&#039;ll stick to here.">3</a>]</sup>  But Honda’s version starts with a rising major 3rd, and its top note is a major 6th above the starting note.  Some might have noticed that the last few notes in Honda’s commercial sound OK.  That’s because they edited over them!  I can prove it.</p>
<p id="william-tell-melody" class="wp-caption-text">Basic melody in the William Tell Overture (schematic)</p>
<p>The CNET article above speculates that Honda designed the road specifically for the Honda civic driving at the speed limit, and other cars might need to drive at a different speed to make it sound better.  But if you’re going at a constant speed, all that matters is the spacing between grooves.  Speeding up or slowing down just transposes everything.  It would be theoretically possible to “correct” the melody by driving at different speeds (like on the road to the Pinery).  But the notes on the musical road are too closely spaced for all but consummate musician Mario Andretti.</p>
<p>It also doesn’t matter what car you drive<sup>[<a href="http://davidsd.org/2008/12/honda-needs-a-tune-up/#footnote_3_12" id="identifier_3_12" class="footnote-link footnote-identifier-link" title="With one exception that can&#039;t fix the tuning.  See my comment, below.">4</a>]</sup>.  The vibration frequency is <tt>f = v/d</tt>, where v is the car’s speed, and d is the distance over which the road pattern repeats.  There’s no place in the equation for wheel spacing, tire size, side-impact airbags, etc.  All of these things affect the quality of the sound, but not the pitch.</p>
<p>So why is the musical road so unmusical?</p>
<h3>The Error</h3>
<p>Honda posted a series of 5 ridiculous videos: <a href="http://www.youtube.com/watch?v=gRiJlEte9l0">[Part 1]</a><a href="http://www.youtube.com/watch?v=wVynYbGhDcs">[Part 2]</a><a href="http://www.youtube.com/watch?v=gPWQ_TM6rsU">[Part 3]</a><a href="http://www.youtube.com/watch?v=1qmOR9lS6Pw">[Part 4]</a><a href="http://www.youtube.com/watch?v=Z8aJduZeAFk">[Part 5]</a>, in which they talk about all the hard work they did and congratulate themselves for being so awesome.  There are lots of complicated sounding numbers, there’s a “Mathematician/Musician,” and plenty of experts.  I’m sure some people behind the project understood what was going on.  But I think they failed to anticipate a basic misunderstanding on the part of the groove-designers.</p>
<p>In the fourth “making of” video, they mention that the initial note, a low F, has a spacing of 4 inches (4in) between grooves (1:47):</p>
<p><iframe width="500" height="375" src="http://www.youtube.com/embed/CbZp1nqnZlg?feature=oembed" frameborder="0" allowfullscreen></iframe></p>
<p>From the video, it looks like the grooves themselves are about 1in wide.  Now, suppose you want to make the B♭ a 4th above F.  A perfect 4th is a fequency ratio of <tt>4/3</tt>, so you should multiply the width by a factor of <tt>3/4</tt>…  But the width of what?</p>
<div id="attachment_2039" class="wp-caption aligncenter" style="width: 530px"><a href="http://davidsd.org/wp-content/uploads/2008/12/measurements.jpg"><img src="http://davidsd.org/wp-content/uploads/2008/12/measurements.jpg" alt="" title="measurements" width="520" height="285" class="size-full wp-image-2039" /></a><p class="wp-caption-text">Based on the Civic’s 106.3 inch wheelbase, we can see from this picture that s+g is about 5 inches. Honda says the lowest note has a 4 inch spacing, so that’s consistent with 1 inch grooves.</p></div>
<p>The width that really matters is the total width of the spacing plus groove (s+g).  That’s the distance over which the road pattern repeats, so that’s the distance over which the car completes one vibration.<sup>[<a href="http://davidsd.org/2008/12/honda-needs-a-tune-up/#footnote_4_12" id="identifier_4_12" class="footnote-link footnote-identifier-link" title="More precisely, once you know the force driving the vibrations is periodic with period T=d/v, it follows that the vibrations themselves have that periodicity, so the Fourier transform of any resultant sound is only nonzero at integer multiples of f=1/T.  For more explanation, see the second comment, below.">5</a>]</sup>  Suppose you didn’t know this, and only changed the spacing, from <tt>s = 4in</tt> to <tt>s’ = 3/4 × 4in = 3in</tt>.  Then the frequency ratio is <tt>(s+g)/(s’+g) = (4+1)/(3+1) = 5/4</tt>, a major 3rd, not a perfect 4th.  What about the octave above the starting note?  An octave is a frequency ratio of <tt>2/1</tt>, but if you only changed the spacing to <tt>s’ = 1/2 × 4in = 2in</tt>, you’d get an actual ratio of <tt>(s+g)/(s’+g) = (4+1)/(2+1) = 5/3</tt>, a major 6th, not an octave.</p>
<p>Oops.</p>
<div id="attachment_2034" class="wp-caption aligncenter" style="width: 460px"><a href="http://davidsd.org/wp-content/uploads/2008/12/octave-bad.png"><img src="http://davidsd.org/wp-content/uploads/2008/12/octave-bad.png" alt="" title="octave-bad" width="450" height="60" class="size-full wp-image-2034 noborder" /></a><p class="wp-caption-text">making an octave, incorrectly</p></div>
<p>There are two ways you could correct this problem:</p>
<ol>
<li>Adjust the groove width g as well as the spacing s.  For instance, to make an octave, use a spacing <tt>s’ = 2in</tt> and a groove <tt>g’ = .5in</tt>, giving a fequency ratio <tt>(s+g)/(s’+g’) = 5/2.5 = 2/1</tt>.  This is probably hard with typical cutting tools.  Also, the engineers may have found that they need to make the grooves bigger than some minimum width to get a good sound. So on to method 2…
</li>
<li>
Over-adjust the groove spacing so that the total <tt>g+s</tt> is correct.  For instance, to make an octave, adjust the groove spacing to <tt>s’ = 1.5in</tt>, so you get a frequency ratio of <tt>(s+g)/(s’+g) = 5/2.5 = 2/1</tt>.
</li>
</ol>
<div id="attachment_2035" class="wp-caption aligncenter" style="width: 460px"><a href="http://davidsd.org/wp-content/uploads/2008/12/octave-good.png"><img src="http://davidsd.org/wp-content/uploads/2008/12/octave-good.png" alt="" title="octave-good" width="450" height="119" class="size-full wp-image-2035 noborder" style="padding-bottom: 15px; background: white;" /></a><p class="wp-caption-text">making an octave, correctly</p></div>
<h3>The Coverup</h3>
<p>Armed with this theory for why the musical road sounds so bad, I crunched some numbers in Mathematica, and was able to reproduce Honda’s result, sort of…</p>
<p>Here’s Mathematica playing the correct William Tell Overture:</p>
<p>And here’s Mathematica programmed to make the mistake I think Honda’s engineers made:</p>
<p>And here’s honda’s commercial version again:</p>
<p>Notice that a few notes in the commercial sound different from Mathematica’s version.  Particularly at the end.  Honda’s last few notes are sort of… in tune!  Turns out that’s a bit of Hollywood magic.  Here’s a recording I stole from a <a href="http://jalopnik.com/5053214/hondas-musical-road-to-be-paved-over">different video</a> of someone driving down the Musical Road<sup>[<a href="http://davidsd.org/2008/12/honda-needs-a-tune-up/#footnote_5_12" id="identifier_5_12" class="footnote-link footnote-identifier-link" title="I&#039;ve actually transposed it up to be in approximately the same key as the other recordings in this article.  By the way, there are hundreds of such videos on Youtube.">6</a>]</sup>:</p>
<p>What happened to the ending?  It’s all funky again.  Go back and listen to the Mathematica version that mimics Honda’s mistake.  Same funky ending<sup>[<a href="http://davidsd.org/2008/12/honda-needs-a-tune-up/#footnote_6_12" id="identifier_6_12" class="footnote-link footnote-identifier-link" title="Aside from a single passing note.  If I change the closing notes from F-E&amp;#9837;-D-B&amp;#9837;-D-B&amp;#9837; to F-E&amp;#9838;-D-B&amp;#9837;-D-B&amp;#9837;, and apply the Honda miscalculation, it sounds almost exactly like the undoctored recording of the musical road.  So it appears that there are two errors at work here: the groove spacing miscalculation, and replacing an E&amp;#9837; with an E&amp;#9838;.">7</a>]</sup>.  Whoever put together the Honda commercial must have edited over the ending, assuming that as long as the last few notes were correct, no one would notice anything wrong.<sup>[<a href="http://davidsd.org/2008/12/honda-needs-a-tune-up/#footnote_7_12" id="identifier_7_12" class="footnote-link footnote-identifier-link" title="It seems like Honda fixed up some of the other notes, too, to get a more pleasant sound.  Some might object that it&#039;s easy to make the notes sound bad by speeding up or slowing down as you drive down the road.  However, I don&#039;t hear anything like that in the random person&#039;s recording.  The melody returns to previous notes with reasonable accuracy, which it wouldn&#039;t do if the speed were varying.">8</a>]</sup></p>
<p>What I don’t understand is: if they were going to doctor the sound, why didn’t they just correct the whole thing?  It’s not that hard.  My dad did this version in about 20 minutes:</p>
<h3>Aftermath</h3>
<p>I learned something else kind of ridiculous from this analysis: if Honda didn’t doctor the overall pitch of the melody in their commercial, then they were speeding.  The opening frequency is about 238Hz, which corresponds to a speed of about 67mph if the road pattern repeats over 5in.  But they mention in one of the videos that the speed limit is 55! Crap.</p>
<p>In fact, in <a href="http://www.youtube.com/watch?v=z9-s7RI4tOs">this youtube video</a>, where they explicitly state they’re going 55mph, the melody starts a minor third below the Honda commercial.  A minor third is a frequency ratio of <tt>6/5</tt>, so this is consistent with Honda’s driver doing <tt>6/5 × 55mph = </tt>more than 10mph over the speed limit…</p>
<p>Another funny point is that some of the intervals you get from Honda’s miscalculation are pretty bizarre.  The D, a major 6th above the starting F, should have a frequency ratio of <tt>5/3</tt> above the starting frequency.  Instead, it has a ratio <tt>5/(4 × 3/5+1) = 25/17</tt>.  This isn’t really in the western scale.  It’s about 2/3rds of the way between an augmented 4th and a pure 5th.  Microtonal composers like <a href="http://en.wikipedia.org/wiki/Easley_Blackwood_Jr.">Easley Blackwood</a> might have found a use for it, but I don’t think it’s what Honda was after.</p>
<p>If I were them, I’d seriously consider paving over the road.  In fact, it seems like some local residents <a href="http://cbs2.com/local/William.Tell.Road.2.822008.html">might do it for them</a>.  There is another option, though.  If they bring in the bulldozers, and shuffle around a few chunks of asphalt at the end of the road, they might get a decent rendition of “When The Saints Go Marching In.”</p>
<p>Update [12/30/08]: Added picture comparing grooves to Civic wheelbase</p>
<p>Update [5/2/11]: I am both sorry and delighted to hear that they rebuilt the musical road (see, e.g., <a href="http://9teen87spostcards.blogspot.com/2011/05/musical-road-in-lancaster-california.html">here</a>), and they fixed nothing.  Here it is on April 28, 2011:</p>
<p><iframe width="500" height="281" src="http://www.youtube.com/embed/e0mtomTCG4c?feature=oembed" frameborder="0" allowfullscreen></iframe></p>
<p>Just… wow.</p>
<ol class="footnotes"><li id="footnote_0_12" class="footnote">If anyone’s wondering what happened to the 1/12th powers of 2 in this whole tuning discussion, I’m using what’s called <a href="http://en.wikipedia.org/wiki/Just_intonation">Just Intonation</a>, which is an (often better-sounding) approximation to the <a href="http://en.wikipedia.org/wiki/Equal_temperament">Equal Temperament</a> system most people know.  Actually, it’s really the other way around: the reason we use 12 equal semitones is that it lets us approximate nice integer ratios like 3/2, 4/3, 5/4, etc..  This is a <a href="http://www.amazon.com/Equal-Temperament-Ruined-Harmony-Should/dp/0393062279">long story</a> that I’m not going to get into here.</li><li id="footnote_1_12" class="footnote">Actually, the starting note in the recording is around a B♭.  I’m going to pretend like everything is in the key of B♭ (so the starting note is F), since that’s the key they talk about in the making-of videos.<span id="attachment_197" class="wp-caption aligncenter" style="width: 350px"><img src="http://davidsd.org/wp-content/uploads/2008/12/honda_tell.png" alt="a picture of Honda&#039;s score" title="honda_tell" width="350" height="169" class="size-full wp-image-197" /><span class="wp-caption-text">a picture of Honda’s score</span></span>Sorry to the perfect-pitch people.</li><li id="footnote_2_12" class="footnote">The original melody actually has a run down to the B♭: F-E♭-D-C-B♭.  Honda apparently decided this was too complicated and used a simplified version.  That’s what I’ll stick to here.</li><li id="footnote_3_12" class="footnote">With one exception that can’t fix the tuning.  See <a href="#comment-4">my comment</a>, below.</li><li id="footnote_4_12" class="footnote">More precisely, once you know the force driving the vibrations is periodic with period T=d/v, it follows that the vibrations themselves have that periodicity, so the Fourier transform of any resultant sound is only nonzero at integer multiples of f=1/T.  For more explanation, see the <a href="#comment-4">second comment</a>, below.</li><li id="footnote_5_12" class="footnote">I’ve actually transposed it up to be in approximately the same key as the other recordings in this article.  By the way, there are hundreds of such videos on Youtube.</li><li id="footnote_6_12" class="footnote">Aside from a single passing note.  If I change the closing notes from F-E♭-D-B♭-D-B♭ to F-E♮-D-B♭-D-B♭, and apply the Honda miscalculation, it sounds <em>almost exactly</em> like the undoctored recording of the musical road.  So it appears that there are two errors at work here: the groove spacing miscalculation, and replacing an E♭ with an E♮.</li><li id="footnote_7_12" class="footnote">It seems like Honda fixed up some of the other notes, too, to get a more pleasant sound.  Some might object that it’s easy to make the notes sound bad by speeding up or slowing down as you drive down the road.  However, I don’t hear anything like that in the random person’s recording.  The melody returns to previous notes with reasonable accuracy, which it wouldn’t do if the speed were varying.</li></ol><img src="http://feeds.feedburner.com/~r/davidsd/~4/qKgULwRs1HA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://davidsd.org/2008/12/honda-needs-a-tune-up/feed/</wfw:commentRss>
		<slash:comments>35</slash:comments>
		<feedburner:origLink>http://davidsd.org/2008/12/honda-needs-a-tune-up/</feedburner:origLink></item>
	</channel>
</rss>
