<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Rules of Reason</title>
	<atom:link href="http://rulesofreason.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://rulesofreason.wordpress.com</link>
	<description>Uses and abuses of research on public policy and administration</description>
	<lastBuildDate>Thu, 23 Feb 2012 01:21:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='rulesofreason.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/81d89830f5c794ba4ff19a736d012742?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>Rules of Reason</title>
		<link>http://rulesofreason.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://rulesofreason.wordpress.com/osd.xml" title="Rules of Reason" />
	<atom:link rel='hub' href='http://rulesofreason.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Explanation and the quest for &#8216;significant&#8217; relationships. Part II</title>
		<link>http://rulesofreason.wordpress.com/2012/02/22/explanation-and-the-quest-for-significant-relationships-part-ii/</link>
		<comments>http://rulesofreason.wordpress.com/2012/02/22/explanation-and-the-quest-for-significant-relationships-part-ii/#comments</comments>
		<pubDate>Wed, 22 Feb 2012 20:11:45 +0000</pubDate>
		<dc:creator>Dimiter Toshkov</dc:creator>
				<category><![CDATA[Causality]]></category>
		<category><![CDATA[Observational studies]]></category>
		<category><![CDATA[causality]]></category>
		<category><![CDATA[observational studies]]></category>
		<category><![CDATA[explanation]]></category>
		<category><![CDATA[statistical significance]]></category>
		<category><![CDATA[practical signiicance]]></category>
		<category><![CDATA[observational research]]></category>
		<category><![CDATA[observations]]></category>
		<category><![CDATA[model fit]]></category>
		<category><![CDATA[causal explanation]]></category>
		<category><![CDATA[quantitative research]]></category>
		<category><![CDATA[empiricism]]></category>
		<category><![CDATA[statistical vs. practical significance]]></category>
		<category><![CDATA[Uncategorized and tagged causal effects]]></category>

		<guid isPermaLink="false">http://rulesofreason.wordpress.com/?p=336</guid>
		<description><![CDATA[In Part I I argue that the search and discovery of statistically significant relationships does not amount to explanation and is often misplaced in the social sciences because the variables which are purported to have effects on the outcome cannot be manipulated. &#8230; <a href="http://rulesofreason.wordpress.com/2012/02/22/explanation-and-the-quest-for-significant-relationships-part-ii/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=336&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In <a href="http://rulesofreason.wordpress.com/2012/02/17/explanation-and-the-quest-for-significant-relationships-part-i/" target="_blank">Part I</a> I argue that the search and discovery of statistically significant relationships does not amount to explanation and is often misplaced in the social sciences because the variables which are purported to have <em>effects </em>on the outcome cannot be manipulated.</p>
<p>Just to make sure that my message is not misinterpreted &#8211; I am not arguing for a fixation on maximizing R-squared and other measures of model fit in statistical work, instead of the current focus on the size and significance of individual coefficients. R-squared has been rightly criticized as a standard of how good a model is** (see for example <a href="http://www.stat.cmu.edu/~cshalizi/uADA/12/lectures/ch02.pdf" target="_blank">here</a>). But I am not aware of any other measure or standard that can convincingly compare the explanatory potential of different models in different contexts. Predictive success might be one way to go, but prediction is altogether something else than explanation.</p>
<p>I don&#8217;t expect much to change in the future with regard to the problem I outlined. In practice, all one could hope for is some clarity on the part of the researchers whether their objective is to explain (account for) or find significant effects. The standards for evaluating progress towards the former objective (model fit, predictive success, &#8216;coverage&#8217; in the QCA sense) should be different than the standards for the latter (statistical &amp; practical significance <em>and</em> the practical possibility to manipulate the exogenous variables).</p>
<p>Take the so-called garbage-can regressions, for example. These are models with tens of variables all of which are interpreted causally if they reach the magic 5% significance level. The futility of this approach is matched only by its popularity in political science and public administration research. If the research objective is to explore a causal relationship, one better focus on that variable and  include covariates only if it is suspected that they are correlated with the outcome <em>and</em> with the main independent variable of interest. Including everything else that happens to be within easy reach not only leads to inefficiency in the estimation. One should refrain from  interpreting causally the significance of these covariates altogether. On the other hand, if the objective is to comprehensively explain (account for) a certain phenomenon, than including as many variables as possible might be warranted but than the significance of individual variables is of little interest.</p>
<p>The goal of research is important when choosing the research design and the analytic approach. Different standards apply to explanation, the discovery of causal effects, and prediction. <em></em></p>
<p><em>**Just one small example from my current work &#8211; a model with one dependent and one exogenous time-series variables in levels with a lagged dependent variable included on the right-hand side of the equation produces an R-squared of 0.93. The same model in first differences has an R-squared of 0.03 while the regression coefficient of the exogenous variable remains significant in both models. So we can &#8216;explain&#8217; 90% of the variation in the first case by reference to the past values of the outcome. Does this amount to an explanation in any meaningful sense? I guess that depends on the context. Does it provide any leverage to the researcher to manipulate the outcome? Not at all.</em></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rulesofreason.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rulesofreason.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rulesofreason.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rulesofreason.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rulesofreason.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rulesofreason.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rulesofreason.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rulesofreason.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rulesofreason.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rulesofreason.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rulesofreason.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rulesofreason.wordpress.com/336/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rulesofreason.wordpress.com/336/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rulesofreason.wordpress.com/336/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=336&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rulesofreason.wordpress.com/2012/02/22/explanation-and-the-quest-for-significant-relationships-part-ii/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/270a698905658429f9147d89905c97ed?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">eurosearch</media:title>
		</media:content>
	</item>
		<item>
		<title>Explanation and the quest for &#8216;significant&#8217; relationships. Part I</title>
		<link>http://rulesofreason.wordpress.com/2012/02/17/explanation-and-the-quest-for-significant-relationships-part-i/</link>
		<comments>http://rulesofreason.wordpress.com/2012/02/17/explanation-and-the-quest-for-significant-relationships-part-i/#comments</comments>
		<pubDate>Fri, 17 Feb 2012 12:34:36 +0000</pubDate>
		<dc:creator>Dimiter Toshkov</dc:creator>
				<category><![CDATA[Causality]]></category>
		<category><![CDATA[Observational studies]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[causal effects]]></category>
		<category><![CDATA[causal explanation]]></category>
		<category><![CDATA[empiricism]]></category>
		<category><![CDATA[explanation]]></category>
		<category><![CDATA[model fit]]></category>
		<category><![CDATA[observational research]]></category>
		<category><![CDATA[observations]]></category>
		<category><![CDATA[practical signiicance]]></category>
		<category><![CDATA[quantitative research]]></category>
		<category><![CDATA[statistical significance]]></category>
		<category><![CDATA[statistical vs. practical significance]]></category>

		<guid isPermaLink="false">http://rulesofreason.wordpress.com/?p=331</guid>
		<description><![CDATA[The ultimate goal of social science is causal explanation*. The actual goal of most academic research is to discover significant relationships between variables. The two goals are supposed to be strongly related &#8211; by discovering (the) significant effects of exogenous (independent) variables, &#8230; <a href="http://rulesofreason.wordpress.com/2012/02/17/explanation-and-the-quest-for-significant-relationships-part-i/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=331&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The ultimate goal of social science is causal explanation*. The actual goal of most academic research is to discover significant relationships between variables. The two goals are supposed to be strongly related &#8211; by discovering (the) significant effects of exogenous (independent) variables, one accounts for the outcome of interest. In fact, the working assumption of the empiricist paradigm of social science research is that the two goals are essentially the same &#8211; explanation<strong> is</strong> the sum of the significant effects that we have discovered. Just look at what all the academic articles with &#8216;explanation&#8217;, &#8216;determinants&#8217;, and &#8217;causes&#8217; in their titles do &#8211; they report significant effects, or associations, between variables.</p>
<p>The problem is that <strong>explanation and collecting significant associations are not the same</strong>. Of course they are not. The point is obvious to all uninitiated into the quantitative empiricist tradition of doing research, but seems to be lost to many of its practitioners. We could have discovered a significant determinant of X, and still be miles (or even light-years) away from a convincing explanation of why and when X occurs. This is <strong>not</strong> because of the difficulties of causal identification &#8211; we could have satisfied all conditions for causal inference from observational data, but the problem still stays. And it would not go away after we pay attention (as we should) to the fact that statistical significance is not the same as practical significance. Even the discovery of convincingly-identified causal effects, large enough to be of practical rather than only statistical significance, does not amount to explanation. A successful explanation needs to account for the variation in X, and causal associations need not to &#8211; they might be significant but not even make a visible dent in the unexplained variation in X. The difference I am talking about is partly akin to the difference between looking at the significance of individual regression coefficients and looking at the model fit as a whole (more on that will follow in Part II). The current standards of social science research tend to emphasize the former rather than the later which allows for significant relationships to be sold as explanations.</p>
<p>The objection can be made that the discovery of causal effects is all we should aim for, and all we could hope for. Even if a causal relationship doesn&#8217;t account for large amounts of variation in the outcome of interest, it still <strong>makes a difference.  </strong>After all, this is the approach taken in epidemiology, agricultural sciences and other fields (like beer production) where the statistical research paradigm has its origins. A pill might not treat all headaches but if it has a positive and statistically-significant effect, it will still help millions. But here is the trick &#8211; the quest for statistically significant relationships in epidemiology, agriculture, etc. is valuable because all these effects can be considered as <em>interventions &#8211; </em>the researchers have control over the formula of the pill, or the amount of pesticide, or the type of hops. In contrast, social science researchers too often seek and discover significant relationships between an outcome and variables that couldn&#8217;t even remotely be considered as interventions. So we end up with a pile of significant relationships which do not account for enough variation to count as a proper explanation and they have no value as interventions as their manipulation is beyond our reach. <strong>To sum up, observational social science has borrowed an approach to causality which makes sense for experimental research, and applied its standards (namely, statistical significance) to a context where the discovery of significant relationships is less valuable because the &#8217;treatments&#8217; cannot be manipulated. </strong>Meanwhile, what should really count &#8211; explaining when, how and why a phenomenon happens, is relegated to the background in the false belief that somehow the quest for significant relationships is a substitute. It is like trying to discover the fundamental function of the lungs with epidemiological methods, and claiming success when you prove that cold air reduces significantly lung capacity. While the inference might still be valuable, it is no substitue for the original goal.</p>
<p>In Part II, I will discuss what needs to be changed, and what can be changed in the current practice of empirical social science research to address the problem outlined above.</p>
<p><em>*In my understanding, all explanation is causal. Hence, &#8216;causal explanation&#8217; is tautology. Hence, I am gonna drop the &#8216;causal&#8217; part for the rest of the text.</em></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rulesofreason.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rulesofreason.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rulesofreason.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rulesofreason.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rulesofreason.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rulesofreason.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rulesofreason.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rulesofreason.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rulesofreason.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rulesofreason.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rulesofreason.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rulesofreason.wordpress.com/331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rulesofreason.wordpress.com/331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rulesofreason.wordpress.com/331/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=331&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rulesofreason.wordpress.com/2012/02/17/explanation-and-the-quest-for-significant-relationships-part-i/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/270a698905658429f9147d89905c97ed?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">eurosearch</media:title>
		</media:content>
	</item>
		<item>
		<title>Google tries to find the funniest videos</title>
		<link>http://rulesofreason.wordpress.com/2012/02/15/google-tries-to-find-the-funniest-videos/</link>
		<comments>http://rulesofreason.wordpress.com/2012/02/15/google-tries-to-find-the-funniest-videos/#comments</comments>
		<pubDate>Wed, 15 Feb 2012 18:45:40 +0000</pubDate>
		<dc:creator>Dimiter Toshkov</dc:creator>
				<category><![CDATA[Advertising research]]></category>
		<category><![CDATA[Causality]]></category>
		<category><![CDATA[Humour]]></category>
		<category><![CDATA[comedic potential]]></category>
		<category><![CDATA[funny videos]]></category>
		<category><![CDATA[predicting funny videos]]></category>
		<category><![CDATA[prediction vs. explanation]]></category>
		<category><![CDATA[quantifying comedy]]></category>
		<category><![CDATA[Sanketh Shetty]]></category>
		<category><![CDATA[science misinterpretation in the news]]></category>
		<category><![CDATA[viral videos]]></category>
		<category><![CDATA[youtube comedy slam]]></category>

		<guid isPermaLink="false">http://rulesofreason.wordpress.com/?p=324</guid>
		<description><![CDATA[Following my recent post on the project which tries to explain why some video clips go viral, here is a report on Google&#8217;s efforts to find the funniest videos: You’d think the reasons for something being funny were beyond the &#8230; <a href="http://rulesofreason.wordpress.com/2012/02/15/google-tries-to-find-the-funniest-videos/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=324&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Following my recent <a href="http://rulesofreason.wordpress.com/2011/11/28/what-makes-a-video-go-viral/" target="_blank">post</a> on the project which tries to explain why some video clips go viral, here is a <a href="http://www.dailymail.co.uk/sciencetech/article-2099286/Google-formula-says-No-No-No-No-Cat-funniest-YouTube-clip.html" target="_blank">report</a> on Google&#8217;s efforts to find the funniest videos:</p>
<blockquote><p>You’d think the reasons for something being funny were beyond the reach of science – but Google’s brain-box researchers have managed to come up with a formula for working out which YouTube video clips are the funniest.</p></blockquote>
<p>The Google researcher behind the project is quoted saying:</p>
<blockquote><p>‘If a user uses an “loooooool” vs an “loool”, does it mean they were more amused? We designed features to quantify the degree of emphasis on words associated with amusement in viewer comments.’</p></blockquote>
<p>Other factors taken into account are tags, descriptions, and &#8216;<em>whether audible laughter can be heard in the background</em>&#8216;. Ultimately, the algorithm gives a ranking of the funniest videos  (with <a href="http://www.youtube.com/watch?v=oKI-tD0L18A" target="_blank">No No No No Cat</a> on top, since you asked).</p>
<p>Now I usually have high respect for all things Google, but this &#8216;research&#8217; at first appeared to be a total piece of junk. Of course, it turned out that it is just the way it is reported by the Daily Mail (cited above), <a href="http://www.newscientist.com/blogs/onepercent/2012/02/google-algorithm-picks-funnies.html" target="_blank">New Scientist</a> and countless other more or less reputable outlets.</p>
<p>Google&#8217;s new algorithm does not provide a <strong>normative ranking</strong> of the funniest videos ever based on some objective criteria; it is a <strong>predictive</strong> score about the video&#8217;s comedic potential. Google trained the algorithm on a bunch of videos (it&#8217;s unclear from the original <a href="http://googleresearch.blogspot.com.au/?v=0" target="_blank">source</a> what the external &#8216;fun&#8217; measure used for the training part was) in order to inductively extract features  associated with the video being funny. Based on these features, the program can then score any possible video. But these scores are not normative measures, they are predictions. So <a href="http://www.youtube.com/watch?v=oKI-tD0L18A" target="_blank">No No No No Cat</a> is not the funniest video ever <em>[well, it might be, it's pretty hilarious actually]</em>, it is Google&#8217;s safest bet that the video would be considered funny.</p>
<p>The story is worth mentioning not only because it exposes yet another case of gross misinterpretation of a scientific project in the news, but because it nicely illustrates the differences between measurement, prediction, and explanation. The newspapers have taken Google&#8217;s project to be an exercise in <strong>measurement</strong>. As explained above, the goal is actually <strong>predictive</strong> in nature. But even if the algorithm has 100% success rate in identifying potentially funny videos, that would still not count as an <strong>explanation</strong> of what makes a video funny. Just think about it &#8211; would a boring video become funny if we just put funny tags, background laughter, and plenty of  loools in the comments? Not really. In that respect <a href="http://rulesofreason.wordpress.com/2011/11/28/what-makes-a-video-go-viral/" target="_blank">Brent Coker&#8217;s approach</a>, which I mentioned in a previous post, has real explanatory potential (although I doubt whether it has any explanatory power).</p>
<p>So, no need to panic, the <em>formula for something being funny</em> is as distant as ever.</p>
<p><em>P.S.</em> In an ironic turn of events, now that  <a href="http://www.youtube.com/watch?v=oKI-tD0L18A" target="_blank">No No No No Cat</a> <strong>has</strong> gone viral, Google would never know whether the algorithm was very good, or just everyone wanted to see the video Google declared the funnies ever. Ah, the joys of social science research!</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rulesofreason.wordpress.com/324/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rulesofreason.wordpress.com/324/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rulesofreason.wordpress.com/324/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rulesofreason.wordpress.com/324/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rulesofreason.wordpress.com/324/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rulesofreason.wordpress.com/324/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rulesofreason.wordpress.com/324/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rulesofreason.wordpress.com/324/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rulesofreason.wordpress.com/324/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rulesofreason.wordpress.com/324/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rulesofreason.wordpress.com/324/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rulesofreason.wordpress.com/324/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rulesofreason.wordpress.com/324/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rulesofreason.wordpress.com/324/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=324&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rulesofreason.wordpress.com/2012/02/15/google-tries-to-find-the-funniest-videos/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/270a698905658429f9147d89905c97ed?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">eurosearch</media:title>
		</media:content>
	</item>
		<item>
		<title>Hyperlinks</title>
		<link>http://rulesofreason.wordpress.com/2012/02/14/hyperlinks-9/</link>
		<comments>http://rulesofreason.wordpress.com/2012/02/14/hyperlinks-9/#comments</comments>
		<pubDate>Tue, 14 Feb 2012 10:05:42 +0000</pubDate>
		<dc:creator>Dimiter Toshkov</dc:creator>
				<category><![CDATA[Hyperlinks]]></category>

		<guid isPermaLink="false">http://rulesofreason.wordpress.com/?p=320</guid>
		<description><![CDATA[Big data for evaluating education Should have been done long ago, no? Neanderthals painted Most relaxing song ever [?!?]  Testosterone, digit ratio, and abstract reasoning ability [via MindBlog] &#8216;North Korea&#8217; by Damir Šagolj 1st Prize World Press Photo 2012.  Daily Life &#8230; <a href="http://rulesofreason.wordpress.com/2012/02/14/hyperlinks-9/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=320&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.insidehighered.com/news/2012/02/01/using-big-data-predict-online-student-success" target="_blank">Big data for evaluating education</a> <em>Should have been done long ago, no?</em></p>
<p><a href="http://www.newscientist.com/article/dn21458-first-neanderthal-cave-paintings-discovered-in-spain.html?DCMP=OTC-rss&amp;nsref=online-news" target="_blank">Neanderthals painted </a></p>
<p><a href="http://soundcloud.com/justmusiclabel/marconi-union-weightless/s-kttxT" target="_blank">Most relaxing song eve</a>r <em>[?!?] </em></p>
<p><a href="http://mindblog.dericbownds.net/2012/02/testosterone-digit-ratio-and-abstract.html" target="_blank">Testosterone, digit ratio, and abstract reasoning ability</a> <em>[via MindBlog]</em></p>
<p>&#8216;North Korea&#8217; <em>by <a href="http://damirsagolj.wordpress.com/" target="_blank">Damir Šagolj </a>1st Prize <a href="http://www.worldpressphoto.org/gallery/2012-world-press-photo" target="_blank">World Press Photo 2012</a>.  Daily Life Category</em><a href="http://rulesofreason.files.wordpress.com/2012/02/potd7-e1318434855867.jpeg"><img class="alignnone size-full wp-image-321" title="potd7-e1318434855867" src="http://rulesofreason.files.wordpress.com/2012/02/potd7-e1318434855867.jpeg?w=584&#038;h=438" alt="" width="584" height="438" /></a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rulesofreason.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rulesofreason.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rulesofreason.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rulesofreason.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rulesofreason.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rulesofreason.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rulesofreason.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rulesofreason.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rulesofreason.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rulesofreason.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rulesofreason.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rulesofreason.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rulesofreason.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rulesofreason.wordpress.com/320/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=320&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rulesofreason.wordpress.com/2012/02/14/hyperlinks-9/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/270a698905658429f9147d89905c97ed?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">eurosearch</media:title>
		</media:content>

		<media:content url="http://rulesofreason.files.wordpress.com/2012/02/potd7-e1318434855867.jpeg" medium="image">
			<media:title type="html">potd7-e1318434855867</media:title>
		</media:content>
	</item>
		<item>
		<title>Weighted variance and weighted coefficient of variation</title>
		<link>http://rulesofreason.wordpress.com/2012/02/13/weighted-variance-and-weighted-coefficient-of-variation/</link>
		<comments>http://rulesofreason.wordpress.com/2012/02/13/weighted-variance-and-weighted-coefficient-of-variation/#comments</comments>
		<pubDate>Mon, 13 Feb 2012 20:22:53 +0000</pubDate>
		<dc:creator>Dimiter Toshkov</dc:creator>
				<category><![CDATA[Measurement]]></category>
		<category><![CDATA[asylum policy]]></category>
		<category><![CDATA[coefficient of variation]]></category>
		<category><![CDATA[estimating variability]]></category>
		<category><![CDATA[measures of variability]]></category>
		<category><![CDATA[variance]]></category>
		<category><![CDATA[weighted coefficient of variation]]></category>
		<category><![CDATA[weighted mean]]></category>
		<category><![CDATA[weighted variance]]></category>

		<guid isPermaLink="false">http://rulesofreason.wordpress.com/?p=290</guid>
		<description><![CDATA[Often we want to compare the variability of a variable in different contexts &#8211; say, the variability of unemployment in different countries over time, or the variability of height in two populations, etc. The most often used measures of variability are &#8230; <a href="http://rulesofreason.wordpress.com/2012/02/13/weighted-variance-and-weighted-coefficient-of-variation/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=290&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Often we want to compare the <em>variability</em> of a variable in different contexts &#8211; say, the variability of unemployment in different countries over time, or the variability of height in two populations, etc. The most often used measures of variability are the <a href="http://en.wikipedia.org/wiki/Variance" target="_blank">variance</a> and the <a href="http://en.wikipedia.org/wiki/Standard_deviation" target="_blank">standard deviation</a> (which is just the square root of the <a href="http://en.wikipedia.org/wiki/Variance" target="_blank">variance</a>). However, for some types of data, these measures are not entirely appropriate. For example, when data is generated by a <a href="http://en.wikipedia.org/wiki/Poisson_distribution" target="_blank">Poisson process</a> (e.g. when you have counts of rare events) the mean equals the variance <strong>by definition</strong>. Clearly, comparing the variability of two Poisson distributions using the <em>variance</em> or the <em>standard deviation</em> would not work if the means of these populations differ. A common and easy fix is to use the <a href="http://en.wikipedia.org/wiki/Coefficient_of_variation" target="_blank">coefficient of variation</a> instead, which is simply the standard deviation divided by the mean. So far, so good.</p>
<p>Things get tricky however when we want to calculate the <em>weighted coefficient of variation</em>. The <em>weighted mean</em> is just the mean but <a href="http://en.wikipedia.org/wiki/Weighted_mean" target="_blank">some data points contribute more than others</a>. For example the mean of 0.4 and 0.8 is 0.6. If we assign the weights 0.9 to the first observation [0.4] and 0.1 to the second [0.8], the weighted mean is (0.9*0.4+0.1*0.8)/1, which equals to 0.44. You would guess that we can compute the weighted variance by analogy,  and you would be wrong.</p>
<p>For example, the sample variance of {0.4,0.8} is given by [Wikipedia]:<a href="http://rulesofreason.files.wordpress.com/2012/02/sample-variance.png"><img class="alignnone size-full wp-image-298" title="sample variance" src="http://rulesofreason.files.wordpress.com/2012/02/sample-variance.png?w=584" alt=""   /></a></p>
<p>or in our example <strong>(</strong>(0.4-0.6)^2+(0.8-0.6)^2<strong>)</strong> <strong>/</strong> <strong>(</strong>2-1<strong>)</strong> which equals to 0.02. But, the <em>weighted sample variance</em> <strong>cannot</strong> be computed by simply adding the weights to the above formula <strong>(0.9*</strong>(0.4-0.6)^2+<strong>0.1*</strong>(0.8-0.6)^2<strong>)</strong> <strong>/</strong> <strong>(</strong>2-1<strong>). </strong>The formula for the weighted variance is different [<a href="http://en.wikipedia.org/wiki/Weighted_mean" target="_blank">Wikipedia</a>]:<br />
<a href="http://rulesofreason.files.wordpress.com/2012/02/weighted-sample-variance1.png"><img class="alignnone size-full wp-image-300" title="weighted sample variance" src="http://rulesofreason.files.wordpress.com/2012/02/weighted-sample-variance1.png?w=584" alt=""   /></a><br />
where V1 is the sum of the weights and V2 is the sum of squared weights:<img title="v2" src="http://rulesofreason.files.wordpress.com/2012/02/v21.png?w=98&#038;h=48" alt="" width="98" height="48" />.<br />
The next steps are straightforward: the <em>weighted standard deviation</em> is the square root of the above, and the <em>weighted coefficient of variation</em> is the weighted standard deviation divided by the weighted mean.</p>
<p>Although there is nothing new here, I thought it&#8217;s a good idea to put it together because it appears to be causing some confusion.  For example, in the latest issue of <a href="http://eup.sagepub.com/" target="_blank">European Union Politics</a> you can find the <a href="http://eup.sagepub.com/content/13/1/70.abstract" target="_blank">article</a> &#8216;Measuring common standards  and equal responsibility-sharing in EU asylum outcome data&#8217;  by a team of scientists from LSE. On page 74, you can read that:</p>
<blockquote><p>The weighted variance <em>[of the set p={0.38, 0.42} with weights W={0.50,0.50}] </em>equals 0.5(0.38-.0.40)^2+0.5(0.42-0.40)^2 =0.0004.</p></blockquote>
<p>As explained above, this is not generally correct unless the biased (population) rather than the unbiased (sample)  weighted variance is meant. When calculated properly, the weighted variance turns out to be 0.0008. <a href="https://stat.ethz.ch/pipermail/r-help/2008-July/168762.html" target="_blank">Here</a> you can find the function Gavin Simpson has provided  for calculating the weighted variance in R and try for yourself.</p>
<p><em>P.S.</em> To be clear, the weighted variance issue is not central to the argument of the article cited above but is significant as the authors discuss at length the methodology for estimating variability in data and introduce the so-called Coffey-Feingold-Broomberg measure of variability which the authors  deem more appropriate for proportions.</p>
<p><em>P.P.S</em> On the internet, there is yet more confusion: for example, <a href="http://www.itl.nist.gov/div898/software/dataplot/refman2/ch2/weightsd.pdf" target="_blank">this document</a> (which pops high in the Google results) has yet a different formula, shown in a slightly different form <a href="http://stats.stackexchange.com/questions/6534/how-do-i-calculate-a-weighted-standard-deviation-in-excel" target="_blank">here</a>  as  well.</p>
<p><em>Disclaimer.</em> I have a forthcoming <a href="http://www.dimiter.eu/articles/europeanization%20of%20asylum.pdf" target="_blank">paper</a> on the same topic (asylum policy) as the EUP article mentioned above.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rulesofreason.wordpress.com/290/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rulesofreason.wordpress.com/290/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rulesofreason.wordpress.com/290/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rulesofreason.wordpress.com/290/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rulesofreason.wordpress.com/290/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rulesofreason.wordpress.com/290/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rulesofreason.wordpress.com/290/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rulesofreason.wordpress.com/290/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rulesofreason.wordpress.com/290/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rulesofreason.wordpress.com/290/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rulesofreason.wordpress.com/290/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rulesofreason.wordpress.com/290/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rulesofreason.wordpress.com/290/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rulesofreason.wordpress.com/290/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=290&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rulesofreason.wordpress.com/2012/02/13/weighted-variance-and-weighted-coefficient-of-variation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/270a698905658429f9147d89905c97ed?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">eurosearch</media:title>
		</media:content>

		<media:content url="http://rulesofreason.files.wordpress.com/2012/02/sample-variance.png" medium="image">
			<media:title type="html">sample variance</media:title>
		</media:content>

		<media:content url="http://rulesofreason.files.wordpress.com/2012/02/weighted-sample-variance1.png" medium="image">
			<media:title type="html">weighted sample variance</media:title>
		</media:content>

		<media:content url="http://rulesofreason.files.wordpress.com/2012/02/v21.png" medium="image">
			<media:title type="html">v2</media:title>
		</media:content>
	</item>
		<item>
		<title>The Good, the Bad, and the Stranger</title>
		<link>http://rulesofreason.wordpress.com/2012/02/10/the-good-the-bad-and-the-stranger/</link>
		<comments>http://rulesofreason.wordpress.com/2012/02/10/the-good-the-bad-and-the-stranger/#comments</comments>
		<pubDate>Fri, 10 Feb 2012 11:12:47 +0000</pubDate>
		<dc:creator>Dimiter Toshkov</dc:creator>
				<category><![CDATA[Immigration and asylum policy]]></category>
		<category><![CDATA[capitalism]]></category>
		<category><![CDATA[emigration]]></category>
		<category><![CDATA[immigration policy]]></category>
		<category><![CDATA[nationalism]]></category>
		<category><![CDATA[socialism]]></category>

		<guid isPermaLink="false">http://rulesofreason.wordpress.com/?p=312</guid>
		<description><![CDATA[Once upon a time, in a land far away, there lived two brothers. The first brother was like an ox: strong, dutiful and hard-working. The second brother was like a rotten apple – useless, menacing, and foul. The first brother &#8230; <a href="http://rulesofreason.wordpress.com/2012/02/10/the-good-the-bad-and-the-stranger/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=312&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Once upon a time, in a land far away, there lived two brothers. The first brother was like an ox: strong, dutiful and hard-working. The second brother was like a rotten apple – useless, menacing, and foul. The first brother set up a small enterprise which quickly took root and sprawled. Soon, he needed to hire a helping hand. He could either employ his brother, who was wicked and lazy, but still a relation, or a Stranger who was diligent and qualified, but came from some distant God-forsaken place. At this point the story forks and you, the reader, have to choose which path to take:</p>
<p> - You hire the stranger. The enterprise grows and prospers. Your brother vanishes in misery. Every Christmas you send him a present to an address he has long abandoned. <em>This is the way of the capitalist.</em></p>
<p>- You hire the brother. He might be trouble, but he is your own blood. And, on his advice you close your community to strangers. Soon, your brother stops showing for work, and more often than not shows up drunk. You quarrel, and curse but you stay loyal, and the enterprise rapidly goes into wreck. But you go down together. <em>This is the way of the nationalist.</em></p>
<p>- You hire the stranger. Every month you take a generous slice from your profit, and a big cut from the stranger’s salary and you give them to your brother. Your brother acquires a big TV, junk food addiction and a feeling of entitlement which leads him to wrangle every time your contributions are late. But the enterprise survives and your conscience is clear. <em>This is the way of the socialist.</em></p>
<p>But whatever you chose, the good times come to an end, the fat years are over, and a long and painful crisis settles in the land. In the capitalist path of the story, the stranger, who has been saving during all the good years, buys the enterprise from you. You ask him to employ you, but he hires <em>his</em> brother instead and kicks you out of the door.</p>
<p>In the nationalist path of the story the good times were over long time ago anyways. You had long since reached the bottom and what only keeps you alive is the deep hatred of your neighbors, which is the one remaining thing that you share with your brother.</p>
<p>In the socialist path of the story, your brother suddenly feels the pain when your monthly contributions dry up. He accuses the stranger of stealing his job and having no right to be here and causing too much trouble altogether. He starts to pester you, to beg and to threaten. Finally, you succumb, kick out the stranger and hire your brother instead. But he has never done a day of work, so he quickly develops back problems and sues you for damage, which drives the enterprise to its end.</p>
<p>So what is the moral of this story? I don’t know, you tell me, I’m just the stranger.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rulesofreason.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rulesofreason.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rulesofreason.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rulesofreason.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rulesofreason.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rulesofreason.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rulesofreason.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rulesofreason.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rulesofreason.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rulesofreason.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rulesofreason.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rulesofreason.wordpress.com/312/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rulesofreason.wordpress.com/312/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rulesofreason.wordpress.com/312/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=312&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rulesofreason.wordpress.com/2012/02/10/the-good-the-bad-and-the-stranger/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/270a698905658429f9147d89905c97ed?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">eurosearch</media:title>
		</media:content>
	</item>
		<item>
		<title>No use for big data in electioneering, according to Hollywood</title>
		<link>http://rulesofreason.wordpress.com/2012/02/08/no-use-for-big-data-in-electioneering-according-to-holywood/</link>
		<comments>http://rulesofreason.wordpress.com/2012/02/08/no-use-for-big-data-in-electioneering-according-to-holywood/#comments</comments>
		<pubDate>Wed, 08 Feb 2012 15:06:07 +0000</pubDate>
		<dc:creator>Dimiter Toshkov</dc:creator>
				<category><![CDATA[Humour]]></category>
		<category><![CDATA[Voting and elections]]></category>
		<category><![CDATA[big data movies]]></category>
		<category><![CDATA[electioneering]]></category>
		<category><![CDATA[inside job]]></category>
		<category><![CDATA[margin call]]></category>
		<category><![CDATA[movies about politics]]></category>
		<category><![CDATA[movies about statistics]]></category>
		<category><![CDATA[nowcasts]]></category>
		<category><![CDATA[the ides of march]]></category>
		<category><![CDATA[the war room]]></category>

		<guid isPermaLink="false">http://rulesofreason.wordpress.com/?p=307</guid>
		<description><![CDATA[Over the last year two major Hollywood movies that touch upon the use of big data and sophisticated data analysis hit the big screen. Which, of course, is two more than the mean (or was that the median). Moneyball shows &#8230; <a href="http://rulesofreason.wordpress.com/2012/02/08/no-use-for-big-data-in-electioneering-according-to-holywood/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=307&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Over the last year two major Hollywood movies that touch upon the use of big data and sophisticated data analysis hit the big screen. Which, of course, is two more than the mean (or was that the median). <a href="http://www.imdb.com/title/tt1210166/" target="_blank">Moneyball</a> shows how crunching numbers helps win baseball games and <a href="http://www.imdb.com/title/tt1615147/" target="_blank">Margin Call</a> shows how crunching numbers helps ruin financial firms. It&#8217;s kind of fun to see Brad Pitt and Kevin Spacey stare at spreadsheets and nod approvingly while being explained some statistical subtleties. But watching someone stare at somebody else&#8217;s spreadsheets quickly becomes tiresome &#8230; which probably explains why Regressing with the Stars, Dotchart Master, and America&#8217;s Next Multilevel Model haven&#8217;t yet taken over reality TV.</p>
<p>So I was really disappointed to see that a third 2011 movie &#8211; <a href="http://www.imdb.com/title/tt1124035/" target="_blank">The Ides of March</a> &#8211; misses a golden opportunity to show the use of big data and sophisticated analysis for winning elections. The movie revolves around the primary presidential campaign of George Clooney (pardon, Governor Mike Morris) and the dirty politics behind the scenes. But for Hollywood in 2011, electioneering is still a game of horse-trading, media spinning and good-ol&#8217; stabs in the back. All these things about election campaigns are probably true, but I was disappointed that there were no fancy graphs plotting approval ratings and <a href="http://www.intrade.com/v4/home/" target="_blank">prediction market</a> quotes, no real-time election <a href="http://fivethirtyeight.blogs.nytimes.com/" target="_blank">forecasts</a> (or <a href="http://themonkeycage.org/blog/2011/12/15/13232/" target="_blank">nowcasts</a>) at which  George Clooney to stare and nod approvingly, no GIS-supported campaign targeting, not even focus groups, twits, facebook pages, not to speak of google circles. Now, I have never been involved  in an election campaign but I would have guessed that some of what political scientists are doing to analyze election outcomes and the effects of various elements of election campaigns has filtered through to campaign managers. But according to <a href="http://www.imdb.com/title/tt1124035/" target="_blank">The Ides of March</a>, electioneering is still stuck in the 1990-s. Someone get Hollywood a subscription to<a href="http://pan.oxfordjournals.org/" target="_blank"> Political Analysis</a>.</p>
<p>In fact, the only difference between <a href="http://www.imdb.com/title/tt1124035/" target="_blank">The Ides of March</a> and <a href="http://www.imdb.com/title/tt0108515/" target="_blank">The War Room</a> &#8211; the 1993 documentary about Bill Clinton&#8217;s 1992 presidential campaign &#8211; is that the actors in <a href="http://www.imdb.com/title/tt1124035/" target="_blank">The Ides of March</a> wear less hideous suits. And the intern is blond (just joking). Now when I think about it, the documentary <a href="http://www.imdb.com/title/tt0108515/" target="_blank">The War Room</a> actually packs more drama and suspense than the scripted <a href="http://www.imdb.com/title/tt1124035/" target="_blank">The Ides of March</a>. Which in fact is true about the documentary <a href="http://www.imdb.com/title/tt1645089/" target="_blank">Inside Job</a> vis-a-vis <a href="http://www.imdb.com/title/tt1615147/" target="_blank">Margin Call</a> as well.</p>
<p>P.S. My recent movie ratings can be found <a href="http://demetriodor.wordpress.com/2012/01/28/recently-watched-movies-january-february-2012/" target="_blank">here</a>.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rulesofreason.wordpress.com/307/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rulesofreason.wordpress.com/307/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rulesofreason.wordpress.com/307/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rulesofreason.wordpress.com/307/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rulesofreason.wordpress.com/307/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rulesofreason.wordpress.com/307/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rulesofreason.wordpress.com/307/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rulesofreason.wordpress.com/307/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rulesofreason.wordpress.com/307/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rulesofreason.wordpress.com/307/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rulesofreason.wordpress.com/307/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rulesofreason.wordpress.com/307/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rulesofreason.wordpress.com/307/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rulesofreason.wordpress.com/307/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=307&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rulesofreason.wordpress.com/2012/02/08/no-use-for-big-data-in-electioneering-according-to-holywood/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/270a698905658429f9147d89905c97ed?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">eurosearch</media:title>
		</media:content>
	</item>
		<item>
		<title>Denial of refusal</title>
		<link>http://rulesofreason.wordpress.com/2012/02/07/denial-of-refusal/</link>
		<comments>http://rulesofreason.wordpress.com/2012/02/07/denial-of-refusal/#comments</comments>
		<pubDate>Tue, 07 Feb 2012 19:46:12 +0000</pubDate>
		<dc:creator>Dimiter Toshkov</dc:creator>
				<category><![CDATA[Humour]]></category>
		<category><![CDATA[The profession]]></category>
		<category><![CDATA[job application]]></category>
		<category><![CDATA[rejection letter]]></category>

		<guid isPermaLink="false">http://rulesofreason.wordpress.com/?p=294</guid>
		<description><![CDATA[via Failblog &#160;<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=294&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://rulesofreason.files.wordpress.com/2012/02/job-fails-monday-thru-friday-heres-hoping-that-they-accepted-his-refusal-of-his-denial.jpg"><img class="alignnone size-full wp-image-295" title="job-fails-monday-thru-friday-heres-hoping-that-they-accepted-his-refusal-of-his-denial" src="http://rulesofreason.files.wordpress.com/2012/02/job-fails-monday-thru-friday-heres-hoping-that-they-accepted-his-refusal-of-his-denial.jpg?w=584" alt=""   /></a></p>
<p>via <a href="http://failblog.org/2012/02/07/job-fails-monday-thru-friday-heres-hoping-that-they-accepted-his-refusal-of-his-denial/" target="_blank">Failblog</a></p>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rulesofreason.wordpress.com/294/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rulesofreason.wordpress.com/294/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rulesofreason.wordpress.com/294/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rulesofreason.wordpress.com/294/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rulesofreason.wordpress.com/294/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rulesofreason.wordpress.com/294/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rulesofreason.wordpress.com/294/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rulesofreason.wordpress.com/294/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rulesofreason.wordpress.com/294/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rulesofreason.wordpress.com/294/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rulesofreason.wordpress.com/294/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rulesofreason.wordpress.com/294/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rulesofreason.wordpress.com/294/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rulesofreason.wordpress.com/294/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=294&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rulesofreason.wordpress.com/2012/02/07/denial-of-refusal/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/270a698905658429f9147d89905c97ed?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">eurosearch</media:title>
		</media:content>

		<media:content url="http://rulesofreason.files.wordpress.com/2012/02/job-fails-monday-thru-friday-heres-hoping-that-they-accepted-his-refusal-of-his-denial.jpg" medium="image">
			<media:title type="html">job-fails-monday-thru-friday-heres-hoping-that-they-accepted-his-refusal-of-his-denial</media:title>
		</media:content>
	</item>
		<item>
		<title>Hyperlinks</title>
		<link>http://rulesofreason.wordpress.com/2012/02/03/hyperlinks-8/</link>
		<comments>http://rulesofreason.wordpress.com/2012/02/03/hyperlinks-8/#comments</comments>
		<pubDate>Fri, 03 Feb 2012 09:12:07 +0000</pubDate>
		<dc:creator>Dimiter Toshkov</dc:creator>
				<category><![CDATA[Hyperlinks]]></category>

		<guid isPermaLink="false">http://rulesofreason.wordpress.com/?p=287</guid>
		<description><![CDATA[Science visualization challenge 2011 192 answers to the question &#8216;What is your favorite deep, elegant, or beautiful explanation?&#8217; Higher education for the masses [commentary by Felix Salmon] Researchers feel pressure to cite superfluous papers<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=287&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.sciencemag.org/site/special/vis2011/" target="_blank">Science visualization challenge 2011</a></p>
<p><a href="http://edge.org/responses/what-is-your-favorite-deep-elegant-or-beautiful-explanation" target="_blank">192 answers to the question &#8216;What is your favorite deep, elegant, or beautiful explanation?&#8217;</a></p>
<p><a href="http://www.udacity.com/" target="_blank">Higher education for the masses</a> [<a href="http://blogs.reuters.com/felix-salmon/2012/01/23/udacity-and-the-future-of-online-universities/" target="_blank">commentary by Felix Salmon</a>]</p>
<p><a href="http://www.nature.com/news/researchers-feel-pressure-to-cite-superfluous-papers-1.9968" target="_blank">Researchers feel pressure to cite superfluous papers</a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rulesofreason.wordpress.com/287/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rulesofreason.wordpress.com/287/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rulesofreason.wordpress.com/287/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rulesofreason.wordpress.com/287/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rulesofreason.wordpress.com/287/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rulesofreason.wordpress.com/287/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rulesofreason.wordpress.com/287/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rulesofreason.wordpress.com/287/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rulesofreason.wordpress.com/287/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rulesofreason.wordpress.com/287/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rulesofreason.wordpress.com/287/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rulesofreason.wordpress.com/287/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rulesofreason.wordpress.com/287/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rulesofreason.wordpress.com/287/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=287&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rulesofreason.wordpress.com/2012/02/03/hyperlinks-8/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/270a698905658429f9147d89905c97ed?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">eurosearch</media:title>
		</media:content>
	</item>
		<item>
		<title>Writing with the rear-view mirror</title>
		<link>http://rulesofreason.wordpress.com/2012/02/02/writing-with-the-rear-view-mirror/</link>
		<comments>http://rulesofreason.wordpress.com/2012/02/02/writing-with-the-rear-view-mirror/#comments</comments>
		<pubDate>Thu, 02 Feb 2012 09:52:06 +0000</pubDate>
		<dc:creator>Dimiter Toshkov</dc:creator>
				<category><![CDATA[Academic publishing]]></category>
		<category><![CDATA[Observational studies]]></category>
		<category><![CDATA[Public opinion]]></category>
		<category><![CDATA[academic publishing]]></category>
		<category><![CDATA[european union politics]]></category>
		<category><![CDATA[explanation]]></category>
		<category><![CDATA[hypothetico-deductive research]]></category>
		<category><![CDATA[policy output]]></category>
		<category><![CDATA[political science]]></category>
		<category><![CDATA[public opinion]]></category>

		<guid isPermaLink="false">http://rulesofreason.wordpress.com/?p=282</guid>
		<description><![CDATA[Social science research is supposed to work like this: 1) You want to explain a certain case or a class of phenomena; 2) You develop a theory and derive a set of hypotheses; 3) You test the hypotheses with data; &#8230; <a href="http://rulesofreason.wordpress.com/2012/02/02/writing-with-the-rear-view-mirror/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=282&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Social science research is <strong>supposed to work</strong> like this:<br />
1) You want to explain a certain case or a class of phenomena;<br />
2) You develop a theory and derive a set of hypotheses;<br />
3) You test the hypotheses with data;<br />
4) You conclude about the plausibility of the theory;<br />
5) You write a paper with a structure (research question, theory, empirical analysis, conclusions) that <strong>mirrors</strong> the steps above.</p>
<p>But in practice, social science research <strong>often works</strong> like this:<br />
1) You want to explain a certain case or a class of phenomena;<br />
2) You test a number hypotheses with data;<br />
3) You pick the hypotheses that matched the data best and combine them in a theory;<br />
4) You conclude that this theory is plausible and relevant;<br />
5) You write a paper with a structure (research question, theory, empirical analysis, conclusions) that <strong>does not reflect</strong> the steps above.</p>
<p>In short, an inductive quest for a plausible explanation is masked and reported as deductive theory-testing. This fallacy is both well-known and rather common (at least in the fields of political science and public administration). And, in my experience, it turns out to be tacitly supported by the policies of some journals and reviewers.</p>
<p>For one of my previous research projects, I studied the relationship between public support and policy output in the EU. Since the state of the economy can influence both, I included levels of unemployment as a potential omitted variable in the empirical analysis. It turned out that lagged unemployment is positively related to the volume of policy output. In the paper, I mentioned this result in passing but didn&#8217;t really discuss it at length because 1) the original relationship between public support and policy output was not affected, and 2) although highly statistically significant, the result was quite puzzling.</p>
<p>When I submitted the paper at a leading political science journal, a large part of the reviewers&#8217; critiques focused on the fact that I do not have an explanation for the link between unemployment and policy output in the paper. But why should I? I did not have a good explanation why these variables should be related (with a precisely 4-year lag) when I did the empirical analysis, <strong>so why pretend?</strong> Of course, I suspected unemployment as a confounding variable for the original relationship I wanted to study, so I took the pains of collecting the data and doing the tests, still that certainly doesn&#8217;t count as <strong>an explanation</strong> for the observed statistical relationship between unemployment and policy output. But the point is, it would have been entirely possible to write the paper <strong>as if</strong> I had strong ex ante theoretical reasons to expect that rising unemployment increases the policy output of the EU, and that the empirical test supports (or more precisely, <em>does not reject</em>) this hypothesis. That would certainly have greased the review process, and it only takes moving a few paragraphs from the concluding section to the theory part of the paper. So, if your data has a surprising story to tell, make sure it looks like you anticipated it all along &#8211; you even had a theory that predicted it! This is what I call &#8216;writing with the rear-view mirror&#8217;.</p>
<p>Why is it a problem? After all, an empirical association is an empirical association no matter whether you theorized about it beforehand or not. So where is the harm? As I see it, by pretending to have theoretically anticipated an empirical association, you grant it undue credence. <strong>Not only</strong> is data consistent with a link between two variables, but there are strong <em>theoretical</em> grounds to believe the link should be there. A surprising statistical association, however robust, is just what it is &#8211; a surprising statistical association that possibly deserves speculation, exploration and further research. On the other hand, a robust statistical association &#8216;predicted&#8217; by a previously-developed theory is way more &#8211; it is a claim that we understand how the world works.</p>
<p>Until journals and reviewers act as if proper science never deviates from the hypothetico-deductive canon, writers will pretend that they follow it. While openly descriptive and exploratory research is frowned upon, sham theory-testing will prevail.</p>
<p>Eventually, my paper on the links between public support, unemployment and policy output in the EU got accepted (in a different journal). Surprisingly given the bumpy review process, it has just been selected as the best article published in that journal during 2011. Needless to say, <strong>an explanation</strong> why unemployment might be related to EU policy output is still wanting.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rulesofreason.wordpress.com/282/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rulesofreason.wordpress.com/282/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rulesofreason.wordpress.com/282/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rulesofreason.wordpress.com/282/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rulesofreason.wordpress.com/282/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rulesofreason.wordpress.com/282/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rulesofreason.wordpress.com/282/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rulesofreason.wordpress.com/282/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rulesofreason.wordpress.com/282/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rulesofreason.wordpress.com/282/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rulesofreason.wordpress.com/282/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rulesofreason.wordpress.com/282/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rulesofreason.wordpress.com/282/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rulesofreason.wordpress.com/282/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rulesofreason.wordpress.com&amp;blog=27908254&amp;post=282&amp;subd=rulesofreason&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rulesofreason.wordpress.com/2012/02/02/writing-with-the-rear-view-mirror/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/270a698905658429f9147d89905c97ed?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">eurosearch</media:title>
		</media:content>
	</item>
	</channel>
</rss>
