<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Pharma BI &#187; Visualization</title>
	<atom:link href="http://pharma-bi.com/category/analytics/visualization/feed/" rel="self" type="application/rss+xml" />
	<link>http://pharma-bi.com</link>
	<description>Business Intelligence Blog</description>
	<lastBuildDate>Mon, 26 Jul 2010 22:40:06 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>How Three Graphs Can Tell Us Where to Focus our Attention</title>
		<link>http://pharma-bi.com/2010/05/how-three-graphs-can-tell-us-where-to-focus-our-attention/</link>
		<comments>http://pharma-bi.com/2010/05/how-three-graphs-can-tell-us-where-to-focus-our-attention/#comments</comments>
		<pubDate>Tue, 04 May 2010 22:22:00 +0000</pubDate>
		<dc:creator>Christine Muser</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://pharma-bi.com/?p=644</guid>
		<description><![CDATA[<p>A while back, SEO guru Glenn Crocker and I were talking about how visual analytics can help with search engine optimization.   Getting useful SEO information usually requires crunching data for thousands of links, so it’s quite useful to have something that takes us from a quick overview to the interesting details.</p>
<p>To illustrate how visual analytics [...]]]></description>
			<content:encoded><![CDATA[<p>A while back, SEO guru <a href="http://www.glenncrocker.com/">Glenn Crocker</a> and I were talking about how visual analytics can help with search engine optimization.   Getting useful SEO information usually requires crunching data for thousands of links, so it’s quite useful to have something that takes us from a quick overview to the interesting details.</p>
<p>To illustrate how visual analytics can help with this, we decided to look at the web sites for two of our favorite charities: <a href="http://feedingamerica.org/">Feeding America</a>, formerly known as Second Harvest, and an affiliated organization called the <a href="http://harvesters.org/">Harvesters Community Food Network</a> in Kansas City.</p>
<p>Using data from <a href="http://www.seomoz.org/">SEOmoz</a>, we combined the links for both sites into one database and compared their link performance.  A quick glance at the graph below tells us that Feeding America has the better SEO profile: more links in general and also more high quality links.   This shouldn’t be too surprising since Feeding America is a national organization, while Harvesters serves the greater Kansas City area.  But comparing the two sites provided some interesting data for us to review.</p>
<p>Two things are worth noting here:</p>
<ul>
<li>Some      domains linking to Feeding America are very highly ranked – up to a rank      of 10, while the domains linked to the Harvester’s site top out at 8.  This is also reflected in the overall average for the domain      ranks &#8211;  5.5 and 4.5 respectively.</li>
<li>When      looking at the color near the peak of each curve, we notice that the rank      for domains with the most links to Feeding America is higher than for      highly linked domains at Harvesters.</li>
</ul>
<div id="attachment_645" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2010/05/mozRank01.jpg"><img class="size-medium wp-image-645" title="Comparing mozRank for Feeding America and the Harvesters Food Network" src="http://pharma-bi.com/wp-content/uploads/2010/05/mozRank01-300x246.jpg" alt="Comparing mozRank for Feeding America and the Harvesters Food Network" width="300" height="246" /></a><p class="wp-caption-text">The quality of the links is indicated by the color (green is better) and the Domain mozRank (10 is best).  The number of links is indicated by the height of each curve.</p></div>
<p style="text-align: center;">
<p>Next we tried to figure out which sites contributed the most to the SEO performance.  Below I have highlighted a few of them.  These scatter plots show domains that have 5 or more links across both sites.  In the interactive version we can hover over data points to see more details about each domain.  I am not sure that the folks at SEOmoz would be happy about me uploading their data, so I am showing just this picture to get the general idea across.</p>
<p><a href="http://pharma-bi.com/wp-content/uploads/2010/05/AvgDmTR5B.jpg"><img class="aligncenter size-medium wp-image-646" title="Domains With High Impact on SEO Performance" src="http://pharma-bi.com/wp-content/uploads/2010/05/AvgDmTR5B-300x188.jpg" alt="Domains With High Impact on SEO Performance" width="300" height="188" /></a></p>
<p>Depending on time and interest, we can perform even more fine grained analysis.  For example, just because a highly ranked domain sends us links doesn’t mean the links rank equally well.  The graph below shows that only two highly ranked domains send links of a similarly high rank.</p>
<p><a href="http://pharma-bi.com/wp-content/uploads/2010/05/HarvesterTop15.jpg"><img class="aligncenter size-medium wp-image-647" title="Top 15 Domains Linking to Harvesters" src="http://pharma-bi.com/wp-content/uploads/2010/05/HarvesterTop15-249x300.jpg" alt="Top 15 Domains Linking to Harvesters" width="249" height="300" /></a></p>
<p>Let’s pretend we are Harvesters and we want to reach an audience beyond the Kansas City area. By looking at these three graphs, we now know that</p>
<ul>
<li>we      need more and better links</li>
<li>we      need web content that attracts more highly trusted domains</li>
<li>we      have plenty of links from the local community (many of the Top 15 Domains      are from Kansas City      based organizations).  Maybe we      should broaden our horizons and reach out to the owners of more nationally      focused, highly trusted domains.</li>
</ul>
<p>P.S.:  Just in case anyone is curious: at least in the Kansas City area, Harvesters does rank at top for the search term “Harvesters”</p>
]]></content:encoded>
			<wfw:commentRss>http://pharma-bi.com/2010/05/how-three-graphs-can-tell-us-where-to-focus-our-attention/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Don’t be Fooled by Poor Analytics</title>
		<link>http://pharma-bi.com/2010/04/don%e2%80%99t-be-fooled-by-poor-analytics/</link>
		<comments>http://pharma-bi.com/2010/04/don%e2%80%99t-be-fooled-by-poor-analytics/#comments</comments>
		<pubDate>Mon, 19 Apr 2010 20:02:46 +0000</pubDate>
		<dc:creator>Christine Muser</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Visualization]]></category>
		<category><![CDATA[Working With Consultants]]></category>

		<guid isPermaLink="false">http://pharma-bi.com/?p=631</guid>
		<description><![CDATA[<p>Today’s shrinking resources may tempt us into rushing things along, yet we need to be careful when relying on graphics to make decisions.  Good graphics make their point more quickly than a wall of text.  On the other hand, poor graphics easily create the wrong impression.  Distinguishing between the two is not as easy as [...]]]></description>
			<content:encoded><![CDATA[<p>Today’s shrinking resources may tempt us into rushing things along, yet we need to be careful when relying on graphics to make decisions.  Good graphics make their point more quickly than a wall of text.  On the other hand, poor graphics easily create the wrong impression.  Distinguishing between the two is not as easy as one might think.</p>
<p>Data analysis tools have advanced to the point where seemingly anyone with basic computer skills can develop meaningful insights.  As with any tool, however, operator skill determines the ultimate outcome.  Anyone can create graphs with trend lines, but knowing how to graph data properly requires skill.</p>
<p>For example, the graphic below appeared in a recent blog post about the impact of Health Care Reform on President Obama&#8217;s popularity.  While the trend lines look impressive, this graph misleads the reader.</p>
<div id="attachment_632" class="wp-caption aligncenter" style="width: 304px"><a href="http://pharma-bi.com/wp-content/uploads/2010/04/Scatter_Opinion_HCR_copy.jpg"><img class="size-medium wp-image-632" title="Scatter_Opinion_HCR_copy" src="http://pharma-bi.com/wp-content/uploads/2010/04/Scatter_Opinion_HCR_copy-294x300.jpg" alt="Four scatter graphs contrasting public opinion about Obama with public opinion about Health Care Reform" width="294" height="300" /></a><p class="wp-caption-text">Four scatter graphs contrasting public opinion about Obama with public opinion about Health Care Reform</p></div>
<p>It is true that statisticians use scatter plots to show the relationship between two variables, but in this case a third variable plays an overriding role.  This third variable is Time.  Public opinions shift over time, depending on  the headlines and the proposed changes in legislation.  A scatter plot  cannot take this into consideration. A timeline chart such as the one  below proves to be more informative: it shows trends over time and  provides possible explanations for shifts in public opinion.</p>
<div id="attachment_633" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2010/04/Bar_TimeLine_HCR_Poll.jpg"><img class="size-medium wp-image-633" title="Bar_TimeLine_HCR_Poll" src="http://pharma-bi.com/wp-content/uploads/2010/04/Bar_TimeLine_HCR_Poll-300x235.jpg" alt="Annotated timeline chart showing public opinion over time" width="300" height="235" /></a><p class="wp-caption-text">Annotated timeline chart showing public opinion over time</p></div>
<p>Think about the many graphics we consume on a daily basis, whether in business meetings or in the media. What do we really know about the skill and motivation of the author behind the graphic?  If a graphic supports an opinion we already hold, we may never question it &#8211; even if our opinion deserves questioning!</p>
<p>Herein lies the Catch-22: during times of change we have even less time than usual to deal with the intricacies of data analysis.  Yet, precisely at those times do we need to question our assumptions, adapt to new realities and update our opinions.  One way of solving this dilemma: delegate data analysis to people with the appropriate experience and skills.</p>
]]></content:encoded>
			<wfw:commentRss>http://pharma-bi.com/2010/04/don%e2%80%99t-be-fooled-by-poor-analytics/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Beware of Creative Analytics: Lies, Damned Lies and Statistics</title>
		<link>http://pharma-bi.com/2010/03/beware-of-creative-analytics-lies-damned-lies-and-statistics/</link>
		<comments>http://pharma-bi.com/2010/03/beware-of-creative-analytics-lies-damned-lies-and-statistics/#comments</comments>
		<pubDate>Fri, 19 Mar 2010 15:41:05 +0000</pubDate>
		<dc:creator>Christine Muser</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Know Your Data]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://pharma-bi.com/?p=623</guid>
		<description><![CDATA[<p>How often have we seen a graph in an opinion piece without knowing how it was created, but somewhere in the back of our mind we suspected that it was tweaked somehow to make a point?  How can we ferret out &#8220;creative analytics&#8221; from the true story?  Remember Mark Twain&#8217;s famous quote about [...]]]></description>
			<content:encoded><![CDATA[<p>How often have we seen a graph in an opinion piece without knowing how it was created, but somewhere in the back of our mind we suspected that it was tweaked somehow to make a point?  How can we ferret out &#8220;creative analytics&#8221; from the true story?  Remember Mark Twain&#8217;s famous quote about &#8221; &#8230; lies, damned lies and statistics.&#8221;  It is much more difficult to identify &#8220;lies&#8221; when we cannot inspect the data behind them.  </p>
<p>By necessity, we always make choices about how to present data.  After all, we *are* trying to make a point when we share information.  But even if we do not intent to spin the message, we may be unable to see the whole story until someone else adds their insight. By making our data available for download, we can level the debating field somewhat and hopefully reach better informed conclusions.    </p>
<p>Whether by accident or by design, one way to spin the message involves the use of data ranges.  In the example below, we have divided US obesity rates into three different ranges.  The first range uses intervals of 11, the second range uses intervals of 10 and the last range uses intervals of 5.  </p>
<p>Look at the graphs about soda taxes in vending machines and see how each graph may lead to a different conclusion about obesity and soda taxes in vending machines.  Then take a look at the graphs for the other taxes and notice how those graphs support similar conclusions regardless of the range size.  </p>
<p><script type="text/javascript" src="http://public.tableausoftware.com/javascripts/api/viz_v1.js"></script><object class="tableauViz" width="554" height="689" style="display:none;"><param name="name" value="OB_CalorieSource_Taxes_bin/DB2" /><param name="toolbar" value="yes" /></object><noscript>DB2 <br /><a href="#"><img alt="DB2 " src="http://public.tableausoftware.com/static/images/OB_CalorieSource_Taxes_bin-DB2_rss.png" height="100%" /></a></noscript>
<div style="width:554px;height:22px;padding:0px 10px 0px 0px; margin-top: -6px; color:black;font:normal 8pt verdana,helvetica,arial,sans-serif;">
<div style="padding-left: 438px;"><a href="http://www.tableausoftware.com/public?ref=http://public.tableausoftware.com/views/OB_CalorieSource_Taxes_bin/DB2" target="_blank">Powered by Tableau</a></div>
</div>
<p>When deciding how to present information we have to balance &#8220;information overload&#8221; with the need to present important details.  Which graph we choose ultimately depends on the point we are trying to make.  Some might call that spin, others call it effective communication.  If we are the audience, we need to be skeptical and ask questions.</p>
<p>Related Posts:</p>
<p><a href="http://pharma-bi.com/2010/03/how-to-avoid-misleading-conclusions-explore-your-data">http://pharma-bi.com/2010/03/how-to-avoid-misleading-conclusions-explore-your-data/</a><br />
<a href="http://pharma-bi.com/2010/03/how-to-avoid-misleading-conclusions-explore-your-data/">http://pharma-bi.com/2010/03/tableau-public-interactive-obesity-data-on-the-web/ </a></p>
]]></content:encoded>
			<wfw:commentRss>http://pharma-bi.com/2010/03/beware-of-creative-analytics-lies-damned-lies-and-statistics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tableau Public: Interactive Obesity Data On The Web</title>
		<link>http://pharma-bi.com/2010/03/tableau-public-interactive-obesity-data-on-the-web/</link>
		<comments>http://pharma-bi.com/2010/03/tableau-public-interactive-obesity-data-on-the-web/#comments</comments>
		<pubDate>Fri, 19 Mar 2010 04:26:19 +0000</pubDate>
		<dc:creator>Christine Muser</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://pharma-bi.com/?p=584</guid>
		<description><![CDATA[<p>The viz below uses Tableau Public and allows visitors to explore differences in food consumption and food/soda taxes based on obesity rates in the US.  Notice the dramatic differences in soft drink consumption among populations with different obesity rates.  At first glance, it appears that cutting back on sodas is our best bet for reducing [...]]]></description>
			<content:encoded><![CDATA[<p>The viz below uses <a href="http://www.tableausoftware.com/public/how-it-works" target="_blank">Tableau Public</a> and allows visitors to explore differences in food consumption and food/soda taxes based on obesity rates in the US.  Notice the dramatic differences in soft drink consumption among populations with different obesity rates.  At first glance, it appears that cutting back on sodas is our best bet for reducing waistlines!</p>
<p>By selecting different obesity ranges, one can observe that soda taxes tend to be low in states with high obesity rates.  Just check out Mississippi, Alabama and South Carolina.  Low soda taxes also occur in states with low obesity rates, so it would be premature to conclude that soda taxes are a good way to reduce obesity. Another interesting observation about populations with the highest obesity rates: general food taxes are high while soda taxes are low.  Is this encouraging soda purchases over food?</p>
<p>Of course there may be issues with the underlying data <a href="http://pharma-bi.com/2010/03/how-to-avoid-misleading-conclusions-explore-your-data/" target="_blank">as I explain here</a>.  I do not know this data well enough to draw definitive conclusions from it.  Besides, this post is more about illustrating how we can move beyond static charts &#038; graphs when discussing issues on the web.  By the way, using the icons at the bottom of this viz, you can download this data and even change the cursor behavior so it will zoom in on a particular area on the map.  </p>
<p><script type="text/javascript" src="http://public.tableausoftware.com/javascripts/api/viz_v1.js"></script><object class="tableauViz" width="589" height="709" style="display:none;"><param name="name" value="OB_CalorieSource_Taxes_bin/DB1" /><param name="toolbar" value="yes" /></object><noscript>DB1 <br /><a href="#"><img alt="DB1 " src="http://public.tableausoftware.com/static/images/OB_CalorieSource_Taxes_bin-DB1_rss.png" height="100%" /></a></noscript>
<div style="width:589px;height:22px;padding:0px 10px 0px 0px; margin-top: -6px; color:black;font:normal 8pt verdana,helvetica,arial,sans-serif;">
<div style="padding-left: 473px;"><a href="http://www.tableausoftware.com/public?ref=http://public.tableausoftware.com/views/OB_CalorieSource_Taxes_bin/DB1" target="_blank">Powered by Tableau</a></div>
</div>
<p>Here are a few tips for using Tableau Public to share your data:</p>
<ul>
<li>focus on one issue per viz</li>
<li>decide how to guide visitors into exploring your data</li>
<li>include only relevant data and create an extract</li>
<li>determine how much display space you have</li>
<li>first set the display size in your Tableau workbook &#8211; <a href="http://www.tableausoftware.com/public/knowledgebase/sizing-dashboards-fit-your-webpage" target="_blank">here are some tips</a></li>
<li>then design your viz within that space</li>
<li>beware of borders and shading &#8211; they may look very differently in a web browser</li>
<li>preview your viz &amp; make sure everything works as planned</li>
<li>keep it simple and have fun!</li>
</ul>
<p>If you are using a blog with a theme, be sure you know how much display space your theme allows.  If you use an HTML editor like the one in WordPress, make sure to paste Tableau&#8217;s HTML code into the HTML section of the editor, not the visual editor.  I found it best to add the Tableau HTML at very end before publishing/updating the post.</p>
<p>Happy authoring and exploring.  Please shoot me a note with your comments.</p>
<p><strong>Acknowledgment:</strong></p>
<p>Although based on USDA data, the data set in the above viz comes from Tableau and can be found here: <a href="http://www.tableausoftware.com/public/viz-contest" target="_blank">http://www.tableausoftware.com/public/viz-contest</a></p>
<p><strong>Related Posts</strong></p>
<p>Just because a graph looks pretty doesn&#8217;t mean it tells the truth.  <a href="http://pharma-bi.com/2010/03/beware-of-creative-analytics-lies-damned-lies-and-statistics/">Here is an example</a>. </p>
<p>Visual exploration tools make it easy to create graphs, but we still need to spend time exploring the data to make sure we are drawing meaningful conclusions.  <a href="http://pharma-bi.com/2010/03/how-to-avoid-misleading-conclusions-explore-your-data/">Here is an example</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://pharma-bi.com/2010/03/tableau-public-interactive-obesity-data-on-the-web/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to Avoid Misleading Conclusions: Explore Your Data</title>
		<link>http://pharma-bi.com/2010/03/how-to-avoid-misleading-conclusions-explore-your-data/</link>
		<comments>http://pharma-bi.com/2010/03/how-to-avoid-misleading-conclusions-explore-your-data/#comments</comments>
		<pubDate>Wed, 17 Mar 2010 08:29:39 +0000</pubDate>
		<dc:creator>Christine Muser</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Know Your Data]]></category>
		<category><![CDATA[Modelling]]></category>
		<category><![CDATA[Six Sigma]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://pharma-bi.com/?p=537</guid>
		<description><![CDATA[<p>Often we have to work with data without knowing all the details of how it was collected and processed.  In those situations we first need to determine what information the data contains and what it can and cannot tell us.   We need to ask questions of the data and determine whether it makes sense, [...]]]></description>
			<content:encoded><![CDATA[<p>Often we have to work with data without knowing all the details of how it was collected and processed.  In those situations we first need to determine what information the data contains and what it can and cannot tell us.   We need to ask questions of the data and determine whether it makes sense, given what we already know.   To hone in on the time saving questions it helps to be a subject matter expert.  But even if we are unfamiliar with the subject area, we can start by inspecting the different pieces of data to see how everything fits together.  Visual analysis tools like <a href="http://www.tableausoftware.com/" target="_blank">Tableau software</a> make that job much easier than it used to be.</p>
<p>Here is an example of how such an exploration may look: we are exploring data about obesity, soda consumption and sales taxes on soda.  We are told this data came from the US Department of Agriculture and a quick look reveals that we are looking at county level data.  As one might expect, a scatter plot reveals a strong relationship between rising soda consumption and increased obesity.</p>
<div id="attachment_538" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2010/03/1_ob_drink.jpg"><img class="size-medium wp-image-538" title="1_ob_drink" src="http://pharma-bi.com/wp-content/uploads/2010/03/1_ob_drink-300x248.jpg" alt="Adult Obesity Rates and Soda Consumption by US County" width="300" height="248" /></a><p class="wp-caption-text">Adult obesity rates increase as soda consumption increases</p></div>
<p>Now we get to the real questions: do sales taxes on soda help with lowering obesity rates?  What relationship do we see between sales tax rates on soda and obesity? As luck would have it, the data we received also provides two measures about sales taxes for soda: one rate for vending machines and another rate for retail stores.</p>
<p>First we look at the relationship between soda taxes for retail stores versus obesity rates.  One might expect that taxes discourage soda consumption and, yes, there appears to be a small downward trend as tax rates increase.  Maybe soda taxes actually help with bringing down obesity?</p>
<div id="attachment_539" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2010/03/2_ob_drink_store_tax.jpg"><img class="size-medium wp-image-539" title="2_ob_drink_store_tax" src="http://pharma-bi.com/wp-content/uploads/2010/03/2_ob_drink_store_tax-300x256.jpg" alt="Adult Obesity Rates, Retail Sales Tax Rates and Soda Consumption by US County" width="300" height="256" /></a><p class="wp-caption-text">Adult Obesity Rates, Retail Sales Tax Rates and Soda Consumption by US County</p></div>
<p>Now let’s take a look at sales taxes on soda coming from vending machines.  Interesting observation: diabetes rates seem to increase slightly as these tax rates increase.  Counter intuitive?  How do vending machine purchases differ from purchases in a retail store?  Are we observing a real relationship here, or is the data fooling us?</p>
<div id="attachment_541" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2010/03/3_ob_drink_vending_tax1.jpg"><img class="size-medium wp-image-541" title="3_ob_drink_vending_tax" src="http://pharma-bi.com/wp-content/uploads/2010/03/3_ob_drink_vending_tax1-300x256.jpg" alt="Adult Obesity Rates, Vending Machine Sales Tax Rates and Soda Consumption by US County" width="300" height="256" /></a><p class="wp-caption-text">Adult Obesity Rates, Vending Machine Sales Tax Rates and Soda Consumption by US County</p></div>
<p>Before answering these questions, let’s take a closer look at all those data points on the y-axis.  Do they really indicate that these counties levy a 0% soda tax?  A quick inspection of the underlying data shows that, yes indeed, all records indicate a 0% tax rate.  Not a single “null” value among them.   However, without knowing how the data was processed, we cannot be sure that “zero” really means “no taxes” &#8211; it could also mean &#8220;no data.&#8221;</p>
<p>To explore further we start by placing the three graphs side by side.  This way we can see more easily what happens when we exclude “zeroes.”</p>
<div id="attachment_542" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2010/03/4_ob_tax_drink.jpg"><img class="size-medium wp-image-542" title="4_ob_tax_drink" src="http://pharma-bi.com/wp-content/uploads/2010/03/4_ob_tax_drink-300x100.jpg" alt="Soft Drink Consumption, Obesity and Soda Sales Taxes" width="300" height="100" /></a><p class="wp-caption-text">Soft Drink Consumption, Obesity and Soda Sales Taxes</p></div>
<p>First we exclude “zeroes” for retail sales taxes. Then we’ll do the same with taxes levied on soda in vending machines.  The following graphs illustrate this.</p>
<div id="attachment_543" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2010/03/5_ob_tax_drink_xr.jpg"><img class="size-medium wp-image-543" title="5_ob_tax_drink_xr" src="http://pharma-bi.com/wp-content/uploads/2010/03/5_ob_tax_drink_xr-300x106.jpg" alt="Soft Drink Consumption, Obesity and Sales Taxes: Excluding Records with 0% Soda Sales Tax Rate (Retail)" width="300" height="106" /></a><p class="wp-caption-text">Excluding Records with 0% Soda Sales Tax Rate (Retail).  The center graph shows the relationship between the remaining retail records and obesity. The trend line still points downward.</p></div>
<div id="attachment_544" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2010/03/6_ob_tax_drink_xv.jpg"><img class="size-medium wp-image-544" title="6_ob_tax_drink_xv" src="http://pharma-bi.com/wp-content/uploads/2010/03/6_ob_tax_drink_xv-300x106.jpg" alt="Soft Drink Consumption, Obesity and Sales Taxes: Excluding Records with 0% Soda Sales Tax Rate (Vending)" width="300" height="106" /></a><p class="wp-caption-text">Excluding Records with 0% Soda Sales Tax Rate (Vending).  The right hand graph shows the relationship between the remaining vending machine records and obesity.  We now see a downward trend as tax rates increase.</p></div>
<p>Wait a minute, though.  When we exclude “zeroes” from one set of taxes, all data points for “greater than 0% taxes” disappear from the other graph.  In other words, this data indicates that the two types of taxes are mutually exclusive!  Hmm, does this even make sense in real life?  Why would every US county tax soda either in retail stores or in vending machines but never in both?</p>
<p>Without further knowledge about this data we have to reframe our questions and conclusions:</p>
<ul>
<li>When soda taxes are levied, higher tax rates appear to go hand in hand with decreasing obesity      rates</li>
<li>We      cannot draw any conclusions about the impact of “no sales taxes” versus      “sales taxes”</li>
<li>Before we continue with a detailed analysis, we      probably need to ask questions about this data.  At first glance it makes little sense      that counties levy soda taxes either on vending machines or on retail      stores but never on both.  Then      again, I’m not a tax expert.</li>
</ul>
<p>Chances are that we will uncover other areas about which we need to ask questions.  Instead of taking the scattershot approach to learning about this data, data exploration helps us to develop very specific questions to ask.  With specific questions, we stand a better chance of finding the right subject matter experts to consult.</p>
<p>This was a quick example for exploring data about which we knew nothing when we started.  To gain new insights, we sometimes need to apply this &#8220;beginners mind&#8221; approach even to data about which we already know a lot.  After all, errors can happen, collection and processing systems can change without our knowledge and sometimes we find nuggets that were hidden until we started looking for them.  One final thought: the next time your boss or client asks to hurry up with the analysis, ask these two questions:</p>
<ul>
<li>What      are the consequences of making poor decisions because we hurried too quickly      through the data exploration?</li>
<li>Do      we need to go for more accuracy or is a ballpark analysis good enough at this time?</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://pharma-bi.com/2010/03/how-to-avoid-misleading-conclusions-explore-your-data/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>When Data Details Matter</title>
		<link>http://pharma-bi.com/2009/11/when-data-details-matter/</link>
		<comments>http://pharma-bi.com/2009/11/when-data-details-matter/#comments</comments>
		<pubDate>Tue, 24 Nov 2009 00:37:10 +0000</pubDate>
		<dc:creator>Christine Muser</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Know Your Data]]></category>
		<category><![CDATA[Modelling]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://pharma-bi.com/?p=482</guid>
		<description><![CDATA[<p>Ted Cuzzillo, the author behind the datadoodle blog, got me thinking about data details today.  When do they matter and when do they distract from what matters?</p>
<p>Being a data analyst means that I love details: the more the better, so I can understand how they form the Big Picture.  Intrinsically, I am drawn to graphs [...]]]></description>
			<content:encoded><![CDATA[<p>Ted Cuzzillo, the author behind the <a href="http://datadoodle.com/">datadoodle</a> blog, got me thinking about data details today.  When do they matter and when do they distract from what matters?</p>
<p>Being a data analyst means that I love details: the more the better, so I can understand how they form the Big Picture.  Intrinsically, I am drawn to graphs like this one:</p>
<div id="attachment_483" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2009/11/Individual_Points_90th.jpg"><img class="size-medium wp-image-483" title="Individual_Points_90th" src="http://pharma-bi.com/wp-content/uploads/2009/11/Individual_Points_90th-300x188.jpg" alt="A scatter graph showing individual data points and 90th percentile reference lines with their respective values" width="300" height="188" /></a><p class="wp-caption-text">A scatter plot showing individual data points and 90th percentile reference lines with their respective values</p></div>
<p>The spray of dots and their colors actually tell me something.  They give me a feel for the data and point me toward what is driving the overall result.  I can dig into individual data points and learn from them.  On the other hand, many people need a more abstract view of the world &#8211; a view that boils down to the overall shape of things.  After all, meaningful abstractions – like the graph below – are needed to make strategic, big picture decisions.</p>
<div id="attachment_484" class="wp-caption aligncenter" style="width: 266px"><a href="http://pharma-bi.com/wp-content/uploads/2009/11/Line_18_Data_Points.jpg"><img class="size-medium wp-image-484" title="Line_18_Data_Points" src="http://pharma-bi.com/wp-content/uploads/2009/11/Line_18_Data_Points-256x300.jpg" alt="A line graph averaging out the data points from the previous graph" width="256" height="300" /></a><p class="wp-caption-text">A line graph averaging out the data points from the previous graph</p></div>
<p>The graph above only plots 18 data points and connects them through a line to show the overall shape of the data.  Of course, the more we abstract information, the more we loose the ability to derive meaningful insights.</p>
<p>In order to generate this line graph, I had to create bins into which I could group the many data points from the first graph.  This means I now only have 18 data points from which to differentiate between the bottom 90% and the top 10% of the data.  In the graph below, the numbers along each line indicate the number of records that have been binned to create each data point.  As we can see from the 90<sup>th</sup> percentile reference lines below, the bottom 90% of the handful of data points in each section fall below 9 and 8 respectively.</p>
<div id="attachment_485" class="wp-caption aligncenter" style="width: 269px"><a href="http://pharma-bi.com/wp-content/uploads/2009/11/Line_18_Data_Points_90th.jpg"><img class="size-medium wp-image-485" title="Line_18_Data_Points_90th" src="http://pharma-bi.com/wp-content/uploads/2009/11/Line_18_Data_Points_90th-259x300.jpg" alt="The same line graph as above, including 90th percentile reference lines" width="259" height="300" /></a><p class="wp-caption-text">The same line graph as above, including 90th percentile reference lines</p></div>
<p>However, the very first graph in this story shows us just how misleading the percentiles from the abstracted data are.  According to the more detailed data, the 90<sup>th</sup> percentile values come out to 6.083 and 5.334 respectively.  The abstracted values point in the right direction, but they are quite bit removed from the true values.  The more detail we use, the closer we get to the truth.</p>
]]></content:encoded>
			<wfw:commentRss>http://pharma-bi.com/2009/11/when-data-details-matter/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>How to Create a Misleading Quadrant Analysis – by Accident</title>
		<link>http://pharma-bi.com/2009/11/how-to-create-a-misleading-quadrant-analysis-%e2%80%93-by-accident/</link>
		<comments>http://pharma-bi.com/2009/11/how-to-create-a-misleading-quadrant-analysis-%e2%80%93-by-accident/#comments</comments>
		<pubDate>Mon, 23 Nov 2009 22:23:29 +0000</pubDate>
		<dc:creator>Christine Muser</dc:creator>
				<category><![CDATA[Know Your Data]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://pharma-bi.com/?p=475</guid>
		<description><![CDATA[<p>When we use analysis tools like Tableau software, it becomes very important to keep our bearings about the data we are investigating.  For example, we need to keep in mind that Tableau retrieves and calculates information based only on the data needed to generate the graph.  That statement sounds really, duh, obvious.   But we can [...]]]></description>
			<content:encoded><![CDATA[<p>When we use analysis tools like <a href="http://www.tableausoftware.com/" target="_blank">Tableau software</a>, it becomes very important to keep our bearings about the data we are investigating.  For example, we need to keep in mind that Tableau retrieves and calculates information based only on the data needed to generate the graph.  That statement sounds really, duh, obvious.   But we can get into trouble when we don’t think about it <img src='http://pharma-bi.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>Let’s look at an example.  Below are two graphs based on exactly the same underlying data – but why do the colors look different?   Each graph appears to show a quadrant analysis that compares two web sites based on their search engine rank and trust.</p>
<div id="attachment_476" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2009/11/QuadsDB1.jpg"><img class="size-medium wp-image-476" title="QuadsDB1" src="http://pharma-bi.com/wp-content/uploads/2009/11/QuadsDB1-300x224.jpg" alt="Graphs comparing two web sites based on their SEO rank and trust" width="300" height="224" /></a><p class="wp-caption-text">Graphs comparing two web sites based on their SEO rank and trust</p></div>
<p>The difference lies in the way each graph is generated: the first graph really represents a set of eight data points, while the second graph represents two sets of four data points each &#8211; a subtle, but important distinction.  The second graph shows the quadrants for each individual site using a separate scale for each site.  This allows us to compare each site quadrant by quadrant without having to worry about one site having vastly more links than the other.  In other words, we can answer questions like: which site did a better job of getting high quality links vs. low quality links?</p>
<p>The first graph combines the data for both sites and plots each quadrant on a scale for the combined data.   If one site has many more links than the other site, it will skew the scale toward the higher linked site.  In essence, we are comparing all eight quadrants against each other as opposed to comparing how each site performed on a particular quadrant.</p>
<p>The second graph therefore is the “correct” quadrant analysis if we want to compare each site quadrant by quadrant.  But why even talk about the first graph?</p>
<p>That’s because in Tableau it may be tempting to generate the first graph to save time &#8211; especially when one is new to Tableau.  We only have to drag the “Site Name” dimension onto the column shelf and, voila, we can show both sites next to each other.  The problem is this: the shading is now determined based on all 8 data points together – rather than using a set of 4 data points for each individual site.  This becomes obvious once we add color scales to the graphs:</p>
<div id="attachment_477" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2009/11/QuadsDB2.jpg"><img class="size-medium wp-image-477" title="QuadsDB2" src="http://pharma-bi.com/wp-content/uploads/2009/11/QuadsDB2-300x224.jpg" alt="Graphs comparing two web sites based on their SEO rank &amp; trust - includes color scales" width="300" height="224" /></a><p class="wp-caption-text">Graphs comparing two web sites based on their SEO rank &amp; trust - includes color scales</p></div>
<p>The first graph really does not compare the two sites to each other. Instead it takes a look at all the links for both sites combined and creates 8 data points from all those links.  The second graph uses data from one site at a time.   A small – but critical – difference.</p>
<p>While this example may seem trivial, it actually has deep implications when we deal with more complex visualizations.  For example, when we use bins or when we filter records based on certain values, we may add misleading reference lines or create inaccurate charts &#8211; but that’s a topic for another day.</p>
]]></content:encoded>
			<wfw:commentRss>http://pharma-bi.com/2009/11/how-to-create-a-misleading-quadrant-analysis-%e2%80%93-by-accident/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Understanding Customer Feedback: How Visualizations Quickly Guide Us to Useful Conclusions</title>
		<link>http://pharma-bi.com/2009/11/understanding-customer-feedback-how-visualizations-quickly-guide-us-to-useful-conclusions/</link>
		<comments>http://pharma-bi.com/2009/11/understanding-customer-feedback-how-visualizations-quickly-guide-us-to-useful-conclusions/#comments</comments>
		<pubDate>Fri, 06 Nov 2009 23:43:10 +0000</pubDate>
		<dc:creator>Christine Muser</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://pharma-bi.com/?p=461</guid>
		<description><![CDATA[<p>Have you ever had the “pleasure” of slogging through hundreds of feedback forms from a seminar or conference?  Have you ever noticed how the mind seems to dwell on the negative comments, maybe even to the point that all the positives seem to loose their luster?  That’s when crunching actual survey numbers can help put [...]]]></description>
			<content:encoded><![CDATA[<p>Have you ever had the “pleasure” of slogging through hundreds of feedback forms from a seminar or conference?  Have you ever noticed how the mind seems to dwell on the negative comments, maybe even to the point that all the positives seem to loose their luster?  That’s when crunching actual survey numbers can help put things into perspective: either there really were problems or we are about to fall prey to the naysayers and constant critics.</p>
<p>For example, this heat map makes it obvious that Session C was the least popular event in this conference – but notice also that the range of scores is fairly close together.  In this example, attendees used a 5 point scale, with 1 being the least favorable score and 5 being the most favorable score.   Seeing that the lowest average score was above 4 tells us that, overall, attendees were quite happy with this conference.</p>
<p style="text-align: center;">
<div id="attachment_462" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2009/11/Session_Evaluations.jpg"><img class="size-medium wp-image-462" title="Heatmap: Session Evaluations" src="http://pharma-bi.com/wp-content/uploads/2009/11/Session_Evaluations-300x137.jpg" alt="A heatmap showing how each individual session was rated along multiple criteria -- click on the picture to enlarge it" width="300" height="137" /></a><p class="wp-caption-text">A heatmap showing how each individual session was rated along multiple criteria -- click on the picture to enlarge it</p></div>
<p>Of course we also want to understand what worked well and what didn’t work so well.  Let us take a more detailed look at Session C.</p>
<div id="attachment_465" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2009/11/Session_C_Scores.jpg"><img class="size-medium wp-image-465" title="Individual Scores for Session C" src="http://pharma-bi.com/wp-content/uploads/2009/11/Session_C_Scores-300x189.jpg" alt="Click the picture to enlarge it" width="300" height="189" /></a><p class="wp-caption-text">Click the picture to enlarge it</p></div>
<p>Very quickly we can tell that the audience was critical of the session content and how it was presented rather than the speaker’s knowledge about the subject.  More than 90% felt that the speaker had an adequate background to present on this topic.  But less than 75% were happy with the way the information was presented and how it related to their job. A few people awarded low scores of 2 and 1, but it is encouraging that these low scores came from fewer than 10% of the audience members.</p>
<p>Just to provide a contrast, let’s also take a look at Session B, clearly the favorite of this event.</p>
<div id="attachment_467" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2009/11/Session_B_Scores.jpg"><img class="size-medium wp-image-467" title="Session B Scores" src="http://pharma-bi.com/wp-content/uploads/2009/11/Session_B_Scores-300x187.jpg" alt="Individual Scores for Session B" width="300" height="187" /></a><p class="wp-caption-text">Individual Scores for Session B</p></div>
<p>There is no doubt that the audience rated this Session very highly across the board: 74% or more audience members awarded it the highest score across all measures.  Notice that this session, too, received a handful of low scores.</p>
<p>As we can see here, sometimes we only need a few pictures to gain useful insights.   No doubt, if this were a more involved customer survey, we would need more and very likely different graphs to discern the finer points of attitudes and perceptions &#8212; especially when such information is tracked over time.  But pictures like these are a good start to find out where to focus such additional efforts.</p>
]]></content:encoded>
			<wfw:commentRss>http://pharma-bi.com/2009/11/understanding-customer-feedback-how-visualizations-quickly-guide-us-to-useful-conclusions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Expanding our Visual Vocabulary</title>
		<link>http://pharma-bi.com/2009/10/expanding-our-visual-vocabulary/</link>
		<comments>http://pharma-bi.com/2009/10/expanding-our-visual-vocabulary/#comments</comments>
		<pubDate>Tue, 06 Oct 2009 00:03:14 +0000</pubDate>
		<dc:creator>Christine Muser</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://pharma-bi.com/?p=440</guid>
		<description><![CDATA[<p>Last week I wrote about the need to expand our visual vocabulary in line with software that allows us to graph complex data relationships and events in ever more meaningful ways.  Let me expand on this point with an example.</p>
<p>Just about any introductory statistics course covers the draft lottery of the Vietnam War era as [...]]]></description>
			<content:encoded><![CDATA[<p>Last week I wrote about <a href="../../../../../2009/10/02/visual-analytics-what%E2%80%99s-the-big-deal/">the need to expand our visual vocabulary</a> in line with software that allows us to graph complex data relationships and events in ever more meaningful ways.  Let me expand on this point with an example.</p>
<p>Just about any introductory statistics course covers the <a href="http://en.wikipedia.org/wiki/Draft_lottery_%281969%29">draft lottery of the Vietnam War era</a> as a case study for determining whether events occurred by chance or whether they were subject to some significant influence.  Typically this case study involves a scatter plot and discussions about statistical significance, p-values and regression lines.</p>
<p>The section of the draft lottery we are discussing here involved the following process: Each Birthday was given a number from 1 to 366 (including leap day) so that 1 = Jan 1<sup>st</sup>, 2 = Jan 2<sup>nd</sup>, 3 = Jan 3<sup>rd</sup> and so on until 366 = Dec 31<sup>st</sup>.  Each Birthday number was written on a piece of paper, put in a plastic capsule and then in a shoebox from which each was later drawn one at a time.  The Draft Number represents the order in which each birthday was drawn from the shoebox.</p>
<p>The scatter plot below shows Birthdays vs. Draft Numbers along with a trend line to indicate the relationship between the two.  The casual observer may not notice the relative sparsity of data points in the upper right hand quadrant, let alone grasp the significance of it.  It takes careful observation to notice that a significant number of birthdays that occur later in the year have lower draft numbers and visa versa.</p>
<div id="attachment_441" class="wp-caption aligncenter" style="width: 310px"><a href="http://pharma-bi.com/wp-content/uploads/2009/10/DL_Scatter.jpg"><img class="size-medium wp-image-441" title="DL_Scatter" src="http://pharma-bi.com/wp-content/uploads/2009/10/DL_Scatter-300x247.jpg" alt="Scatter plot and trend line showing Birthday vs. Draft Number." width="300" height="247" /></a><p class="wp-caption-text">Scatter plot and trend line showing Birthday vs. Draft Number.  P &lt; 0.0001 - Click the picture to enlarge it.</p></div>
<p>During business discussions or presentations we often do not have time to explain statistical models and their implications.  Sometimes we need to find more intuitive means to get our point across quickly and effectively.  When constructed properly, visual displays can convey a lot of information very quickly.</p>
<p>To illustrate the results from the Draft Lottery more intuitively, I divided all birthdays into three equal groups: Group 1 includes days 1 – 122, Group 2 includes days 123 to 244 and Group 3 includes days 245 to 366.  I then plotted the corresponding draft numbers in each Birthday Group.</p>
<p>At a glance one can see that the Birthdays tend to cluster toward the upper end of the Draft Numbers range in Group 1 and toward the lower end of the Draft Numbers range in Group 3.  There is no need to talk about p-values and regression lines, just a quick note pointing out that Birthdays in the early part of the year had a better chance of receiving a high draft number than Birthdays that occurred later in the year.</p>
<div id="attachment_442" class="wp-caption aligncenter" style="width: 242px"><a href="http://pharma-bi.com/wp-content/uploads/2009/10/BDay_Group_viz1b.jpg"><img class="size-medium wp-image-442" title="BDay_Group_viz1b" src="http://pharma-bi.com/wp-content/uploads/2009/10/BDay_Group_viz1b-232x300.jpg" alt="Draft Numbers by Birthday Group" width="232" height="300" /></a><p class="wp-caption-text">Draft Numbers by Birthday Group.  Click the picture to enlarge it.</p></div>
<p>An alternative representation may include reference lines for quartiles as in this illustration:</p>
<div id="attachment_443" class="wp-caption aligncenter" style="width: 226px"><a href="http://pharma-bi.com/wp-content/uploads/2009/10/BDay_Group_viz2b.jpg"><img class="size-medium wp-image-443" title="BDay_Group_viz2b" src="http://pharma-bi.com/wp-content/uploads/2009/10/BDay_Group_viz2b-216x300.jpg" alt="Draft Numbers by Birthday Group - including refernce lines for quartiles" width="216" height="300" /></a><p class="wp-caption-text">Draft Numbers by Birthday Group - including reference lines for quartiles.  Click the picture to enlarge it.</p></div>
<p>Business decisions, of course, are based on more than fancy pictures and we need to be able to back things up through detailed analysis.   But when time is short and we need to make our point quickly, the second graph helps us out much more than the scatter plot.</p>
<p><strong>Acknowledgements:</strong></p>
<p>A big “Thank You” goes to the authors of “<a href="http://onlinestatbook.com/" target="_blank">Online Statistics: An Interactive Multimedia Course of Study</a>” for providing the data and for the inspiration to visualize it in a format other than a scatter plot.  This free online course was developed at Rice University, University of Houston, Clear Lake, and University of Houston, Downtown, with partial support from the <a href="http://www.nsf.gov/" target="_blank">National Science Foundation</a>.</p>
<p>All graphs were generated using <a href="http://www.tableausoftware.com/" target="_blank">Tableau</a> software.</p>
]]></content:encoded>
			<wfw:commentRss>http://pharma-bi.com/2009/10/expanding-our-visual-vocabulary/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Visual Analytics: What’s The Big Deal?</title>
		<link>http://pharma-bi.com/2009/10/visual-analytics-what%e2%80%99s-the-big-deal/</link>
		<comments>http://pharma-bi.com/2009/10/visual-analytics-what%e2%80%99s-the-big-deal/#comments</comments>
		<pubDate>Sat, 03 Oct 2009 03:32:33 +0000</pubDate>
		<dc:creator>Christine Muser</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Current Topics]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://pharma-bi.com/?p=433</guid>
		<description><![CDATA[<p>During several conversations recently the following comment came up:  “What’s the big deal with visual analytics?  It’s just a bunch of pretty pictures!”  It took a while, but it finally dawned on me that we have reached the Xerox-GUI-Macintosh stage for data analysis.  The early versions of a graphical user interface (GUI) which were developed [...]]]></description>
			<content:encoded><![CDATA[<p>During several conversations recently the following comment came up:  “What’s the big deal with visual analytics?  It’s just a bunch of pretty pictures!”  It took a while, but it finally dawned on me that we have reached the <a href="http://en.wikipedia.org/wiki/History_of_the_graphical_user_interface" target="_blank">Xerox-GUI-Macintosh</a> stage for data analysis.  The early versions of a graphical user interface (GUI) which were developed at what was then called <a href="http://en.wikipedia.org/wiki/Xerox_PARC" target="_blank">Xerox-PARC</a> were no commercial success until <a href="http://en.wikipedia.org/wiki/Mac_OS_history" target="_blank">Macintosh</a> – now better known as Apple – created a computer that allowed everyone to point and click rather than write arcane computer instructions.</p>
<p>While point-and-click is much easier than writing code, it still requires computer users to become familiar with what the computer can do and how to accomplish various tasks.  It also requires standards about where to click and what should happen when certain actions are taken.  All this knowledge and these standards had to develop over time and often through trial and error.</p>
<p>In some ways we have reached a similar stage for data analysis: visual analytics provides a new language through which non-analysts can explore and answer business questions.  It frees the non-technical user from the analytic equivalent of writing code, that is, it frees them from the need to learn how to create graphics that – until now – required significant technical knowledge to generate.</p>
<p>As with any new technology, some mayhem ensues: we have to gain experience and learn through practice.  We need to become fluent in the appropriate use of less familiar – yet oddly intuitive – graphing techniques like <a href="http://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=0001OR&amp;topic_id=1" target="_blank">sparklines</a>, <a href="http://en.wikipedia.org/wiki/Heatmap" target="_blank">heatmaps</a> or <a href="http://en.wikipedia.org/wiki/Small_multiple" target="_blank">small multiples</a>.  In short, we must develop a visual vocabulary beyond the bar charts and line graphs we know from Excel or PowerPoint.</p>
<p>And this brings me back to the Xerox-GUI-Macintosh comment from the beginning: as in those early days of learning the language of icons and point-and-click, we have now reached the point where more powerful ways of encoding data in a visual format is available to the lay person.  Just as with the graphical user interface, we will some day look back and say “I can’t imagine a world without seeing data in pictures.”  For at least a little while, those of us in the analyst professions need to act as interpreters and guides to those who are learning this new language.   Sooner rather than later we will all get there.</p>
]]></content:encoded>
			<wfw:commentRss>http://pharma-bi.com/2009/10/visual-analytics-what%e2%80%99s-the-big-deal/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
