<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>datadoodle &#187; analysis</title>
	<atom:link href="http://datadoodle.com/tag/analysis/feed/" rel="self" type="application/rss+xml" />
	<link>http://datadoodle.com</link>
	<description>Where the humans meet analytics and related subjects</description>
	<lastBuildDate>Fri, 13 Jan 2012 00:03:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Survey of people who analyze data</title>
		<link>http://datadoodle.com/2011/03/08/survey-people-who-analyze-data/</link>
		<comments>http://datadoodle.com/2011/03/08/survey-people-who-analyze-data/#comments</comments>
		<pubDate>Tue, 08 Mar 2011 19:25:33 +0000</pubDate>
		<dc:creator>Ted Cuzzillo</dc:creator>
				<category><![CDATA[Data analysis]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[analysts]]></category>
		<category><![CDATA[survey]]></category>

		<guid isPermaLink="false">http://datadoodle.com/?p=1656</guid>
		<description><![CDATA[Data analysts, data champions, and others who analyze data are some of the most interesting and valuable people in business today. If you think you might be part of this group, please take part in my new survey. Go to www.datadoodle.com/analysts/. Later, you get a preview report.]]></description>
			<content:encoded><![CDATA[<p></p><p>
The people who translate raw data into meaning &mdash; sometimes titled &#8220;data analyst&#8221; but often not &mdash; are some of the least understood in business and the most valuable. If you&#8217;ve read much of Datadoodle, you know that I&#8217;m trying to understand them better. Now I&#8217;ve launched a <a href="http://datadoodle.com/analysts/" target="_blank">survey</a>.
</p>
<p>
If you play any part in data analysis, please take part in the survey. You may analyze data or you may be a champion of data analysis, or both. You may or may not call yourself a &#8220;data analyst.&#8221; You may not even do any of this at work, only at home.
</p>
<p>
For your trouble, you get a preview of the final report when the preview&#8217;s ready.
</p>
<p>
<a href="http://datadoodle.com/analysts/" target="_blank">Please go now and take part. Tell your friends, too.</a></p>
<img src="http://datadoodle.com/wordpress/?ak_action=api_record_view&id=1656&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://datadoodle.com/2011/03/08/survey-people-who-analyze-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New data analysts and teenage love</title>
		<link>http://datadoodle.com/2011/01/04/new-analysts-and-teenage-love/</link>
		<comments>http://datadoodle.com/2011/01/04/new-analysts-and-teenage-love/#comments</comments>
		<pubDate>Tue, 04 Jan 2011 17:37:02 +0000</pubDate>
		<dc:creator>Ted Cuzzillo</dc:creator>
				<category><![CDATA[creative analysis]]></category>
		<category><![CDATA[Data analysis]]></category>
		<category><![CDATA[muses]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[analyst]]></category>
		<category><![CDATA[analysts]]></category>
		<category><![CDATA[collaboration]]></category>
		<category><![CDATA[conversation]]></category>
		<category><![CDATA[Lyza]]></category>
		<category><![CDATA[Pete Warden]]></category>
		<category><![CDATA[Scott Davis]]></category>
		<category><![CDATA[Tableau]]></category>
		<category><![CDATA[tools]]></category>

		<guid isPermaLink="false">http://datadoodle.com/?p=1558</guid>
		<description><![CDATA[Search all the business literature you can and you&#8217;ll never find data analysis compared to romantic love. But, hey, why not? Love&#8217;s trajectories might hint at what the business world&#8217;s newly enabled generation of data analysts can expect. These data analysts tend to be independent, are often creative and at least partly self-trained. They&#8217;re strapped [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>
Search all the business literature you can and you&#8217;ll never find data analysis compared to romantic love. But, hey, why not? Love&#8217;s trajectories might hint at what the business world&#8217;s newly enabled generation of data analysts can expect.
</p>
<p>
These data analysts tend to be independent, are often creative and at least partly self-trained. They&#8217;re strapped to rockets from Tableau, Lyzasoft, Predixion, and others, tools that are at first deceptively toy-like. Aren&#8217;t they analogous to the garden variety teenager? Bothg groups revel in newly discovered tools, while both pursuits are fundamentally social &mdash; as Lyzasoft CEO Scott Davis observes about data analysis. <a href="http://www.information-management.com/issues/20_7/information_management_strategic_intelligence_MDM-10019102-1.html" target="_blank">His blog post</a> got me thinking about this.
</p>
<p>
Everyone shows up ready to rumble. They&#8217;re fascinated with the possibilities, they experiment in private, later they have a blush of quick results followed by a long trail of self-training on finer points.
</p>
<p>
Each group&#8217;s toolset is potent and designed for early success but never early mastery. They make lots of mistakes. In love and analysis, people fall for the wrong data, mess up good data and dates, do all kinds of things they wish they hadn&#8217;t.
</p>
<p>
Without realizing, they face danger. I&#8217;ve noticed that behind most good trends comes a rotten sibling right behind it. Think of the history of other social events: Hippies begat the Summer of Love and then came Altamont. We celebrated &#8220;free love&#8221; and then came a surge of sexually transmitted diseases. Baseball begat the World Series and then came batters on steroids. PageMaker begat self-publishing but then came the ugliest lost-cat posters ever tacked on a telephone pole.
</p>
<p>
You may already wish that bad analysis would go away. Pete Warden, for one, <a href="http://petewarden.typepad.com/searchbrowser/2010/12/data-is-snake-oil.html" target="_blank">warns</a> of some fabulous ways people trip over new data. We could easily call this stuff &#8220;data porn&#8221; and ignore it.
</p>
<p>
But there are even more treacherous pitfalls. These potent tools can change everything in a flash (at the &#8220;speed of thought&#8221;). One minute you&#8217;re in orbit, and the next minute you wish you were dead. With sex comes the hazard of a painful breakup, and with data analysis comes the danger of unwanted speech that&#8217;s too hot for any public platform. Oops!
</p>
<p>
We have ways to deal with all that, but it&#8217;s never pleasant. The rejected lover picks up and leaves, and the analyst just finds his creative viz zapped off the cloud &mdash; by those who are themselves learning a new role.
</p>
<p>
The lover and the analyst both feel hurt, perhaps betrayed. Wasn&#8217;t each playing by the rules? Wasn&#8217;t each part of the group? Suddenly each one feels rejected for reasons that a hasty explanation doesn&#8217;t quite calm the hurt feelings.
</p>
<p>
In hindsight, we realize we shouldn&#8217;t have been surprised. Social pursuits can be like this.
</p>
<p>
By the way, who said good tools were the end of the story? Well, most vendors did. Some teenagers think so, too. But even slightly more advanced users know that technical proficiency is only the price of entry. We do the real work in many long conversations and collaborations with words, data, gestures, misunderstandings and reconciliations, and on and on.
</p>
<p>
Here the analogy breaks. The tools will keep getting better while the bodies fall apart. But the lesson&#8217;s the same: Tools enable, but conversation &mdash; better known in the business world as collaboration &mdash; is really at the heart of our work.</p>
<img src="http://datadoodle.com/wordpress/?ak_action=api_record_view&id=1558&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://datadoodle.com/2011/01/04/new-analysts-and-teenage-love/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Let your gray hair light your way through unfamiliar data</title>
		<link>http://datadoodle.com/2010/11/08/let-your-gray-hair-light-your-way-through-unfamiliar-data/</link>
		<comments>http://datadoodle.com/2010/11/08/let-your-gray-hair-light-your-way-through-unfamiliar-data/#comments</comments>
		<pubDate>Mon, 08 Nov 2010 23:16:09 +0000</pubDate>
		<dc:creator>Ted Cuzzillo</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[analyst]]></category>
		<category><![CDATA[Dan Murray]]></category>
		<category><![CDATA[Michael Princi]]></category>
		<category><![CDATA[unfamiliar data]]></category>

		<guid isPermaLink="false">http://datadoodle.com/?p=1493</guid>
		<description><![CDATA[How do you approach unfamiliar data? An investment banker I talked to last week &#8212; one I know from a client&#8217;s whitepaper &#8212; rejects the &#8220;don&#8217;t think&#8221; method, advocated in my earlier post about Dan Murray. Instead, he thinks first, on paper. &#8220;My approach is driven by having a bunch of gray hair,&#8221; says Michael [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>
How do you approach unfamiliar data? An investment banker I talked to last week &mdash; one I know from a <a href="http://www.tableausoftware.com/whitepapers/making-numbers-pop-visual-analytics" target="_blank">client&#8217;s whitepaper</a> &mdash; rejects the &#8220;don&#8217;t think&#8221; method, advocated in my <a href="http://datadoodle.com/2010/10/18/analyze-unfamiliar-data/" target="_blank">earlier post</a> about Dan Murray. Instead, he thinks first, on paper.
</p>
<p>
&#8220;My approach is driven by having a bunch of gray hair,&#8221; says <a href="http://www.thoughtstorm.com/about/our-team/" target="_blank">Michael Princi</a>, managing director of ThoughtStorm Strategic Capital, a boutique investment bank and advisory firm in northern New Jersey. &#8220;I want to use my business acumen to tease out what might be the underlying issues.&#8221; <a href="http://datadoodle.com/b2b_content/"><img alt="" src="http://datadoodle.com/wordpress/wp-content/themes/elements-of-seo_1.4/images/ad.png" title="Good writing works for whitepapers and other lead-generating journalism." class="alignright" width="106" height="178" /></a>
</p>
<p>
<strong>Experience counts</strong> The naive mind is prone to bad mistakes, he says. Take, for example, the 22-year-old analyst in India he employed who spotted this correlation: a firm&#8217;s revenue correlated with the number of Bobs in the workforce.
</p>
<p>
<strong>Hypothesize on paper</strong> &#8220;I first think through my hypothesis on paper,&#8221; he says. &#8220;&#8216;It gives you a starting point.&#8221; If the model is wrong, as it often is, he just tries another one.
</p>
<p>
<strong>Test and repeat</strong> If the actual numbers are somewhat close to his expectations, he knows he&#8217;s on the right track. It&#8217;s the traditional consulting confirm-or-deny method. When the data does confirm his hypothesis, he&#8217;s able to run through the data again and again in iterations.
</p>
<p>
<strong>Test and repeat</strong> If the actual numbers are somewhat close to his expectations, he knows he&#8217;s on the right track. &#8220;It&#8217;s the traditional consulting confirm-or-deny method. It&#8217;s the quickest way I know. When the data does confirm his hypothesis, he&#8217;s able to run through the data again and again in iterations.
</p>
<p>
Does Michael Princi really analyze data differently from Dan Murray? They&#8217;ve never met, but Michael guesses not. He says of Dan, &#8220;I think he&#8217;s probably mapping it out intuitively.&#8221;
</p>
<p style="font-weight: bold; border-style: dotted; border-width: 1px; padding: 5px; margin-bottom: 10px;">Do you have a routine for analyzing unfamiliar data? <a href="http://datadoodle.com/tell-datadoodle-3/" target="_blank">Please introduce yourself here.</a></p>
<img src="http://datadoodle.com/wordpress/?ak_action=api_record_view&id=1493&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://datadoodle.com/2010/11/08/let-your-gray-hair-light-your-way-through-unfamiliar-data/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Feature lists miss the point</title>
		<link>http://datadoodle.com/2010/06/29/feature-lists-miss-the-point/</link>
		<comments>http://datadoodle.com/2010/06/29/feature-lists-miss-the-point/#comments</comments>
		<pubDate>Tue, 29 Jun 2010 17:34:47 +0000</pubDate>
		<dc:creator>Ted Cuzzillo</dc:creator>
				<category><![CDATA[BI industry]]></category>
		<category><![CDATA[creative analysis]]></category>
		<category><![CDATA[events]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[Stephen McDaniel]]></category>
		<category><![CDATA[Tableau]]></category>
		<category><![CDATA[tools]]></category>

		<guid isPermaLink="false">http://datadoodle.com/?p=1287</guid>
		<description><![CDATA[So many people who should know better seem to miss the point when they mention Tableau. Why? I asked BI veteran Stephen McDaniel for his thoughts &#8212; which he gave, but then went on to suggest an almost unheard of challenge: a data analysis face-off among vendors. Consider this description by a BI analyst: &#8220;Tableau [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>
So many people who should know better seem to miss the point when they mention <a href="http://www.tableausoftware.com/">Tableau</a>. Why? I asked BI veteran Stephen McDaniel for his thoughts &mdash; which he gave, but then went on to suggest an almost unheard of challenge: a data analysis face-off among vendors.
</p>
<p>
Consider this description by a BI analyst: &#8220;Tableau provides business analysts speed of thought visual analysis on data held in memory on their desktop machines.&#8221; All that&#8217;s fine, but it may as well have been about a whole bunch of other tools, too.
</p>
<p>
At the root of this fuzz, explained McDaniel, is that most analysts who concern themselves with tools don&#8217;t actually use the tools. They rely on demos , marketing, and hearsay.
</p>
<p>
Though much of McDaniel&#8217;s recent work has centered on Tableau &mdash; his second book is <a href="http://www.freakalytics.com/2009/07/12/rapid-graphs-01/"><i>Rapid Graphs with Tableau Software</i></a>  and he gives <a href="http://www.freakalytics.com/training/">training</a> sessions around the country &mdash; he also has a long, credible trail back through BI and data mining. He was director of analytics at Netflix, and has worked with more than 50 companies in BI. His first book was SAS for Dummies.
</p>
<p>
&#8220;I love SAS,&#8221; he says. Still, he remembers his sister in law&#8217;s reaction to his book on SAS. She was not an analyst but a &#8220;people manager.&#8221; These are the ones, he says, who have hated BI because &#8220;it had been made into a priesthood.&#8221; When she had looked through the book, she said, &#8220;Oh, this is great&#8221; and put it down. But she read the Tableau book for a half hour and said, &#8220;You should come talk to some people I work with.&#8221; She had recognized what she could do with the tool.
</p>
<p>
McDaniel&#8217;s sister in law and many like her don&#8217;t care whether the data is &#8220;in memory,&#8221; they don&#8217;t see themselves as business analysts, they take &#8220;desktop&#8221; for granted, and they know &#8220;speed of thought&#8221; is just gloss.
</p>
<p>
The list of features really doesn&#8217;t matter. All that really matters is whether someone can do what needs to be done with the tool.
</p>
<p>
McDaniel imagines a throw down, a data analysis match. It would be open to any BI vendor. Each vendor would send their best people, and each team would receive a uniform set of data. Over some defined period, teams would analyze and then present the results to a panel of vendor-neutral judges.
</p>
<p>
The reward? Perhaps a signed copy of a Stephen McDaniel book, or maybe a beer, possibly both. But certainly, repute.
</p>
<p>
What do you think of the face-off idea? Please write a comment.</p>
<img src="http://datadoodle.com/wordpress/?ak_action=api_record_view&id=1287&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://datadoodle.com/2010/06/29/feature-lists-miss-the-point/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Self tracking is business intelligence</title>
		<link>http://datadoodle.com/2010/05/10/self-tracking-is-business-intelligence/</link>
		<comments>http://datadoodle.com/2010/05/10/self-tracking-is-business-intelligence/#comments</comments>
		<pubDate>Mon, 10 May 2010 15:03:44 +0000</pubDate>
		<dc:creator>Ted Cuzzillo</dc:creator>
				<category><![CDATA[performance]]></category>
		<category><![CDATA[self tracking]]></category>
		<category><![CDATA[trends]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[book]]></category>
		<category><![CDATA[FileMaker]]></category>
		<category><![CDATA[Gary Wolf]]></category>
		<category><![CDATA[new york times]]></category>
		<category><![CDATA[workday]]></category>

		<guid isPermaLink="false">http://datadoodle.com/?p=1262</guid>
		<description><![CDATA[Back when secretaries were common, you could have had yours track your day in 15-minute increments. In his book The Effective Executive, Peter Drucker suggested this as a way to find out what you really did all day. The picture usually wasn&#8217;t so pretty. Tracking your time then and now is personal, it&#8217;s messy, and [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>
Back when secretaries were common, you could have had yours track your day in 15-minute increments. In his book The Effective Executive, Peter Drucker suggested this as a way to find out what you really did all day. The picture usually wasn&#8217;t so pretty.
</p>
<p>
Tracking your time then and now is personal, it&#8217;s messy, and it&#8217;s the essence of business intelligence: collecting data and reading it for guidance in business activities that matter. Is there anything that matters more to an organization than productivity of its people? For a small office or home-based business, this might be the best BI there is.
</p>
<p>
This gets no recognition in the BI industry that I can find, at least not in the conservative world of TDWI. At least not yet.
</p>
<p>
PI &#8212; for &#8220;private intelligence&#8221; &#8212; has different issues, starting with data collection. In BI, data comes from transactions, all recorded routinely. In PI, most of it has to come from a &#8220;secretary&#8221; or from our own, tedious notation.
</p>
<p>
I dabbled in it once. The insights were good, if painful, but mostly it was tedious. A few years ago, a confluence of personal events let me do something I&#8217;d always wanted to try: hole up for a few months in a Sicilian village I knew slightly. The food was good, I had relatives nearby, and the nearby church bells rang all day and all night, four times an hour. At the same time, I had a book to edit. To stay productive, I made a game out of the work, tracking my time to the minute in Filemaker.
</p>
<p>
I liked the local food and started to hate the book, an office manual that inadvertently revealed a con game. Even so, I threw myself at it every day. But no matter how hard I tried, no full day ever resulted in more than about two hours of actual, productive work. My &#8220;quick breaks&#8221; for walks and coffee with a friend actually took up more time.
</p>
<p>
I made a Filemaker database because I could find no off-the-shelf product that would do anything close. Each period, no matter how short, had a starting and ending times I entered with buttons, and a calculation field figured the duration. A drop-down menu offered my usual activites. I could make a report for any period.
</p>
<p>
I thought some product would do that better, but I could find nothing. Then the May 2 issue of the New York Times Magazine ran an article by Gary Wolf about this, <a href="http://www.nytimes.com/2010/05/02/magazine/02self-measurement-t.html?ref=magazine&amp;pagewanted=all">&#8220;The Data Driven Life.&#8221;</a> My Filemaker invention wasn&#8217;t too far from what others have used, and now new devices are coming along that could make all that seem so old hat. Some people are even sharing their data on the cloud.
</p>
<p>
But as in traditional BI, the technology just gets you in the door. The show has just begun.
</p>
<p>
Most people Wolf writes about do it for personal reasons. One wanted to know how his coffee consumption helped him focus, another tried to cure his sleep apnea, and still another noticed that flax seed oil, or just lots of butter, improved his cognitive performance.
</p>
<p>
As in good BI, the experiments often raised new questions. And sometimes the new questions are unexpected, as in Wolf&#8217;s own experience.
</p>
<blockquote><p>
Often, pioneering trackers struggle with feelings of being both aided and tormented by the very systems they have built. I know what this is like. I used to track my work hours, and it was a miserable process. With my spreadsheet, I inadvertently transformed myself into the mean-spirited, small-minded boss I imagined I was escaping through self- employment. Taking advantage of the explosion of self-tracking services available on the Web, I started analyzing my workday at a finer level. Every time I moved to a new activity &mdash; picked up the phone, opened a Web browser, answered e-mail &mdash; I made a couple of clicks with my mouse, which recorded the change. After a few weeks I looked at the data and marveled. My day was a patchwork of distraction, interspersed with valuable, but too rare, periods of focus. In total, the amount of uninterrupted close attention I was able to muster in a given workday was less than three hours. After I got over the humiliation, I came to see how valuable this knowledge was. The efficiency lesson was that I could gain significant benefit by extending my day at my desk by only a few minutes, as long as these minutes were well spent. But a greater lesson was that by tracking hours at my desk I was making an unnecessary concession to a worthless stereotype. Does anybody really believe that long hours at a desk are a vocational ideal? I got nothing from my tracking system until I used it as a source of critical perspective, not on my performance but on my assumptions about what was important to track.
</p></blockquote>
<p>
I wish Drucker were around to respond. Wolf&#8217;s insight sounds like important stuff for everyday knowledge workers, especially those who work alone. What&#8217;s more important to a knowledge worker than time?
</p>
<p>
These experiments are often haphazard and highly personal.
</p>
<blockquote><p>
Generally, when we try to change, we simply thrash about: we improvise, guess, forget our results or change the conditions without even noticing the results. Errors are possible in self-tracking and self-experiment, of course. It is easy to mistake a transient effect for a permanent one, or miss some hidden factor that is influencing your data and confounding your conclusions. But once you start gathering data, recording the dates, toggling the conditions back and forth while keeping careful records of the outcome, you gain a tremendous advantage over the normal human practice of making no valid effort whatsoever.
</p></blockquote>
<p>
Yes, just as analytics gives companies a tremendous advantage over those who make less effort.
</p>
<p>
Let the BI traditionalists pooh-pooh self-tracking. The very same people might have dismissed such things as visual analysis, agile development, and at one time even business intelligence itself. Sometimes it take a few pioneers and geeks, perhaps even a secretary, to prove a concept.</p>
<img src="http://datadoodle.com/wordpress/?ak_action=api_record_view&id=1262&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://datadoodle.com/2010/05/10/self-tracking-is-business-intelligence/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Basking in a dashboard&#8217;s warm glow</title>
		<link>http://datadoodle.com/2010/03/19/basking-in-a-dashboards-warm-glow/</link>
		<comments>http://datadoodle.com/2010/03/19/basking-in-a-dashboards-warm-glow/#comments</comments>
		<pubDate>Fri, 19 Mar 2010 22:27:22 +0000</pubDate>
		<dc:creator>Ted Cuzzillo</dc:creator>
				<category><![CDATA[visual analysis]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[audience]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[dashboards]]></category>
		<category><![CDATA[Lyndsay Wise]]></category>
		<category><![CDATA[Neil Raden]]></category>
		<category><![CDATA[San Francisco]]></category>
		<category><![CDATA[Tom Davenport]]></category>

		<guid isPermaLink="false">http://datadoodle.com/?p=1201</guid>
		<description><![CDATA[When some people look at dashboards, they want to see patterns but not reasons. &#8220;They don&#8217;t want to read the fine print,&#8221; said one attendee in Lyndsay Wise&#8217;s dashboards seminar at Enterprise Data World in San Francisco yesterday. That&#8217;s what the man learned in one data-quality project for a human resources department. He was frank [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>
When some people look at dashboards, they want to see patterns but not reasons. &#8220;They don&#8217;t want to read the fine print,&#8221; said one attendee in Lyndsay Wise&#8217;s <a href="http://edw2010.wilshireconferences.com/sessionPop.cfm?confid=38&amp;proposalid=2187">dashboards seminar at Enterprise Data World</a> in San Francisco yesterday. That&#8217;s what the man learned in one data-quality project for a human resources department.
</p>
<p>
He was frank enough to call drill-down &#8220;the fine print&#8221; &mdash; the suggestion that the &#8220;why?&#8221; is just noise. He slipped out before I could find out more.
</p>
<p>
Had his complacent users been victims of abusive parents or bad teachers? I&#8217;ve worked with such users. I trust them, I like them, and most businesses couldn&#8217;t do without them. But I&#8217;m curious about their incuriousness, as some of them might wonder about me.
</p>
<p>
There&#8217;s too much data, we know that. <a href="http://blogs.hbr.org/davenport/2010/03/analysis-without-analysts.html?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+harvardbusiness%2Fdavenport+%28Tom+Davenport+on+HBR.org%29">Tom Davenport ponders</a> the overwhelmingness of it all today. The Economist <a href="http://www.economist.com/specialreports/displaystory.cfm?story_id=15557443">reported on it</a> last month, and Neil Raden <a href="http://www.hiredbrains.com/artic2.html">wrote about it</a> 15 years ago. The casual users feel it more and more.
</p>
<p>
For the overwhelmed, there&#8217;s the palliative dashboard. It works the way Mozart does for who can&#8217;t tell Mozart from Schmozart: knowing it&#8217;s Mozart makes them feel good. The palliative dashboard can be contrary to every best practice we know of and still succeed.
</p>
<p>
One person in the audience told about a pre-dashboard-era CEO who prided himself on having no high school degree. He wanted yesterday&#8217;s sales figures on his desk at 8 a.m. every morning. What decisions did he make based on that data? None! It just made him feel good, someone discovered later. Even without his reading glasses on, the patterns on the paper must have looked nice against the wood grain on the desk.
</p>
<p>
Attention dashboard makers: mind the furniture.</p>
<img src="http://datadoodle.com/wordpress/?ak_action=api_record_view&id=1201&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://datadoodle.com/2010/03/19/basking-in-a-dashboards-warm-glow/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mapping the many faces of &#8220;retention&#8221;</title>
		<link>http://datadoodle.com/2010/01/15/mapping-the-many-faces-of-retention/</link>
		<comments>http://datadoodle.com/2010/01/15/mapping-the-many-faces-of-retention/#comments</comments>
		<pubDate>Fri, 15 Jan 2010 17:30:11 +0000</pubDate>
		<dc:creator>Ted Cuzzillo</dc:creator>
				<category><![CDATA[creative analysis]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[customers]]></category>
		<category><![CDATA[economy]]></category>
		<category><![CDATA[games]]></category>
		<category><![CDATA[games site]]></category>
		<category><![CDATA[Ken Rudin]]></category>
		<category><![CDATA[LucidEra]]></category>
		<category><![CDATA[metric]]></category>
		<category><![CDATA[metrics]]></category>
		<category><![CDATA[retention curve]]></category>
		<category><![CDATA[Zynga]]></category>

		<guid isPermaLink="false">http://datadoodle.com/?p=1124</guid>
		<description><![CDATA[Everybody knows what &#8220;retention&#8221; means until they have to design a metric. Ken Rudin, once of LucidEra and now general manager of analytics at the games site Zynga, thought that he and his team could &#8220;put something together&#8221; quickly &#8212; but it actually took &#8220;four solid weeks of discussion and debate.&#8221; About 50 million people [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>
Everybody knows what &#8220;retention&#8221; means until they have to design a metric. Ken Rudin, once of LucidEra and now general manager of analytics at the games site Zynga, thought that he and his team could &#8220;put something together&#8221; quickly &mdash; but it actually took &#8220;four solid weeks of discussion and debate.&#8221;
</p>
<p>
About 50 million people play <a href="http://www.zynga.com/">Zynga</a> games every day. It&#8217;s the leading online social gaming platform, according to Ken, and it&#8217;s grown from zero in 2007 to revenues of &#8220;a few&#8221; hundred million dollars annual revenue. Every day, the company captures 20 to 30 billion records of data, and Ken and his team use that data to improve revenue, viral marketing &mdash; and customer retention.
</p>
<p>
Zynga players play free. The revenue comes in a few dollars at a time for &#8220;virtual goods.&#8221; In the popular game FarmVille, for example, a player might get tired of the old-fashioned plow. The tractor upgrade costs $2.
</p>
<p>
&#8220;There are tons of different ways you can think about retention,&#8221; he laughs, &#8220;and which one should we use?&#8221;
</p>
<p>
How do you know when a customer has left? &#8220;Unless we don&#8217;t get a note saying, &#8216;Hi, we&#8217;re no longer playing,&#8217; how do we know?&#8221;
</p>
<p>
Of course, no player&#8217;s going to make it that easy, so how long should Zynga wait before considering the player gone? A week? A man could have dropped his virtual pitchfork for a real vacation &mdash; or he could have plowed the last row.
</p>
<p>
Ken dealt with analytics all the time at LucidEra, but games were new to him. He&#8217;s learned a few things.
</p>
<p>
&#8220;It turns out, as you might imagine, that it depends on the game,&#8221; he says. The average simulation-game player tends to visit frequently, for example. Poker players, though, are much more likely to come back after, say, a three-month gap.
</p>
<p>
The retention curve also varies by the length of each player&#8217;s tenure. A new player who stays away 30 days is much less likely to return than a player who&#8217;s been at Zynga for years. Ken now puts users in three basic tenure buckets: &#8220;new,&#8221; &#8220;mature,&#8221; and &#8220;elder.&#8221;
</p>
<p>
Whatever question you try to answer, it has to be actionable. &#8220;There are metrics, and there are metrics that matter,&#8221; he says. If volume plunges, were the missing players mostly new ones? If so, it could indicate frustration; perhaps the games need better tutorials or less functionality at the beginning. Or were most of the missing the long-term customers? If so, perhaps the games haven&#8217;t offered enough challenge.
</p>
<p>
Ken expects growth when the economy improves. &#8220;When we look at what happens over holidays, such as July Fourth and Thanksgiving, usage really drops. Then it picks up as people go back to work,&#8221; he says. &#8220;[The games] are part of their routine. On vacation, players break their routines. They sleep late and spend more time with family. They don&#8217;t play the game.&#8221;
</p>
<p>
&#8220;It&#8217;s fascinating,&#8221; says Ken. &#8220;In analytics, so much of the problem is figuring out what the question really is.&#8221;
</p>
<p>
I think he means that it&#8217;s a great game.</p>
<img src="http://datadoodle.com/wordpress/?ak_action=api_record_view&id=1124&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://datadoodle.com/2010/01/15/mapping-the-many-faces-of-retention/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Hoping for Citizen 2.0</title>
		<link>http://datadoodle.com/2010/01/06/hoping-for-citizen-2-0/</link>
		<comments>http://datadoodle.com/2010/01/06/hoping-for-citizen-2-0/#comments</comments>
		<pubDate>Wed, 06 Jan 2010 19:57:09 +0000</pubDate>
		<dc:creator>Ted Cuzzillo</dc:creator>
				<category><![CDATA[culture]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[analyst]]></category>
		<category><![CDATA[government 2.0]]></category>
		<category><![CDATA[Lyza]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[Tableau]]></category>
		<category><![CDATA[tools]]></category>

		<guid isPermaLink="false">http://datadoodle.com/?p=1093</guid>
		<description><![CDATA[I like the sound of Government 2.0: Collaborate with citizens online and you can change government from a sewer-dwelling raccoon into a purring housecat. Social media lets us try for a kind of politics that was impossible until now. I hope for great results. For many, Government 2.0, or &#8220;collaborative government,&#8221; will mean just &#8220;friending&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>
I like the sound of <a href="http://en.wikipedia.org/wiki/Government_2.0">Government 2.0</a>: Collaborate with citizens online and you can change government from a sewer-dwelling raccoon into a purring housecat.
</p>
<p>
Social media lets us try for a kind of politics that was impossible until now. I hope for great results. For many, Government 2.0, or &#8220;collaborative government,&#8221; will mean just &#8220;friending&#8221; a local cop. But in full flower, Government 2.0 can mean far better service, and far more government-and-citizen collaboration than ever before.
</p>
<p>
Even before we had social media, the glare of public attention was a proven antidote for bad politics. Citizens getting up their elbows in policymaking has always been another strong medicine.
</p>
<p>
Trouble is, that &#8220;sewer-dwelling raccoon&#8221; is always smarter than people think. When he&#8217;s hungry, he purrs like a housecat and covers stinky laws with high-minded names. Advertising fools just enough voters &mdash; so often complacent and impatient &mdash; to throw a new law onto the books. On and on it goes.
</p>
<p>
Such a stinky new law is just what Californians got in 2000. Proposition 34 was sold to voters as campaign finance reform. It was a ruse. (A few days ago, a report confirmed suspicions, and a major drafter of the proposition insisted he was &#8220;outraged.&#8221; Yeah, and round up the usual suspects.)
</p>
<p>
One other fix, more honest, came 100 years ago: California amended its constitution to give citizens the ballot proposition. It was the only way for voters to bypass the paralyzed Legislature and loosen the Southern Pacific Railroad&#8217;s grip. It worked. But more recently, ballot propositions have helped tie the state&#8217;s budget in knots.
</p>
<p>
In the long run, who knows how social media, visual analysis, and other tools may be used in government? What will matter most of all is who uses them. If it&#8217;s &#8220;the people,&#8221; which people?
</p>
<p>
I hope this new, pervasive politics mobilizes a new wave of smart activists &mdash; the way desktop publishing and, later, weblogs enabled editors and writers. Or the way tools like Tableau and Lyza are enabling independent-minded, creative analysts today.
</p>
<p>
As these activists learn about politics, I also hope that more citizens than ever before step up to watch, push, and verify. Such a voter would be Citizen 2.0, the real hope.
</p>
<p>
Otherwise, it&#8217;s going to be that raccoon again &mdash; this time on Twitter.</p>
<img src="http://datadoodle.com/wordpress/?ak_action=api_record_view&id=1093&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://datadoodle.com/2010/01/06/hoping-for-citizen-2-0/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>No wizard, just you and the data</title>
		<link>http://datadoodle.com/2009/11/03/no-wizard-just-you-and-the-data/</link>
		<comments>http://datadoodle.com/2009/11/03/no-wizard-just-you-and-the-data/#comments</comments>
		<pubDate>Tue, 03 Nov 2009 08:49:47 +0000</pubDate>
		<dc:creator>Ted Cuzzillo</dc:creator>
				<category><![CDATA[creative analysis]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[analysts]]></category>
		<category><![CDATA[business analysts]]></category>
		<category><![CDATA[Joe Mako]]></category>
		<category><![CDATA[Lyza]]></category>
		<category><![CDATA[Scott Davis]]></category>
		<category><![CDATA[Tableau]]></category>

		<guid isPermaLink="false">http://datadoodle.com/?p=1024</guid>
		<description><![CDATA[What&#8217;s the hardest part of training a new data analyst? Resetting the trainee&#8217;s mindset. &#8220;They start out with the idea that there&#8217;s a right answer,&#8221; says Joe Mako. Joe&#8217;s leaving his job &#8212; where about one year ago he began analyzing data &#8212; to go work for the producer of Lyza. Lyzasoft CEO Scott Davis [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>
What&#8217;s the hardest part of training a new data analyst? Resetting the trainee&#8217;s mindset.
</p>
<p>
&#8220;They start out with the idea that there&#8217;s a right answer,&#8221; says Joe Mako.
</p>
<p>
Joe&#8217;s leaving his job &mdash; where about one year ago he began analyzing data &mdash; to go work for the producer of Lyza. <a href="http://www.lyzasoft.com/">Lyzasoft</a> CEO Scott Davis sees him as a &#8220;prototype&#8221; of a kind of creative, resourceful analyst that Lyza was designed for. Joe will engage with other analysts to evangelize Lyza and to help new users ease into the flow.
</p>
<p>
Joe, 29 and a veteran of two Army tours in Iraq, started out on the help desk. He answered calls from within the company, an ISP. Many callers couldn&#8217;t or wouldn&#8217;t analyze their own data, so Joe did it for them. His boss also enlisted his help &mdash; and now won&#8217;t dare go without a backup.
</p>
<p>
The first people he&#8217;ll help get into the flow are the two women who&#8217;re replacing him, and he&#8217;s got to do before he starts at Lyzasoft on November 9. They&#8217;re some of only a few in the his group who applied. Most others refused the &#8220;boring&#8221; work with &#8220;ugly&#8221; data.
</p>
<p>
New users, he says, want to know, &#8220;Where&#8217;s my wizard?&#8221; There is none. &#8220;But that&#8217;s why I enjoy these tools.&#8221; He uses Lyza and <a href="http://www.tableausoftware.com/">Tableau</a> primarily. &#8220;They stay out of my way. They enable me. It&#8217;s just me and the data. &#8230; That&#8217;s what&#8217;s neat. But [new users] don&#8217;t know where to start.&#8221;
</p>
<p>
&#8220;I&#8217;m handed crazy files without any structure,&#8221; he says. The first thing new users have to know is that, no matter how ugly the data may be, it really can be cleaned up. He demonstrated to his new trainees, he says, and &#8220;they were blown away.&#8221; After that, he started showing them how they can clean up data on their own.</p>
<p>
He explained basic steps and functions. Then he showed them how to combine tools, such as how to use two functions in sequence. And deeper still.
</p>
<p>
&#8220;It takes time playing to figure out where you need to get to,&#8221; he says. &#8220;You have to just go and play. If one thing doesn&#8217;t work, you try something else.&#8221;
</p>
<p>
&#8220;I always thought that data was exact,&#8221; he says. &#8220;If not, it was garbage and I&#8217;d throw it out.&#8221; But he later learned that there&#8217;s usually only a portion that&#8217;s garbage &mdash; that somewhere within the crazy mess there&#8217;s a story. &#8220;Even if every data point is wrong, there still might be some trend you can see. If there&#8217;s a bunch of ugly data, how do you figure what he story is?&#8221; It takes a willingness to figure it out, to untangle it, to find out what&#8217;s in there.
</p>
<p>
That&#8217;s a skill, not a talent, he says. &#8220;I&#8217;ve watched [his two replacements] get it closer and closer, learning to merge other data in, to reshape it and finally produce the output.&#8221;
</p>
<p>
Closer and closer. Business will trudge ahead, training a Joe here and a Joe there until people don&#8217;t complain anymore about boring work with ugly data. Someday, many more people will welcome the chance to do this work.</p>
<img src="http://datadoodle.com/wordpress/?ak_action=api_record_view&id=1024&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://datadoodle.com/2009/11/03/no-wizard-just-you-and-the-data/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Spam barely pays, says spamalytics</title>
		<link>http://datadoodle.com/2009/09/22/spam-barely-pays-says-spamalytics/</link>
		<comments>http://datadoodle.com/2009/09/22/spam-barely-pays-says-spamalytics/#comments</comments>
		<pubDate>Tue, 22 Sep 2009 10:01:18 +0000</pubDate>
		<dc:creator>Ted Cuzzillo</dc:creator>
				<category><![CDATA[creative analysis]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[spam]]></category>

		<guid isPermaLink="false">http://datadoodle.com/?p=933</guid>
		<description><![CDATA[With all that email that piles in for vi*gra and unlucky Nigerian princes, we assume that someone, somewhere, makes tons of money on it all. But some stealthy University of California researchers at Berkeley and San Diego concluded that spammers may be easier to thwart than we thought. They tell the story in &#8220;Spamalytics: An [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>
With all that email that piles in for vi*gra and unlucky Nigerian princes, we assume that someone, somewhere, makes tons of money on it all. But some stealthy University of California researchers at Berkeley and San Diego concluded that spammers may be easier to thwart than we thought.
</p>
<p><span id="more-933"></span></p>
<p>
They tell the story in &#8220;Spamalytics: An Empirical Analysis of Spam Marketing Conversion,&#8221; published about one year ago. (Download the PDF <a href="http://www.icir.org/vern/papers/CACMSpam09.pdf">here</a>.)
</p>
<p>
 To gather data, they devised trickery of their own: They infiltrated an existing spam botnet.
</p>
<blockquote><p>
By infiltrating the botnet parasitically, we convinced it to modify a subset of the spam it already sends, thereby directing any interested recipients to Web sites under our control.
</p></blockquote>
<p>
The team studied three campaigns: one selling pharmaceuticals and two propagating malware. They tracked nearly a half billion spam emails to count successful deliveries to mail servers, successful passes through anti-spam defenses, user visits to advertised sites, and sales and infections. Throughout, researchers were careful to avoid doing any harm; users responding to infiltrated bots could never actually buy drugs or download malware.
</p>
<p>
In the study&#8217;s small sample, only about a quarter of the all spam leaving their cave ever reached a mail server. Only about 16 percent reached users&#8217; inboxes that targeted Hotmail, Gmail, Yahoo, or Barracuda. Only about one in about 12,500,000 users originally targeted ever took the bait &mdash; a 0.00001 percent conversation rate.
</p>
<p>
Researchers extrapolated from their tiny sample &mdash; roughly 1.5 percent of all traffic on this network &mdash; that the spammers might gross about $3.5 million per year, and higher if users come back for more.
</p>
<p>
But low conversion is what we&#8217;ve always assumed. What&#8217;s critical, and tricky, is estimating costs. The researchers figured that even at $80 per million, an average derived from anecdotal evidence, costs would be too great unless the spammers were vertically integrated &mdash; the vi*gra and the spam delivery all operates under one roof.
</p>
<p>
This discovery is &#8220;heartening,&#8221; says the paper.
</p>
<blockquote><p>
&#8220;&#8230; profitable spam campaigns require organizations that can assemble complete &ldquo;soup-to-nuts&rdquo; teams. Put another way, the profit margin for spam (at least for this one pharmacy campaign) may be meager enough that spammers must be sensitive to the details of how their campaigns are run and are economically susceptible to new defenses.&#8221;
</p></blockquote>
<p>
Spam may be easier to beat than we thought. Anyone for a &#8220;surge&#8221;?</p>
<img src="http://datadoodle.com/wordpress/?ak_action=api_record_view&id=933&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://datadoodle.com/2009/09/22/spam-barely-pays-says-spamalytics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

