<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Data Content Blog</title>
	
	<link>http://blogs.hds.com/datacontent</link>
	<description>Data Content Blog</description>
	<lastBuildDate>Fri, 18 May 2012 15:53:28 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/hds/datacontent" /><feedburner:info uri="hds/datacontent" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>Hey!  You Guys Do That?!</title>
		<link>http://feedproxy.google.com/~r/hds/datacontent/~3/0sA4bPFQ4OM/hey-you-guys-do-that.html</link>
		<comments>http://blogs.hds.com/datacontent/2011/11/hey-you-guys-do-that.html#comments</comments>
		<pubDate>Tue, 15 Nov 2011 19:00:26 +0000</pubDate>
		<dc:creator>Frank Wilkinson</dc:creator>
				<category><![CDATA[Customer Success]]></category>
		<category><![CDATA[Tech Talk]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[content cloud]]></category>
		<category><![CDATA[data content]]></category>
		<category><![CDATA[Frank Wilkinson]]></category>
		<category><![CDATA[HDS]]></category>
		<category><![CDATA[Hitachi]]></category>
		<category><![CDATA[Hitachi Data Systems]]></category>
		<category><![CDATA[influencer summit]]></category>
		<category><![CDATA[Jack Domme]]></category>
		<category><![CDATA[khnaser]]></category>
		<category><![CDATA[nigel poulton]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/datacontent/?p=120</guid>
		<description><![CDATA[Last week HDS held its inaugural Influencer Summit in San Jose, California. It was a very big deal for our company, not to mention our invited guests, who by all accounts were about to get the one, two punch! (In a good way). The creation and preparations for this historic event were driven by our [...]]]></description>
			<content:encoded><![CDATA[<p>Last week HDS held its inaugural Influencer Summit in San Jose, California. It was a very big deal for our company, not to mention our invited guests, who by all accounts were about to get the one, two punch! (In a good way).</p>
<p><span id="more-120"></span></p>
<p><img class="alignleft size-full wp-image-121" title="jack" src="http://blogs.hds.com/datacontent/wp-content/uploads/2011/11/jack.jpg" alt="jack" width="368" height="246" />The creation and preparations for this historic event were driven by our marketing group and trusted advisors, not to mention half the company (well it seemed that way, I may be over exaggerating a little bit). We had our executives on hand to deliver the core messaging with some great insights, as well as folks from The Office of Technology and Planning. This was our first of many events which will enable HDS to share its strategy and technologies within an exclusive (for now anyway) invitation to the industries’ most prestigious analysts and bloggers.</p>
<p>There were many internal meetings to discuss our individual participation and also to cover each presenters’ topics, strategy materials, presentations, NDA clarification, blood type, first born, and dare I forget that I had to sign my name in secret ink (I am kidding, there was no secret ink).</p>
<p>All kidding aside, it was a great event.  Jack Domme (pictured above) was first to present, and he did a great job as always. (I will save the remaining details for my fellow colleagues and bloggers, who I am sure will do it better justice than I can).</p>
<p>One of the initiatives at the event was to decide who would be monitoring Twitter and responding. I of course volunteered, as did many of my peers and colleagues. The job was easy enough, as I am on Twitter (<a href="http://it.twitter.com/#!/FTWilkinson" target="_blank">@FTWilkinson</a>) as much as I can, and also I like to see instant feedback from our customers and peers in real-time.</p>
<p>As the event started off, the Twitter chatter was quiet, but started to pick up rather quickly with some tweets pointing out that HDS is the best kept secret…</p>
<ul>
<li><a href="https://twitter.com/#!/seepij" target="_blank">@seepij</a>: If you thought #HDS were just into storage &#8211; like I did &#8211; hearing impressive insights on new technology coming #hdsday</li>
<li><a href="https://twitter.com/#!/CIOmatters" target="_blank">@CIOmatters</a>: #HDS much more innovative and strategic than I realised -not content buying IP they build it and use/re-use it, ahead of the market #hdsday</li>
<li><a href="https://twitter.com/#!/nigelpoulton" target="_blank">@nigelpoulton</a>: Randy Demont saying customers are really pleased but telling HDS that they dont market well enough &lt;&#8211; only for the last 10+ years! #hdsday</li>
<li><a href="https://twitter.com/#!/ekhnaser" target="_blank">@ekhnaser</a>: Great message at #hdsday but y limited to 70 people? This should be heard by more partners customers influencers&#8230;.</li>
</ul>
<p>Ahem!&#8230;The presentations were great and overall were well received—at some level I think we shocked some folks by our candor as well as laying out our strategy and some possible future areas of focus and technologies (all covered under NDA, so no sharing in this blog forum). Nonetheless, the Summit was a great inaugural event that proved to those in attendance that we, HDS, do have a complete strategy, and a plan and vision for carrying them out.</p>
<p>So, YES, we do that!</p>
<p><em><span>For more content from HDS Analyst Day, visit our bit.ly bundle: </span><a href="http://bitly.com/u0mh27" target="_blank">http://bitly.com/u0mh27</a></em></p>
<img src="http://feeds.feedburner.com/~r/hds/datacontent/~4/0sA4bPFQ4OM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/datacontent/2011/11/hey-you-guys-do-that.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://blogs.hds.com/datacontent/2011/11/hey-you-guys-do-that.html</feedburner:origLink></item>
		<item>
		<title>HDS Information Cloud Vision and ParaScale</title>
		<link>http://feedproxy.google.com/~r/hds/datacontent/~3/ac0jgaqYAv4/hds-information-cloud-vision-and-parascale.html</link>
		<comments>http://blogs.hds.com/datacontent/2011/10/hds-information-cloud-vision-and-parascale.html#comments</comments>
		<pubDate>Mon, 31 Oct 2011 20:22:20 +0000</pubDate>
		<dc:creator>Cameron Bahar</dc:creator>
				<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Tech Talk]]></category>
		<category><![CDATA[virtualization]]></category>
		<category><![CDATA[ACID]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[Cameron Bahar]]></category>
		<category><![CDATA[content cloud]]></category>
		<category><![CDATA[data content]]></category>
		<category><![CDATA[HDS]]></category>
		<category><![CDATA[Hitachi]]></category>
		<category><![CDATA[Hitachi Data Systems]]></category>
		<category><![CDATA[information cloud]]></category>
		<category><![CDATA[parascale]]></category>
		<category><![CDATA[PB's]]></category>
		<category><![CDATA[RDBMS]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[TB's]]></category>
		<category><![CDATA[TCP-D]]></category>
		<category><![CDATA[Teradata]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/datacontent/?p=109</guid>
		<description><![CDATA[Last week, HDS unveiled its roadmap for the Information Cloud and stated that it is based on technology obtained through the acquisition of ParaScale in August 2010. In this blog post, I will explain how the ParaScale platform will serve as a foundation and enabler for the Information Cloud. In my last blog post, I [...]]]></description>
			<content:encoded><![CDATA[<p>Last week, HDS unveiled its roadmap for the Information Cloud and stated that it is based on technology obtained through the acquisition of ParaScale in August 2010. In this blog post, I will explain how the ParaScale platform will serve as a foundation and enabler for the Information Cloud.</p>
<p><span id="more-109"></span></p>
<p>In my <a href="http://blogs.hds.com/datacontent/2011/09/big-data-coming-down-the-pipe.html">last blog post</a>, I wrote about the impact and requirements of a new class of applications on storage and computing infrastructures. As this massive wave of unstructured data is created, we need platforms that are specifically architected to efficiently ingest, store and analyze this data.</p>
<p>In the early 90s I had the privilege of working on the second version of the Teradata data warehouse appliance which was used by leading companies around the world to mine their structured data sets. The interfaces were SQL and we implemented an ACID compliant RDBMS while leveraging a scale-out shared nothing architecture to achieve scale and performance for TCP-D (decision support) type queries. Teradata allowed companies to achieve record &#8220;time to answer&#8221; results to their most pressing business intelligence problems. In the process, Teradata gave these select companies a significant competitive advantage and in many cases led to these companies dominating their various industry segments.</p>
<p><img class="alignright size-full wp-image-111" title="cloud-storage-puzzle" src="http://blogs.hds.com/datacontent/wp-content/uploads/2011/10/cloud-storage-puzzle.jpg" alt="cloud-storage-puzzle" width="294" height="304" />An example of this is Walmart, which invested heavily in this technology in the 1990s and even sued Amazon for hiring their data warehouse experts as it considered this knowledge a competitive weapon. Walmart was able to analyze supplier and customer patterns and figure out what product to put on which shelf, in which city, in which month in order to maximize its revenue. It is interesting to compare the relative growth rates of Walmart to Kmart, which I believe did not invest aggressively in this area in the 1990s!</p>
<p>What we&#8217;re witnessing is the second wave of this paradigm. The fundamental difference is that the data sets are a few orders of magnitude larger and they&#8217;re all unstructured or semi-structured.</p>
<p>Many organizations are realizing that there&#8217;s tremendous hidden value in this data and it needs to be harnessed to enable businesses to get new insight about their businesses and customers. Proof points are leading web companies such as Google, Amazon, Facebook, Yahoo!, eBay and others that are mining user activity and patterns to sell ads or products or both.</p>
<p>Because the size of these data sets is relatively large (TB&#8217;s to PB&#8217;s), it is impractical and expensive to copy the data over a network into a data warehouse in order to process. It is much faster and cheaper to move the processing to where the data is stored, which brings me to why ParaScale is an important bridge to the Information Cloud.</p>
<p>The ParaScale platform will allow organizations to efficiently ingest, store and mine unstructured and semi-structured big data sets at scale and all in one place using a scale-out shared nothing architecture. This technology provides an elegant and flexible method to combine storage and compute in the same stack and achieve performance, efficiencies and capabilities required to solve this class of problems. The journey from the Content Cloud to the Information Cloud requires an array of technologies and we think the ParaScale technology is a key element required to bring this vision to reality.</p>
<p><em><span>Want to read more about our Cloud Roadmap? Visit our bit.ly bundle here: </span><a href="http://bitly.com/pCt5Gk">http://bitly.com/pCt5Gk</a></em></p>
<img src="http://feeds.feedburner.com/~r/hds/datacontent/~4/ac0jgaqYAv4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/datacontent/2011/10/hds-information-cloud-vision-and-parascale.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blogs.hds.com/datacontent/2011/10/hds-information-cloud-vision-and-parascale.html</feedburner:origLink></item>
		<item>
		<title>Big Data? Big Deal!</title>
		<link>http://feedproxy.google.com/~r/hds/datacontent/~3/ebq9hzkjdUg/big-data-big-deal.html</link>
		<comments>http://blogs.hds.com/datacontent/2011/10/big-data-big-deal.html#comments</comments>
		<pubDate>Wed, 19 Oct 2011 21:51:24 +0000</pubDate>
		<dc:creator>Frank Wilkinson</dc:creator>
				<category><![CDATA[IT Transformation]]></category>
		<category><![CDATA[Tech Talk]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[capacity]]></category>
		<category><![CDATA[content cloud]]></category>
		<category><![CDATA[data content]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Frank Wilkinson]]></category>
		<category><![CDATA[HDS]]></category>
		<category><![CDATA[Hitachi]]></category>
		<category><![CDATA[Hitachi Data Systems]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[IO]]></category>
		<category><![CDATA[michael hay]]></category>
		<category><![CDATA[NetAPP]]></category>
		<category><![CDATA[Object Stores]]></category>
		<category><![CDATA[starry night]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[unstructured content]]></category>
		<category><![CDATA[van Gogh]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/datacontent/?p=71</guid>
		<description><![CDATA[There have been many articles, opinions and positions written about the big data phenomenon in the past year, and if you were not confused before, you may be trying to decipher its impact to your organization and your business.  There is no doubt about what big data implies, and its affect in business and beyond [...]]]></description>
			<content:encoded><![CDATA[<p>There have been many articles, opinions and positions written about the big data phenomenon in the past year, and if you were not confused before, you may be trying to decipher its impact to your organization and your business.  There is no doubt about what big data implies, and its affect in business and beyond the data center.  It is an important milestone as we enter the next phase of the IT technology era, as we look to not only the data created by business applications, databases, machine generated data, email and file data, but social media generated data and business productivity tools (such as business social media applications).</p>
<p><span id="more-71"></span></p>
<p>They all have their impact to how we run business, and that is the easy part to discern from big data issues. What about insight?  How will we better utilize our data objects, object stores and their associated metadata to really help drive real opportunities and insight? This was part of the premise of business intelligence solutions, and what they were going to deliver, right?</p>
<p>Not so much!</p>
<p><strong>Seriously Big Data</strong></p>
<p>There are many reasons to take big data issues seriously, not to mention how to best manage it and integrate it, leverage it, and back it all up. Those are challenges all in themselves.  For me the greater issue is: how will we interact with the data and make decisions based upon factors that can have an immediate impact to our business?  Sure, integration, mobility, backup and exponential data growth are important and they do relate to how we can gain better<img class="alignright size-full wp-image-80" title="fw11" src="http://blogs.hds.com/datacontent/wp-content/uploads/2011/10/fw11.jpg" alt="fw11" width="230" height="130" /> insight for making more accurate business decisions, but the fact is, if we have not figured out how to manage the data growth by now, we are in serious trouble.  Data consolidation, converged infrastructures and scalable architectures are the beginning to solving how best to approach big data impacts, but what do you do with the data, how will you leverage its insights, and what will you use to extract the insight of the data?</p>
<p>Data mashup dashboards and business dashboards are not necessarily new concepts, and while there are some that have made impacts to the market, none of them   have breached beyond basic information without leaving you feeling like you forgot something (like that feeling you get when you leave the house in the morning).</p>
<p>Analytics are becoming more relevant in unstructured data, just as they did for structured data. This is such a big market for enterprise vendors such as IBM, EMC, NetApp and Hitachi, that we are all hard at work building better solutions to help this effort. But that does not mean all solutions will be created equal, nor does it mean that they will offer identical or similar approaches. The end result will be end user adoption, usability and customization.</p>
<p>The question I often ask myself is:”what is really needed in an architecture that will lend itself to provide greater clarity and insight of the data?”  This is a complex problem, and one with many different variants for the answer.  I believe that we are facing a dynamic shift in how solutions will be designed in the future with a heavier emphasis on truly understanding how businesses run and keying in on the requirements to help business make more intelligent decisions.  Throughout the past twenty years we have focused on helping to improve processes, IO, smarter applications and the underlining hardware technologies. I feel, however, the next step is determining the best way to marry hardware and software functions more tightly with more internal hand-off.  As an example of this, <a href="http://blogs.hds.com/technomusings/2009/06/autopoesis-fpga-ai-intelligence.html">Michael Hay has written a post</a> describing the essence of autopoiesis, the marriage and tighter integration between hardware and software functions.  This is a start. But how?</p>
<p>There is no doubt that our systems and solutions have become much more intelligent—as well as more complex—but at the same time they have become less dynamically integrated with no common connector to share information in a true usable way. This can be leveraged for greater insight to what is happening within the systems and applications, being able to report data into a business dashboard (which can be used as the single pane of glass view of the data and its relevance to the business).</p>
<p><strong>What Are We Searching For? What Will We Discover?</strong></p>
<p>If I want to understand a particular event, such as a sales engagement, what would I like to know about it?  Is there information already residing within my data stores?  What type of data?  Could I also have instant access to social media data and what kind of insight can I gain to assist my decisions going forward?</p>
<p><img class="alignright size-full wp-image-72" title="fw2" src="http://blogs.hds.com/datacontent/wp-content/uploads/2011/10/fw2.jpg" alt="fw2" width="201" height="251" />What I am talking about is how can we have a true 360 degree view of the data that we have access to, as well as data that we need to have access to?  While we are in the midst of a big data revolution, the issue is how can we find what we are searching for when the results are collected from various, and perhaps disparate, data silos presented in a correlated view?  This would enable data to be displayed with relevancy across data streams and data types.  As an example:  If I perform a search for <strong>John Dowe</strong>, the results returned should show all John’s activity related to the constructed query.  In this case, the results may show some relevant emails, documents, John’s network or server activity, log file data  (he was tying to gain access to a SharePoint site which he does not have access to), some voice mails and perhaps some security surveillance video of him while on the company campus and let’s add in his social network activities, to see if he has breached corporate security by discussing corporate secrets (Yes big brother is watching).  Alone, the data parts are not very interesting by themselves, but If we can tie the data together, it can show that John was up to something very nefarious due to the email correspondence to a competitor, voice mails that captured a conversation between John and a corporate spy, security footage with audio capturing me in the parking lot exchanging a sealed envelope.</p>
<p>Is that interesting?  Absolutely!</p>
<p><strong>How Do We Get There?</strong></p>
<p>The Search is NOT over!</p>
<p>Search is a term which is too widely used and not fully understood for its potential abilities.  Whether you are searching data as part of an eDiscovery process, corporate governance or simply to find relevance around a certain subject, search is the tool we want to leverage and has the results we want to gain insight from to make decisions based on.  Search is just the beginning, but to be fair, its just search, and what we need is more data and data types to help us see a true picture of what we are looking at.</p>
<p><a href="http://www.ibiblio.org/wm/paint/auth/gogh/starry-night/gogh.starry-night.jpg">Starry Night by Van Gogh</a> is a beautiful painting depicting a wondrous night sky.  But what if you only could see the Rhone river? What would we know about the rest of the painting?  Would we know that the artist wanted us to know that it is evening?  Would we know that there are other aspects and objects?  Of course not, we would only see what we have access to.  Much like search and discovery of data, we only know what we know. Nothing more.</p>
<p>There are often discussions around what it would take to get there, and the answer I hear quite often is that it is too hard and too costly. To get to where business needs us to be, we need to think about how we can develop <strong>THE NEXT</strong> smarter architectures and infrastructures leveraging open source solutions and common connectors, while leveraging PCIe, smarter controllers, FPGA’s, SSDS and imbed software functions more closely to the data streams.</p>
<p>Sounds hard right?  Yes, but not impossible.  We have spent the last two decades adding more and more complexity and infrastructure to handle large amounts of data we generate, but we fell short of true integration across the data center and business applications. I do agree that there are technical obstacles to overcome in order to have a better integration, and how we can push search functions and its associated IO down to the hardware layer to minimize the IO across the infrastructure.  This is where HDS shines, as we have the best scientists, engineers and resources that are working and solving these problems.  Let’s not forget that Hitachi develops some of the worlds best technology solutions and core IP that it utilized in almost every facet of life and is leveraged in some of the technology you have in your data center. We know a thing or two about taking the overtly complex and making it simple.</p>
<p>While I cannot go into great detail about our current endeavors around our research and development efforts, rest assured that we have been working to decipher the necessary technologies to bring <strong>THE NEXT</strong>—whatever it may be.</p>
<p>Check back soon for my next blog, which will discuss next generation discovery needs and data correlation.</p>
<img src="http://feeds.feedburner.com/~r/hds/datacontent/~4/ebq9hzkjdUg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/datacontent/2011/10/big-data-big-deal.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blogs.hds.com/datacontent/2011/10/big-data-big-deal.html</feedburner:origLink></item>
		<item>
		<title>Big Data – Coming Down The Pipe!</title>
		<link>http://feedproxy.google.com/~r/hds/datacontent/~3/TXYczRSjBto/big-data-coming-down-the-pipe.html</link>
		<comments>http://blogs.hds.com/datacontent/2011/09/big-data-coming-down-the-pipe.html#comments</comments>
		<pubDate>Thu, 29 Sep 2011 20:36:44 +0000</pubDate>
		<dc:creator>Cameron Bahar</dc:creator>
				<category><![CDATA[data solutions]]></category>
		<category><![CDATA[IT Transformation]]></category>
		<category><![CDATA[Tech Talk]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[bigtable]]></category>
		<category><![CDATA[cost]]></category>
		<category><![CDATA[data content]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[HDS]]></category>
		<category><![CDATA[Hitachi Data Systems]]></category>
		<category><![CDATA[i/o characteristics]]></category>
		<category><![CDATA[machine generated data]]></category>
		<category><![CDATA[Machine Generated Data (MGD)]]></category>
		<category><![CDATA[MGD]]></category>
		<category><![CDATA[parascale]]></category>
		<category><![CDATA[processing]]></category>
		<category><![CDATA[san/nas]]></category>
		<category><![CDATA[tco]]></category>
		<category><![CDATA[unstructured content]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/datacontent/?p=61</guid>
		<description><![CDATA[These are exciting times. I joined HDS through the acquisition of ParaScale, a startup I founded to focus on solving what the industry is now calling the Big Data problem.  When I visit customers, I notice a growing percentage are faced with a very challenging data problem.  The data in its original form is unstructured [...]]]></description>
			<content:encoded><![CDATA[<p>These are exciting times.</p>
<p>I joined HDS through the acquisition of ParaScale, a startup I founded to focus on solving what the industry is now calling the Big Data problem.  When I visit customers, I notice a growing percentage are faced with a very challenging data problem.  The data in its original form is unstructured as expected, but it&#8217;s not your typical &#8220;human generated data&#8221;. Human generated data is what I refer to as file based data such as documents, spreadsheets, presentations, medical records, or block based data such as transactions, customer records, sales and financials records that are usually stored in relational databases. These traditional data sets are adequately served with high performance SAN/NAS systems, which have excellent random I/O characteristics and can handle massive amounts of structured and unstructured content.<br />
<span id="more-61"></span><br />
<img class="alignright size-full wp-image-64" title="sfs1" src="http://blogs.hds.com/datacontent/wp-content/uploads/2011/09/sfs1.jpg" alt="sfs1" width="420" height="265" /></p>
<p>I noticed that this new workload and the corresponding data being produced was fundamentally different from traditional enterprise workloads.  This data was being generated by hundreds to thousands of machines, was rarely updated, often appended, and was fundamentally streaming in nature. I called this category &#8220;Machine Generated Data&#8221; and people have started to embrace this term over the last few years. We started looking at this problem as early as 2004.</p>
<p>Imagine a web company that stores and processes log files from one hundred thousand web servers to generate optimized advertisements to monetize its freemium business model.  Prime examples of such business models are Google, Facebook or Yahoo. Imagine a bio-informatics company sequencing millions of genomes in hopes of finding patterns in disease, or a security company scanning lots of high-def video and analyzing it looking for a specific person or object, or millions of sensors generating data that needs to be processed and analyzed.</p>
<p>Besides the workload being different for Machine Generated Data (MGD), what else is different? Notice that with MGD, companies store this data in order to be able to analyze it shortly after storing it.  The faster they can analyze the data, the better the payoff usually. Think of ad placement by analyzing log files or video analytics to find the right guy in the video just in time, or analyzing genome data for a cancer patient who doesn’t have a lot of time, or finding inefficiencies or trends in financial markets or looking for oil in all the wrong places.</p>
<p>This problem as it turns out is really about large scale data storage and mining of unstructured and semi-structured data to gather information and insight from a company&#8217;s vast datasets.  It&#8217;s nothing short of information processing, repurposing, and data transformations to uncover hidden patterns in data that will ultimately lead to better decisions.</p>
<p>So what are the high level requirements to build a system to solve this problem?</p>
<p style="padding-left: 30px;">1. <strong>Scale </strong>– the system has to scale. But scale to what and in which dimensions. I suppose it has to scale in capacity to be able to store petabytes or exabytes of data. But it also has to ingest this data pretty fast, since there are lots of concurrent streams of machine generated data that are coming through the pipe. So the pipe can’t be the bottleneck either.</p>
<p style="padding-left: 30px;">What role does system management play in this equation? If you have lots and lots of data, you can’t afford to hire lots of storage admins to sit around and manage this data; it costs too much! So what is one to do? The system should be self healing and self managing, right? It should handle most of the mundane data management tasks automatically instead of relying on people.  Ideally all people do is add new hardware or replace failed disks or power supplies.</p>
<p style="padding-left: 30px;">2. <strong>Processing</strong> – once you store petabytes of data into a storage system, how do you analyze it? You can’t very well load a few petabytes over the network into a compute farm, can you? How long would that take? Isn’t the network the bottleneck then? What if we were able to move the processing or the program to the data and run it there instead. The program is pretty small compared to the data set size and so you at least take the network out of the equation to a large degree.  Therefore, the platform should allow in-place data processing or analytics.</p>
<p style="padding-left: 30px;">3. <strong>Cost </strong>– when you store lots and lots of data, your TCO should be reasonable, so using commodity components as much as possible, virtualization, and automation will go a long way.</p>
<p>So when  you look around and you read and hear about people talk about or working on gfs, bigtable, hadoop , hdfs, cassandra, riak, couchbase, mongodb, hbase, zookeeper, flume, mahout, pig, hive, etc, etc, …  you know what I&#8217;m thinking about.</p>
<p>The information revolution is upon us and its pace is accelerating.</p>
<p>Stay tuned…</p>
<img src="http://feeds.feedburner.com/~r/hds/datacontent/~4/TXYczRSjBto" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/datacontent/2011/09/big-data-coming-down-the-pipe.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://blogs.hds.com/datacontent/2011/09/big-data-coming-down-the-pipe.html</feedburner:origLink></item>
		<item>
		<title>Kicking Off a Blog Series on Object Stores</title>
		<link>http://feedproxy.google.com/~r/hds/datacontent/~3/rFvz1t-swq0/kicking-off-a-blog-series-on-object-stores.html</link>
		<comments>http://blogs.hds.com/datacontent/2011/09/kicking-off-a-blog-series-on-object-stores.html#comments</comments>
		<pubDate>Tue, 27 Sep 2011 16:08:11 +0000</pubDate>
		<dc:creator>Robert Primmer</dc:creator>
				<category><![CDATA[Best Practices]]></category>
		<category><![CDATA[How-To]]></category>
		<category><![CDATA[Object Stores]]></category>
		<category><![CDATA[object]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/datacontent/?p=44</guid>
		<description><![CDATA[I’ve worked on three commercial object stores: Centera, Atmos, and now HCP (Hitachi Content Platform). In that time, I’ve seen numerous misunderstandings about this particular brand of storage technology–not just in how they function, but even more fundamentally, on where, when and why such a system would be employed. In a new blog series starting [...]]]></description>
			<content:encoded><![CDATA[<p>I’ve worked on three commercial object stores: Centera, Atmos, and now HCP (<a href="http://www.hds.com/products/storage-systems/content-platform/?WT.ac=us_mg_pro_hcp">Hitachi Content Platform</a>). In that time, I’ve seen numerous misunderstandings about this particular brand of storage technology–not just in how they function, but even more fundamentally, on where, when and why such a system would be employed. In a new blog series starting today, I hope to answer these questions.</p>
<p><span id="more-44"></span></p>
<p>To that end, my co-authors and I will provide a series of tutorials on distributed object stores (DOS), essentially providing a short course on the subject with the occasional odd topic interspersed periodically.</p>
<p><img class="alignright size-full wp-image-46" title="directions" src="http://blogs.hds.com/datacontent/wp-content/uploads/2011/09/directions.png" alt="directions" width="256" height="301" />Where it makes sense, articles will be split along business and technical lines, as the two topics often will have completely different audiences. In some cases this division will result in wholly separate articles, but generally both should fit in a single article.</p>
<p>It’s always challenging to get the right level of technical detail with a diverse audience. Generally we’ll bias toward simplicity, as there&#8217;s an inverse relationship between how technical an article is and the number of possible readers. So, the typical article will strive to hit a moderate technical level, hopefully avoiding the arcane. However, if there&#8217;s sufficient interest in a particular topic, we&#8217;ll go back later and add greater technical and mathematical rigor to those specific areas of interest.</p>
<p>I&#8217;ll try to construct the series in such a way that each topic builds successively upon the previous. However, as this will be my first attempt at creating such a course I&#8217;m sure to get some topics out of sequence. Fortunately, web pages  – with their ability to readily point to other content outside the page displayed – allow for non-linear reading in a manner far superior to what is possible in print.</p>
<p>Here is the Topic Index as I see it at this juncture. A single topic might span several articles connected together. As with source code, it&#8217;s generally better to write several small modules that connect together rather than a single large source file that tries to do everything.</p>
<p>This method of relatively short articles also allows me to later go back and insert new articles within a given topic as either new things occur to me, in response to feedback about a given topic, or additions as the state of technology changes.</p>
<p><strong><a id="TopicIndex" href="http://blogs.hds.com/datacontent/2011/09/kicking-off-a-blog-series-on-object-stores.html#TopicIndex">Topic Index</a></strong></p>
<p><em>1. What is Structured and Unstructured data, and why do we care about the difference?</em></p>
<p><em>2. What is an Object?</em></p>
<p><em>3. What is an Object Store?</em></p>
<p><em>4. What is a Distributed Object Store?</em></p>
<p><em>5. When would I use an Object Store versus other forms of Storage?</em></p>
<p><em>6. Industry Implications of Object Stores</em></p>
<p><em>a. Traditional Storage Vendors vs. Cloud Vendors Approach to Object Stores</em></p>
<p><em>7. Basic elements of an Object Store ecosystem</em></p>
<p><em>8. Distributed Object Store Blueprint</em></p>
<p><em>9. Architectural Considerations of an Object Store</em></p>
<p><em>10. A Comparison of Object Store Implementations</em></p>
<p><em>11. The Life and Times of an Object</em></p>
<blockquote><p><em>a. The Birth of an Object</em></p>
<p><em>b. Data Ingest</em></p>
<p><em>c. Life inside the Object Store</em></p>
<p><em>d. Where does an Object live?</em></p>
<p><em>e. How is Data Protection achieved?</em></p>
<p><em>f. Object Mobility</em></p>
<p><em>i. Duplication and Replication</em></p>
<p><em>g. Why is Tape Backup not required?</em></p>
<p><em>h. Basics of Data Unavailability and Data Loss (DUDL)</em></p>
<p><em>i. Fundamentals of Self Healing</em></p>
<p><em>j. Read / Write: What makes it fast, or slow?</em></p></blockquote>
<p><em>12. De-duplication and Object Stores</em></p>
<p><em>13. The Road Ahead – The Evolution of Object Stores going forward</em></p>
<p>Please let me know your thoughts along the way.</p>
<img src="http://feeds.feedburner.com/~r/hds/datacontent/~4/rFvz1t-swq0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/datacontent/2011/09/kicking-off-a-blog-series-on-object-stores.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://blogs.hds.com/datacontent/2011/09/kicking-off-a-blog-series-on-object-stores.html</feedburner:origLink></item>
		<item>
		<title>Welcome</title>
		<link>http://feedproxy.google.com/~r/hds/datacontent/~3/1TnQDpXqdTA/welcome.html</link>
		<comments>http://blogs.hds.com/datacontent/2011/09/welcome.html#comments</comments>
		<pubDate>Mon, 26 Sep 2011 15:35:54 +0000</pubDate>
		<dc:creator>Frank Wilkinson</dc:creator>
				<category><![CDATA[Best Practices]]></category>
		<category><![CDATA[IT Transformation]]></category>
		<category><![CDATA[Tech Talk]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[api]]></category>
		<category><![CDATA[atom]]></category>
		<category><![CDATA[backup]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[capacity]]></category>
		<category><![CDATA[content cloud]]></category>
		<category><![CDATA[disk pak]]></category>
		<category><![CDATA[ediscovery]]></category>
		<category><![CDATA[Frank Wilkinson]]></category>
		<category><![CDATA[HDS]]></category>
		<category><![CDATA[Hitachi Data Systems]]></category>
		<category><![CDATA[issue]]></category>
		<category><![CDATA[IT]]></category>
		<category><![CDATA[meta-data]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[michael hay]]></category>
		<category><![CDATA[networking]]></category>
		<category><![CDATA[object]]></category>
		<category><![CDATA[open framework]]></category>
		<category><![CDATA[opensearch]]></category>
		<category><![CDATA[recovery]]></category>
		<category><![CDATA[repliation]]></category>
		<category><![CDATA[rss]]></category>
		<category><![CDATA[techno-musings]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/datacontent/?p=17</guid>
		<description><![CDATA[This is my first official blog with Hitachi Data Systems.  I started with HDS on of all days, Valentines Day 2011!  This reminded me of what I love: of course my wife and children, but also what I have chosen as a career.  I mean, who can’t get excited everyday when you have the unique [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><img class="aligncenter size-full wp-image-18" title="welcome" src="http://blogs.hds.com/datacontent/wp-content/uploads/2011/09/welcome.jpg" alt="welcome" width="378" height="311" /></p>
<p>This is my first official blog with Hitachi Data Systems.  I started with HDS on of all days, Valentines Day 2011!  This reminded me of what I love: of course my wife and children, but also what I have chosen as a career.  I mean, who can’t get excited everyday when you have the unique opportunity to try and make the world (or at least technology) a better place?<br />
<span id="more-17"></span><br />
<strong>Some background on me:</strong> I have been in the technology industry for over 15 years.  I am a Master Architect, both hardware and software, and have worked in sales, pre-sales and strategy roles for most of my career.</p>
<p>For the past 10+ years my main focus has been around search and eDiscovery, and I have had the pleasure to work with some of the industry’s top thought leaders and developers, who actually created this space for archiving and search in 1999. I have helped create this market, and along with some others, we have watched our “child” grow into what it is today, which is far bigger than most thought it would be: hungry for better, faster, richer search capabilities and faster insight to data. I have worked for some of the largest providers of search and eDiscovery solutions and it has been a joy ride all the way.</p>
<p>Like I said, I started on Valentines Day of this year and I think it is apropos, that I have the fortunate opportunity to work for the best IT Solutions company in the world!  I get to do what I love and that is to create new concepts and technology, and bring new ideas for Content Cloud solutions for HDS.  Who wouldn’t be excited?!</p>
<p>Speaking of exciting…</p>
<p><strong>Big Data Problems</strong></p>
<p>IT is not a mystery in that at some point we were going to run into issues of having too much data which led to too much overly complex architectures, networking latency issues and backup issues—just to name a few.  The reality is this is not a new problem, but one that has always existed; just look back at the first Disk Pak that you bought, 10mb and we thought that was enough capacity. When that space ran out we bought a larger one, and thought that was going to be enough capacity!  We simply added capacity as needed and along with that we grew our complexities for backup, recovery, replication, networking, etc. You get the picture.</p>
<p>As the decades flew by we had grown to the point of an information overload and grew data beyond our wildest imaginations.  While archiving content gave us some relief for email and file data, it also added an extra capability that allowed the ingest data to be indexed, making the data searchable.  This was the beginning of the next era for search and eDiscovery.  <img class="alignright size-full wp-image-22" title="rubix1" src="http://blogs.hds.com/datacontent/wp-content/uploads/2011/09/rubix1.jpg" alt="rubix1" width="272" height="187" /></p>
<p>Fast forward a few years and now we see that there are hundreds of companies who offer search and archiving solutions—perhaps you have one of those solutions in your enterprise today.</p>
<p>I am not telling you something that you don’t already know, but I will tell you that Big Data impacts your ability for greater insight to your data.</p>
<p>Why do we keep data?</p>
<p>Why don’t we expunge that data when its usefulness expires?</p>
<p>Well, we know that data comes in all types and forms, and while some have a high relevance to your business—such as a financial record or contract&#8211;we also keep data for historical reference.</p>
<p><strong>So What’s The Issue Here?</strong></p>
<p>The reality is that we have tired to address the big data issue with options like archiving, data de-duplication, application retirement, consolidation, file planning, etc.  Please don’t get me wrong, we need these technologies in order to hold the Big Data beast at bay, and these technologies also offer the baseline for building the next generation architectures.  Since data has grown exponentially over the past several years, we have tired our best to contain data in its place while trying to adjust our architectures for the next great thing.</p>
<p><strong>So Where Are We Today?</strong></p>
<p>We have complicated and rigid architectures which may not lend themselves to take advantage of Cloud solutions. We have complicated applications and integration issues. We have a ton of meta-data, objects and even some archiving solutions in place—perhaps too many and not easily integrated together. We’ll even throw in some business intelligence solutions.</p>
<p><strong>So What Do We Have?</strong></p>
<p>A big mess!</p>
<p>Maybe it’s just me, and perhaps I see things from  too simplistic a view point, but the data we create should serve the business and allow us to make better business decisions that react to the markets with more agility.  As Michael Hay and my friends over at the <em><span><a href="http://blogs.hds.com/technomusings/">Techno-Musings blog</a></span></em> preach, data should not hold us hostage and cause such pains. To me, that is what is important: how can we make business’ run better and more efficiently while reducing risk and reducing the size complexity of IT, while maximizing new technology to gain better insight to the data?  This is what I love, and it is my passion (just ask anyone who knows me).</p>
<p>I do have a point to all of this, I promise!</p>
<p>The challenge, as I see it, is not giving you more solutions and adding more complexities, as some companies are trying to sell you, but rather the opposite.  How can we look to technology to help solve some basic issues with regards to Big Data?</p>
<p>We have all been promised at one time or another that technology will unlock the value in IT. Well, I am here to tell you that we are almost there, but we still have work to do.</p>
<p>While there are many challenges to address, there are a few which make it to the top of the list:</p>
<ol>
<li><strong>Objects and Meta-data</strong>: In order for greater insight, we know that we need to do a much better job at exploring and expanding meta-data capabilities. And while we are at it, a better way to standardize meta-data abstraction and find relationships between dissociative meta data.</li>
<li><strong>Search</strong>: we need to think bigger than just search.  Analytics need to be more tightly integrated with the data and meta-data. I call it “Content Analytics,” since that is really what we need.  If all the data and its associated meta-data can be made to be more intelligent—and yes we are talking way beyond meta-data tagging here, and yes I am leaving out all the other parts to this like (data dictionaries, indices, BI, etc.) because they are understood, at least for now—then we will be able to provide a much richer set of data and meta-data. This could lend itself to a more refined result, giving us greater insight. But how do we get there?</li>
<li><strong>Open Frameworks and API’s</strong>:  If we really want to think about the impact that big data has today and its strong hold on our data centers and architecture, then we need to think of a better way for data to communicate with business’.  Part of this can be addressed with the adoption and implementation of such open source solutions like <a href="http://www.opensearch.org/Home">OpenSearch</a>, which allows for the sharing of data through a common and structured format for data to be shared and even extended with formats like RSS and Atom just to name a few.</li>
</ol>
<p><strong>I Told You I Had A Point&#8230;..Here It Is!</strong></p>
<p>When I look at the different variants and the many ways in which meta-data is utilized today, and how we are just learning how to maximize its potential, we are still tied to the old architectures, and thus tied to a less spectacular way to gain insight of our data. Being able to leverage that analysis will help us make better business decisions.  I am not talking about Business Intelligence solutions, although I do ponder that perhaps if data and its meta-data were able to leverage a common open source set of API’s, then we may have something here. Perhaps we could then be the well for all intelligent meta-data and be the providers and or helpers who other solutions can leverage for meta-data analytics.  In order to shrink our architectures and squeeze out more benefits from our data, we need to think about a new approach to meta-data management and what we could expect from next generation solutions.</p>
<p>In Michael Hay’s recent blog post <a href="http://blogs.hds.com/technomusings/2011/09/interacting-with-cloud-stores.html"><em>Interacting with Cloud Stores</em></a>, Michael articulates precisely what is needed around a common set of open sourced APIs, which can be leveraged throughout the enterprise across all applications, storage, data objects and meta-data in order to get us to be able to link and share meta-data across not only the enterprise, but with external meta-data from social media networks (and even from public cloud solutions). Meta-data is the Holy Grail, and the more we can leverage, embed unique markers and add intelligence into our meta-data, the more we can begin to reap the benefits of clearer insight to that data.</p>
<p>As you can probably tell, my blog will take on the challenge of looking at Big Data in many different ways, but most importantly around Content Cloud, Search and eDiscovery.</p>
<p>I look forward to your thoughts and comments!</p>
<img src="http://feeds.feedburner.com/~r/hds/datacontent/~4/1TnQDpXqdTA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/datacontent/2011/09/welcome.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blogs.hds.com/datacontent/2011/09/welcome.html</feedburner:origLink></item>
		<item>
		<title>Announcing the Data Content Blog</title>
		<link>http://feedproxy.google.com/~r/hds/datacontent/~3/y7k_vDso34s/announcing-the-data-content-blog.html</link>
		<comments>http://blogs.hds.com/datacontent/2011/09/announcing-the-data-content-blog.html#comments</comments>
		<pubDate>Fri, 23 Sep 2011 11:47:24 +0000</pubDate>
		<dc:creator>Robert Primmer</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/datacontent/?p=5</guid>
		<description><![CDATA[Hello, and welcome to the Data Content Blog. On this blog we will focus on the area of data storage commonly referred to as “unstructured data.” We have all seen the industry charts that show the growth of unstructured data to be dramatic, growing to exabytes by 2015. The storage systems designed for this class [...]]]></description>
			<content:encoded><![CDATA[<p>Hello, and welcome to the Data Content Blog.</p>
<p>On this blog we will focus on the area of data storage commonly referred to as “unstructured data.” We have all seen the industry charts that show the growth of unstructured data to be dramatic, growing to exabytes by 2015. The storage systems designed for this class of data are evolving to meet the challenges associated with trying to manage and keep track of such massive amounts of data.<br />
Perhaps the most important distinction is that these systems are really software applications that use storage, but aren’t intrinsic to the storage subsystem. As a software solution, the storage application itself can <strong>perform functions particular to the data content as there’s a greater knowledge of the nature of the customer data than is possible in a pure disk storage subsystem</strong>. This increased knowledge allows for a greater set of actions to be taken based upon the data content, rather than on more generic qualities such as disk segment boundaries. A simple example is the ability for an application or storage administrator to select which specific objects or files it would like to have replicated, when and where. A more complex example involves performing arbitrarily complex analytics on the data that effectively transforms data to information.</p>
<p><span id="more-5"></span></p>
<p>A second important function for this class of storage software is the<strong> ability to simply keep track of where all the data resides.</strong> It’s a lot easier to store a petabyte than it is to store, catalog, and keep track of a billion objects over their lifetime.  As the time horizon for stored data approaches decades, it’s a given that the associated storage and server hardware will change generations multiple times. The software application that fronts these systems needs to be built to withstand these changes without requiring forklift upgrades. Therefore, it’s important that all hardware elements are sufficiently abstracted in the storage software to accommodate change. Likewise, the ability the test the veracity of stored data is needed as backup for data at this size is impractical.<br />
On this blog we’ll talk about these and other issues particular to unstructured data and how it relates to what’s happening in the industry at large, as well as specific customer segments.<br />
This blog will have four contributors: Cameron Bahar, Frank Wilkinson, Shmuel Shottan, and myself, Robert Primmer. Below are biographies for Cameron, Frank and Shmuel.</p>
<p><strong> Cameron Bahar</strong></p>
<p>Chief Product Strategist, Scale Out Storage Platforms &amp; Big Data Analytics<br />
Cameron Bahar leads the technology direction and strategy on scale-out storage platforms, distributed file systems and Big Data analytics at Hitachi Data Systems, bringing over 20 years of systems software development and deep expertise in distributed operating systems, parallel databases and data center management.<br />
Cameron joined HDS through the acquisition of ParaScale where he was the founder, CTO and VP of engineering. At ParaScale, he developed and released a private cloud storage and computing software platform for the enterprise to address Big Data workloads.<br />
Earlier, Cameron led design, deployment, and operation of Scale8&#8242;s distributed Internet storage service that provided storage for digital content owners including MSN and Viacom/MTV. At the HP Enterprise Systems Technology Lab, he developed system software for disk volume management, data security, and utility data center management. At Teradata, Cameron developed extensions to UNIX to allow the massive parallel processing required by the Teradata database, the world&#8217;s largest and fastest distributed RDBMS. Cameron started his career at Locus Computing, a pioneer in distributed operating systems, single-system image clustering, and distributed file systems.<br />
Cameron holds a BS, summa cum laude, and an MS with honors, in Electrical Engineering from UC Irvine. Cameron holds 4 patents in scalable distributed storage systems, virtual file systems, and high availability systems.</p>
<p><strong>Frank T. Wilkinson</strong></p>
<p>Content Services Strategist, Office of Technology and Planning<br />
As the content strategist in the Office of Technology and Planning, Frank brings over 15 years’ experience in the field of IT and software development and expertise in HDDS and HNAS. He is an industry expert in the areas of knowledge and content management, information management within the financial services, healthcare, media &amp; entertainment, retail and SLED.<br />
He has held many IT certifications, including being a Master Architect. Prior to joining HDS, Frank was the Business Strategist for HP Software, Information Management Division and spent five years developing and deploying information management and content management solutions. Prior to HP, he enjoyed stints at the EMC TSG group’s information management practice and K-Vault Software (Symantec Enterprise Vault). Frank has been an active speaker for industry led panels and discussion on the topics of eDiscovery, information management, content management and next generation solutions facing the enterprise.</p>
<p><strong> Shmuel Shottan</strong></p>
<p>SVP, CTO BlueArc, Part of Hitachi Data Systems<br />
As CTO of BlueArc technology, Shmuel is responsible for developing and advancing BlueArc product innovation. Previously Shmuel served as senior vice president of Product Development of BlueArc. He joined BlueArc in 2001.<br />
Shmuel has over 30 years&#8217; experience in the research and development of hardware and software, and in engineering management for firms ranging from start-ups to Fortune 500 companies. Prior to BlueArc, he was senior vice president of Engineering and chief strategy officer for Snap Appliance. Earlier, Shmuel held executive positions at Quantum Technology Ventures and Parallan Computer. Previously he held senior engineering positions at AST Computers and ICL North America.<br />
Shmuel holds B.S. degrees from the Technion – Israel Institute of Technology in electrical engineering and computer science.</p>
<p>As for me, I am senior technologist and senior director of product management in the file content and services division at HDS where I am responsible for devising technology solutions that will be incorporated into future enterprise and cloud product/service offerings, with in-depth knowledge and expertise with <a href="http://www.hds.com/products/storage-systems/content-platform/" target="_self">Hitachi Content Platform</a> (HCP).<br />
I have 25 years’ experience in technology, working in R&amp;D and Product Management organizations with Cisco Systems, EMC, HDS and several start-ups. I am a member of ACM and IEEE and belong to the IEEE Computer Society.<br />
We all look forward to bringing you insights on this dynamic topic of data content.  If you have particular issues you’d like us to address, please let us know in the comments.</p>
<img src="http://feeds.feedburner.com/~r/hds/datacontent/~4/y7k_vDso34s" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/datacontent/2011/09/announcing-the-data-content-blog.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blogs.hds.com/datacontent/2011/09/announcing-the-data-content-blog.html</feedburner:origLink></item>
	</channel>
</rss>

