<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Michael Hay</title>
	
	<link>http://blogs.hds.com/michael</link>
	<description>Musings and discussions about storage and technology</description>
	<pubDate>Fri, 06 Nov 2009 08:57:35 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<image><link>http://www.hds.com/</link><url>http://www.hds.com/img/logo_hds-93x24.gif</url><title>Hitachi Data Systems</title></image><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/hds/michaelhay" type="application/rss+xml" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com" /><item>
		<title>Interview with Gazopa Personnel</title>
		<link>http://feedproxy.google.com/~r/hds/michaelhay/~3/W0-ErUfzDoM/interview-with-gazopa-personnel.html</link>
		<comments>http://blogs.hds.com/michael/2009/11/interview-with-gazopa-personnel.html#comments</comments>
		<pubDate>Fri, 06 Nov 2009 06:24:41 +0000</pubDate>
		<dc:creator>Michael Hay</dc:creator>
		
		<category><![CDATA[Cloud]]></category>

		<category><![CDATA[File Storage]]></category>

		<category><![CDATA[Forward Thinking]]></category>

		<category><![CDATA[Innovation]]></category>

		<category><![CDATA[Search]]></category>

		<category><![CDATA[Web 2.0]]></category>

		<category><![CDATA[HDS]]></category>

		<category><![CDATA[Hitachi]]></category>

		<category><![CDATA[Hitachi Data Systems]]></category>

		<category><![CDATA[Image]]></category>

		<category><![CDATA[Michael Hay]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/michael/?p=1351</guid>
		<description><![CDATA[

As I mentioned from my previous short post on Gazopa this post contains an interview from Mr. Hideki Kobayashi at Hitachi Limited in Japan.  Mr. Kobayashi is the Gazopa project leader running a team that spans both the United States and Japan.   With that said, let&#8217;s jump right in.

From your perspective what is the [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://www.gazopa.com"><img class="aligncenter" title="Gazopa" src="http://www.gazopa.com/images/logo_with_catch_beta.png" alt="" width="223" height="83" /></a></p>
<p><span id="more-1351"></span></p>
<p>As I mentioned from my <a title="Gazopa post" href="http://blogs.hds.com/michael/2009/10/gazopa-beta.html" target="_blank">previous short post</a> on <a title="Gazopa image search" href="http://www.gazopa.com/" target="_blank">Gazopa</a> this post contains an interview from Mr. Hideki Kobayashi at Hitachi Limited in Japan.  Mr. Kobayashi is the Gazopa project leader running a team that spans both the United States and Japan.   With that said, let&#8217;s jump right in.</p>
<ol>
<li>From your perspective what is the most exciting thing about Gazopa?<br />
<em><span style="color: #888888;"><br />
The most exciting thing about GazoPa is that it enables users to search for  images by uploading images and through online drawings. With a conventional image search, only a key word could become a search query. So I view the approach of the technology within GazoPa.com as extremely, powerful, natural and highly intuitive.</p>
<p></span></em><em> </em></li>
<li>What kinds of applications developers or companies are interested in the technology behind Gazopa?<br />
<span style="color: #888888;"><em><br />
E-commerce sites and stock photo sellers have an interest in our technology. Game companies are also quite interested in our technology. We&#8217;ve also released a second Facebook application called GazoPa Drawing. It&#8217;s a kind of game, where seers compete with others to create the most similar drawings.</p>
<p></em></p>
<p></span></li>
<li>Is there anything that is coming to the Gazopa application which you would like to hint at?<br />
<em><span style="color: #888888;"><br />
We&#8217;re planning to release a new application that uses our image search technology to solve e-commerce problems.</p>
<p></span></em><em> </em></li>
<li>What kinds of applications do you believe that Gazopa technologies will be used in?<br />
<em><span style="color: #888888;"><br />
E-commerce and stock photo sites and community sites that have lots of photos and images. However we are open to anything and anyone who has a thought an idea on how to use the technologies behind Gazopa.</p>
<p></span></em></li>
<li>Can you provide us some history of Gazopa from initial idea to its current state?<br />
<em><span style="color: #888888;"><br />
We started development of the core engine more than 10 years. However at that time there was no iPhone and YouTube so we could not really express the capabilities of the core engine until now. We started developing GazoPa.com in the spring of 2008, and we were selected as one of finalists of TechCrunch50.  In September, 2008 we started our private beta, and later in 2008 we released our first Facebook application.  Following the Facebook  application, we also released the iPhone application to further illustrate how our technology can be used.  I&#8217;m also proud to say that GazoPa.com has entered open beta this month, so everyone should head on over to Gazopa or download the iPhone application and give it a try. (Note that to get the iPhone application to work you will have to sign-up for an account at Gazopa first.  We soon hope to remove this requirement in a future release of the iPhone for Gazopa application.)</p>
<p></span></em></li>
<li>There have been recent articles and releases by Google as well as others who suggest that they are in the image search business.  Google has even stated that this is a feature and not a product.  What are your response to the industry reports, analysts and competitors?<br />
<em><span style="color: #888888;"><br />
Basically I view Google&#8217;s and other&#8217;s comments on this capability as business domain specific. If you want to become a general image search engine provider, keyword is still a main search query and similar image is a feature. However, if your focus is on similar image searching, one can focus on being a similar image search engine provider with specialized services and technologies suited to that business and technical problem domains.  Further one can also provide the core technologies in an ASP model to other online applications, applications on the iPhone, applications in Facebook, etc.  Hitachi is not out to compete with Google in terms of generalized search, we are looking to bring our technologies to market through other providers and we view Gazopa as an excellent demonstration of what is possible for others to make use of with our technologies.</p>
<p></span></em></li>
<li>A bit of a technical question.  What are the components and languages that the Gazopa application and services are written in?<br />
<em><span style="color: #888888;"><br />
The Gazopa front end is developed in Ruby, and the core engine is developed in JAVA and C.</p>
<p></span></em></li>
<li>Finally if there are established companies, start-ups, open source developers, etc. who are interested in Gazopa who should they contact?<br />
<em><span style="color: #888888;"><br />
Please contact me at: hkoba@gazopa.com</span></em></li>
</ol>
<img src="http://feeds.feedburner.com/~r/hds/michaelhay/~4/W0-ErUfzDoM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/michael/2009/11/interview-with-gazopa-personnel.html/feed</wfw:commentRss>
		<feedburner:origLink>http://blogs.hds.com/michael/2009/11/interview-with-gazopa-personnel.html</feedburner:origLink></item>
		<item>
		<title>The Analog Blogger</title>
		<link>http://feedproxy.google.com/~r/hds/michaelhay/~3/-oocR9di36o/the-analog-blogger.html</link>
		<comments>http://blogs.hds.com/michael/2009/11/the-analog-blogger.html#comments</comments>
		<pubDate>Thu, 05 Nov 2009 22:42:49 +0000</pubDate>
		<dc:creator>Michael Hay</dc:creator>
		
		<category><![CDATA[Forward Thinking]]></category>

		<category><![CDATA[Innovation]]></category>

		<category><![CDATA[HDS]]></category>

		<category><![CDATA[Hitachi]]></category>

		<category><![CDATA[Hitachi Data Systems]]></category>

		<category><![CDATA[Michael Hay]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/michael/?p=1333</guid>
		<description><![CDATA[In a sense information wants to be free and Alfred Sirleaf in Liberia seems to be an agent of information freedom.  He appears to be the world&#8217;s first and only analog blogger.  I find this interesting, because in my mind this is innovation.  He is using the tools at his disposal to gather the information, and [...]]]></description>
			<content:encoded><![CDATA[<p>In a sense information wants to be free and Alfred Sirleaf in Liberia seems to be an agent of information freedom.  He appears to be the world&#8217;s first and only analog blogger.  I find this interesting, because in my mind this is innovation.  He is using the tools at his disposal to gather the information, and just like an Internet news aggregator site like Google News, but on a chalk board, he publishes his findings free for all to see.  I just finished reading rather short article <a title="The analog blogger" href="http://www.motherboard.tv/2009/10/27/monrovian-analog-blogger" target="_blank">here</a> and watched the video which is embedded in my post at the bottom.  Due to the lack of regular information in the country he is apparently quite successful with &#8220;droves&#8221; of people lining up to get to the information.  My primary reason for believing that this is innovation lies in the fact that Alfred recognized a need of his country and sought to fulfill said need.  This is the basis of innovation.  In fact if we look up the definition of innovation on wiktionary we find the following:</p>
<p><span id="more-1333"></span></p>
<p style="padding-left: 30px;"><em>The act of innovating; the introduction of something new, in customs, rites, etc; A change effected by innovating; a change in customs; something new, and contrary to established customs, manners, or rites; A newly formed shoot, or the annually produced addition to the stems of many mosses. (source: </em><a title="Definition of innovation" href="http://en.wiktionary.org/wiki/innovation" target="_blank"><em>http://en.wiktionary.org/wiki/innovation</em></a><em>)</em></p>
<p>I think that too often in our modern world we are tricked into thinking that innovation can only occur in the domains of sophisticated R&amp;D labs, an the innovations themselves can only be scientific in nature. I believe that Alfred is a great illustration that something simple but new can be extraordinarily innovative.  For the longest time one of my personal driving forces has been to think of one thing that no one has thought of before; I envy Alfred who has thought of this new thing.</p>
<p><script src="http://www.vbs.tv/vbs_player.js?width=500&amp;height=300&amp;ec=pqa214OunaUilVbuf1urN2VQ7QYeefxp&amp;st=undefined&amp;pl=http://www.motherboard.tv/2009/10/27/monrovian-analog-blogger" type="text/javascript"></script></p>
<img src="http://feeds.feedburner.com/~r/hds/michaelhay/~4/-oocR9di36o" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/michael/2009/11/the-analog-blogger.html/feed</wfw:commentRss>
		<feedburner:origLink>http://blogs.hds.com/michael/2009/11/the-analog-blogger.html</feedburner:origLink></item>
		<item>
		<title>Gazopa Beta</title>
		<link>http://feedproxy.google.com/~r/hds/michaelhay/~3/jyXULa_cCKM/gazopa-beta.html</link>
		<comments>http://blogs.hds.com/michael/2009/10/gazopa-beta.html#comments</comments>
		<pubDate>Thu, 29 Oct 2009 05:48:32 +0000</pubDate>
		<dc:creator>Michael Hay</dc:creator>
		
		<category><![CDATA[Enterprise 2.0]]></category>

		<category><![CDATA[File Storage]]></category>

		<category><![CDATA[Search]]></category>

		<category><![CDATA[Web 2.0]]></category>

		<category><![CDATA[HDS]]></category>

		<category><![CDATA[Hitachi]]></category>

		<category><![CDATA[Hitachi Data Systems]]></category>

		<category><![CDATA[Michael Hay]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/michael/?p=1328</guid>
		<description><![CDATA[If you have not already heard the Hitachi Central Research Lab has their image search engine at the beta stage.  The name of the beta application is Gazopa and there are even iPhone and Facebook applications for the service.


Note that the above Youtube video shows the iPhone application in action.  In a future post I&#8217;ll [...]]]></description>
			<content:encoded><![CDATA[<p>If you have not already heard the Hitachi Central Research Lab has their image search engine at the beta stage.  The name of the beta application is <a title="Gazopa" href="http://www.gazopa.com" target="_blank">Gazopa</a> and there are even iPhone and Facebook applications for the service.</p>
<p><span id="more-1328"></span></p>
<p><object width="350" height="245" data="http://www.youtube.com/v/i21H8HAlnfE&amp;feature" type="application/x-shockwave-flash"><param name="src" value="http://www.youtube.com/v/i21H8HAlnfE&amp;feature" /></object></p>
<p>Note that the above Youtube video shows the <a title="Gazopa for iPhone" href="http://itunes.apple.com/WebObjects/MZStore.woa/wa/viewSoftware?id=307884092&amp;mt=8" target="_blank">iPhone application</a> in action.  In a future post I&#8217;ll be doing an email interview with one of the members of the Gazopa team at Hitachi America.</p>
<img src="http://feeds.feedburner.com/~r/hds/michaelhay/~4/jyXULa_cCKM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/michael/2009/10/gazopa-beta.html/feed</wfw:commentRss>
		<feedburner:origLink>http://blogs.hds.com/michael/2009/10/gazopa-beta.html</feedburner:origLink></item>
		<item>
		<title>Microsoft Sharepoint Conference</title>
		<link>http://feedproxy.google.com/~r/hds/michaelhay/~3/uv3ey0JI6x4/microsoft-sharepoint-conference.html</link>
		<comments>http://blogs.hds.com/michael/2009/10/microsoft-sharepoint-conference.html#comments</comments>
		<pubDate>Thu, 22 Oct 2009 22:37:47 +0000</pubDate>
		<dc:creator>Michael Hay</dc:creator>
		
		<category><![CDATA[Enterprise 2.0]]></category>

		<category><![CDATA[File Storage]]></category>

		<category><![CDATA[Web 2.0]]></category>

		<category><![CDATA[File Tiering]]></category>

		<category><![CDATA[HDS]]></category>

		<category><![CDATA[Hitachi]]></category>

		<category><![CDATA[Hitachi Data Systems]]></category>

		<category><![CDATA[Keiko Harada]]></category>

		<category><![CDATA[Michael Hay]]></category>

		<category><![CDATA[Sharepoint]]></category>

		<category><![CDATA[Single Instancing]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/michael/?p=1300</guid>
		<description><![CDATA[

Hitachi (Hitachi Data Systems and Hitachi Consulting) is at the MS Sharepoint conference displaying our competencies for Sharepoint including enabling products and deployment approaches.  I&#8217;ve talked about the Hitachi Data Discovery for Microsoft Sharepoint (HDD-MS) in the past here and here.  Here&#8217;s a couple of really great pictures taken by my colleague Keiko Harada with respect to HDD-MS. I think that [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-1302" style="margin-top: 3px; margin-bottom: 3px; margin-left: 10px; margin-right: 10px;" title="hdd-ms-sharepoint2" src="http://blogs.hds.com/michael/wp-content/uploads/2009/10/hdd-ms-sharepoint2.jpg" alt="hdd-ms-sharepoint2" width="274" height="205" /><img class="alignright size-full wp-image-1304" style="margin-top: 3px; margin-bottom: 3px; margin-left: 10px; margin-right: 10px;" title="picture-083" src="http://blogs.hds.com/michael/wp-content/uploads/2009/10/picture-083.jpg" alt="picture-083" width="274" height="205" /></p>
<p><span id="more-1300"></span></p>
<p>Hitachi (Hitachi Data Systems and <span style="background-color: #ffffff;">Hitachi Consulting) is at the <a title="MS Sharepoint" href="http://www.mssharepointconference.com/Pages/default.aspx" target="_blank">MS Sharepoint</a> conference displaying our competencies for Sharepoint including enabling products and deployment approaches.  I&#8217;ve talked about the Hitachi Data Discovery for Microsoft Sharepoint (HDD-MS) in the past <a title="HDD-MS part 1" href="http://blogs.hds.com/michael/2009/02/infrastructure-for-enterprise-20-applications-part-2.html" target="_blank">here</a> and <a title="HDD-MS part 2" href="http://blogs.hds.com/michael/2009/07/capacity-optimization-for-hitachi-file-and-content-services.html" target="_blank">here</a>.  Here&#8217;s a couple of really great pictures taken by my colleague Keiko <span style="background-color: #ffffff;">Harada with respect to HDD-MS. I think that the first one is pretty cool and includes several of the Hitachi booth swag as the audience. The second photo is of our entire booth, with HDD-MS being shown integrated to the Hitachi Content Platform (HCP).  Again as I pointed out in my previous post, HDD-MS is a light weight plugin that enables direct from Sharepoint to HCP content storage.  Unlike our competitors&#8217; products which require Sharepoint, content management offerings, potentially data archiving software, and then long term cloud storage &#8212; oops wait as far as I know ATMOS(T) isn&#8217;t integrated to any applications yet &#8212; we&#8217;re pretty skinny in terms of our implementation.  The combination of HDD-MS and a durable backing store assists both with making it easier to backup Sharepoint as well as being able to allow for simple eDiscovery across multiple Sharepoint sites, site collections, farms, etc.  Also with the capacity optimization features within HCP, if end users make a habit of using the sneaker-net consisting of email and USB drives to distribute content outside of Sharepoint leading to duplicates, then HCP can both perform the single instancing as well as compression allowing IT professionals to cope better with sneaker-net content distribution.  With all of this users continue to use their document libraries, Sharepoint permissions, versioning, etc.  We are just moving content under the covers, using REST, without disturbing their behaviors. Anyway kudos to Keiko and the team at the Sharepoint conference.</span></span></p>
<img src="http://feeds.feedburner.com/~r/hds/michaelhay/~4/uv3ey0JI6x4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/michael/2009/10/microsoft-sharepoint-conference.html/feed</wfw:commentRss>
		<feedburner:origLink>http://blogs.hds.com/michael/2009/10/microsoft-sharepoint-conference.html</feedburner:origLink></item>
		<item>
		<title>The Rainbow Connection</title>
		<link>http://feedproxy.google.com/~r/hds/michaelhay/~3/VuflqEgOZDY/the-rainbow-connection.html</link>
		<comments>http://blogs.hds.com/michael/2009/10/the-rainbow-connection.html#comments</comments>
		<pubDate>Mon, 19 Oct 2009 02:14:00 +0000</pubDate>
		<dc:creator>Michael Hay</dc:creator>
		
		<category><![CDATA[Cloud]]></category>

		<category><![CDATA[Enterprise 2.0]]></category>

		<category><![CDATA[File Storage]]></category>

		<category><![CDATA[Web 2.0]]></category>

		<category><![CDATA[Amazon]]></category>

		<category><![CDATA[Apple]]></category>

		<category><![CDATA[HDS]]></category>

		<category><![CDATA[Hitachi]]></category>

		<category><![CDATA[Hitachi Data Systems]]></category>

		<category><![CDATA[iTunes]]></category>

		<category><![CDATA[Michael Hay]]></category>

		<category><![CDATA[NetApp]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/michael/?p=1224</guid>
		<description><![CDATA[Digressions from the previous post
If you had not guessed already I&#8217;m sticking with the Kermit the Frog theme here.  And just for grins here is the Rainbow Connection with Kermit and Debby Harry singing.

So on my previous post I talked about files being &#8220;the objects&#8221; stored out there in the cloud.  I know, I know [...]]]></description>
			<content:encoded><![CDATA[<h2>Digressions from the previous post</h2>
<p>If you had not guessed already I&#8217;m sticking with the Kermit the Frog theme here.  And just for grins here is the Rainbow Connection with Kermit and Debby Harry singing.<span id="more-1224"></span></p>
<p style="padding-left: 90px;"><object width="340" height="285" data="http://www.youtube.com/v/lRvhRhWWE44" type="application/x-shockwave-flash"><param name="quality" value="autohigh" /><param name="salign" value="r" /><param name="id" value="my_movie" /><param name="name" value="rainbow" /><param name="src" value="http://www.youtube.com/v/lRvhRhWWE44" /><param name="align" value="right" /><param name="bgcolor" value="#640fd0" /><param name="vspace" value="3" /><param name="hspace" value="10" /></object></p>
<p>So on my <a title="Objects are files" href="http://blogs.hds.com/michael/2009/10/froggy-or-cloudy.html" target="_blank">previous post</a> I talked about files being &#8220;the objects&#8221; stored out there in the cloud.  I know, I know that is a really obvious point, but as I mentioned even truly novel ideas are obvious when you look at them on paper.  One of the greatest quotes of learning a horrible yet obvious point is &#8220;Soylent Green is people!&#8221; which is from the 1973 film <a title="...is people" href="http://en.wikipedia.org/wiki/Soylent_Green" target="_blank">Soylent Green</a>.  In this the main character through various adventures shows that the reality behind the food people are eating is well people.  I feel like this was my point at files being objects.</p>
<h2>There&#8217;s a pot of gold at the end of the rainbow?</h2>
<p><img class="alignleft" style="margin-top: 3px; margin-bottom: 3px; margin-left: 10px; margin-right: 10px;" title="beautiful rainbows" src="http://farm4.static.flickr.com/3291/3085643294_eef1b94211.jpg" alt="" width="265" height="350" /></p>
<p>Okay sorry for the digressions, in this post what I want to touch on is scale and infrastructure in the cloud.  The first point that I want to make: for new efforts which have not been done before that do not have commercial software already available of course new technologies emerge.  However there is always a hype cycle which everyone is familiar with and is rather like a gold rush in some sense.  However, hopefully the consequences are no where near the dotcom crash. Speaking of hype, I&#8217;m sure that during the early part of the RDBMS hype cycle databases were going to replace everything in the world including file systems.  Oops that didn&#8217;t happen.  The same with the hype surrounding distributed systems versus the mainframe where the distributed system was going to replace the mainframe and the mainframe was a dinosaur &#8212; yeah a T-Rex!  Well that did not happen either.  So cloud is not the end all be all for everything that the world is hyping it up to be.  Cloud is about a new set of problems that cannot be solved using traditional COTS approaches.  That is why Google, Microsoft, Amazon, Apple and others are using a lot of manpower and a mixture of various commercial and open source stacks to scratch their itch for their businesses.  This is largely my point behind citing good old Kermit&#8217;s song where the lyrics talk about the rainbows being visionary but merely an illusion.  So what I&#8217;m saying is that I agree with <a title="Tony's thoughts on clouds" href="http://blogs.hds.com/tony/2009/10/thoughts-on-it-clouds.html" target="_blank">Tony</a> about the current point in the hype cycle and that the promise of cloud solving everything sounds like a great vision but when you finally get there, like a pot of gold at the end of the rainbow it was a myth.</p>
<h2>Hammer in a nail not a screw</h2>
<p>I think back to several years ago when I talked to a budding SNS company.  They were looking for specialized file systems which were application specific and not general purpose.  A file system that was flat and tuned for this company&#8217;s particular application which required billions of small files and really really random I/O.  Essentially, two of NetApp&#8217;s highest end systems were not cutting the muster with their CPU busy rate pegged to as close to 100% as possible.  The bulk of the company&#8217;s image I/O was handled by their squid web cache, which responded to 95% of the I/O requests from their application and users.  While the NetApp systems were on their knees with only 5% of the remaining workload.  Basically the WAFL file system is not well suited to really small file random I/O, in the long term, without running what is effectively a defrag on the WAFL file system.  (In fact if it was read optimized we would call it the Read Anywhere File Layout or RAFL for short.  Wanna buy a ticket?)  Because the company in question could never take one of the systems down &#8212; the CPUs were pegged at 100% &#8212; they could never run the defrag and therefore this was causing them significant performance availability problems.  I know that in the intervening time NetApp has surely worked to remedy this, but I want to use this as an illustration of the fact that you have to use the right tool for the job.  If there was a specific image repository or store which was optimized for the I/O workload and capacity then I think that the company in question would have been okay.  They were effectively trying to hammer in a screw which is fatal for the screw, NetApp was the wrong tool for their application.  Hopefully you can take the leap here and see that the SNS domain is an emerging effort and therefore requires new technologies to solve related problems.</p>
<h2>Modern Internet geology</h2>
<p><img class="alignleft" style="margin-top: 3px; margin-bottom: 3px; margin-left: 10px; margin-right: 10px;" title="The rock" src="http://farm3.static.flickr.com/2074/2242018402_7533c6d876.jpg" alt="" width="300" height="200" />One of the most famous communities on the Internet is <a title="/." href="http://highscalability.com/slashdot-architecture-how-old-man-internet-learned-scale" target="_blank">Slashdot</a> who largely predates many of today&#8217;s web communities as well as architectures which are now famous &#8212; like Google.  If you take a look at their <a title="/. arch" href="http://news.slashdot.org/article.pl?sid=07/10/18/1641203&amp;tid=124" target="_blank">architecture</a> you can find that they dedicate special purpose systems to storing static content, a common practice for many web applications.  What I see as going on in the cloud segment today, with respect to emerging applications and businesses, is the big infrastructure crunch.  That is to say offerings like the Hitachi Content Platform which serves up static content via HTTP/REST allows these emerging businesses to not worry about the mundane parts of their infrastructure and instead focus on providing value added features to their end users.  The image and model that I have in my mind is of sedimentary rock which starts out as living vibrant things with a lot of action and motion, and eventually falls into a lake or another body of water becoming brown peat moss or middleware.  Give things a few million years and eventually this vibrant stuff turns into well rock that you can build new structures on top of.  The infrastructure crunch is really just like that.  It is about taking things that need scale and performance and sticking them in a layer that can provide these attributes and then well allowing the upper application layers to forget about this stuff.  In the case of the Slashdot architecture HCP could crunch down their architecture and hold their static content and then they could forget about having to manage these systems moving on to supplying more value added functions/features.</p>
<h2>Mature applications</h2>
<p>With all of this talk about the emerging applications what about the mature applications like ERP systems, email systems, groupware systems, RDBMS systems, etc.?  In the race to deploy things in the cloud, these systems need some approach to get them into a cloud like infrastructure which seems to me like a utility approach to IT infrastructure.  In my opinion Hyper-V, XEN, VMWare, Oracle-VM and others can do just that.  While the current driver for virtual machine deployments is really about cost savings &#8212; note that even Green gets people&#8217;s attention because it saves money &#8212; in the long term I believe it will be the way that mature applications make it into the cloud.  The switching cost of changing these applications is super high and the cost of engineering new applications which are functionally equivalent is super high too.  So in my humble opinion I think that we will see architectures much like what Amazon has deployed, but inside of a company as the private cloud.  My colleague <a title="Hu's blog" href="http://blogs.hds.com/hu" target="_blank">Hu</a> has just put up a great post on what he views as an in effective architecture, VMAX for these workloads.  I think Hu is on to something here and I want to augment his point: people who have already made an investment in their SANs and existing infrastructures want to reuse their investments and rightly so.  Throwing these SAN architectures out which have served companies for years to create strong infrastructures is a short sighted approach.</p>
<h2>Conclusion</h2>
<p>So just to summarize a bit.  The objects that everyone is talking about are files.  As we pass through this cloud hype I think we will find that a lot of what we thought it would be was basically the illusion that there is a pot of gold at the end of the rainbow.  You need the right tool for the right job and COTS solutions today don&#8217;t completely solve the infrastructure problems for emerging applications.  Currently we are in a phase I call the infrastructure crunch which seeks to commercialize the portions of these emerging technology architectures for added scale, reliability and performance.  Finally we cannot forget about the emerging applications, heck even Amazon did not do that, and we have to help people and companies maximize the return on their investment they&#8217;ve already made.</p>
<img src="http://feeds.feedburner.com/~r/hds/michaelhay/~4/VuflqEgOZDY" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/michael/2009/10/the-rainbow-connection.html/feed</wfw:commentRss>
		<feedburner:origLink>http://blogs.hds.com/michael/2009/10/the-rainbow-connection.html</feedburner:origLink></item>
		<item>
		<title>Storage Fusion - StoraFgUeSION - SfTuOsRiAoGnE</title>
		<link>http://feedproxy.google.com/~r/hds/michaelhay/~3/7nr64hS-cKs/storage-fusion-storafguesion-sftuosriaogne.html</link>
		<comments>http://blogs.hds.com/michael/2009/10/storage-fusion-storafguesion-sftuosriaogne.html#comments</comments>
		<pubDate>Thu, 15 Oct 2009 06:55:44 +0000</pubDate>
		<dc:creator>Ken Wood</dc:creator>
		
		<category><![CDATA[Best Practices]]></category>

		<category><![CDATA[Block Storage]]></category>

		<category><![CDATA[Cloud]]></category>

		<category><![CDATA[File Storage]]></category>

		<category><![CDATA[File Systems]]></category>

		<category><![CDATA[Forward Thinking]]></category>

		<category><![CDATA[Green]]></category>

		<category><![CDATA[SSD]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/michael/?p=1177</guid>
		<description><![CDATA[
 
Storage Fusion - StoraFgUeSION - SfTuOsRiAoGnE
&#8220;HybridStorage&#8221; refers to the mix of flash and magnetic storage medias blended together to increase performance and efficiency. This particular effort is part of the HybridStorage Alliance to promote and educate the industry on the benefits and advantages of this approach. While I&#8217;m not personally involved in this organization&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p><img class="aligncenter size-full wp-image-1181" src="http://blogs.hds.com/michael/wp-content/uploads/2009/10/me-designed-head-no-glasses2.jpg" alt="me-designed-head-no-glasses2" width="500" height="90" /></p>
<p><!--  /* Font Definitions */  @font-face 	{font-family:"MS Mincho"; 	panose-1:2 2 6 9 4 2 5 8 3 4; 	mso-font-alt:"?l?r ??fc"; 	mso-font-charset:128; 	mso-generic-font-family:modern; 	mso-font-pitch:fixed; 	mso-font-signature:-1610612033 1757936891 16 0 131231 0;} @font-face 	{font-family:"\@MS Mincho"; 	panose-1:2 2 6 9 4 2 5 8 3 4; 	mso-font-charset:128; 	mso-generic-font-family:modern; 	mso-font-pitch:fixed; 	mso-font-signature:-1610612033 1757936891 16 0 131231 0;}  /* Style Definitions */  p.MsoNormal, li.MsoNormal, div.MsoNormal 	{mso-style-parent:""; 	margin:0in; 	margin-bottom:.0001pt; 	mso-pagination:widow-orphan; 	font-size:12.0pt; 	font-family:"Times New Roman"; 	mso-fareast-font-family:"MS Mincho";} @page Section1 	{size:8.5in 11.0in; 	margin:1.0in 1.25in 1.0in 1.25in; 	mso-header-margin:.5in; 	mso-footer-margin:.5in; 	mso-paper-source:0;} div.Section1 	{page:Section1;} --><!--[if gte mso 10]&gt; &lt;!   /* Style Definitions */  table.MsoNormalTable 	{mso-style-name:"Table Normal"; 	mso-tstyle-rowband-size:0; 	mso-tstyle-colband-size:0; 	mso-style-noshow:yes; 	mso-style-parent:""; 	mso-padding-alt:0in 5.4pt 0in 5.4pt; 	mso-para-margin:0in; 	mso-para-margin-bottom:.0001pt; 	mso-pagination:widow-orphan; 	font-size:10.0pt; 	font-family:"Times New Roman"; 	mso-fareast-font-family:"Times New Roman"; 	mso-ansi-language:#0400; 	mso-fareast-language:#0400; 	mso-bidi-language:#0400;} --> </p>
<h3>Storage Fusion - StoraFgUeSION - SfTuOsRiAoGnE</h3>
<p style="text-align: justify;">&#8220;HybridStorage&#8221; refers to the mix of flash and magnetic storage medias blended togeth<img class="alignright size-full wp-image-1199" src="http://blogs.hds.com/michael/wp-content/uploads/2009/10/next-picture2.jpg" alt="next-picture2" width="185" height="125" />er to increase performance and efficiency. This particular effort is part of the <a href="http://www.hybridstorage.org/index.html">HybridStorage Alliance</a> to promote and educate the industry on the benefits and advantages of this approach. While I&#8217;m not personally involved in this organization&#8217;s efforts, Hitachi is. This bring back memories of the <a href="http://en.wikipedia.org/wiki/Floptical">floptical disk </a>and the <a href="http://en.wikipedia.org/wiki/NeXT">NeXt Machine.</a> However, I&#8217;m not going to discuss these devices here in detail. What I am going to describe is Storage Fusion which is a term I just made up because Hybrid storage was taken. I love thesauruses. Also, this is different than <a href="http://blogs.hds.com/michael/2009/07/the-other-side-of-single-instancing-%E2%80%93-re-instancing.html">Controlled Instancing</a> which places data at the right place, at the right time, for the right reason.<span id="more-1177"></span></p>
<p style="text-align: justify;">Specifically, what I see Hybridstorage proposing is that certain types of data can use or benefit from different types of storage media, not that we storage aficionados didn&#8217;t know this. So Hybridstorage is tiered storage media within a HDD form factor. Similarly, but at a higher level, Storage Fusion proposes a similar premise, certain types a data needs different types of storage or file systems. For example, with the proliferation of internet data and various services, it was noticed a while back that most of the bits on the web are static. While the storage industry calls this stuff &#8220;fixed content&#8221;, reference data, or write once-read many (WORM), these terms tend to be use to describe compliance or archiving types of storage uses and have a less active nature. In reality, there is a lot of bits patterns out there that never change, are highly active (read a lot) and are usually just replaced instead of being modified. I just used the term &#8220;bit patterns&#8221; but could also use ‘byte patterns&#8221; or binaries because most people use the term &#8220;data&#8221; to describe &#8220;their data&#8221; or information. What is an application, or operating system programs, devices drivers or binaries when stored on disk? This too is data, but is rarely referred to as &#8220;data&#8221;.  An interesting difference with this type of data is that there are millions of copies in some cases of this data. Think of your Windows Vista or XP, or a Linux distro. For every installed version there is at least one CD/DVD copy associated with it, site licenses not withstanding. With many Linux distros there&#8217;s the online copy and usually a plethora of mirror sites, your downloaded ISO images and possibly the CD/DVD version you&#8217;ve burned. So protection of this data is usually not an issue, but the inconvenience of rebuilding your systems is always annoying but doable. Useless you make a Ghost image like I do.</p>
<p style="text-align: justify;">So let&#8217;s scale this scenario up to say 500 TB and climbing. There&#8217;s data and there&#8217;s &#8220;data&#8221;. For some reason, when we get to this level of complexity, we end up saying operational data and user generated data. Operational data is the programs, applications, binaries, templates and configuration information that constitute the &#8220;state&#8221; of the system, and user generated data, with &#8220;user&#8221; including another machine, is the data that is stored by users or the results of applications.</p>
<p style="text-align: justify;">Take an online picture storage and sharing web site for example. The web servers, database servers and application servers running the applications of this web service could store and provision the storage centrally. This would allow the compute nodes to essentially be stateless and easily replaceable during failures or upgrades. Likewise, the user generated data would be stored centrally in a content repository like the newly announce <a href="http://www.hds.com/corporate/press-analyst-center/press-releases/2009/gl091013.html?WT.ac=us_inside_pan_cloud_101309">Hitachi Content Platform</a>.  The operational data is relatively small compared to the user generated data, but the activity of this data is highly active with much of it pinned in system memory. Booting nodes, loading application programs, serving up static web content and the assorted graphics that are locally stored are candidates for Storage Fusion where the file system is comprised of different storage types and media for the different types of data being stored. <a href="http://www.hds.com/products/storage-software/hitachi-tiered-storage-manager.html">Hitachi Tiered Storage Manager</a> does just this under the file systems. The operational data would end up on SSD or high speed SAS/FC drives using RAID-5 or RAID-10 protection levels depending on the databases. The bulk of the storage would be magnetic using SATA drive and probably a RAID-6 protection level and would store the user generated content, in this case pictures, that have a much lower activity level and that continues to diminish in activity over time. However, unlike Enterprise data management, this type of data cannot be deleted by the service provider so long as the service is available and the subscriber is paying the fees.</p>
<p style="text-align: justify;">Actually when you think about this scenario and other similar use cases, only a small percentage of any of the data stored is modified. The databases are probably the most active with regards to updating data. The most activity is reading both operational data and user generated data, uploading data and storing it, maybe moving it somewhere after initial load and removing data. In these cases you could craft ways of using optimized file systems and object storage with different media types and RAID levels to obtain the most optimal storage infrastructure to meet performance demands.</p>
<img src="http://feeds.feedburner.com/~r/hds/michaelhay/~4/7nr64hS-cKs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/michael/2009/10/storage-fusion-storafguesion-sftuosriaogne.html/feed</wfw:commentRss>
		<feedburner:origLink>http://blogs.hds.com/michael/2009/10/storage-fusion-storafguesion-sftuosriaogne.html</feedburner:origLink></item>
		<item>
		<title>Froggy or Cloudy?</title>
		<link>http://feedproxy.google.com/~r/hds/michaelhay/~3/7gh6Cu4SBL4/froggy-or-cloudy.html</link>
		<comments>http://blogs.hds.com/michael/2009/10/froggy-or-cloudy.html#comments</comments>
		<pubDate>Thu, 15 Oct 2009 01:13:00 +0000</pubDate>
		<dc:creator>Michael Hay</dc:creator>
		
		<category><![CDATA[Cloud]]></category>

		<category><![CDATA[File Storage]]></category>

		<category><![CDATA[Apple]]></category>

		<category><![CDATA[File Systems]]></category>

		<category><![CDATA[HDS]]></category>

		<category><![CDATA[Hitachi]]></category>

		<category><![CDATA[Hitachi Data Systems]]></category>

		<category><![CDATA[Michael Hay]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/michael/?p=1168</guid>
		<description><![CDATA[Hi there folks! Okay so there are two logical arguments that come up again and again and I want to address them, but in two posts and this is the first.  For this post I want to point out the obvious and put a stake in the ground.  For all of the hubris about cloud [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" style="margin-top: 3px; margin-bottom: 3px; margin-left: 10px; margin-right: 10px;" title="Kermit in the winter" src="http://farm4.static.flickr.com/3284/3125429199_de2c9cb7c3.jpg" alt="" width="300" height="200" />Hi there folks! Okay so there are two logical arguments that come up again and again and I want to address them, but in two posts and this is the first.  For this post I want to point out the obvious and put a stake in the ground.  For all of the hubris about cloud object stores I have one simple question what objects are we storing in there?  On the consumer side there is a lot of discussion of digital photos, digital videos, contacts, notes, office productivity documents, etc.  On the enterprise/commercial side largely the biggest use of cloud are things like online backup where companies like Iron Mountain&#8217;s desktop backup service is pretty successful.  Well most of what I just listed are well files or services that collect files and send them to a &#8220;cloud environment.&#8221;  Why is that?  Well the reason is quite simple the devices and applications that all of us use today produce well files, store files, and operate on files.   Here&#8217;s an example of a non-PC system that uses and produces files: smart phones &#8212; like my iPhone &#8212; store pictures as files on a file system and movies as well as files on a file system.  So the simple answer to my question above is that these objects that we are storing in cloud services today are files plain and simple.  With that in mind let&#8217;s do a fancy bulleted list to compare some of the often cited reasons for &#8220;objects&#8221; and see how files rate.<span id="more-1168"></span></p>
<ul>
<li>User permissions - objects must contain enough state such that when the objects are ported to another system the user permission state is persisted.  Well, guess what files can preserve those attributes and that is why things like robo-copy are used to move files from one system to another because ACLs and permissions are preserved</li>
<li>Self describing format - the object should contain enough information to tell the surrounding systems what can and cannot be done with it. Well, guess what DRMed files &#8212; like what is on my AppleTV &#8212; are stored with security information included such as expiration time, permission to execute, etc.  Further systems like Microsoft Office include IRM that allows content authors to specify what users can and cannot do with the content.  Finally there are many applications out there that keep or store files in an XML format so that in the event a company goes out of business then the meaning can be regenerated by another program or even by a human &#8212; although I have hand edited/hacked Postscript files before</li>
<li>Explicit/Implicit metadata - objects should have the ability to associate arbitrary metadata with them and also include a set of inherent metadata.  Well, extended attributes, the ability to create files associated to primary files, or even use directory structures to encode metadata are all relevant ways to add arbitrary metadata for files.  Further every file has implicit metadata like the size of the file, the inode number, etc.</li>
</ul>
<p>I&#8217;m sure that there are more, but the point I&#8217;m trying to get at is that cloud storage should obviously be about files ladies and gentlemen.  We aren&#8217;t storing something new or different, we might be fully making use of the technologies and approaches which were already in place, but underutilized.    So when you next hear the latest cloud guru talking about exotic object storage systems ask a simple question: what are those objects you are talking about?  I suggest that you also follow-up with a healthy dose of: here is my business problem can you solve it at the right, cost, with investment protection (hey this is a nice way of saying that there is a roadmap and you can grow with your vendor), with the right performance and reliability, etc.  After all being a student of the games of billiards one of the key sayings is that a good billiard player can make use of a broomstick to cream someone at the selected game.  This implies that if you use block storage and Hyper-V to solve your problem and you did it in less time than your competition then you&#8217;ve done well.</p>
<p>Finally as to the title, well I started out an introduction with a reference to Kermit the Frog and now I will end one as well.</p>
<p style="padding-left: 30px;"><em><span style="font-family: mceinline;">&#8220;Time&#8217;s fun when you are having </span></em><span style="text-decoration: line-through;"><em><span style="font-family: mceinline;">files</span></em></span><em><span style="font-family: mceinline;">, oops flies.&#8221;</span></em></p>
<p style="padding-left: 30px;">
<p><span style="font-family: mceinline;">P.S.  From my somewhat regular discussions with Intellectual Property attorneys they often point out that patents are obvious and often simple once someone puts their idea on paper.  However before that point many people are baffled.  I think that while this file + cloud concept might be obvious to everyone now that I&#8217;m writing about, even my competitors, it needed to be said.</span></p>
<img src="http://feeds.feedburner.com/~r/hds/michaelhay/~4/7gh6Cu4SBL4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/michael/2009/10/froggy-or-cloudy.html/feed</wfw:commentRss>
		<feedburner:origLink>http://blogs.hds.com/michael/2009/10/froggy-or-cloudy.html</feedburner:origLink></item>
		<item>
		<title>Chapter 4 - A New Hope</title>
		<link>http://feedproxy.google.com/~r/hds/michaelhay/~3/w79w-xj4B4o/chapter-4-a-new-hope.html</link>
		<comments>http://blogs.hds.com/michael/2009/10/chapter-4-a-new-hope.html#comments</comments>
		<pubDate>Tue, 13 Oct 2009 13:03:41 +0000</pubDate>
		<dc:creator>Michael Hay</dc:creator>
		
		<category><![CDATA[Cloud]]></category>

		<category><![CDATA[Enterprise 2.0]]></category>

		<category><![CDATA[File Storage]]></category>

		<category><![CDATA[Innovation]]></category>

		<category><![CDATA[Web 2.0]]></category>

		<category><![CDATA[cloud computing]]></category>

		<category><![CDATA[cloud storage]]></category>

		<category><![CDATA[File Systems]]></category>

		<category><![CDATA[HCAP]]></category>

		<category><![CDATA[HCP]]></category>

		<category><![CDATA[HDS]]></category>

		<category><![CDATA[Hitachi]]></category>

		<category><![CDATA[Hitachi Data Systems]]></category>

		<category><![CDATA[Michael Hay]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/michael/?p=1032</guid>
		<description><![CDATA[My intention with this post is to tie quite a few things together.  The first thing to accomplish is a quick review of the previous chapters.

Chapter 1 - talked about the emotional hang-ups associated to      getting data in the cloud.  So in essence while we can understand     [...]]]></description>
			<content:encoded><![CDATA[<p>My intention with this post is to tie quite a few things together.  The first thing to accomplish is a quick review of the previous chapters.<span id="more-1032"></span></p>
<ol type="1">
<li class="MsoNormal"><a title="Chapter 1" href="http://blogs.hds.com/michael/2009/09/what-is-the-challenge-to-web-applications-in-the-cloud.html" target="_blank">Chapter 1</a><span class="MsoCommentReference"> </span>- talked about the emotional hang-ups associated to      getting data in the cloud.  So in essence while we can understand      intellectually that our data can be protected, until we get it at a      visceral level, we don&#8217;t really &#8220;get it.&#8221;</li>
<li class="MsoNormal"><a title="Chapter 2" href="http://blogs.hds.com/michael/2009/09/chapter-2-the-millennials.html" target="_blank">Chapter 2</a> - essentially      points out that some of the emotional hang-ups are alleviated by the next      generation of workers coming into the US over the next decade, nearly 80      million. It’s interesting to note the rough match in China is nearly 300      million people by some estimates. There is not an estimated number for      this population in India, but we could guesstimate that the number is in      the 100s of millions of people.</li>
<li class="MsoNormal"><a title="Chapter 3" href="http://blogs.hds.com/michael/2009/09/chapter-3-cheap-cloud-storage-but-what-about-tco.html#postcom" target="_blank">Chapter 3</a> -      asked the question about TCO with the all of the rage being the cheap      storage and Backblaze becoming the poster child du jour.  (Note the      folks over at Object Matrix objected to my personal assurances of      Hitachi&#8217;s continued survival.  Nick and I did have a little bit of a      &#8220;loud&#8221; <a title="Debate with Nick" href="http://blogs.hds.com/michael/2009/10/follow-up-to-previous-post.html" target="_blank">debate</a> but his point of storing content in an open manner      is spot on.  This is something that Hitachi has been practicing for      sometime even on the block storage side of the house.)</li>
</ol>
<p>So how does all this come together? Am I making good on my promise that Hitachi can and will help with various cloud activities?</p>
<p><a title="HDS Announces Agile Cloud Solutions" href="http://www.hds.com/corporate/press-analyst-center/press-releases/2009/gl091013.html" target="_self">Today we announced</a> our intentions for cloud and the new<a title="HCP" href="http://www.hds.com/products/storage-systems/content-archive-platform" target="_blank"> Hitachi Content Platform</a> (HCP) – which is slated for release later this calendar year.  We are augmenting HCP, which is already a venerable cloud storage contender, to include more features making it even more compelling for private cloud deployments, and we aren&#8217;t just stopping there.  In the past I&#8217;ve mentioned that when EMC announced their intentions with ATMOS(T) they were really joining a crowd of folks who had already done this before &#8212; not to mention silently replacing Centera (but shhhh, that&#8217;s a secret). What EMC did not do very well, and we are pushing the ball forward on, is direct and indirect application integrations to the cloud.</p>
<p><img class="size-full wp-image-1108 alignleft" title="hcp-v3-1a" src="http://blogs.hds.com/michael/wp-content/uploads/2009/10/hcp-v3-1a.png" alt="hcp-v3-1a" width="245" height="232" /></p>
<p style="text-align: center;"><img class="size-full wp-image-1110 aligncenter" title="hcp-v3-2a" src="http://blogs.hds.com/michael/wp-content/uploads/2009/10/hcp-v3-2a.png" alt="hcp-v3-2a" width="256" height="226" /></p>
<p style="text-align: center;">
<p><span style="background-color: #ffffff;">In the HDS solution, the first integration is via our Sharepoint Plugin, <a title="HDD-MS" href="http://www.hds.com/products/storage-software/data-discovery-suite.html" target="_blank">Hitachi Data Discovery for Microsoft Sharepoint</a>.  Since we first GA&#8217;ed this product almost two years ago, it used ultra-modern REST access to push files into HCP. We&#8217;ve continued pushing the ball forward since then to include persistence of Sharepoint metadata, scheduled retention, etc.  However, if we also look at the <a title="HNAS" href="http://www.hds.com/products/storage-systems/network-attached-storage/hitachi-high-performance-nas-platform.html" target="_blank">Hitachi NAS Platform</a> (HNAS) models&#8217; 3200, 3100, <a title="New HNAS announcement" href="http://www.hds.com/corporate/press-analyst-center/press-releases/2009/gl090908.html?WT.ac=us_hp_rm_hnas_090809" target="_blank">3090 and 3080</a> file content into the cloud, we have what is sometimes referred to as a “cloud on-ramp” which allows applications that have not made the leap to REST to continue using CIFS/NFS, while also taking advantage of getting their data into a private cloud(note that these are topics which I&#8217;ve pointed out with more detail in the past <a title="HDD-MS blog post" href="http://blogs.hds.com/michael/2009/02/infrastructure-for-enterprise-20-applications-part-2.html" target="_blank">here</a>, <a title="Cloud momentum" href="http://blogs.hds.com/michael/2009/04/cloud-and-archive-momentum-hcap-compliance.html" target="_blank">here</a> and <a title="Thoughts on cloud" href="http://blogs.hds.com/michael/2009/08/thoughts-on-applications-using-the-cloud.html" target="_blank">here</a>).  Our systems can be combined so that if customers have made a previous investment in consolidated SANs they can just add another application.  This is not like our competitors who can only offer stove pipes, which has little to no integration.</span></p>
<p>If you want, you can also use <a title="HDP" href="http://www.hds.com/products/storage-software/hitachi-dynamic-provisioning.html" target="_blank">Hitachi Dynamic Provisioning</a> (HDP) to work  with your cloud on-ramp attached to an <a title="AMS" href="http://www.hds.com/products/storage-systems/adaptable-modular-storage-2000-family/index.html?WT.ac=prodams2000" target="_blank">AMS</a> to support an application that uses NFS.  The on-ramp can then send data over REST to a centralized private cloud on HCP that is attached to a <a title="USP-VM" href="http://www.hds.com/products/storage-systems/universal-storage-platform-vm.html" target="_blank">USP-VM</a> and make use of <a title="HTSM" href="http://www.hds.com/products/storage-software/hitachi-tiered-storage-manager.html" target="_blank">Tiered Storage Manager</a>.  Further, this large centralized cloud is replicating to a bunker site which is made up of HCP 300 all running with embedded disk storage. And, we can do all of this today! So for us, cloud is not necessarily about a single product per se,  it is about making sure that applications which are tuned for the cloud can take advantage of it. It is about deploying a private cloud on top of existing assets, and enabling many capacity optimization features including single instancing, compression, HDP, WAN optimized replication, etc.  This is a key tenant of Hitachi&#8217;s strength and where we leave our competitors in the dust!</p>
<p><!--EndFragment--></p>
<img src="http://feeds.feedburner.com/~r/hds/michaelhay/~4/w79w-xj4B4o" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/michael/2009/10/chapter-4-a-new-hope.html/feed</wfw:commentRss>
		<feedburner:origLink>http://blogs.hds.com/michael/2009/10/chapter-4-a-new-hope.html</feedburner:origLink></item>
		<item>
		<title>A Great Post on Cloud</title>
		<link>http://feedproxy.google.com/~r/hds/michaelhay/~3/4G1asw17e20/a-great-post-on-cloud.html</link>
		<comments>http://blogs.hds.com/michael/2009/10/a-great-post-on-cloud.html#comments</comments>
		<pubDate>Sun, 11 Oct 2009 05:22:44 +0000</pubDate>
		<dc:creator>Michael Hay</dc:creator>
		
		<category><![CDATA[Cloud]]></category>

		<category><![CDATA[File Storage]]></category>

		<category><![CDATA[Web 2.0]]></category>

		<category><![CDATA[File Systems]]></category>

		<category><![CDATA[HDS]]></category>

		<category><![CDATA[Hitachi]]></category>

		<category><![CDATA[Hitachi Data Systems]]></category>

		<category><![CDATA[Michael Hay]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/michael/?p=1145</guid>
		<description><![CDATA[Over at the Enterprise Storage Forum there is a great article on data corruption and various media types for both the Enterprise as well a Consumer cloud markets.  Henry Newman basically looks at the error rates per media type &#8212; I find it interesting that tape has a lower error rate than HDDs and it [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" style="margin-top: 3px; margin-bottom: 3px; margin-left: 10px; margin-right: 10px;" title="cloud storage" src="http://farm3.static.flickr.com/2450/3618067392_1954bfe305.jpg" alt="" width="210" height="158" />Over at the <a title="Cloud article at Enterprise Storage Forum" href="http://www.enterprisestorageforum.com/technology/features/article.php/3843151" target="_blank">Enterprise Storage Forum</a> there is a great article on data corruption and various media types for both the Enterprise as well a Consumer cloud markets.  Henry Newman basically looks at the error rates per media type &#8212; I find it interesting that <a title="musings on tape" href="http://blogs.hds.com/michael/2009/05/so-is-tape-really-dead.html" target="_blank">tape</a> has a lower error rate than HDDs and it is one fact that bolsters some governmental organizations that state tape is a more reliable archive media &#8212; and also talks about the bandwidth limitations and hitting the wall meaning that the data sizes of 100s of petabytes versus the standardly available Internet connections are suffering an impedance mismatch.  (Actually Henry has a great table on this showing that it could take 79 days to replicate 100PB worth of data.)</p>
<p><span id="more-1145"></span></p>
<p>When I look at Henry&#8217;s article I see one point that I want to call attention.</p>
<p style="padding-left: 30px;"><em><span style="font-family: mceinline;">Clearly, moving 100PB on even 1TB enterprise drives can potentially cause significant loss of data, especially as many clouds I am familiar with do not use </span></em><a href="http://enterprisestorageforum.webopedia.com/TERM/R/RAID.html"><em><span style="font-family: mceinline;">RAID</span></em></a><em><span style="font-family: mceinline;"> and maintain data protection via mirroring.</span><span style="font-family: mceinline;"> </span></em></p>
<p>This does point out something that I think everyone should be paying attention to: multi-layer data protection!  Combining RAID, data replication, object cloning and policing, checksumming file systems, etc. should all lead to a more reliable system overall.  To prove my point I&#8217;m going to go into the realm of security for a second where <a title="Multi-Layered Security" href="http://identity-manager.hitachi-id.com/security/multi-layered-security.html" target="_blank">multi-layered security</a> is a norm.  Here is a really easy way to understand this concept, because it is more than just associated to the link above.  I know that there are many companies that for their use of firewall deployments intentionally select two vendors.  Now usually this would be for keeping each vendor honest with the entire customer experience however, in this case it has to do with having two different protection paradigms.  You see with one vendor there is one implementation including all features, bugs and vulnerabilities, but when two vendors are added there is another set of features, bugs, and vulnerabilities.  This implies that if a crack for one vendor&#8217;s system is pushed into the wild the other system has a higher chance of not impacted.  So it is the diversity of vendors, when combined, that actually increases the security and reliability of the aggregate system.  Hopefully, you are following me so far as I&#8217;m going to switch back to storage and the cloud now.</p>
<p>Henry&#8217;s point above really hints at this point: namely multiple data protection schemes are required at the media level to ensure that objects and blocks avoid the evilness of corruption.  Combining RAID, read after write on S-ATA, object cloning, remote replication, usage of alternative media types, etc. are all ways to improve the aggregate reliability of the system.  So I agree with Henry&#8217;s statement that we need to be looking into this as a key part of moving into the cloud.  Perhaps it is because of my <a title="Chapter 1 of 3" href="http://blogs.hds.com/michael/2009/09/what-is-the-challenge-to-web-applications-in-the-cloud.html" target="_blank">hangups</a> that I&#8217;m worried about these kinds of things.</p>
<img src="http://feeds.feedburner.com/~r/hds/michaelhay/~4/4G1asw17e20" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/michael/2009/10/a-great-post-on-cloud.html/feed</wfw:commentRss>
		<feedburner:origLink>http://blogs.hds.com/michael/2009/10/a-great-post-on-cloud.html</feedburner:origLink></item>
		<item>
		<title>Welcome Symantec to the Scalable NAS Market</title>
		<link>http://feedproxy.google.com/~r/hds/michaelhay/~3/R2hLKMUSa1g/welcome-symantec-to-the-scalable-nas-market.html</link>
		<comments>http://blogs.hds.com/michael/2009/10/welcome-symantec-to-the-scalable-nas-market.html#comments</comments>
		<pubDate>Sat, 10 Oct 2009 07:56:53 +0000</pubDate>
		<dc:creator>Michael Hay</dc:creator>
		
		<category><![CDATA[Competition]]></category>

		<category><![CDATA[File Storage]]></category>

		<category><![CDATA[File Systems]]></category>

		<category><![CDATA[HDS]]></category>

		<category><![CDATA[Hitachi]]></category>

		<category><![CDATA[Hitachi Data Systems]]></category>

		<category><![CDATA[Michael Hay]]></category>

		<category><![CDATA[Oracle]]></category>

		<category><![CDATA[SUN]]></category>

		<category><![CDATA[WFS2]]></category>

		<category><![CDATA[ZFS]]></category>

		<guid isPermaLink="false">http://blogs.hds.com/michael/?p=1066</guid>
		<description><![CDATA[
I &#8216;ve been reviewing Symantec&#8217;s new FileStore and their soon to be released S4 offerings. There are quite a few resources available on FileStore from their slides on the Internet to their product page.  Firstly I want to welcome them to the clustered NAS business much like HP is there already with EXDS, their Polyserve appliance. [...]]]></description>
			<content:encoded><![CDATA[<div>
<p>I &#8216;ve been <a title="Register's view of Symantec's new announcements" href="http://www.theregister.co.uk/2009/10/05/symantec_filestore/" target="_blank">reviewing Symantec&#8217;s new FileStore</a> and their soon to be released S4 offerings. There are quite a few resources available on FileStore from their slides on the Internet to their product page.  Firstly I want to welcome them to the clustered NAS business much like HP is there already with EXDS, their Polyserve appliance.  However it seems to me that Symantec like Oracle.SUN and Hitachi has joint the greater than 200TB file system club.  HNAS with WFS2 can support a file system size of 256TB and Oracle.SUN.  (Note NetApp, EMC and HP do not offer products with filesystems of this size.  Although NetApp will most likely debate that point with their Clustered Namespace available with Ontap V8 in GX mode.)</p>
<p><span id="more-1066"></span></p>
<p>Since there is not a lot of clarity on S4 I&#8217;ll skip it until we can see something with some &#8220;real meat on the bones&#8221; to analyze.  With respect to FileStore I do have some questions which I would like to find out the answer to:</p>
<ol>
<li>In the <a title="Slide materials from Symantec" href="http://www.slideshare.net/symantecnews/symantec-cloud-storage-offering-file-store" target="_blank">slide materials</a> from Symantec there is mention of NFS Spec performance (SFS2008) however the y-axis has no numbers on it, but Symantec claims near linear scalability of performance up to 16 nodes and that they will be faster than the fastest number from NetApp.   My question is where are the draft numbers and why did the presentation even include a performance graph if they weren&#8217;t ready yet?</li>
<li>FileStore is a soft appliance and as such runs on any backend storage including SAN attached.  To me this is a validation of the approach which Hitachi has taken, and in some cases we offer RAIN as well as SAIN modes of operation.  The question that I have is largely related to price here.  The quotes in the press are that for just under $7k USD you can purchase a node pair.  Does that include Storage, nodes, etc.?  Is there a per capacity license, etc.?</li>
<li>Embedded DLM/ILM or data migration policies are something else which are a part of FileStore (SSD to SAS/FC to SATA).  Just like HNAS these features are baked right into the NAS software.  One point of differentiation between the two products appears to be the fact that HNAS can target external file/object storage systems such as the Essential NAS Platform as well as HCAP.  So no question here just merely an observation.</li>
</ol>
<p>So I warmly welcome Symantec to the clustered/scalable NAS business and cannot wait to see how we compete in the market.</p>
<p><img class="aligncenter" title="Blade Symphony" src="http://www.hitachi.co.jp/products/bladesymphony/img_index/top_brnd.jpg" alt="" width="459" height="101" /></div>
<img src="http://feeds.feedburner.com/~r/hds/michaelhay/~4/R2hLKMUSa1g" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blogs.hds.com/michael/2009/10/welcome-symantec-to-the-scalable-nas-market.html/feed</wfw:commentRss>
		<feedburner:origLink>http://blogs.hds.com/michael/2009/10/welcome-symantec-to-the-scalable-nas-market.html</feedburner:origLink></item>
	</channel>
</rss>
