<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:media="http://search.yahoo.com/mrss/" xmlns:yt="http://gdata.youtube.com/schemas/2007" version="2.0">
   <channel>
      <title>Data Domain Blogs</title>
      <description>Pipes Output</description>
      <link>http://pipes.yahoo.com/pipes/pipe.info?_id=lItVcG3I3RGRCSqbPxJ3AQ</link>
      <pubDate>Wed, 11 Nov 2009 09:05:12 -0800</pubDate>
      <generator>http://pipes.yahoo.com/pipes/</generator>
      <atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/DataDomainDedupeMatters" type="application/rss+xml" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com" /><item>
         <title>Driving Down the Costs of Deduplication: "Continuing the Continuum"</title>
         <link>http://www.dedupematters.com/daniels_blog/2009/10/driving-down-the-costs-of-deduplication-continuing-the-continuum.html</link>
         <description>Since shipping its very first deduplication storage system in 2004, each successive generation of Data Domain product has deduped faster inline and offered extra capacity (both raw and logical capacity). This "continuum" of performance and capacity increases has been marked by some true milestones - such as the recently introduced DD880, which eclipsed the single controller ingest and store to disk speeds of all post-process deduplicating VTLs (and even of many VTLs that don't dedupe at all). And yes, along the way, the features and capabilities of Data Domain systems have evolved with new capabilities (cascaded replication, OST, ability to...</description>
         <guid isPermaLink="false">http://www.dedupematters.com/daniels_blog/2009/10/driving-down-the-costs-of-deduplication-continuing-the-continuum.html</guid>
         <pubDate>Tue, 20 Oct 2009 10:05:38 -0700</pubDate>
         <content:encoded><![CDATA[<p>Since shipping its very first deduplication storage system in 2004, each successive generation of Data Domain product has deduped faster inline and offered extra capacity (both raw and logical capacity). This "continuum" of performance and capacity increases has been marked by some true milestones -&#0160;such as&#0160;the recently introduced <a rel="nofollow" target="_blank" href="http://http://www.datadomain.com/news/press_rel_072009.html"><a rel="nofollow" target="_blank" href="http://www.datadomain.com/news/press_rel_072009.html">DD880</a></a>, which eclipsed the single controller ingest and store to disk speeds of all post-process deduplicating VTLs (and even of many VTLs that don't dedupe at all). And yes, along the way, the features and capabilities of Data Domain systems have evolved with new capabilities (<a rel="nofollow" target="_blank" href="http://www.datadomain.com/news/press_rel_092209.html">cascaded replication</a>, <a rel="nofollow" target="_blank" href="http://www.datadomain.com/pdf/DataDomain-OST-Datasheet.pdf">OST</a>, <a rel="nofollow" target="_blank" href="http://www.datadomain.com/news/press_rel_091007.html">ability to address archival workloads</a>, <a rel="nofollow" target="_blank" href="http://www.datadomain.com/news/press_rel_062308.html">retention lock for IT governance</a>, etc.) Putting those enhancements aside for a second and focusing purely on Data Domain's "continuum" of performance and capacity increases, what are we&#0160;really delivering in terms of unique customer value? In short, it's this: driving down costs of deduplication - while keeping customers ahead of the curve in terms of their data growth challenges. </p>
<p>So why is this interesting? I spent all of last week at <a rel="nofollow" target="_blank" href="http://www.snwusa.com/">SNW</a>in Phoenix, AZ, running the SNIA Data Deduplication Hands-On Lab. Not surprisingly, data deduplication was once again a very popular topic throughout the conference. In the concurrent sessions, many of the vendors were busy arguing about technicalities, and this can definitely make for interesting debates, as well as the occasional farce. While some lab participants were getting their first exposure to the technology, many of them were now well versed in the particulars of the technology and their interest and questions were surprisingly singular in nature. They wanted to know how deduplication can save them money in the face of rapid data growth averaging 50% a year or more. </p>
<p>Well, there are la lot of ways to achieve cost savings with inline deduplication technology, ranging from automation of labor intensive DR processes to the whole tape replacement story. But, curiously, I instead found myself telling these users the same story I've been telling customers since I first joined Data Domain in 2006 - namely the "continuum" of performance and capacity increases demonstrated by each successive generation of our systems. How specifically does this drive down costs? Ultimately, it's all due to our <a rel="nofollow" target="_blank" href="http://www.datadomain.com/products/SISL.html">SISL scaling architecture</a>, which leverages the upgraded multi-core CPUs and memory in these new appliances -&#0160;but, for purposes of this blog post, I'd prefer to address the question at the system and real-world implementation level. </p>
<p>Consider the new mid-range and entry-level deduplication storage systems Data Domain <a rel="nofollow" target="_blank" href="http://www.datadomain.com/news/press_rel_101909.html">announced</a>earlier this week (the DD610, DD630 and the DD140 entry-level system). Each of these new systems replaces a previous generation that was similarly targeted for either mid-range enterprise applications/regional data centers or remote office locations. In the case of the new mid-range systems you get 60% more capacity and 100% more throughput performance. For the new entry level remote office system, you get 100% more capacity and 50% better performance. So, cost per Gb goes down (yet again) and price/to performance improves. Users can protect 60%-100% larger datasets and they can write their data 2x faster, enabling them to meet their nightly backup windows. </p>
<p>Based on what these users at the SNW SNIA labs were asking me, I remain convinced that Data Domain's proven track record of ongoing significant performance and capacity increases (while relative costs decrease), may actually be the most important element of the value proposition inline data deduplication offers. </p>
<p>So, I guess we should "continue the continuum?"</p>]]></content:encoded>
      </item>
      <item>
         <title>Replication Without Borders</title>
         <link>http://www.dedupematters.com/richs_blog/2009/10/replication-without-borders.html</link>
         <description>A past conversation with a Data Domain Fortune 1000 customer revealed that, at the primary data center, they were maintaining no less than 15 various types of backup devices and associated software - specifically for the purpose of being able to support the restoration of data in remote sites. They had nearly 700 of these remote sites and a variety of regional data centers. For a number of reasons, including acquisitions, the company's backup infrastructure had evolved over time to become incredibly diverse and difficult to manage. To mitigate, they had recently installed Data Domain systems at their primary and...</description>
         <guid isPermaLink="false">http://www.dedupematters.com/richs_blog/2009/10/replication-without-borders.html</guid>
         <pubDate>Fri, 16 Oct 2009 14:32:22 -0700</pubDate>
         <content:encoded><![CDATA[<p>A past conversation with a&#0160;Data Domain Fortune 1000 customer revealed that, at the primary data center, they were maintaining no less than 15 various types of backup devices and associated software&#0160;- specifically for the purpose of being able to support the restoration of data in remote sites. They had nearly 700 of these remote sites and a variety of regional data centers. For a number of reasons, including acquisitions, the company's backup infrastructure had evolved over time to become incredibly diverse and difficult to manage. To mitigate, they had recently installed Data Domain systems at their primary and regional data centers deployed with a multi-site replication typology. Their longer term vision was to tackle the challenges they faced with their remote offices. </p>
<p>The story underscores the point that the replication needs of large organizations can be virtually boundless and can require tremendous configuration flexibility. Data Domain recently expanded the capabilities of its replicator software in two significant dimensions. </p>
<p>First, replication fan-in for many-to-one topology now supports up to 180 remote sites all replicating into a single system at a central hub site. One might wonder who would ever need that much, but the aforementioned customer is one of many who I have spoken to who have such an environment. </p>
<p>Second, Data Domain systems now support cascaded replication. In other words, you can now replicate data from one location to a secondary location, and then from the secondary location to a third site. While this might be more than some customers require, it's a requested feature that is more commonly asked for than you might think. </p>
<p>Recently I was discussing Data Domain systems and technology at an energy company. Since the bulk of the conversation was about their two data center locations, I assumed that cascaded replication would be of little interest. Turns out I was way off target. The company in question recently experienced a security related 'incident' that exposed a weakness in their dual data center model. The company realized then that they needed to have a stronger segregation of duties and are moving towards adding a third data center with strong isolation from either the first or the second site. In their new model, sites A and B will selectively cross-replicate with each other, and then both replicated data sets will also replicate into a secured and hardened site C. Administrators at sites A and B will not have a physical access or administrative access to equipment at site C. </p>
<p>However, I believe the main adopters of cascaded replication will be organizations that want to create a replication topology that maps precisely to their distributed site model. I see cascading fitting well into organizations with a combination of small remote sites, medium-sized regional hubs, and large global data centers. In this model you will see any number of small, remote sites replicating into regional hubs. These regional hubs will then replicate the remote site data plus their own local data to the larger, global data centers for longer term retention and a minimal amount of tape creation as may be required. </p>
<p>Let me backtrack a little and point out that Data Domain has&#0160;already benefited from an industry-leading replication capability before the recent enhancements. In fact, some of the competition have been shipping a very limited and inflexible replication capability for a few short months, while others are still promising 'Replication 1.0' in the near future. </p>
<p>With Data Domain, IT architects can now even more easily design and implement whatever replication topology their businesses require. With a flexible replication technology, customers can mold their replication strategy around the environment with respect to WAN topology, regional affinities and the desire to massively centralize or eliminate tape automation. </p>]]></content:encoded>
      </item>
      <item>
         <title>Packing Your Bag for VMworld</title>
         <link>http://www.dedupematters.com/richs_blog/2009/08/packing-your-bag-for-vmworld--vmworld-2009-is-drawing-near-and-its-time-to-start-preparing-for-the-trip-im-particular.html</link>
         <description>VMworld 2009 is drawing near, and it’s time to start preparing for the trip. I’m particularly excited about this year’s show. If there’s one IT event you choose to attend all year, VMworld 2009 should be near the top of the list for consideration. VMware has a great website for planning your time at the show. If typing “vmworld” into your browser and pressing ctrl+enter is too much effort, I’ve provided this convenient link: http://www.vmworld.com/community/conferences/2009/ VMworld always has an extensive schedule of events and activities, not to mention a list of sponsors and exhibitors a mile long. This year is...</description>
         <guid isPermaLink="false">http://www.dedupematters.com/richs_blog/2009/08/packing-your-bag-for-vmworld--vmworld-2009-is-drawing-near-and-its-time-to-start-preparing-for-the-trip-im-particular.html</guid>
         <pubDate>Wed, 26 Aug 2009 07:56:00 -0700</pubDate>
         <content:encoded><![CDATA[<p>VMworld 2009 is drawing near, and it’s time to start preparing for the trip. I’m particularly excited about this year’s show.&#0160; If there’s one IT event you choose to attend all year, VMworld 2009 should be near the top of the list for consideration. </p><p>VMware has a great website for planning your time at the show. If typing “vmworld” into your browser and pressing ctrl+enter is too much effort, I’ve provided this convenient link:</p><p><a rel="nofollow" target="_blank" href="http://www.vmworld.com/community/conferences/2009/">http://www.vmworld.com/community/conferences/2009/</a></p><p>VMworld always has an extensive schedule of events and activities, not to mention a list of sponsors and exhibitors a mile long. This year is no exception. So how do you not get overwhelmed by all the possibilities? No worries – just sit back and let me do the planning for you.</p><p>Item 1:&#0160; Make a short list of your one or two biggest virtual wishes in advance. Touring the floor can be a marathon. If you have one or two problems to solve it can really help keep you focused on finding the right vendors to talk to.</p><p>Item 2:&#0160; Select two or three speaking sessions to attend.&#0160; See item 1 for suggestions on topics.&#0160; It’s hard to pick speaking sessions while the show is in progress.&#0160; Reading the list with a tote under one arm and a Caesar salad on a paper plate balanced on the hand opposite while standing in a moving sea of humanity takes more talent than most of us have. Also, since you’re here at dedupematters.com, I’ll go ahead and assume you’ve got some level of interest in the Data Domain story. Therefore, I’d like to be so bold as to suggest a couple of sessions. “Planning for Optimized and Cost-effective Storage Utilizing Deduplication and Virtualization” (Session BC3223) featuring Data Domain customer Jules Thomas, and “The Real Deal on Dedupe and Virtualization” featuring our own Enterprise Application Technologist, Daniel Budiansky.</p><p>Item 3:&#0160; Make extensive use of the hands-on labs. There’s no substitute for the experience of doing it yourself. There are labs covering a vast array of subjects, so there’s sure to be more than one that gives you the opportunity to experience something new. </p><p>Item 4:&#0160; Spend some time with a big vendor.&#0160; I know it sounds obvious, but it’s easy to overlook the big companies because you’re already familiar with what they have to offer.&#0160; Cisco is certainly on my short list.</p><p>Item 5:&#0160; Stop by the Data Domain booth. We’re going to be wearing EMC shirts for the first time. I think a bunch of us are planning on sneaking over to the regular EMC booth and trying to blend in.&#0160; Who knows, if we play our cards right we might even get invited back to <a rel="nofollow" target="_blank" href="http://www.emcworld.com/">EMC World</a> in Boston next May. Yeah, we love that show!</p><p>Most importantly, have a little fun while you’re in town. Maybe it’s time to put on your black leather baseball cap, rent a red Trans-Am, and ride in style to the <a rel="nofollow" target="_blank" href="http://www.foreigneronline.com/">Foreigner</a> show on Wednesday night. I wonder if they still have that big, inflatable jukebox?</p><p>However you pack your bags, and whatever your plans, I look forward to seeing you at the show.</p><p>Oh, and if you’re curious about how Data Domain can help protect your virtual infrastructure, check out our solutions page <a rel="nofollow" target="_blank" href="http://www.datadomain.com/solutions/vmware.html">here</a> . Or better yet, just stop by our booth. Once we’re done with our shenanigans, we’ll be happy to talk shop.</p>]]></content:encoded>
      </item>
      <item>
         <title>Like Peanut Butter and Chocolate</title>
         <link>http://www.dedupematters.com/daniels_blog/2009/08/like-peanut-butter-and-chocolate.html</link>
         <description>I have been excited about the possibilities of using deduplication storage and virtualization togethersince implementing both technologies while working in IT prior to joining Data Domain. At my previous employer, we built a multi-tier application infrastructure with a disaster recovery site 2,000 miles away using VMware ESX Server in both locations. All VMDK and application backups went to a Data Domain system in the primary site and were replicated those 2,000 miles to the DR site. The image backups were recovered on a scheduled basis to keep the underlying virtual servers up-to-date, with only specific applications (MS SQL, Exchange, CIFS...</description>
         <guid isPermaLink="false">http://www.dedupematters.com/daniels_blog/2009/08/like-peanut-butter-and-chocolate.html</guid>
         <pubDate>Tue, 25 Aug 2009 10:50:58 -0700</pubDate>
         <content:encoded><![CDATA[<p>I have been excited about the possibilities of <a rel="nofollow" target="_blank" href="http://www.datadomain.com/solutions/vmware.html">using deduplication storage and virtualization together</a>since implementing both technologies while working in IT prior to joining Data Domain.&#0160;At my previous employer, we built a multi-tier application infrastructure with a disaster recovery site 2,000 miles away using VMware ESX Server in both locations.&#0160;All VMDK and application backups went to a Data Domain system in the primary site and were replicated those 2,000 miles to the DR site.&#0160;The image backups were recovered on a scheduled basis to keep the underlying virtual servers up-to-date,&#0160;with only specific applications (MS SQL, Exchange, CIFS fileshares) receiving agent level protection and recovery.&#0160;As a result, our primary data center was consolidated into five racks and we eliminated the use of tape for protecting our most critical applications. It is the many benefits of achieving these two objectives that our <a rel="nofollow" target="_blank" href="http://www.datadomain.com/resources/case-studies-application.html#vmware">customers</a> continue to enjoy today.</p>
<p>There are three generally accepted techniques for backing up VMs:</p>
<ol>
<li>
<p>Running backup agents, installed inside the guest</p>
<li>
<p>Running proxy-based VMDK backups using VCB, with or without 3rd -party backup application integration</p>
<li>
<p>Running host-based VMDK backups, either from the service console or using a VMware-specific backup application, like Vizioncore vRanger Pro</p></li>
</ol>
<p>Best practices documentation typically references using some combination of these approaches.&#0160;Unlike most other deduplication solutions, Data Domain systems support all three methods and will eliminate the redundancy across all data stored.&#0160;This support allows organizations to customize their data protection policies, and to best leverage existing infrastructure. <p>Because Data Domain systems deduplicate data inline, image backups can be reduced 40-60x or more before being stored, and entire virtual environments can be efficiently replicated using a minimum amount of bandwidth. We support VCB backups natively, and can be a target for most enterprise backup applications that provide VCB integration as well as traditional agents installed in the guest OS. Specialized applications, scripts, and even native VMware features like cloning, storage VMotion and templates can all use Data Domain storage. <p>In today's world, this simplicity and flexibility are critical forces which balance the underlying complexity of new technologies. Data Domain and VMware are like peanut butter and chocolate - both are great on their own, but together, they become greater than the sum of the parts. <p>Come visit us in booth #1302 at VMworld next week - the <a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Reese%27s_Peanut_Butter_Cup">RPBC</a> is on me.</p>
<p></p>
<p></p>
<p></p>
<p></p>
<p></p>
<p></p>
<p></p>
<p></p>
<p></p>]]></content:encoded>
         <category>Virtualization</category>
      </item>
      <item>
         <title>Do We Really Need Post-Process Anymore?</title>
         <link>http://www.dedupematters.com/daniels_blog/2009/07/every-time-we-come-out-with-a-bigger-faster-system-i-get-excited-about-all-the-new-customers-who-will-benefit-from-deployin.html</link>
         <description>Every time we come out with a bigger, faster system, I get excited about all the new customers who will benefit from deploying our technology. This past Monday, Data Domain announced the DD880, our newest deduplication storage system. The full details of the release can be found here. This system presents about 70TB of usable capacity per controller, twice that of the DD690 and more than doubles the maximum throughput to 5.4TB/hr. A year ago, large enterprise customers were very excited by the DD690, and that system is now successfully deployed in many large datacenters. I imagine the DD880 will...</description>
         <guid isPermaLink="false">http://www.dedupematters.com/daniels_blog/2009/07/every-time-we-come-out-with-a-bigger-faster-system-i-get-excited-about-all-the-new-customers-who-will-benefit-from-deployin.html</guid>
         <pubDate>Fri, 24 Jul 2009 10:31:52 -0700</pubDate>
         <content:encoded><![CDATA[<p>Every time we come out with a bigger, faster system, I get excited about all the new customers who will benefit from deploying our technology.&#0160; This past Monday, Data Domain announced the DD880, our newest deduplication storage system. The full details of the release can be found <a rel="nofollow" target="_blank" href="http://www.datadomain.com/news/press_rel_072009.html">here</a>.&#0160; This system presents about 70TB of usable capacity per controller, twice that of the DD690 and more than doubles the maximum throughput to 5.4TB/hr.&#0160; A year ago, large enterprise customers were very excited by the DD690, and that system is now successfully deployed in many large datacenters.&#0160; I imagine the DD880 will be equally well received.&#0160; </p>
<p>On the other hand, vendors with post-process deduplication solutions might be less enthusiastic.&#0160; After all, the fundamental premise of their marketing has been that post-process systems are faster and bigger.&#0160; In reality, the inline deduplication throughput of a single DD880 exceeds the speed at which they can ingest data in native form to a <a rel="nofollow" target="_blank" href="http://www.dedupematters.com/richs_blog/">2-node post-process system</a>. And the higher physical capacities they claim as a benefit?&#0160; They are but a side effect of the disk i/o and caching requirements of post-process deduplication – you can’t actually protect a larger dataset with all that space.&#0160; It’s kind of like a car salesperson pointing out the massive fuel tank in a gas guzzler as a competitive advantage, while disregarding the mileage it gets.&#0160; </p>
<p>A single DD880 can deduplicate 60TB inline, while concurrently replicating the deduplicated data, all within a 12 hour window.&#0160; As with all of our previous new product releases, performance and capacity increases are derived from upgrades to the processors and associated hardware in our systems.&#0160; This is the power of Data Domain SISL, our CPU-centric approach which enables fast inline deduplication using a minimum amount of disk.&#0160; In comparison to the DD880, for the same 60TB backup a post-process deduplication system would require at least 2-4x more physical disk and significantly more compute power in the form of complex multi-node configurations.&#0160; It just doesn’t make sense why our competitors would expect customers to do this.</p>
<p>In the end, this is about one thing – Data Domain has enabled a paradigm shift in disk storage for data protection and disaster recovery because once our technology is deployed, customers can save money.&#0160; The bottom line value of post-process deduplication, with its need for more of everything, will never match us.&#0160; Good luck to those vendors that decide to keep trying.&#0160;&#0160;&#0160; <br /></p>
<p><br /></p>]]></content:encoded>
      </item>
      <item>
         <title>How Much Speed Does a Large Enterprise Need?</title>
         <link>http://www.dedupematters.com/richs_blog/2009/07/how-much-speed-does-a-large-enterprise-need.html</link>
         <description>The storage industry is perpetually awash with a variety of performance measurements. These statistics, commonly referred to as “speeds and feeds,” are essential to help the enterprise technologist determine which solutions are suitable for a given storage workload. They allow for a semblance of apples-to-apples comparison across competing products, even if one must tweak the numbers to account for varying levels of vendor optimism. Protection storage (i.e. backup disk, tape, or other media) performance is typically measured in data transfer rates. These rates are commonly expressed in MB/sec or TB/hr. Both aggregate speed and single stream performance are important to...</description>
         <guid isPermaLink="false">http://www.dedupematters.com/richs_blog/2009/07/how-much-speed-does-a-large-enterprise-need.html</guid>
         <pubDate>Fri, 24 Jul 2009 10:14:00 -0700</pubDate>
         <content:encoded><![CDATA[<p>The storage industry is perpetually awash with a variety of performance measurements.&#0160; These statistics, commonly referred to as “speeds and feeds,” are essential to help the enterprise technologist determine which solutions are suitable for a given storage workload.&#0160; They allow for a semblance of apples-to-apples comparison across competing products, even if one must tweak the numbers to account for varying levels of vendor optimism.</p>
<p>Protection storage (i.e. backup disk, tape, or other media) performance is typically measured in data transfer rates.&#0160; These rates are commonly expressed in MB/sec or TB/hr.&#0160; Both aggregate speed and single stream performance are important to the backup workload.&#0160; Aggregate speed is indicative of the big picture; given x amount of data to backup, what is the minimum possible time that the storage system will require to complete all backups?&#0160; Single stream performance is an important consideration as well, especially for high-speed backup clients such as large, powerful database servers.&#0160; Ideally, both aggregate speed and individual stream speed will be very fast.&#0160; But how fast is fast enough, especially for the enterprise?<br /></p>
<p>The answer as it turns out is not all that simple.&#0160; Storage vendors who compete on feeds and speeds have a tendency to focus the customer’s attention strictly on the characteristics of their own device, ignoring the fact that protection storage is a downstream element from what is often a very large and complex technology ecosystem.&#0160; Here are a few somewhat imperfect analogies that may help simplify and put things in perspective.<br /></p>
<p>Pitching a Baseball (single-stream):&#0160; When a pitcher throws a baseball it reaches its maximum velocity upon release.&#0160; The baseball never accelerates on the way to home plate.&#0160; The catcher’s mitt doesn’t exert pull on the baseball.&#0160; Therefore, if you equate the storage device with the catcher’s mitt, you clearly need a mitt that can handle a 100mph fastball, but is there a material difference between mitts that can handle 150mph versus 200mph?&#0160; Conversely, if the catcher’s mitt can only handle a 70mph pitch then your catcher will be calling for knuckleballs and changeups all night long.&#0160; In a similar fashion, backup data never speeds up after it leaves the client, yet the backup storage device must be able to handle the fastest streams possible.&#0160; Backup storage devices do not exert pull.<br /></p>
<p>Terminal Velocity or Backup to Null (aggregate):&#0160; Any collection of data residing in a data center has what I think of as a “terminal velocity.”&#0160; That’s the hypothetical transfer rate that would exist if all of the backup clients sent their data directly to a null device.&#0160; In other words, how fast can the existing backup clients (or other protocols and techniques) lift the data off of their provisioned storage and send it into an infinitely fast receptacle?&#0160; The goal here is to hypothetically remove network or fiber channel (i.e. transports), any other intermediary devices (backup servers), and of course the backup storage device itself.<br /></p>
<p>Given these two ideas, it should be fairly clear that a backup storage device cannot be deployed as an accelerator.&#0160; It need only be thought of as a potential bottleneck.&#0160; The goal then is to avoid having the backup storage device slow down the backups, which coincidentally is also the goal of the transports (network or SAN), and the intermediaries (backup servers.)&#0160; The further upstream you push the backup bottleneck, the closer you come to achieving terminal velocity in your backup environment, which is exactly what you want to do.&#0160; The more successful enterprise shops are the ones who intuitively understand these principles and architect complete solutions instead of comparing storage device speeds and feeds in a vacuum.<br /></p>
<p>So I still haven’t answered the question ‘how fast is fast enough for the enterprise?’&#0160; The answer lies in a compound question:&#0160; How fast must a storage solution be to complete backups within the time allotted *and* if you have such a solution, is the rest of the infrastructure up to the task?&#0160; Simply put, your backup storage should be fast enough to not be the bottleneck *unless* you are able to complete your backups comfortably within your backup window anyway, in which case you may not care.&#0160; At the highest level you can say that faster is always better, but only up to a certain point.<br />Now here’s how the performance question starts to get interesting with Data Domain and deduplication.&#0160; For years many vendors have been using speeds and feeds arguments, warranted or not, to position their solutions against Data Domain.&#0160; However, Data Domain Operating System 4.6 broke through an important inflection point in the evolution of deduplication.&#0160; As a result, the DD690 attained an amazing 750 MB/sec of aggregate throughput utilizing a very minimal number of disks.&#0160; This meant that for the first time it was almost as fast to backup straight to Data Domain’s deduplicated storage than it was to an “enterprise” post-process deduplication controller, or even to one of the many, non-deduplicating VTL systems.&#0160; <br /></p>
<p>With the introduction of Data Domain’s new flagship product, the DD880, the ante has doubled.&#0160; The DD880 sports an outstanding 1,500 MB/sec, or 5.4 TB/hr of aggregate throughput, placing it clearly on top of the leader board across all backup storage targets, dedupe and non-dedupe alike.&#0160; By comparison, a single Data Domain DD880 controller ingests data faster than a 2-node NetApp 1400 (post-process dedupe), 2-node HP 9000 (post-process dedupe), or 2-node IBM 7650G (inline dedupe) active-active cluster.<br /></p>
<p>Read this if you read nothing else:&#0160; Data Domain’s single-controller inline ingest speed is now faster than a 2-node active-active clustered post-process ingest speed.<br /></p>
<p>With the introduction of the DD880, speeds and feeds are no longer an inline versus post-process argument.&#0160; In fact, the sole argument that supported post-process deduplication has just been obliterated.&#0160; Speeds and feeds are no longer a dedupe versus non-dedupe argument either.&#0160; The DD880 is the new standard in backup storage performance spanning all categories.&#0160; Now we can get back to the task at hand and select a solution based on what is really important – the business requirements and the other technological realities of the enterprise.&#0160; We can put the speeds and feeds arguments to rest.&#0160; The DD880 wins in an overwhelming landslide.<br /><br /></p>]]></content:encoded>
      </item>
      <item>
         <title>Federal Forum with Data Domain Customers</title>
         <link>http://www.dedupematters.com/tonysblog/2009/06/federal-forum-with-data-domain-customers.html</link>
         <description>Last week I went down to Washington DC to present to a number of Federal customers on various industry topics. I asked them a bunch of questions and here are the highlights: - There was one deployment of FCoE and everyone else was highly interested in FCoE going forward. I mentioned the political challenges of FCoE between the LAN networking guys and the SAN networking guys -- and the guy who actually deployed it said that he found no issue. The lines are still there and remain - the SAN guys still manage FCoE. The others in the room were...</description>
         <guid isPermaLink="false">http://www.dedupematters.com/tonysblog/2009/06/federal-forum-with-data-domain-customers.html</guid>
         <pubDate>Wed, 24 Jun 2009 11:35:36 -0700</pubDate>
         <content:encoded><![CDATA[<p>Last week I went down to Washington DC to present to a number of Federal customers on various industry topics. I asked them a bunch of questions and here are the highlights: </p>
<p>- There was one deployment of FCoE and everyone else was highly interested in FCoE going forward. I mentioned the political challenges of FCoE between the LAN networking guys and the SAN networking guys -- and the guy who actually deployed it said that he found no issue. The lines are still there and remain - the SAN guys still manage FCoE. The others in the room were still unsure how this was going to all play out. Again, no one in this group deployed iSCSI. I am a big proponent of iSCSI but FC still rules the SAN world in larger organizations. </p>
<p>- Everyone was using VMWare and no one had deployed or was overly interested in Hyper-V. This is consistent with other forums I have been involved in recently. </p>
<p>- There was some interest in SSD or EFD for Enterprise storage, but only one person has deployed. His take was that it was a waste of time and money. It did not give him the performance he was looking for. This is an important point. Storage performance is complicated and depending on the I/O profile, SSD/EFD may not be the answer you are looking for - it is not a panacea by any means. It would be great to remove a mechanical device from the data center - after all disk drives do three things: they read, write and break. But for the most part disk drives still provide the best price/performance and even with the advances in SSD/EFD, it still is only right for certain corner cases. </p>
<p>- There was no implementation of active archive storage with this group. There was a level of interest but they agreed that the challenges were identifying what to move, how to move it transparently, what to move it to and ensuring response times if that data is ever needed again. One person piped in saying that there was no single solution that allowed them to do database, email and file archiving without putting agents on their hosts. I suggested to go after the biggest problem first - which is unstructured data. That could save them a ton of money. There were nods in the room but you could tell that they weren't running out the door to make it happen. </p>
<p>- There were no plans for any of these folks to use a public cloud service within the next twelve months. And by the looks of it there wasn't any real interest beyond that timeframe at this point. There was only one company that was building their own private cloud. I wanted to pick their brains more about what they specifically were doing but they had to leave before the meet and greet session (I hope it wasn't my presentation that made them leave). </p>
<p>- Finally, it was good to see that close to half of the IT professionals had Green IT initiatives. Naturally IT folks have been focused on power, cooling and floor space for years and now a number of them were going green. While this is related it is different - having a green mandate goes beyond these three things (although it is a good place to start). </p>]]></content:encoded>
         <category>Data Centers</category>
      </item>
      <item>
         <title>D2D Backup + Active Archive</title>
         <link>http://www.dedupematters.com/tonysblog/2009/06/d2d-backup-active-archive-.html</link>
         <description>In my travels I have discussed the value of storage tiering and it is amazing how few companies have actually implemented this. Part of the challenge is identifying what to move and when. One way to easily address this is to just consider dormant data. Over the last few years I have presented to hundreds of IT professionals and the vast majority believe that more than 70% of their data is dormant 90 days after its creation. If this is the case, then focusing on dormant data after a 90 day period seems like a good place to start -...</description>
         <guid isPermaLink="false">http://www.dedupematters.com/tonysblog/2009/06/d2d-backup-active-archive-.html</guid>
         <pubDate>Fri, 19 Jun 2009 11:21:17 -0700</pubDate>
         <content:encoded><![CDATA[<p>In my travels I have discussed the value of storage tiering and it is amazing how few companies have actually implemented this. Part of the challenge is identifying what to move and when. One way to easily address this is to just consider dormant data. Over the last few years I have presented to hundreds of IT professionals and the vast majority believe that&#0160;more than&#0160;70% of their data is dormant 90 days after its creation. If this is the case, then focusing on dormant data after a 90 day period seems like a good place to start - move this data from your primary storage systems to a lower tier. </p>
<p>The next challenge with storage tiering is identifying what solution you are going to use to move the dormant data from primary to archival storage. This process needs to be transparent to the users and applications. And it would be nice if it was easy for IT professionals to implement and there was a high level of performance. In the NAS world you can use file data movers or virtualization solutions. Data Domain partners with F5 and its Acopia solution. </p>
<p>The third and final macro issue is what platform should be used for the active archival storage. Data Domain is typically seen as a disk-to-disk backup solution but it should also be considered as an active archive as well. Remember that the Data Domain solution is a NAS system and therefore a mount point can be easily created and it can be used to store files via NFS and CIFS protocols. One of the main reasons to do this is to take advantage of its dedupe capabilities. Since this is not a backup data - the dedupe rates will not be as high but you should easily get a 2-to-1 and maybe as high as a 5-to-1 ratio. What makes this even more compelling and valuable is using Data Domain for both&#0160;D2D backup and as an active archive. It is this combination that presents a powerful value proposition. Consolidating these functions into a single Data Domain appliance further reduces cost and complexity. In addition to the direct CAPEX savings there is also the reduction of power, cooling and floor space. </p>
<p>There are companies that have saved significant capital costs by implemented tiered storage environments - even without dedupe. Imagine adding dedupe to the equation. Further,<span id="fck_dom_range_temp_1245431495028_251"></span> leveraging the same applicance that is being used for D2D backup and giving you 20-to-1 data dedupe. Not many people talk about providing a consolidated D2D backup and active archive solution, but it really does make a ton of sense if that single solution can support both with dedupe. </p>]]></content:encoded>
         <category>Deduplication</category>
      </item>
      <item>
         <title>Keeping It Clean</title>
         <link>http://www.dedupematters.com/daniels_blog/2009/06/keeping-it-clean.html</link>
         <description>Recently, we announced the ability to perform NIST/DOD compliant system sanitizing at a file level, as a new feature of our Retention Lock software option. With deduplication storage systems, where underlying data patterns are potentially shared among many logical files, there is a particular challenge around ensuring that contaminated data is appropriately removed. In many cases, an entire storage system would need to be destroyed because of a single piece of inappropriate data, particularly in secure or classified data environments. For these customers, this feature provides tremendous value, as individual files or entire systems can now be sanitized by a...</description>
         <guid isPermaLink="false">http://www.dedupematters.com/daniels_blog/2009/06/keeping-it-clean.html</guid>
         <pubDate>Tue, 09 Jun 2009 15:19:10 -0700</pubDate>
         <content:encoded><![CDATA[<p><span style="FONT-SIZE:13pt;"><a rel="nofollow" target="_blank" href="http://www.youtube.com/watch?v=rC9sQdTmq2k&amp;feature=related" title="blocked::http://www.youtube.com/watch?v=rC9sQdTmq2k&amp;feature=related"></a>
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"><span style=""><font size="3">Recently, we </font><a rel="nofollow" target="_blank" href="http://www.datadomain.com/news/press_rel_051809.html" title="blocked::http://www.datadomain.com/news/press_rel_051809.html"><font size="3">announced</font></a>&#0160;<font size="3">the ability to perform NIST/DOD compliant system sanitizing at a file level, as a new feature of our </font><a rel="nofollow" target="_blank" href="http://www.datadomain.com/products/retention-lock.html" title="blocked::http://www.datadomain.com/products/retention-lock.html"><font size="3">Retention Lock</font></a>&#0160;<font size="3">software option.&#0160; With deduplication storage systems, where underlying data patterns are potentially shared among many logical files, there is a particular challenge around ensuring that contaminated data is appropriately removed.&#0160;In many cases, an entire storage system would need to be destroyed because of a single piece of inappropriate data, particularly in secure or classified data environments.&#0160;For these customers, this feature provides tremendous value, as individual files or entire systems can now be sanitized by a process that meets the standards dictated by the </font><a rel="nofollow" target="_blank" href="http://it.ouhsc.edu/policies/documents/infosecurity/DoD_5220.pdf" title="blocked::http://it.ouhsc.edu/policies/documents/infosecurity/DoD_5220.pdf"><font size="3">DoD Clearing and Sanitization Matrix</font></a><font size="3"> and the </font><a rel="nofollow" target="_blank" href="http://csrc.nist.gov/publications/nistpubs/800-88/NISTSP800-88_rev1.pdf" title="blocked::http://csrc.nist.gov/publications/nistpubs/800-88/NISTSP800-88_rev1.pdf"><font size="3">NIST Guidelines on Media Sanitization</font></a><font size="3">, while the “clean” data remains online and available for access.</font></span></p> 
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"><span style=""><font size="3">&#0160;</font></span></p> 
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"><span style=""><font size="3">Data Domain is the first to deliver secure shredding for an inline deduplication system on a per-file basis.&#0160;It is not glamorous nor does it get the blogosphere all in a-tizzy.&#0160;But blogs don’t sell products or decide what matters - our incentive to deliver this feature is driven directly by customers telling us they need it.Having a system that works well and solves customers’ problems is what makes us the leader.&#0160;&#0160;&#0160; </font></span></p> 
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"><span style=""><font size="3">&#0160;</font></span></p> 
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"><span style=""><font size="3">I have always felt that Data Domain was about much more than deduplication – we have been speaking to our customers about, and delivering, our efficient and intelligent storage platform for years. Let’s set aside the arguments about inline vs. post-process for backup throughput or time-to-DR for a moment. The fact is, by segmenting and identifying every bit of data as soon as it enters the system we have knowledge about that data which enables many innovative capabilities. We started with </font><a rel="nofollow" target="_blank" href="http://www.datadomain.com/get-inline/" title="blocked::http://www.datadomain.com/get-inline/"><font size="3">inline deduplication</font></a><font size="3"> and </font><a rel="nofollow" target="_blank" href="http://www.datadomain.com/products/DIA.html" title="blocked::http://www.datadomain.com/products/DIA.html"><font size="3">data invulnerability</font></a><font size="3"> then added </font><a rel="nofollow" target="_blank" href="http://www.datadomain.com/products/replicator.html" title="blocked::http://www.datadomain.com/products/replicator.html"><font size="3">WAN-efficient replication</font></a><font size="3"> to successfully address the challenges our customers were having with physical tape and traditional disk-based backup systems. Over time, we have followed through on the promise of </font><a rel="nofollow" target="_blank" href="http://www.datadomain.com/products/SISL.html" title="blocked::http://www.datadomain.com/products/SISL.html"><font size="3">SISL</font></a><font size="3">&#0160;to create bigger and faster systems. And we continue to add value for our customers with features like deduped snapshots, enforced retention, and now secure shredding.</font></span></p> 
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"><span style=""><font size="3">&#0160;</font></span></p> 
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"><span style=""><font size="3">To quote Steve Winwood from a favorite Traffic song of mine:</font></span><span style=""><font size="3">&#0160;</font></span></p> 
<p class="MsoNormal" style="MARGIN:0in 0in 12pt;"><span style=""><a rel="nofollow" target="_blank" href="http://www.youtube.com/watch?v=rC9sQdTmq2k&amp;feature=related" title="blocked::http://www.youtube.com/watch?v=rC9sQdTmq2k&amp;feature=related"><font size="3">“Who knows what tomorrow may bring?”</font></a></span></p> 
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"></p></span> &#0160;
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;">&#0160;</p>]]></content:encoded>
         <category>Data Domain products</category>
      </item>
      <item>
         <title>Technology + Product + Business Excellence</title>
         <link>http://www.dedupematters.com/tonysblog/2009/06/technology-product-business-excellence.html</link>
         <description>There is some discussion around the notion of data dedupe as a feature or a technology and that it will some day be pervasive. People having been saying this for more than five years and it is still not a reality. There are actually only a few vendors that actually have successful solutions in the market supporting this valuable but complex technology. The fact of the matter is that dedupe is complicated - creating dedupe algorithms are only a part of the challenge. Data integrity, performance and scalability are key considerations. If dedupe was easy then it would be pervasive...</description>
         <guid isPermaLink="false">http://www.dedupematters.com/tonysblog/2009/06/technology-product-business-excellence.html</guid>
         <pubDate>Tue, 09 Jun 2009 15:10:47 -0700</pubDate>
         <content:encoded><![CDATA[<p><font face="Times New Roman">
<p style="MARGIN:0in 0in 0pt;"><span style="FONT-SIZE:14px;MARGIN:0in 0in 0pt;FONT-FAMILY:Times New Roman;"></span></p>
<p>
<p style="MARGIN:0in 0in 0pt;"></p></font>
<p>
<p style="MARGIN:0in 0in 0pt;"></p>
<p><font face="Times New Roman" size="3">
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;">There is some discussion around the notion of data dedupe as a feature or a technology and that it will some day be pervasive.&#0160;People having been saying this for&#0160;more than&#0160;five years and it is still not a reality.&#0160;There are actually only a few vendors that actually have successful solutions in the market supporting this valuable but complex technology.&#0160;The fact of the matter is that dedupe is complicated - creating dedupe algorithms are only a part of the challenge. Data integrity, performance and scalability are key considerations.&#0160;If dedupe was easy then it would be pervasive and it is not. &#0160;</p>
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"> &#0160;</p> 
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;">It is then critical to manifest dedupe in a product that has been designed from the ground up to support the specific requirements for its use.&#0160;In the case of Data Domain special attention was paid to the fact that dedupe is CPU-intensive. Additionally, they chose to support NAS for ease of use and transparency.&#0160;They also focused on the file system and adding technology that ensured data integrity.&#0160; There are dozens and perhaps hundreds of big and small considerations to be found in this process.&#0160; There are other things to consider as well that live outside of the appliance. In speaking with a number of Data Domain customers I've heard more than once how good their documentation is.&#0160;I have rarely heard this type of praise for any IT vendor. &#0160;</p>
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"> &#0160;</p> 
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;">Business execution is critical to the process that includes a number of things.&#0160;It involves putting together a strong team, a powerful and consumable message, an effective and scalable go-to-market strategy, a strong sales and support organization, etc, etc.&#0160;It is also important that there be strong business fundamentals - meaning that having a product that makes a really good profit for those who sell it while at the same time saves money for those who buy it.&#0160;And customers should care about this as well because they want healthy vendors to work with that will continue to support, innovate and execute for the long term. &#0160;</p>
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;"> &#0160;</p> 
<p class="MsoNormal" style="MARGIN:0in 0in 0pt;">It is the combination of Technology + Product + Business execution that results in leadership.&#0160;The people that discuss dedupe as just a "feature" or a "technology"&#0160;I believe&#0160;miss the point - as evidenced by the success of Data Domain.</p>
<p style="MARGIN:0in 0in 0pt;"></p></font>
<p style="MARGIN:0in 0in 0pt;"></p>]]></content:encoded>
         <category>Dedupe Market</category>
      </item>
   </channel>
</rss><!-- fe6.pipes.re3.yahoo.com uncompressed/chunked Wed Nov 11 09:05:01 PST 2009 -->
