<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Cluster Connection » Featured</title>
	
	<link>http://www.clusterconnection.com</link>
	<description>Simplify HPC. Share the knowledge.</description>
	<lastBuildDate>Thu, 02 Aug 2012 21:59:21 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4.2</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/ClusterConnectionFeatured" /><feedburner:info uri="clusterconnectionfeatured" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>HPC Cluster Architecture in the Cloud</title>
		<link>http://feedproxy.google.com/~r/ClusterConnectionFeatured/~3/BYA2xFXVTJA/</link>
		<comments>http://www.clusterconnection.com/2011/01/hpc-cluster-architecture-in-the-cloud/#comments</comments>
		<pubDate>Mon, 24 Jan 2011 19:09:46 +0000</pubDate>
		<dc:creator>Brock Taylor</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Amazon EC2]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[cloud bursting]]></category>
		<category><![CDATA[high-performance computing in the cloud]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[HPC Architecture]]></category>
		<category><![CDATA[insideHPC]]></category>
		<category><![CDATA[Intel Cluster Ready]]></category>
		<category><![CDATA[Intel Xeon processors]]></category>
		<category><![CDATA[Nehalem]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/2011/01/hpc-cluster-architecture-in-the-cloud/</guid>
		<description><![CDATA[Intel Cluster Ready architecture applies to the cloud too.]]></description>
			<content:encoded><![CDATA[<p>Just watched the <a href="http://insidehpc.com/2011/01/21/video-building-an-hpc-cluster-on-the-aws-cloud/?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+InsideHPC+%28insideHPC.com%29">training video on insideHPC</a> about the ease of setting up an HPC cluster on Amazon EC2.  The video shows how to get a cluster of servers with Intel "Nehalem" processors up and running in the cloud.</p>
<p><strong>Seamless Transitions...</strong></p>
<p>I'd add that the <a href="http://software.intel.com/en-us/articles/intel-cluster-ready-program-for-customers/" target="_blank">Intel Cluster Ready architecture</a> applies in the cloud as well.  In fact, using cloud images based on Intel Cluster Ready architecture allows more seamless transitions between clusters running in the cloud to physical clusters running locally.  As the video points out, cloud provides an entry point into using HPC or even a try-before-buy approach to HPC clusters.  Cloud bursting benefits as well, since the application that runs on the architecture locally would see the same architecture in the cloud.</p>
<p>The common architecture enables the applications to better understand and execute on top of the computing environment without being tied to the exact details of the solution.  In other words, you can get better mileage with a common architecture in both the cloud and local clusters.</p>
<img src="http://feeds.feedburner.com/~r/ClusterConnectionFeatured/~4/BYA2xFXVTJA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2011/01/hpc-cluster-architecture-in-the-cloud/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.clusterconnection.com/2011/01/hpc-cluster-architecture-in-the-cloud/</feedburner:origLink></item>
		<item>
		<title>Major Release: Intel® Cluster Checker 1.5</title>
		<link>http://feedproxy.google.com/~r/ClusterConnectionFeatured/~3/dug_UT1KNqM/</link>
		<comments>http://www.clusterconnection.com/2010/04/major-release-intel-cluster-checker-15/#comments</comments>
		<pubDate>Wed, 14 Apr 2010 04:39:10 +0000</pubDate>
		<dc:creator>Cluster Ready</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Press Releases]]></category>
		<category><![CDATA[Xeon]]></category>
		<category><![CDATA[Hardware]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/2010/04/major-release-intel-cluster-checker-15/</guid>
		<description><![CDATA[Enjoy the power of Intel's latest processors faster and more easily. This major release of Intel Cluster Checker 1.5 supports the latest Intel® products for high-performance computing, including the Intel® Xeon® processor 5600 and 7500 series and takes advantages of the latest versions of Intel® HPC tools. You'll find improvements to speed the design, certification, [...]]]></description>
			<content:encoded><![CDATA[<p>Enjoy the power of Intel's latest processors faster and more easily. This major release of Intel Cluster Checker 1.5 supports the latest Intel® products for high-performance computing, including the Intel® Xeon® processor 5600 and 7500 series and takes advantages of the latest versions of Intel® HPC tools. You'll find improvements to speed the design, certification, and production of copies of certified designs, plus a wide range of enhancements.<a href="http://software.intel.com/en-us/articles/new-features-in-intel-cluster-checker-14/"> Check out the full list of new features</a>, or <a href="http://software.intel.com/en-us/articles/intel-cluster-checker/">learn more about Intel Cluster Checker</a>.</p>
<img src="http://feeds.feedburner.com/~r/ClusterConnectionFeatured/~4/dug_UT1KNqM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2010/04/major-release-intel-cluster-checker-15/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.clusterconnection.com/2010/04/major-release-intel-cluster-checker-15/</feedburner:origLink></item>
		<item>
		<title>Does More Cores Mean Less Nodes?</title>
		<link>http://feedproxy.google.com/~r/ClusterConnectionFeatured/~3/vf-NbNuEE5w/</link>
		<comments>http://www.clusterconnection.com/2009/10/does-more-cores-mean-less-nodes/#comments</comments>
		<pubDate>Fri, 09 Oct 2009 18:19:06 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[Processor Cores]]></category>
		<category><![CDATA[revenue]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1479</guid>
		<description><![CDATA[Packing cores into a node means less servers are needed, but the market is still growing. Ever since the shift to multi-core processors began, I have always had a nagging question - Does more cores mean less nodes? I have wrestled with this question and finally realized that there is no simple answer. I should [...]]]></description>
			<content:encoded><![CDATA[<p><em>Packing cores into a node means less servers are needed, but the market is still growing.</em></p>
<p>Ever since the shift to multi-core processors began, I have always had a nagging question - <em>Does more cores mean less nodes?</em> I have wrestled with this question and finally realized that there is no simple answer. I should preface this post by stating that I am talking about HPC and not the general market place where multi-core and virtualization are all the rage.</p>
<p>To understand why I ask this, consider that back when Linux clustering began, the single core Pentium Pro and Dec Alpha where the two processors of choice. Large clusters were maybe 64 or 128 nodes (often tower cases), which translated into 64 or 128 cores. Today you can easily pack 128 cores into 16 nodes - not even a full rack chassis. Given this trend, is the node count for clusters getting smaller?</p>
<p>From my anecdotal evidence, I seem to notice several trends. First, the HPC market, after a downturn, seems to be growing again and is projected to reach $11.7 billion by 2012 (<a href="http://www.hpcwire.com/topic/systems/IDC-HPC-Will-Resume-Growth-After-Dipping-in-2009-38620187.html">IDC</a>). This revenue is up from a $9.6 billion figure for 2008. Thus, the hunger for nodes is increasing and not deceasing, which begs a further question. <em>Are node counts increasing or are more people buying clusters?</em> (i.e. instead of a few people buying larger clusters, are there a lot of people buying smaller manageable clusters.) For the marketing types out there, maybe you know the answer. If not, that is a good question to ask. Leave a comment and give us a clue.</p>
<p>Second, from my experience there is plenty of HPC work to go around. Whether nodes are in a large data center cluster or in a small local blade system, it seems the cores are busy. Perhaps we are seeing the rise of the <em>Closet Cluster? </em></p>
<img src="http://feeds.feedburner.com/~r/ClusterConnectionFeatured/~4/vf-NbNuEE5w" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/does-more-cores-mean-less-nodes/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://www.clusterconnection.com/2009/10/does-more-cores-mean-less-nodes/</feedburner:origLink></item>
		<item>
		<title>Clustering in the Cloud</title>
		<link>http://feedproxy.google.com/~r/ClusterConnectionFeatured/~3/HN-9Cd446bA/</link>
		<comments>http://www.clusterconnection.com/2009/10/clustering-in-the-cloud/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 22:45:49 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[grid]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[InfiniBand]]></category>
		<category><![CDATA[Top500]]></category>
		<category><![CDATA[virtualization]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/2009/10/clustering-in-the-cloud/</guid>
		<description><![CDATA[Are clouds a good place to do build HPC Clusters? The use of virtualization and multi-core processors has made cloud computing an option for many users. The ability to buy cloud time as you need it and not purchase hardware is certainly attractive from a financial standpoint. The concept is not new and has its [...]]]></description>
			<content:encoded><![CDATA[<p><em>Are clouds a good place to do build HPC Clusters?</em></p>
<p>The use of virtualization and multi-core processors has made cloud computing an option for many users. The ability to buy <em>cloud</em> time as you need it and not purchase hardware is certainly attractive from a financial standpoint. The concept is not new and has its roots in time shared mainframes and grid computing. One might assume the the vast amount of computing resources in clouds may make them ideal candidates for HPC clustering. Unfortunately, it is not as simple as collecting cores.</p>
<p>One of the issues facing clouds is I/O. Basically, I/O is often not predictable or repeatable. From a storage standpoint read and write times can be fast, but not always fast. In terms of messages between servers, most clouds do not support high performance interconnects and similarly make no guarantees as to latency or bandwidth consistency.  While grids paid attention to certain HPC performance guarantees in terms of I/O, clouds, in order to offer ease of use, have declined such guarantees. Unless a cloud has been specifically designed for HPC, the user cannot expect consistent and/or high performance. There are two papers which discuss this very idea. The first paper looks at <a href="http://www.usenix.org/publications/login/2008-10/openpdfs/walker.pdf">Benchmarking Amazon EC2 for High-performance Scientific Computing</a> and the second paper asks, <a href="http://www.cs.utexas.edu/users/pauldj/pubs/uchpc09.pdf">Can Cloud Computing Reach The TOP500?</a>. Both papers conclude that the cloud is not mature enough for HPC applications.</p>
<p>The limitations of the cloud become more apparent when one looks a little deeper at HPC applications. First, many applications rely on <em>user space</em> communication (i.e. high performance MPI programs transfer data directly from one node to another without using kernel services.) Such a <em>close to the wire</em> operation runs counter to the virtualization model. Secondly, as reported in the first paper (above), the performance of OpenMP applications was reduced by 7-21% when running in the EC2 cloud.</p>
<p>Recently Penguin Computing began offering POD (Penguin on Demand) for HPC cloud computing. The POD cloud offers both Ethernet and InfiniBand connections between nodes thus providing a dedicated high performance computing environment. This service can be considered a specialized HPC cloud.</p>
<p>There are some other other important issues to consider with cloud computing -- security and reliability. When data leaves your domain over the Internet it is virtually impossible to guarantee 100% security. If your organization can live with this situation, using the cloud may be an option. If on the other hand, you need to keep a tight reign on your data, then you may not want to be injecting it into the cloud. The other issue is reliability. If your day to day operations are based on using a cloud, then a contingency plan is a must. Interruptions in Internet traffic due to congestion or hardware failures can be common in some areas. In addition, the cloud provider may have issues (even go out of business) and thus not meet the service requirements.</p>
<p>I believe the cloud is an interesting model, but it is not a real solution for HPC (in its current form). My issue with clouds is that they are often categorized as "grid like" and then are somehow (incorrectly) considered "HPC like." Cloud offers utility computing like grid promised, but has pushed the application layer further away from the hardware. HPC practitioners spend a lot of time making sure the application is as close to the hardware as possible. At this point in time, HPC in the cloud is more of a curiosity than a solution. When examining HPC benchmarks it becomes clear that clouds are not the best means to provide HPC cycles. Whether efforts like POD can meet the HPC users needs in the cloud is still unknown.</p>
<p>To be fair, there are some HPC applications that lend themselves to clouds quite well. (i.e. those that do not require predictable I/O)  <a href="http://folding.stanford.edu/">Folding@home</a> and <a href="http://setiathome.berkeley.edu/">Seti@home</a> are two good examples. These applications could easily run in a cloud (in a sense they do run in the Internet cloud). Keep in mind they have been designed to work in a robust distributed fashion and are not virtualized. Clouds can be enticing and even enabling for some applications, but remember a collection of servers (in the cloud or in a rack) does not a cluster make.</p>
<img src="http://feeds.feedburner.com/~r/ClusterConnectionFeatured/~4/HN-9Cd446bA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/clustering-in-the-cloud/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.clusterconnection.com/2009/10/clustering-in-the-cloud/</feedburner:origLink></item>
		<item>
		<title>Support, Why Do I Need Cluster Support?</title>
		<link>http://feedproxy.google.com/~r/ClusterConnectionFeatured/~3/uyIfNLrlQlo/</link>
		<comments>http://www.clusterconnection.com/2009/10/support-why-do-i-need-cluster-support/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 22:45:30 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Beowulf Mailing List]]></category>
		<category><![CDATA[bug fixes]]></category>
		<category><![CDATA[dependencies]]></category>
		<category><![CDATA[ICR]]></category>
		<category><![CDATA[support]]></category>
		<category><![CDATA[updates]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1536</guid>
		<description><![CDATA[Supporting a successful HPC cluster takes time and money, take your pick Many of the HPC people I know are what you would call rugged individualists. The have been around since the beginning and were responsible for moving the market/community along when commodity HPC was less than fashionable. This group consists mostly of developers, implementers, [...]]]></description>
			<content:encoded><![CDATA[<p><em>Supporting a successful HPC cluster takes time and money, take your pick</em></p>
<p>Many of the HPC people I know are what you would call <em>rugged individualists</em>. The have been around since the beginning and were responsible for moving the market/community along when commodity HPC was less than fashionable. This group consists mostly of developers, implementers, and administrators. Many of these people developed, by way of discussion on the <a href="http://www.beowulf.org/mailman/listinfo/beowulf">Beowulf Mailing List</a>, the best practices used today. The Beowulf Mailing list is a true resource if there ever was one. A <em>newbie</em> can ask a question and get polite (and lengthy) answers by list members. The list holds a large amount of open community knowledge because all the HPC plumbing is open source. Unhindered discussions can take place at any level between any number of people.</p>
<p>There is also the false notion that open source software is "free as in beer." This idea is not quite true because software unlike toasters have a usage cost. Once you install, configure, and study any software you have already made an "investment" in the package. Continued use furthers this investment. The size of the investment is up to you. And, because the software is open, in theory you can fix any problem. Thus, you have the choice to decide how much time you can invest in a particular software package before it becomes "expensive" to you. At some point, the cost (or dilution) of your time may come into play. Spending weeks fine tuning a single application at the expense of other responsibilities is probably not going to play out very well.</p>
<p>In terms of support responsibilities, many clusters have minimal issues once they are configured correctly. There are, of course, hardware failures, but in general, once everything is booted, things often work quite well. There are two areas that need attention, however. The first is software updates. Updates are needed for several reasons including security, bug fix, or feature updates. These types of updates are usually easy to manage unless they have dependencies, which means there may be a whole raft of packages that need updating. If you don't get the dependencies right, then there can be problems with the entire cluster.</p>
<p>The other issue is local integration. This is what I consider the "last mile problem" for clusters. Very often local file system issues need to be worked out and managed in addition to creating job submission policies. There is usually some end-user assistance needed as well as questions on how to compile and submit jobs to the queue. Of course, "Why is my job sitting in the queue?" is probably the question that gets asked the most.</p>
<p>If your job includes time for installing, integrating, and updating software and you happen to be one of those rugged individualist HPC people, then you probably have no interest in professional support. If on the other hand, you are new to clustering (or Linux) and already have many responsibilities, then you may want to consider using professional support services. As with all open source software, the choice is yours. In terms of commercial software, or commercial support of open software, there are many options. In any case, purchasing support for business critical applications is always a good idea. As is the use of a <a href="http://software.intel.com/en-us/cluster-ready/">Intel Cluster Ready</a> (ICR) solution. By adhering to the ICR specification support is much easier -- for you and/or a vendor. That is, a reference platform allows you and your vendors to <em>work from the same page</em>. Without a common framework, support vendors and others in your organization may have to decipher/debug how you configured the cluster.</p>
<p>In conclusion, cluster support can be commercial or it can be institutional. In either case, there is a cost. If you do it on your own, it will cost time and if you hire a consultant or company, it will cost money. To supplement either effort, there is a large amount of information on the web that can be useful when identifying and solving problems. Support is an important part of any successful HPC cluster,  just ask the old-timers. They figured it out, so you don't have to.</p>
<img src="http://feeds.feedburner.com/~r/ClusterConnectionFeatured/~4/uyIfNLrlQlo" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/support-why-do-i-need-cluster-support/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.clusterconnection.com/2009/10/support-why-do-i-need-cluster-support/</feedburner:origLink></item>
		<item>
		<title>The Intel Cluster Checker Experience</title>
		<link>http://feedproxy.google.com/~r/ClusterConnectionFeatured/~3/B103ADu3JGU/</link>
		<comments>http://www.clusterconnection.com/2009/10/the-intel-cluster-checker-experience/#comments</comments>
		<pubDate>Fri, 02 Oct 2009 20:40:51 +0000</pubDate>
		<dc:creator>Thomas Gebert</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Hardware]]></category>
		<category><![CDATA[ICR]]></category>
		<category><![CDATA[Intel Cluster Checker]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1587</guid>
		<description><![CDATA[Well yes, this is another "cheers to the Intel Cluster Checker" blog and there must be reasons out there why people are writing about the Intel Cluster Ready program. Yes, there are reasons... I think I do not have to state that setting up an HPC cluster sometimes is an adventure of compiling and installing [...]]]></description>
			<content:encoded><![CDATA[<p>Well yes, this is another "cheers to the Intel Cluster Checker" blog and there must be reasons out there why people are writing about the Intel Cluster Ready program. Yes, there are reasons...</p>
<p>I think I do not have to state that setting up an HPC cluster sometimes is an adventure of compiling and installing different program versions and libraries. Nevertheless, when having all up and running, the Intel Cluster Checker is a really nice tool to verify that all services needed for your HPC cluster are set up correctly and you haven't forgotten one of those tiny details during the installation of your cluster.</p>
<p>My last experience with the Intel Cluster Checker and the newly installed HPC cluster were a bit different from the ones before. The first steps ran smoothly and everything seemed to work fine, with me having learned from my previous first Intel Cluster Checker experience. The tests were successful, I altered the benchmark thresholds and all tests ended with a "passed". But in the end, when checking the dmidecode output I got a "failed" on this test. First I thought this was due to some BIOS specific mismatches, which you can exclude. But when I had a closer look on the output file of the Intel Cluster Checker I saw that there seemed to be installed different RAM modules on the compute nodes.  Hmm...the sizes of the memories looked fine, all were 2GB and I double checked the product numbers of the DIMMs that the Intel Cluster Checker reported. Here we got two different types of memories installed. I was astonished on the one hand as I would not have checked this without the Intel Cluster Checker and on the other hand I hardly could believe that there were two different types of memory DIMMs installed. Well, the servers I had installed were Intel Nehalem based and I remembered the days when AMD started with the CPU built-in memory controller and the problems with memories that arose during those times...</p>
<p>I took a closer look at the compute nodes and opened the chassis of  those which were affected. And indeed, there were different types of memory modules built in, but they nearly looked the same and also had the same product number written down on them. After some investigation with our purchasing department I found out that those memory types have been mixed up. Unfortunately it was not possible to pin down the real cause for this confusion.</p>
<p>The DIMMs were swapped to the correct ones in the end and I ran the Intel Cluster Checker again. This time all tests were passed and the results were sent to Intel to verify the Cluster Ready Certificate, which now proudly resides beside the other ICRs we have scored.</p>
<img src="http://feeds.feedburner.com/~r/ClusterConnectionFeatured/~4/B103ADu3JGU" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/the-intel-cluster-checker-experience/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.clusterconnection.com/2009/10/the-intel-cluster-checker-experience/</feedburner:origLink></item>
		<item>
		<title>The Skinny on Solid State Disks</title>
		<link>http://feedproxy.google.com/~r/ClusterConnectionFeatured/~3/rQkESfrzIMA/</link>
		<comments>http://www.clusterconnection.com/2009/09/the-skinny-on-solid-state-disks/#comments</comments>
		<pubDate>Tue, 22 Sep 2009 22:54:20 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[IOPS]]></category>
		<category><![CDATA[JEDEC]]></category>
		<category><![CDATA[NAND flash]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[SSD]]></category>
		<category><![CDATA[Storage]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1475</guid>
		<description><![CDATA[What you need to know about the latest trend in storage hardware The Solid State Drive (SSD) has had a break-out year. Unlike traditional mechanical hard disk drives that use spinning platters with movable read/write heads, SSDs have no moving parts. The SSDs is made entirely out of a special type of flash memory -- [...]]]></description>
			<content:encoded><![CDATA[<p><em>What you need to know about the latest trend in storage hardware</em></p>
<p>The Solid State Drive (SSD) has had a break-out year. Unlike traditional mechanical hard disk drives that use spinning platters with movable read/write heads, SSDs have no moving parts.  The SSDs is made entirely out of a special type of flash memory -- the same kind of NAND flash memory found in thumb-drives and memory sticks. Overall, SSDs are faster, quieter, more energy efficient, but less dense than the traditional spinning platter drive.</p>
<p>The staple of the storage industry has been the mechanical drives, where I/O rates are limited to the mechanical properties of a drive. Unlike other semiconductor trends, the sustained write rate has barely doubled from 50 to 90 MB/second over the past 8-10 years. All this is about to change as the use of flash memory will allow stroage to take advantage of a semiconductor growth curve similar to that of processors and memory. (i.e. mechanical drives are limited by the physical motion of spinning disks).</p>
<p>Perhaps the most important feature offered by SSDs is the read and write performance. The IOPS (I/Os per second) rate for an SSD is usually two to five times that of a traditional mechanical hard drive. When reading, performance is mostly constant because the seek time is virtually instantaneous and does not depend on the physical location of the data on a platter. As a result, file fragmentation has almost no impact on read performance. In addition, because there are no moving parts, SSDs use as little as one-fifth the power of a mechanical drive. Another interesting feature is the SSD failure mode. Most SSD failures tend to happen when writing. In contrast, mechanical drives tend to have most failures when reading. Thus, once data is written, it is more likely it can be read from a failed SSD.</p>
<p>SSDs do suffer from “degradation” over time that results in reduced performance and limited lifetimes (i.e. there are a limited amount of read/write cycles avaliable for NAND memory). Vendors have taken this into account and include <em>wear leveling</em> algorithms in SSDs that spread write access evenly over the entire device.</p>
<p>If you are interested in the exploring SSDs, there are some key points to consider. First, SSDs are not the best solution in every case. Currently, their capacity is much less than that of traditional mechanical drives and as such may not be suitable for some of the large HPC data sets. In addition, the cost is per MB is higher and are more susceptible to data loss from energy and power surges.</p>
<p>Second, the very fast read times offered by SSDs has made them a good candidate for improving OLTP (Online Transaction Processing) systems where frequently read tables and indexes can be accelerated. Check both the read and write IOPS, as there is usually a big difference between these values. Typically, the read speed is 10 times the write speed resulting in asymmetric performance. In terms of clusters, using SSDs for NFS mounts or read-only data may be helpful.</p>
<p>Third, because SSDs are new, questions still remain about how much of that speed they can deliver for the long haul (due to degradation). Typically, an SSD will show an initial decrease in performance and then level off. Even with a performance drop over time, SSD drives are almost always faster than traditional hard drives. The JEDEC standards organization plans to publish two standards by the end of this year for SSD endurance metrics.</p>
<p>Finally, pay particular attention to “write endurance,” this number should pertain to random writes. For instance, an Intel® X25-E Extreme 64 GB SATA Solid-State Drive is rated for 2 petabytes of lifetime random writes.</p>
<p>In terms of software, one difficulty facing the industry are the legacy assumptions built into file systems. These assumptions will need to be challenged in order to take advantage of SSD technology. For instance user applications and file systems will need to account for the asymmetric read/write performance of SSDs. Many computer applications rely on synchronous patterns of read/write operations, wherein a given write or update must be completed and the write confirmed before additional application read requests can be issued. With SSDs this process may need to be reconsidered.</p>
<p>There is no doubt that SSDs are the future of storage. Indeed, SSDs are even changing the way we compute. For example, CAE (Computer Aided Engineering) applications can use the speed advantage of SSDs in their out-of-core algorithms. The power of semiconductor manufacturing technology combined the speed of NAND flash memory are about to make the storage market stand still!</p>
<img src="http://feeds.feedburner.com/~r/ClusterConnectionFeatured/~4/rQkESfrzIMA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/09/the-skinny-on-solid-state-disks/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://www.clusterconnection.com/2009/09/the-skinny-on-solid-state-disks/</feedburner:origLink></item>
		<item>
		<title>Future File Systems: Btrfs and ZFS</title>
		<link>http://feedproxy.google.com/~r/ClusterConnectionFeatured/~3/05jVOPSDAA0/</link>
		<comments>http://www.clusterconnection.com/2009/09/future-file-systems-btrfs-and-zfs/#comments</comments>
		<pubDate>Sat, 19 Sep 2009 02:51:34 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Btrfs]]></category>
		<category><![CDATA[File Systems]]></category>
		<category><![CDATA[linux]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[ZFS]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1421</guid>
		<description><![CDATA[The prediction is in. What file system will move us into the future? The thirst for better file system technology is not new to the Unix/Linux world. There is a rich history of trying to optimize the balance between the storage system and the process improvements in a demanding user environment. The efforts to build [...]]]></description>
			<content:encoded><![CDATA[<p><em>The prediction is in. What file system will move us into the future?</em></p>
<p>The thirst for better file system technology is not new to the Unix/Linux world. There is a rich history of trying to optimize the balance between the storage system and the process improvements in a demanding user environment. The efforts to build a better file system are quite numerous and are built on the efforts of many people. For example Kirk McKusicks’ original Berkeley Fast File System improved on the original V7 release. Steven Tweedie’s ext3 took ideas from the database logs and Margo Seltzer’s LFS and added them to Linux’s implementation of UFS -- Ted Tsao’s ext2.  In the mean time, DEC released Megasafe, SGI released XFS and Sun released ZFS all to the wild. And now Oracle has developed Btrfs for Linux.</p>
<p>So why do we as users care?  There better be a good reason to change a file system because a new file system usually means converting to and trusting a new format with your data. Thus, any new format or change must provide a compelling reason or solve a big problem. Otherwise what is "good enough and works" is often better than that which is "new and fancy."</p>
<p>If you follow the details of file systems development, then this quick update may not be of interest to you. For the rest of us, who just choose whatever file system the installer offers, you may want to read further because changes are afoot.</p>
<p>If you are like me, you probably are running Linux with the ext3 file system. There is nothing wrong with ext3 as it is stable, robust, and a standard Linux file systems. And, one other thing, it is old. Even if you are running the newer ext4, you are still running a 30-year old file format that is more than a little short on features.</p>
<p>There are those that believe ext4 is going to be the end of the line and a switch over to Btrfs is very likely. Btrfs (pronounced "butter-F-S") is being developed by Chris Mason at Oracle. It is an open source project that has recently been added to the Linux kernel (as of 2.6.29) as experimental code.</p>
<p>Btrfs is based on several new ideas including b-trees (binary trees, which is where the <em>btr</em> comes from in Btrfs) and "copy-on-write" or COW. While, I won't go into the details, b-trees and COW allow for some new features that would be difficult in the ext* line of file systems. (If you want to learn more about the technical details of Btrfs, see  <a href="http://lwn.net/Articles/342892/">A short history of btrfs</a> on lwn.net.) Some of the new features include file-system snapshots, check-summing, online defragmentation, compression, extents, resizing, and more. In particular, Btrfs allows one thing that has been difficult to achieve in the past -- optimizing both access time and disk space.</p>
<p>The fact that Oracle sponsors Btrfs has lead to some concern. Recently, Oracle purchased Sun Microsystems which has been developing the ZFS file system for many years. ZFS is similar to Btrfs (it uses COW) and provides many of the same features, but it is very different in its internal implementation. ZFS will also "run" under Linux using <a href="http://zfs-on-fuse.blogspot.com/">Fuse</a>. Mason and other have assured the community that Btrfs is important to Oracle and they will continue development. In addition, the open source nature of Btrfs ensures that it cannot be "taken away" now or in the future.</p>
<p>There is plenty more to consider and I suggest reading <a href="http://www.linux-mag.com/id/7308/">Linux Don't Need No Stinkin' ZFS: BTRFS Intro &amp; Benchmarks</a> by my friend Jeff Layton. The consensus seems to be that Btrfs is destined to become the default Linux file systems within two years. ZFS on the other hand must overcome some licensing issues before it can even make it into the Linux kernel for testing. Your next Linux install may offer a new and better (or "btr") file system than in the past.</p>
<img src="http://feeds.feedburner.com/~r/ClusterConnectionFeatured/~4/05jVOPSDAA0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/09/future-file-systems-btrfs-and-zfs/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		<feedburner:origLink>http://www.clusterconnection.com/2009/09/future-file-systems-btrfs-and-zfs/</feedburner:origLink></item>
		<item>
		<title>Computing To Compete</title>
		<link>http://feedproxy.google.com/~r/ClusterConnectionFeatured/~3/MA0G9svOhQ0/</link>
		<comments>http://www.clusterconnection.com/2009/09/computing-to-compete/#comments</comments>
		<pubDate>Sat, 19 Sep 2009 02:47:08 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[aerodynamics]]></category>
		<category><![CDATA[Council on Competitiveness]]></category>
		<category><![CDATA[industry]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1522</guid>
		<description><![CDATA[The Optimized Mudflap And Other HPC Success Stories Welcome to the world of industrial HPC. Today, we will consider a small part of those large trucks that spend most of their day crisscrossing our highways. At highway speeds, anything that moves through the air has an aerodynamic cost. Pushing a big box takes more energy [...]]]></description>
			<content:encoded><![CDATA[<p><em>The Optimized Mudflap And Other HPC Success Stories</em></p>
<p>Welcome to the world of industrial HPC. Today, we will consider a small part of those large trucks that spend most of their day crisscrossing our highways.  At highway speeds, anything that moves through the air has an aerodynamic cost. Pushing a big box takes more energy than a round ball, which is why better aerodynamics means less energy and lower costs. Almost all trucks have some kind of mudflaps to prevent road dirt and debris from hitting the truck. Midsized truck maker Kenworth wondered how much it costs to move those mudflaps though the air. To answer the question, they turned to HPC where they were able to determine  trimming and tapering the mudflaps can cut about $400 from a typical trucks annual gas bill. This amount adds up quickly when you have a fleet of 1000 trucks. And, based on the mudflap success, Kenworth has started using HPC to help increase the efficiency of their truck designs, thus saving customers even more money.  You can read more about these efforts in <a href="http://money.cnn.com/2009/02/19/technology/fortt_kenworth.fortune/index.htm">Heavy-duty Computing</a> from Fortune Magazine.</p>
<p>If mudflaps don't pique your interest, but saving money with HPC does, then you may be interested learn more about <a href="http://www.compete.org/">The Council on Competitiveness</a> (CoC). Who or what is the CoC? They are a group of corporate CEOs, university presidents, and labor leaders committed to enhanced U.S. competitiveness in the global economy. One of their main focus areas is HPC. That is correct. Not only can HPC dock bio-molecules, design jets, and find oil, it can also help many companies save money and be more competitive. The CoC is a nonpartisan, nongovernmental organization based in Washington, D.C. The Council shapes the debate on competitiveness by bringing together business, labor, academic and government leaders to evaluate economic challenges and opportunities. <a href="http://www.compete.org/about-us/initiatives/hpc">The High Performance Computing Initiative</a> is intended to stimulate and facilitate wider usage of HPC across the private sector to propel productivity, innovation, and competitiveness. Click the link to find out more how other companies have cashed in on HPC. Yours could be next.</p>
<img src="http://feeds.feedburner.com/~r/ClusterConnectionFeatured/~4/MA0G9svOhQ0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/09/computing-to-compete/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.clusterconnection.com/2009/09/computing-to-compete/</feedburner:origLink></item>
		<item>
		<title>Software Updates on an Intel® Cluster Ready System</title>
		<link>http://feedproxy.google.com/~r/ClusterConnectionFeatured/~3/SqWoBeCkuKM/</link>
		<comments>http://www.clusterconnection.com/2009/08/software-updates-on-an-intel-cluster-ready-system/#comments</comments>
		<pubDate>Wed, 26 Aug 2009 17:18:04 +0000</pubDate>
		<dc:creator>Brock Taylor</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[certified]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[Intel Cluster Checker]]></category>
		<category><![CDATA[Intel Cluster Ready]]></category>
		<category><![CDATA[Intel Cluster Ready Architecture]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/2009/08/software-updates-on-an-intel-cluster-ready-system/</guid>
		<description><![CDATA[My last post concluded that eventually the answer to, "is my cluster's software too old," is yes. Updating software on a cluster is not as simple as updating a single server, but the down side of errors is the same: if updates aren't done properly, clusters, like traditional servers, can mysteriously break or start to [...]]]></description>
			<content:encoded><![CDATA[<p>My last post concluded that eventually the answer to, "is my cluster's software too old," is yes. Updating software on a cluster is not as simple as updating a single server, but the down side of errors is the same: if updates aren't done properly, clusters, like traditional servers, can mysteriously break or start to have problems in the future.  In addition, the chance of making an error during the update is proportional to the number of nodes in the cluster. For Intel® Cluster Ready compliant clusters, Intel has provided a couple steps that can and should be performed after a software update that will help verify the cluster is still compliant and ensure it is still functioning properly.</p>
<p>First, always use the required and supplied "provisioning system" tools to update a cluster.  It may be relatively easy to update all the nodes in a cluster using a couple RPMs and something like pdsh to script the installation of the update - but don't be tempted.   This manual or brute-force method bypasses the software that manages the image on each server.  Provisioning systems may reimage nodes after a crash or maybe a node is replaced or added to the cluster.  If software updates are applied manually (outside of the provisioning system) then the reimaged node will be inconsistent with the rest of the system.  An admin would need to remember all the manual changes and apply them again.  It's much better to let the provisioning software worry about that.  That is one the reasons ICR required a provisioning system!</p>
<p>Once updates are applied, it's a good idea to verify the cluster is working as it did before the update was installed and it remains compliant with the Intel® Cluster Ready architecture.  Many if not most software updates will behave well, but verification helps ensure an update didn't alter or remove a key system component that may lead to application failures.  Intel® Cluster Checker provides an easy way to check the compliance after an update.  By using the command-line --compliance option, the tool will verify the interface defined by the architecture still exists as before.  It's an easy way to check that the update hasn't had any ill effect on the architecture interface used by ICR applications.</p>
<p>Finally, there may be needed updates to the Intel Cluster Checker configuration files to reflect the updated software.  For example, if a newer version of the Intel C compiler is installed, the Intel Cluster Checker configuration file should be updated to utilize the newer version.  Running the tool would then verify the new installation is functioning on all nodes. It's also valuable to update the list of packages that are expected on each node.  The packages test verifies the RPMs installed on each node matches a predetermined list.  Using the --packages command-line option will create new package lists based on the current installation (use it after all updates are complete).  Save the original list file and set the configuration file to use the updated list.  For more information on using the tool, see the Intel Cluster Checker Users Guide.</p>
<img src="http://feeds.feedburner.com/~r/ClusterConnectionFeatured/~4/SqWoBeCkuKM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/08/software-updates-on-an-intel-cluster-ready-system/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.clusterconnection.com/2009/08/software-updates-on-an-intel-cluster-ready-system/</feedburner:origLink></item>
	</channel>
</rss>
