<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>cod-E-mphasis</title>
	<atom:link href="https://codemphasis.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>https://codemphasis.wordpress.com</link>
	<description>sharing knowledge is the best way to increase it..</description>
	<lastBuildDate>Thu, 25 Jul 2013 12:44:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<site xmlns="com-wordpress:feed-additions:1">7124068</site><cloud domain='codemphasis.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>https://s2.wp.com/i/webclip.png</url>
		<title>cod-E-mphasis</title>
		<link>https://codemphasis.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="https://codemphasis.wordpress.com/osd.xml" title="cod-E-mphasis" />
	<atom:link rel='hub' href='https://codemphasis.wordpress.com/?pushpress=hub'/>
	<item>
		<title>Introduction to R</title>
		<link>https://codemphasis.wordpress.com/2013/07/25/introduction-to-r/</link>
					<comments>https://codemphasis.wordpress.com/2013/07/25/introduction-to-r/#respond</comments>
		
		<dc:creator><![CDATA[Rishu S]]></dc:creator>
		<pubDate>Thu, 25 Jul 2013 12:41:49 +0000</pubDate>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Programming Language]]></category>
		<category><![CDATA[Data mining]]></category>
		<category><![CDATA[R]]></category>
		<guid isPermaLink="false">http://codemphasis.wordpress.com/?p=437</guid>

					<description><![CDATA[R is not for “Rishu” as I made it out to be when I heard of this data mining tool. Initially, I assumed R to be yet another tool as Pentaho. But my assumptions fall apart when I clicked on http://www.r-project.org/ which says up front its definition: “R is a language and environment for statistical [&#8230;]]]></description>
										<content:encoded><![CDATA[<p style="text-align:justify;">R is not for “Rishu” as I made it out to be when I heard of this data mining tool. Initially, I assumed R to be yet another tool as Pentaho. But my assumptions fall apart when I clicked on <a href="http://www.r-project.org/">http://www.r-project.org/</a> which says up front its definition:</p>
<p style="text-align:justify;">“R is a language and environment for statistical computing and graphics. It is a GNU project which is similar to the S language and environment”</p>
<p style="text-align:justify;">So here it is <b>R </b>is a language. R is more of a data mining tool as it seems to me. Well if you ever worked on MATLAB, the format and syntax would look the same. What makes R special is its ability to handle complex mathematical queries and computation simple and easier. Creating graphs and plots are never too easy. For example, let take the below image <i>(Screen shot from code I wrote)</i>:</p>
<p style="text-align:center;"> <a href="https://codemphasis.wordpress.com/wp-content/uploads/2013/07/r_1.png"><img class="size-full wp-image aligncenter" id="i-438" alt="Image" src="https://codemphasis.wordpress.com/wp-content/uploads/2013/07/r_1.png?w=487" width="487" height="270" srcset="https://codemphasis.wordpress.com/wp-content/uploads/2013/07/r_1.png?w=487 487w, https://codemphasis.wordpress.com/wp-content/uploads/2013/07/r_1.png?w=150 150w, https://codemphasis.wordpress.com/wp-content/uploads/2013/07/r_1.png?w=300 300w, https://codemphasis.wordpress.com/wp-content/uploads/2013/07/r_1.png 749w" sizes="(max-width: 487px) 100vw, 487px" /></a></p>
<p style="text-align:justify;">The code is pretty simple. I have assigned certain values (which are in vector format) into two separate variables – “a” and “‘b”. The value of variable “b” is the square of variable “a”. And as you can see the computation of mathematical function is done by using simple commands. I calculated the “MEAN” and “VARIANCE” of the variable b using two simple commands – mean (b) and var (b). The variable “c_lm” shows the linear regression model of variable b and a.</p>
<p style="text-align:justify;">Well there are loads more. People have gone ahead and created something like “<a title="Google Trends" href="http://www.google.com/trends/">Google Trends</a>”.  Though Google has its own GUI built over R, but nothing is stopping us from creating one either.</p>
<p style="text-align:justify;">Sources: <a href="http://www.r-project.org/">http://www.r-project.org/</a>; Google Trends</p>
]]></content:encoded>
					
					<wfw:commentRss>https://codemphasis.wordpress.com/2013/07/25/introduction-to-r/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">437</post-id>
		<media:content url="https://2.gravatar.com/avatar/588d4aff06333cdef4980467ecddd028ad023b7cf0504164ffca2da188389ed0?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">rishushriv</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2013/07/r_1.png?w=487" medium="image">
			<media:title type="html">Image</media:title>
		</media:content>
	</item>
		<item>
		<title>Big Data &#8211; Hadoop HDFS and MapReduce</title>
		<link>https://codemphasis.wordpress.com/2012/09/27/big-data-hadoop-hdfs-and-mapreduce/</link>
					<comments>https://codemphasis.wordpress.com/2012/09/27/big-data-hadoop-hdfs-and-mapreduce/#comments</comments>
		
		<dc:creator><![CDATA[Rishu S]]></dc:creator>
		<pubDate>Thu, 27 Sep 2012 12:20:54 +0000</pubDate>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[Technological Updates]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Hadoop]]></category>
		<guid isPermaLink="false">http://codemphasis.wordpress.com/?p=431</guid>

					<description><![CDATA[The big data buzz is increasing day by day. So here is a more detailed look at the Hadoop &#8211; HDFS and MapReduce. HDFS or the Hadoop Distributed File System is designed to store a large amount of data in various servers/clusters. The definition of large data needs no explanation (especially when we are talking Big Data).  Data in [&#8230;]]]></description>
										<content:encoded><![CDATA[<div>
<p>The big data buzz is increasing day by day. So here is a more detailed look at the Hadoop &#8211; HDFS and MapReduce.</p>
<div>
<p><span style="color:#800000;"><strong>HDFS</strong> </span>or the <strong><span style="color:#800000;">Hadoop Distributed File System</span></strong> is designed to store a large amount of data in various servers/clusters. The definition of large data needs no explanation<span style="color:#808080;"> (especially when we are talking Big Data)</span>.  Data in a Hadoop cluster is broken down in small blocks <span style="color:#808080;">(default is 64MB)</span> and distributed across the clusters.</p>
<p>The blocks in the cluster are placed based on a block placement algorithm – rack aware. Rack aware algorithm basically determines which block is to be placed in clusters based on the replication factor, which is generally 3x by default.</p>
<p><a href="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg"><img data-attachment-id="432" data-permalink="https://codemphasis.wordpress.com/2012/09/27/big-data-hadoop-hdfs-and-mapreduce/hdfs-arch/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg" data-orig-size="1153,710" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="hdfs arch" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg?w=595" class="aligncenter size-full wp-image-432" title="hdfs arch" src="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg?w=595" alt=""   srcset="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg?w=600&amp;h=369 600w, https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg?w=150&amp;h=92 150w, https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg?w=300&amp;h=185 300w, https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg?w=768&amp;h=473 768w, https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg?w=1024&amp;h=631 1024w, https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg 1153w" sizes="(max-width: 600px) 100vw, 600px" /></a>The basic architecture of HDFS cluster consists of two major nodes namely:</p>
<p><span style="color:#808000;">1. Name Node:</span></p>
<p>This is almost like the Master Node in Greenplum database and the “master” as per the master-slave concept.  The name node manages the file system namespace. It maintains the file system tree and the metadata for all the files and directories in the tree. This information is stored persistently on the local disk in the form of two files: the namespace image and the edit log.</p>
<p>Now the question arises what if the single name node crashes down (as we have only one primary name node). So, in order to maintain this data, Hadoop provides a secondary name node or Backup Name node. As primary name node is the Single Point of Failure (SPOF), the secondary name node copies the FsImage and EditLog from the Name Node at a particular time.</p>
<p><span style="color:#808000;">2. Data Node:</span></p>
<p>These are the major working blocks of the HDFS. They store and retrieve blocks when they are told to (by the name node), and they report back to the name node periodically with lists of blocks that they are storing. These data nodes are the places where the majority of the data resides.</p>
<p>&nbsp;</p>
<p><strong><span style="color:#800000;">Map Reduce</span></strong> is the second major portion of Hadoop architecture. Map Reduce is the programming logic or the brain as I would like to say. Map Reduce was created by Google which was based on the parallel processing programming logic, written in Java.</p>
<p>The Map Reduce programming model works on two parts – The Mapping part(done by the <span style="color:#3366ff;">Mapper</span>) and The Reduction part (done by the <span style="color:#3366ff;">Reducer</span>).</p>
<p>The Mapper works on the blocks of data available in the data nodes and tries to get the job done. You can think of Mapper as an individual worker (in the master-slave concept), working to get the data required from the client.</p>
<p>Now the major task remains is to get the aggregate count of the results done by each Mapper. This work is done by the Reducer. The Reducer iterates over the entire result data and sends back a single output value.</p>
<p>Map Reduce programming undergoes through various intermediate stages. Now let’s have a look at the following diagram:</p>
<p><a href="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg"><img data-attachment-id="433" data-permalink="https://codemphasis.wordpress.com/2012/09/27/big-data-hadoop-hdfs-and-mapreduce/mapreduce/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg" data-orig-size="1153,901" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="mapreduce" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg?w=595" class="aligncenter size-full wp-image-433" title="mapreduce" src="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg?w=595" alt=""   srcset="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg?w=600&amp;h=469 600w, https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg?w=150&amp;h=117 150w, https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg?w=300&amp;h=234 300w, https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg?w=768&amp;h=600 768w, https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg?w=1024&amp;h=800 1024w, https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg 1153w" sizes="(max-width: 600px) 100vw, 600px" /></a></p>
<p>From the diagram above we can see that the user give something as the input. In this case the input is a question and its subsequent answer. These files are stored in the data nodes of the HDFS. The Map-Reduce program looks into given data and breaks the data into an intermediate stage. The intermediate stage consists of a key/value pair, which breaks the file data into many key- value pair data.<em><span style="color:#808080;"> [If you have studied Compiler Design during your college days, then a look at the key-value stage just reminds me of the lexical analysis, semantic analysis, etc.]</span></em>. Now after this stage, the sorting or the shuffling of the data takes place. It’s vague to understand from the diagram, but if you look into the second part of the above picture, you will understand the requirement of the sorting phase. The major reason is the availability of various servers or nodes. The Map Reduce makes sure that the shuffling and sorting of the data takes place using the key. Now come the reducer phase, which accepts the data coming from the sorting / shuffling phase and combines the data into a smaller set of values. This data is sent back to the user/client.</p>
<p>The above entire process is controlled by a <span style="color:#993300;">JobTracker</span>, which coordinates the job run and makes sure everything goes fine. The <span style="color:#993300;">TaskTracker</span> runs the tasks that the job has been split into.</p>
<p>So this is a brief description of the HDFS and the MapReduce. I didn’t go much deep into the core functionality of Map Reduce as it requires a full scale knowledge of the Java Programming Language. So I guess am able to give a short but detailed explanation on Hadoop. Thanks and take care.</p>
</div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://codemphasis.wordpress.com/2012/09/27/big-data-hadoop-hdfs-and-mapreduce/feed/</wfw:commentRss>
			<slash:comments>4</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">431</post-id>
		<media:content url="https://2.gravatar.com/avatar/588d4aff06333cdef4980467ecddd028ad023b7cf0504164ffca2da188389ed0?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">rishushriv</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/hdfs-arch.jpg" medium="image">
			<media:title type="html">hdfs arch</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2012/09/mapreduce.jpg" medium="image">
			<media:title type="html">mapreduce</media:title>
		</media:content>
	</item>
		<item>
		<title>Big Data : Parallelism and Hadoop:Basics</title>
		<link>https://codemphasis.wordpress.com/2012/08/13/big-data-parallelism-and-hadoopbasics/</link>
					<comments>https://codemphasis.wordpress.com/2012/08/13/big-data-parallelism-and-hadoopbasics/#comments</comments>
		
		<dc:creator><![CDATA[Rishu S]]></dc:creator>
		<pubDate>Mon, 13 Aug 2012 12:41:12 +0000</pubDate>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[OpenSource]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Hadoop]]></category>
		<guid isPermaLink="false">http://codemphasis.wordpress.com/?p=426</guid>

					<description><![CDATA[&#160; Let me start this blog by putting up two scenarios in front you: Scenario I: You are given a bucket full of mixed fruits. There are 3 different kinds of fruits say apple mango and banana. Now how would you calculate the total number of apple, mango and banana in the bucket? The simplest [&#8230;]]]></description>
										<content:encoded><![CDATA[<p>&nbsp;</p>
<p style="text-align:justify;">
<p style="text-align:justify;">
<p style="text-align:justify;">
<p style="text-align:justify;">Let me start this blog by putting up two scenarios in front you:</p>
<p style="text-align:justify;"><span style="color:#993300;"><strong>Scenario I</strong></span>: You are given a bucket full of mixed fruits. There are 3 different kinds of fruits say apple mango and banana. Now how would you calculate the total number of apple, mango and banana in the bucket?</p>
<p style="text-align:justify;">The simplest answer would be to count the fruits taking one by one and in the end getting the required result.</p>
<p style="text-align:justify;"><strong><span style="color:#993300;">Scenario II</span>:</strong> Now suppose instead of a bucket of fruits, you are given a Truck full of mixed fruits. How would you count the total number of individual fruit this time?</p>
<p style="text-align:justify;"><a href="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/counting.gif"><img loading="lazy" data-attachment-id="429" data-permalink="https://codemphasis.wordpress.com/2012/08/13/big-data-parallelism-and-hadoopbasics/counting/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/counting.gif" data-orig-size="587,359" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="counting" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/counting.gif?w=587" class="alignleft size-medium wp-image-429" style="margin:20px;" title="counting" src="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/counting.gif?w=300&#038;h=183" alt="" width="300" height="183" srcset="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/counting.gif?w=300 300w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/counting.gif?w=150 150w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/counting.gif 587w" sizes="(max-width: 300px) 100vw, 300px" /></a>The most feasible approach would be to divide the work<span style="color:#008080;"><em> (instead of count the entire fruit truck one by one)</em></span>. We would take up one basket each full of fruits <span style="color:#008080;"><em>[mixed up fruits]</em></span> and give it to different people<strong><span style="color:#993300;">[WORKER/SLAVE</span>].</strong> Each people count their own basket<span style="color:#008080;"> <em>(irrespective of any communication between the two)</em></span> and in the end we <strong>[<span style="color:#993300;">MASTER</span>] </strong>sum the results of each basket to get the result. Using this approach we would save time and effort<span style="color:#008080;"> <em>[if you would agree]</em></span>.</p>
<p style="text-align:justify;">Well, if you are still wondering why I started off with this scenario, then I have to say that <span style="color:#993300;"><strong>HADOOP</strong></span> is built on this simple basic principle. The above scenario describes as something in technical terminology called as <span style="color:#008080;">“<em>Parallel processing or distributed system programming</em>”</span>. There is concept of Master – Worker in parallel processing system. Master divides the work and the worker does the allotted work. The work done by each worker is sent back to the Master.</p>
<p style="text-align:justify;">Similar is the situation with BIG DATA. There is plenty of data available<span style="color:#008080;"><em> (just like the truck of fruits)</em> </span>which one cannot handle alone and most importantly the <strong>3-V </strong><span style="color:#008080;"><em>[volume, variety and velocity]</em></span> factor of the BIG DATA. So to handle such a situation Apache came up with <span style="color:#993300;"><strong>HADOOP </strong></span>– a high performance distributed data and processing system that can store any kind of data from any source at a very large scale and can do very sophisticated analysis of the BIG DATA.</p>
<p style="text-align:justify;">Hadoop architecture is mainly based on the following two <strong>components</strong>:</p>
<p style="text-align:justify;"><strong>1.       </strong><strong><span style="color:#008080;"><span style="color:#993300;">HDFS</span> [Hadoop Distributed File System]</span>:</strong></p>
<p style="text-align:justify;">It is more of a storage area for Hadoop. Whenever a data arrives at the cluster<strong>*</strong>, the HDFS software breaks it into pieces and distributes to the participating servers in the cluster.</p>
<p style="text-align:justify;"><strong>2.       </strong><span style="color:#993300;"><strong>MapReduce:</strong></span></p>
<p style="text-align:justify;"> As the data is stored as fragments across various servers, MapReduce uses its programming logic to compute the required job on these server data and later return the result back to the Master Server. The computation happens locally and parallel across all servers in the cluster<span style="color:#008080;"> <em>[Master – Worker concept]</em></span>.</p>
<p style="text-align:justify;"><a href="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoop-architecture.jpg"><img loading="lazy" data-attachment-id="427" data-permalink="https://codemphasis.wordpress.com/2012/08/13/big-data-parallelism-and-hadoopbasics/hadoop-architecture/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoop-architecture.jpg" data-orig-size="585,367" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="hadoop-architecture" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoop-architecture.jpg?w=585" class="aligncenter size-full wp-image-427" title="hadoop-architecture" src="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoop-architecture.jpg?w=595" alt=""   srcset="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoop-architecture.jpg 585w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoop-architecture.jpg?w=150&amp;h=94 150w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoop-architecture.jpg?w=300&amp;h=188 300w" sizes="(max-width: 585px) 100vw, 585px" /></a></p>
<p style="text-align:justify;">The picture above describes the Hadoop Ecosystem, which will be explained in details in my later blogs. I hope I am clear with the parallel distributed concept. This concept will be useful in understanding the architecture of Hadoop.</p>
<p style="text-align:justify;"><strong>[<span style="color:#993300;">A bit of History on Hadoop</span>:</strong> Hadoop was created by<span style="color:#008080;"> <em>Doug Cutting</em></span>, who named it after his son’s elephant toy. Hadoop was derived from<span style="color:#008080;"> <em>Google&#8217;s MapReduce</em> and <em>Google File System (GFS)</em></span> papers. Hadoop is a top-level <span style="color:#008080;"><strong>Apache</strong> </span>project being<a href="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png"><img loading="lazy" data-attachment-id="428" data-permalink="https://codemphasis.wordpress.com/2012/08/13/big-data-parallelism-and-hadoopbasics/hadoopelephant_rgb1/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png" data-orig-size="2000,473" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="hadoopelephant_rgb1" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png?w=595" class="alignright size-thumbnail wp-image-428" title="hadoopelephant_rgb1" src="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png?w=150&#038;h=35" alt="" width="150" height="35" srcset="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png?w=150 150w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png?w=300 300w" sizes="(max-width: 150px) 100vw, 150px" /></a> built and used by a global community of contributors, written in the Java programming language. Yahoo! has been the largest contributor to the project, and uses Hadoop extensively across its businesses.<strong>]</strong></p>
<p style="text-align:justify;"><strong><span style="color:#808080;">FAQ:</span></strong></p>
<p style="text-align:justify;"><span style="color:#993300;"><strong>*cluster</strong> &#8211;</span> <em>A computer cluster consists of a set of loosely connected computers that work together so that in many respects they can be viewed as a single system. The components of a cluster are usually connected to each other through fast local area networks, each node (computer used as a server) running its own instance of an operating system.<a href="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png"><img loading="lazy" data-attachment-id="428" data-permalink="https://codemphasis.wordpress.com/2012/08/13/big-data-parallelism-and-hadoopbasics/hadoopelephant_rgb1/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png" data-orig-size="2000,473" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="hadoopelephant_rgb1" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png?w=595" class="alignright size-thumbnail wp-image-428" title="hadoopelephant_rgb1" src="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png?w=150" alt=""   srcset="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png?w=150 150w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png?w=300 300w" sizes="(max-width: 150px) 100vw, 150px" /></a> Computer clusters emerged as a result of convergence of a number of computing trends including the availability of low cost microprocessors, high speed networks, and software for high performance distributed computing</em></p>
<p style="text-align:justify;"><em><span style="color:#808080;">[Source: Wikipedia [Hadoop History] and Google]</span></em></p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
					
					<wfw:commentRss>https://codemphasis.wordpress.com/2012/08/13/big-data-parallelism-and-hadoopbasics/feed/</wfw:commentRss>
			<slash:comments>4</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">426</post-id>
		<media:content url="https://2.gravatar.com/avatar/588d4aff06333cdef4980467ecddd028ad023b7cf0504164ffca2da188389ed0?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">rishushriv</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/counting.gif?w=300" medium="image">
			<media:title type="html">counting</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoop-architecture.jpg" medium="image">
			<media:title type="html">hadoop-architecture</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png?w=150" medium="image">
			<media:title type="html">hadoopelephant_rgb1</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/hadoopelephant_rgb1.png?w=150" medium="image">
			<media:title type="html">hadoopelephant_rgb1</media:title>
		</media:content>
	</item>
		<item>
		<title>Big Data : An Introduction</title>
		<link>https://codemphasis.wordpress.com/2012/08/10/big-data-an-introduction/</link>
					<comments>https://codemphasis.wordpress.com/2012/08/10/big-data-an-introduction/#respond</comments>
		
		<dc:creator><![CDATA[Rishu S]]></dc:creator>
		<pubDate>Fri, 10 Aug 2012 12:43:16 +0000</pubDate>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[OpenSource]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Businees Intelligence]]></category>
		<category><![CDATA[Data Ware]]></category>
		<guid isPermaLink="false">http://codemphasis.wordpress.com/?p=420</guid>

					<description><![CDATA[Hey guys, I am back to blogging after a pretty long gap. Since my last blog I have been going through data warehousing stuffs. In the midst of my learning data warehousing techniques, I came to know about a bigger issue which is troubling IT companies. It’s called BIG DATA. So I thought to share [&#8230;]]]></description>
										<content:encoded><![CDATA[<p style="text-align:justify;">Hey guys, I am back to blogging after a pretty long gap. Since my last blog I have been going through data warehousing<a href="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/iap123611212.jpg"><img loading="lazy" data-attachment-id="421" data-permalink="https://codemphasis.wordpress.com/2012/08/10/big-data-an-introduction/iap123611212/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/iap123611212.jpg" data-orig-size="480,247" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="IAP123611212" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/iap123611212.jpg?w=480" class="alignright  wp-image-421" style="margin:20px;" title="IAP123611212" src="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/iap123611212.jpg?w=280&#038;h=144" alt="" width="280" height="144" srcset="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/iap123611212.jpg?w=280&amp;h=144 280w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/iap123611212.jpg?w=150&amp;h=77 150w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/iap123611212.jpg?w=300&amp;h=154 300w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/iap123611212.jpg 480w" sizes="(max-width: 280px) 100vw, 280px" /></a> stuffs. In the midst of my learning data warehousing techniques, I came to know about a bigger issue which is troubling IT companies. It’s called <span style="color:#993300;"><strong>BIG DATA</strong></span>. So I thought to share my knowledge on this advanced business analytic with you guys.</p>
<p style="text-align:justify;">If you are thinking BIG DATA deals with “<span style="color:#993300;"><em>data which are big</em> <em>in nature</em></span>”, then I have to say you are perfectly correct. But if your brain is limited to the database tables with 1000 rows to 100K rows; then I fear BIG DATA is something bigger and messier than this. Well, a formal definition on BIG DATA would go as:</p>
<p style="text-align:justify;"><span style="color:#008080;">“<em>Big data is a term applied to data sets both </em><em>structured</em><em> and unstructured, whose volume is more than the capacity of commonly used software tools to capture, manage, and process the data with usual </em><em>database</em><em> and </em><em>software</em><em> techniques within an acceptable time.</em>”</span></p>
<p style="text-align:justify;"> Today, companies face a serious issue. They have access to lots and lots of data and they have no idea what to do with those data. An IBM survey shows that over half of the business leaders today realize that they don’t have access to insights they need to do their jobs. These data normally are generated from the log files, IM Chats, Facebook chats, emails, sensors, etc. These data are raw in nature and is something you won’t find in database table (row-column) format. It’s accumulated from the day to day activity from the work of each and every associate. Companies are trying to access these data store to derive some business intelligence and strategies. BIG DATA is not about relational database but of the data which has got no relations to each other.</p>
<p style="text-align:justify;"> BIG DATA can be classified basically into three different categories based on data characteristics:</p>
<p style="text-align:justify;"><span style="color:#993300;"><strong>1.      </strong><strong>VOLUME:</strong></span></p>
<p style="text-align:justify;">There is huge amount of data that are being stored in the world. In the year 2000, there is around<span style="color:#008080;"> 800,000 petabytes</span> <em>(1 PB = 10<sup>15</sup> bytes)</em> of data stored in the world. The volume of data is growing rapidly. Companies have no idea what to do and <a href="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big_data_paper_stack_web.jpg"><img loading="lazy" data-attachment-id="422" data-permalink="https://codemphasis.wordpress.com/2012/08/10/big-data-an-introduction/big_data_paper_stack_web/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big_data_paper_stack_web.jpg" data-orig-size="615,480" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="BIG_DATA_paper_stack_WEB" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big_data_paper_stack_web.jpg?w=595" class="alignleft  wp-image-422" style="margin:20px;" title="BIG_DATA_paper_stack_WEB" src="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big_data_paper_stack_web.jpg?w=216&#038;h=169" alt="" width="216" height="169" srcset="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big_data_paper_stack_web.jpg?w=216&amp;h=169 216w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big_data_paper_stack_web.jpg?w=432&amp;h=337 432w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big_data_paper_stack_web.jpg?w=150&amp;h=117 150w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big_data_paper_stack_web.jpg?w=300&amp;h=234 300w" sizes="(max-width: 216px) 100vw, 216px" /></a>how to process these data. <span style="color:#008080;">Twitter</span> alone generates more than<span style="color:#008080;"> 7 petabytes of data</span> everyday and <span style="color:#008080;">Facebook</span> generates around <span style="color:#008080;">10PB</span> of data alone. This value is growing exponentially companies. Some Enterprises generate terabytes of data every hour of every day of the year. It won’t be wrong to think that we are drowning deep in the ocean of data. By 2020, it is expected to reach <span style="color:#008080;">35 zettabytes</span> <em>(1 ZB= 10<sup>21</sup> bytes).</em></p>
<p style="text-align:justify;"><span style="color:#993300;"> <strong>2.       </strong><strong>VARIETY:</strong></span></p>
<p style="text-align:justify;">With huge volume of data comes another problem i.e. Variety. With the onset of rapid technology usage, data is not only limited to just relational database, but it has grown to the<span style="color:#008080;"><strong> <em>raw un-structured</em> </strong></span>and <span style="color:#008080;"><strong><em>semi-structured</em></strong></span> data mainly coming from web pages, log files, emails, chats, etc. Traditional systems struggle to store and perform required analytics to gain intelligence because most of the information generated doesn’t lend itself to traditional database technologies.</p>
<p style="text-align:justify;"><span style="color:#993300;"> <strong>3.       </strong><strong>VELOCITY:</strong></span></p>
<p style="text-align:justify;">Velocity is one characteristic of BIG DATA that deals with how fast a data is being stored and used for analytics. In BIG DATA terminology, we are looking at a volume and variety aspect also. So, thinking on the rate of arrival of data along with the volume and variety, is something a traditional database technology could hardly handle. As per the survey is concerned, around <span style="color:#008080;">2.9 million of emails</span> are sent every second, <span style="color:#008080;">20 hrs of video</span> is uploaded every minute in YouTube and around <span style="color:#008080;">50 million tweets</span> per day in Twitter. So I think you can imagine the velocity of data come at you.</p>
<p style="text-align:justify;">There is also another characteristic of BIG DATA, which is <span style="color:#993300;"><strong>VALUE</strong></span>. A value aspect of big data is something all companies are looking forward to. Unless you are able to derive some business intelligence and value of these data present, then there is no use of such data. In simple terms, Value deals with what the present unstructured raw data can get a meaningful statistics so that it can be useful in taking proper business decisions.</p>
<p style="text-align:justify;">Companies are trying to extract all the information possible and derive better intelligence out of it and to gain a better<a href="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big-data-teaser-graphic.jpg"><img loading="lazy" data-attachment-id="423" data-permalink="https://codemphasis.wordpress.com/2012/08/10/big-data-an-introduction/big-data-teaser-graphic/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big-data-teaser-graphic.jpg" data-orig-size="660,330" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="big-data-teaser-graphic" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big-data-teaser-graphic.jpg?w=595" class="alignright  wp-image-423" style="margin:20px;" title="big-data-teaser-graphic" src="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big-data-teaser-graphic.jpg?w=216&#038;h=108" alt="" width="216" height="108" srcset="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big-data-teaser-graphic.jpg?w=216&amp;h=108 216w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big-data-teaser-graphic.jpg?w=432&amp;h=216 432w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big-data-teaser-graphic.jpg?w=150&amp;h=75 150w, https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big-data-teaser-graphic.jpg?w=300&amp;h=150 300w" sizes="(max-width: 216px) 100vw, 216px" /></a> understanding of the customers, marketplace and the business. Few technical solutions like <strong><span style="color:#993300;">HADOOP</span></strong> <em>(which I will explain in my next blog)</em>, NoSQL, DKVS databases, etc. are combating BIG DATA problems.</p>
<p style="text-align:justify;">For now all I could conclude is that the right use of BIG DATA will allow analysts to spot trends and give niche insights that help create value and innovation much faster than the conventional methods. It would also help in better meeting consumer demand and facilitating growth.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://codemphasis.wordpress.com/2012/08/10/big-data-an-introduction/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">420</post-id>
		<media:content url="https://2.gravatar.com/avatar/588d4aff06333cdef4980467ecddd028ad023b7cf0504164ffca2da188389ed0?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">rishushriv</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/iap123611212.jpg" medium="image">
			<media:title type="html">IAP123611212</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big_data_paper_stack_web.jpg" medium="image">
			<media:title type="html">BIG_DATA_paper_stack_WEB</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2012/08/big-data-teaser-graphic.jpg" medium="image">
			<media:title type="html">big-data-teaser-graphic</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloud Computing : Architecture</title>
		<link>https://codemphasis.wordpress.com/2011/11/11/cloud-computing-architecture/</link>
					<comments>https://codemphasis.wordpress.com/2011/11/11/cloud-computing-architecture/#respond</comments>
		
		<dc:creator><![CDATA[Rishu S]]></dc:creator>
		<pubDate>Fri, 11 Nov 2011 10:14:46 +0000</pubDate>
				<category><![CDATA[Networking & Related]]></category>
		<category><![CDATA[OpenSource]]></category>
		<category><![CDATA[Technological Updates]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<guid isPermaLink="false">http://codemphasis.wordpress.com/?p=413</guid>

					<description><![CDATA[Hey guys !!! i hope everyone is clear with the overview on cloud computing ,which i had already discussed in my previous blog. Our entire discussion on cloud computing will not end until and unless we discuss about the architectures and the technical side of this system. So, without wasting much time on &#8220;bakwasss&#8221; lets [&#8230;]]]></description>
										<content:encoded><![CDATA[<p>Hey guys !!! i hope everyone is clear with the overview on cloud computing ,which i had already discussed in my previous blog. Our entire discussion on cloud computing will not end until and unless we discuss about the architectures and the technical side of this system. So, without wasting much time on &#8220;bakwasss&#8221; lets begin our discussion on the architecture of cloud computing.</p>
<p>Cloud architecture, the systems architecture of the software systems involved in the delivery of cloud computing, typically involves multiple cloud components communicating with each other over a loose coupling mechanism such as a messaging queue. When talking about a cloud computing system, it&#8217;s helpful to divide it into two sections:</p>
<p><strong>1. The Front End or the Intercloud:</strong><br />
The front end includes the client&#8217;s computer (or computer network) and the application required to access the cloud computing system. Not all cloud computing systems have the same user interface. Services like Web-based e-mail programs leverage existing Web browsers like Internet Explorer or Firefox. Other systems have unique applications that provide network access to clients.<br />
<div data-shortcode="caption" id="attachment_415" style="width: 335px" class="wp-caption alignright"><a href="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/325px-cloudcomputingsamplearchitecture-svg1.png"><img aria-describedby="caption-attachment-415" loading="lazy" data-attachment-id="415" data-permalink="https://codemphasis.wordpress.com/2011/11/11/cloud-computing-architecture/325px-cloudcomputingsamplearchitecture-svg-2/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/325px-cloudcomputingsamplearchitecture-svg1.png" data-orig-size="325,222" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="Cloud Computing Sample Architecture" data-image-description="" data-image-caption="&lt;p&gt;Cloud Computing Architecture&lt;/p&gt;
" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/325px-cloudcomputingsamplearchitecture-svg1.png?w=325" src="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/325px-cloudcomputingsamplearchitecture-svg1.png?w=595" alt="Cloud Computing Architecture" title="Cloud Computing Sample Architecture"   class="size-full wp-image-415" srcset="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/325px-cloudcomputingsamplearchitecture-svg1.png 325w, https://codemphasis.wordpress.com/wp-content/uploads/2011/11/325px-cloudcomputingsamplearchitecture-svg1.png?w=150&amp;h=102 150w, https://codemphasis.wordpress.com/wp-content/uploads/2011/11/325px-cloudcomputingsamplearchitecture-svg1.png?w=300&amp;h=205 300w" sizes="(max-width: 325px) 100vw, 325px" /></a><p id="caption-attachment-415" class="wp-caption-text">Cloud Computing Architecture</p></div></p>
<p><strong>2. The Back End or The Cloud Engineering :</strong><br />
On the back end of the system are the various computers, servers and data storage systems that create the &#8220;cloud&#8221; of computing services. In theory, a cloud computing system could include practically any computer program you can imagine, from data processing to video games. Usually, each application will have its own dedicated server.</p>
<p>[<em><strong>N.B</strong></em>: Cloud engineering is the application of engineering disciplines to cloud computing. It brings a systematic approach to the high level concerns of commercialisation, standardisation, and governance in conceiving, developing, operating and maintaining cloud computing systems. It is a multidisciplinary method encompassing contributions from diverse areas such as systems, software, web, performance, information, security, platform, risk, and quality engineering.]</p>
<p>If a cloud computing company has a lot of clients, there&#8217;s likely to be a high demand for a lot of storage space. Some companies require hundreds of digital storage devices. Cloud computing systems need at least twice the number of storage devices it requires to keep all its clients&#8217; information stored. That&#8217;s because these devices, like all computers, occasionally break down. A cloud computing system must make a copy of all its clients&#8217; information and store it on other devices. The copies enable the central server to access backup machines to retrieve data that otherwise would be unreachable. Making copies of data as a backup is called <strong>Redundancy.</strong></p>
<p>The architecture of cloud is evolving rapidly. Hopefully in the upcoming future of computing we can say &#8220;we build our home in the cloud&#8221;. There are also many issues such as privacy, data maintenance, etc, but still there are loads of advantages too. We will discuss it in the later blogs. Stay tuned for more !!! </p>
]]></content:encoded>
					
					<wfw:commentRss>https://codemphasis.wordpress.com/2011/11/11/cloud-computing-architecture/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">413</post-id>
		<media:content url="https://2.gravatar.com/avatar/588d4aff06333cdef4980467ecddd028ad023b7cf0504164ffca2da188389ed0?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">rishushriv</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/325px-cloudcomputingsamplearchitecture-svg1.png" medium="image">
			<media:title type="html">Cloud Computing Sample Architecture</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloud Computing : Overview</title>
		<link>https://codemphasis.wordpress.com/2011/11/07/cloud-computing-overview/</link>
					<comments>https://codemphasis.wordpress.com/2011/11/07/cloud-computing-overview/#respond</comments>
		
		<dc:creator><![CDATA[Rishu S]]></dc:creator>
		<pubDate>Mon, 07 Nov 2011 10:10:10 +0000</pubDate>
				<category><![CDATA[1]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<guid isPermaLink="false">http://codemphasis.wordpress.com/?p=405</guid>

					<description><![CDATA[I guess everyone is now all aware of the cloud computing. Its been on news everywhere in all the IT sectors of the world. Its been in huge demand these days and is also said to change the entire computer industry. So, now the question still stays &#8211; What is Cloud computing ?? [only for [&#8230;]]]></description>
										<content:encoded><![CDATA[<p>I guess everyone is now all aware of the cloud computing. Its been on news everywhere in all the IT sectors of the world. Its been in huge demand these days and is also said to change the entire computer industry. So, now the question still stays &#8211; What is Cloud computing ?? [only for those who dont know about it]. Lets me state the basic overview about cloud computing !!!</p>
<p>The term &#8220;cloud&#8221; is used as a metaphor for the Internet, based on the cloud drawing used in the past to represent the telephone network, and later to depict the Internet in computer network diagrams as an abstraction of the underlying infrastructure it represents.Cloud computing is a natural evolution of the widespread adoption of virtualisation, service-oriented architecture, autonomic, and utility computing. Details are abstracted from end-users, who no longer have need for expertise in, or control over, the technology infrastructure &#8220;in the cloud&#8221; that supports them.<br />
<a href="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/400px-cloud_computing-svg.png"><img loading="lazy" data-attachment-id="408" data-permalink="https://codemphasis.wordpress.com/2011/11/07/cloud-computing-overview/400px-cloud_computing-svg/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/400px-cloud_computing-svg.png" data-orig-size="400,362" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="Cloud Computing : Overview Diagram" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/400px-cloud_computing-svg.png?w=400" src="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/400px-cloud_computing-svg.png?w=595" alt="" title="Cloud Computing : Overview Diagram"   class="alignleft size-full wp-image-408" srcset="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/400px-cloud_computing-svg.png 400w, https://codemphasis.wordpress.com/wp-content/uploads/2011/11/400px-cloud_computing-svg.png?w=150&amp;h=136 150w, https://codemphasis.wordpress.com/wp-content/uploads/2011/11/400px-cloud_computing-svg.png?w=300&amp;h=272 300w" sizes="(max-width: 400px) 100vw, 400px" /></a><br />
Let&#8217;s say you&#8217;re an executive at a large corporation. Your particular responsibilities include making sure that all of your employees have the right hardware and software they need to do their jobs. Buying computers for everyone isn&#8217;t enough &#8212; you also have to purchase software or software licenses to give employees the tools they require. Whenever you have a new hire, you have to buy more software or make sure your current software license allows another user. It&#8217;s so stressful that you find it difficult to go to sleep on your huge pile of money every night. And this is where the concept of Cloud computing come into play.<br />
Now all you need to do is just to load only one application instead of installing a suite of software for each computer. That application would allow workers to log into a Web-based service which hosts all the programs the user would need for his or her job. Remote machines owned by another company would run everything from e-mail to word processing to complex data analysis programs. This is what <strong>CLOUD COMPUTING</strong> is all about.</p>
<p>In a cloud computing system, there&#8217;s a significant workload shift. Local computers no longer have to do all the heavy lifting when it comes to running applications. The network of computers that make up the cloud handles them instead. Hardware and software demands on the user&#8217;s side decrease. The only thing the user&#8217;s computer needs to be able to run is the cloud computing system&#8217;s interface software, which can be as simple as a Web browser, and the cloud&#8217;s network takes care of the rest.</p>
<p>If you guys are still wondering, just take another simple example of Gmail accounts that is provided by Google. Instead of running an e-mail program on your computer, you log in to a Web e-mail account remotely. The software and storage for your account doesn&#8217;t exist on your computer &#8212; it&#8217;s on the service&#8217;s computer cloud. </p>
<p>Cloud computing is all the rage. &#8220;It&#8217;s become the phrase du jour,&#8221; says Gartner senior analyst Ben Pring. </p>
<p>Hence my small overview about cloud computing is over. I guess soon i will be able to provide you with more details about this new phenomenon.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://codemphasis.wordpress.com/2011/11/07/cloud-computing-overview/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">405</post-id>
		<media:content url="https://2.gravatar.com/avatar/588d4aff06333cdef4980467ecddd028ad023b7cf0504164ffca2da188389ed0?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">rishushriv</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2011/11/400px-cloud_computing-svg.png" medium="image">
			<media:title type="html">Cloud Computing : Overview Diagram</media:title>
		</media:content>
	</item>
		<item>
		<title>Automation testing contd</title>
		<link>https://codemphasis.wordpress.com/2010/09/16/automation-testing-contd/</link>
					<comments>https://codemphasis.wordpress.com/2010/09/16/automation-testing-contd/#respond</comments>
		
		<dc:creator><![CDATA[JD]]></dc:creator>
		<pubDate>Thu, 16 Sep 2010 17:16:13 +0000</pubDate>
				<category><![CDATA[automation testing]]></category>
		<category><![CDATA[Testing]]></category>
		<category><![CDATA[Integration testing]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Regression testing]]></category>
		<category><![CDATA[Unit testing]]></category>
		<guid isPermaLink="false">http://codemphasis.wordpress.com/?p=396</guid>

					<description><![CDATA[Before going further lets understand the testing steps: Unit testing —————– This type of testing tests individual application objects or methods in an isolated environment. It verifies the smallest unit of the application to ensure the correct structure and the defined operations. Unit testing is the most efficient and effective means to detect defects or [&#8230;]]]></description>
										<content:encoded><![CDATA[<div dir="ltr">Before going further lets understand the testing steps:</div>
<div dir="ltr"><strong>Unit testing</strong><br />
—————–<br />
This type of testing tests individual application objects or methods in   an isolated environment. It verifies the smallest unit of the   application to ensure the correct structure and the defined operations.   Unit testing is the most efficient and effective means  to detect  defects or bugs. The testing tools are capable of creating  unit test  scripts.</div>
<div dir="ltr"></div>
<div dir="ltr"><strong><a class="zem_slink" title="Integration testing" rel="wikipedia" href="http://en.wikipedia.org/wiki/Integration_testing">Integration</a> testing</strong><br />
——————-<br />
This testing is to evaluate proper functioning of the integrated   modules (objects, methods) that make up a subsystem. The focus of   integration testing is on cross-functional tests rather than on <a class="zem_slink" title="Unit testing" rel="wikipedia" href="http://en.wikipedia.org/wiki/Unit_testing">unit   tests</a> within one module. Available testing tools usually  provide  gateways to create stubs and mock objects for this test.</div>
<div dir="ltr"></div>
<div dir="ltr"><strong>System testing</strong><br />
——————<br />
System testing should be executed as soon as an integrated set of   modules has been assembled to form the application. System testing   verifies the product by testing the application in the integrated system   environment.</div>
<div dir="ltr"></div>
<div dir="ltr"><strong><a class="zem_slink" title="Regression testing" rel="wikipedia" href="http://en.wikipedia.org/wiki/Regression_testing">Regression</a> testing</strong><br />
—————–<br />
Regression testing ensures that code modification, bug correction, and   any postproduction activities have not introduced any additional bugs   into the previously tested code. This test often reuses the test scripts   created for unit and integration testing. <a class="zem_slink" title="Software testing" rel="wikipedia" href="http://en.wikipedia.org/wiki/Software_testing">Software  testing</a> tools  offer harnesses to manage these test scripts and schedule  the  regression testing.</div>
<div dir="ltr"><strong><br />
</strong></div>
<div dir="ltr"><strong><a class="zem_slink" title="Usability" rel="wikipedia" href="http://en.wikipedia.org/wiki/Usability">Usability</a> testing</strong><br />
—————–<br />
Usability testing ensures that the presentation, data flow, and general   ergonomics of the application meet the requirements of the intended   users. This testing phase is critical to attract and keep customers.   Usually, manual testing methods are inevitable for  this purpose.</div>
<div dir="ltr"></div>
<div dir="ltr"><strong>Stress testing</strong><br />
————–<br />
Stress testing makes sure that the features of the software and hardware   continue to function correctly under a predesigned set and volume of   test scenarios. The purpose of stress testing is to ensure that the   system can hold and operate efficiently under different  load  conditions. Thus, the possible hardware platforms, operating  systems,  and other applications used by the customers should be  considered for  this testing phase.</div>
<div dir="ltr"></div>
<div dir="ltr"><strong><a class="zem_slink" title="Performance testing" rel="wikipedia" href="http://en.wikipedia.org/wiki/Performance_testing">Performance</a> testing</strong><br />
—————–<br />
Performance testing measures the response times of the systems to   complete a task and the efficiency of the algorithms under varied   conditions. Therefore, performance testing also takes into consideration   the possible hardware platforms, operating systems,  and other  applications used by the customers.</div>
]]></content:encoded>
					
					<wfw:commentRss>https://codemphasis.wordpress.com/2010/09/16/automation-testing-contd/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">396</post-id>
		<media:content url="https://1.gravatar.com/avatar/409128dd10b110d43fe2114cd472fc8b18b12c896270414d17fb9c3af6a3434a?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jaydip13</media:title>
		</media:content>
	</item>
		<item>
		<title>MicroStrategy Architecture</title>
		<link>https://codemphasis.wordpress.com/2010/09/13/microstrategy-architecture-2/</link>
					<comments>https://codemphasis.wordpress.com/2010/09/13/microstrategy-architecture-2/#respond</comments>
		
		<dc:creator><![CDATA[Arnab Guha]]></dc:creator>
		<pubDate>Mon, 13 Sep 2010 19:21:48 +0000</pubDate>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[MicroStrategy]]></category>
		<category><![CDATA[Reporting]]></category>
		<category><![CDATA[Data warehouse]]></category>
		<guid isPermaLink="false">http://codemphasis.wordpress.com/?p=388</guid>

					<description><![CDATA[MicroStrategy has the following 3 types of architecture: 2-Tier Architecture In 2-tier architecture, the MicroStrategy Desktop itself queries against the Data warehouse and the Metadata without the Intermediate tier of the Intelligence server. 3-Tier Architecture The 3-Tier architecture comprises an Intelligence server between MicroStrategy Desktop and the data Warehouse and the Metadata. 4-Tier Architecture The [&#8230;]]]></description>
										<content:encoded><![CDATA[<p><a class="zem_slink" title="Microstrategy" rel="homepage" href="http://www.microstrategy.com/" rel="nofollow">MicroStrategy</a> has the following 3 types of architecture:</p>
<ul>
<li><strong>2-Tier Architecture</strong></li>
</ul>
<p style="padding-left:30px;">In <a class="zem_slink" title="Client–server model" rel="wikipedia" href="http://en.wikipedia.org/wiki/Client%E2%80%93server_model">2-tier architecture</a>, the MicroStrategy Desktop itself queries against the <a class="zem_slink" title="Data warehouse" rel="wikipedia" href="http://en.wikipedia.org/wiki/Data_warehouse">Data warehouse</a> and the Metadata without the Intermediate tier of the Intelligence server.</p>
<p style="padding-left:30px;text-align:center;"><a href="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/2-tier-arch_mstr.jpg"><img loading="lazy" data-attachment-id="377" data-permalink="https://codemphasis.wordpress.com/2-tier-arch_mstr/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/2-tier-arch_mstr.jpg" data-orig-size="241,120" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="2 tier arch_mstr" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/2-tier-arch_mstr.jpg?w=241" class="size-full wp-image-377 aligncenter" title="2 tier arch_mstr" src="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/2-tier-arch_mstr.jpg?w=595" alt=""   srcset="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/2-tier-arch_mstr.jpg 241w, https://codemphasis.wordpress.com/wp-content/uploads/2010/09/2-tier-arch_mstr.jpg?w=150&amp;h=75 150w" sizes="(max-width: 241px) 100vw, 241px" /></a></p>
<ul>
<li><strong>3-Tier Architecture</strong></li>
</ul>
<p style="padding-left:30px;">The 3-Tier architecture comprises an Intelligence server between MicroStrategy Desktop and the data Warehouse and the Metadata.</p>
<p style="padding-left:30px;text-align:center;"><a href="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/3-tier-arch_mstr.jpg"><img loading="lazy" data-attachment-id="382" data-permalink="https://codemphasis.wordpress.com/3-tier-arch_mstr/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/3-tier-arch_mstr.jpg" data-orig-size="260,195" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="3 tier arch_mstr" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/3-tier-arch_mstr.jpg?w=260" class="size-full wp-image-382 aligncenter" title="3 tier arch_mstr" src="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/3-tier-arch_mstr.jpg?w=595" alt=""   srcset="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/3-tier-arch_mstr.jpg 260w, https://codemphasis.wordpress.com/wp-content/uploads/2010/09/3-tier-arch_mstr.jpg?w=150&amp;h=112 150w" sizes="(max-width: 260px) 100vw, 260px" /></a></p>
<ul>
<li><strong>4-Tier Architecture</strong></li>
</ul>
<p style="padding-left:30px;">The 4-tier architecture is same as 3-tier except it has an additional component of MicroStrategy Web.</p>
<p style="padding-left:30px;text-align:center;"><a href="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/4-tier-arch_mstr.jpg"><img loading="lazy" data-attachment-id="383" data-permalink="https://codemphasis.wordpress.com/4-tier-arch_mstr/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/4-tier-arch_mstr.jpg" data-orig-size="276,247" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="4 tier arch_mstr" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/4-tier-arch_mstr.jpg?w=276" class="size-full wp-image-383 aligncenter" title="4 tier arch_mstr" src="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/4-tier-arch_mstr.jpg?w=595" alt=""   srcset="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/4-tier-arch_mstr.jpg 276w, https://codemphasis.wordpress.com/wp-content/uploads/2010/09/4-tier-arch_mstr.jpg?w=150&amp;h=134 150w" sizes="(max-width: 276px) 100vw, 276px" /></a></p>
]]></content:encoded>
					
					<wfw:commentRss>https://codemphasis.wordpress.com/2010/09/13/microstrategy-architecture-2/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">388</post-id>
		<media:content url="https://0.gravatar.com/avatar/c9d46c6f775249f0c152e4bc4b1feb247c1ae0ef36b0432302a707e18716f681?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">arnab86</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/2-tier-arch_mstr.jpg" medium="image">
			<media:title type="html">2 tier arch_mstr</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/3-tier-arch_mstr.jpg" medium="image">
			<media:title type="html">3 tier arch_mstr</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/4-tier-arch_mstr.jpg" medium="image">
			<media:title type="html">4 tier arch_mstr</media:title>
		</media:content>
	</item>
		<item>
		<title>MicroStrategy Intelligence Server</title>
		<link>https://codemphasis.wordpress.com/2010/09/12/microstrategy-intelligence-server/</link>
					<comments>https://codemphasis.wordpress.com/2010/09/12/microstrategy-intelligence-server/#respond</comments>
		
		<dc:creator><![CDATA[Arnab Guha]]></dc:creator>
		<pubDate>Sun, 12 Sep 2010 15:01:56 +0000</pubDate>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[MicroStrategy]]></category>
		<category><![CDATA[Reporting]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[MicroStrategy Intelligence Server]]></category>
		<category><![CDATA[Online analytical processing]]></category>
		<guid isPermaLink="false">http://codemphasis.wordpress.com/2010/09/12/microstrategy-intelligence-server/</guid>

					<description><![CDATA[Before we get into the bits and pieces of MicroStrategy Architecture, we need to know a little bit of Intelligence Server. MicroStrategy Intelligence Server™ is an analytical server that is optimized for enterprise querying and reporting as well as OLAP analysis. It processes report requests from all users of the MicroStrategy Business Intelligence platform through [&#8230;]]]></description>
										<content:encoded><![CDATA[<p>Before we get into the bits and pieces of MicroStrategy Architecture, we need to know a little bit of <strong>Intelligence Server</strong>. MicroStrategy Intelligence Server<img src="https://s0.wp.com/wp-content/mu-plugins/wpcom-smileys/twemoji/2/72x72/2122.png" alt="™" class="wp-smiley" style="height: 1em; max-height: 1em;" /> is an analytical server that is optimized for enterprise querying and reporting as well as OLAP analysis. It processes report requests from all users of the MicroStrategy Business Intelligence platform through windows, web, and wireless interfaces. These reports range from simple performance indicators such as quarterly sales by product, to sophisticated hypothesis testing using a chi-square test. The results are then returned to the users, who can further interact with the data and run more reports. Folloiwng are the benefits of the Intelligence Server:
</p>
<p><strong>Features:<br />
</strong></p>
<p><strong>Dynamic SQL Generation:</strong> MicroStrategy Intelligence Server stores information about the database tables in metadata. MicroStrategy Intelligence Server uses this metadata to generate optimized SQL for the database. Because the metadata is schema independent, these reports, queries and analyses are generated from your current physical schema without any modifications.
</p>
<p><strong>Advanced Caching:</strong> MicroStrategy Intelligence Server caches all user requests. Not only are reports cached, but the individual report pages requested by users are also cached. As a result, no redundant processing occurs on the MicroStrategy Intelligence Server or on the database.
</p>
<p><strong>Built-in Software-level Clustering and Failover:</strong> MicroStrategy Intelligence Server lets you cluster many different individual servers together without any additional software or hardware components. Built-in failover support ensures that if a server experiences a hardware failure, the remaining MicroStrategy Intelligence Servers will pick up failed jobs.
</p>
<p><strong>Integrated Aggregations, OLAP, Financial and Statistical Analysis:</strong> MicroStrategy Intelligence Server provides simple analysis such as basic performance indicators, as well as more sophisticated analyses such as market basket, churn, retention and deciling analyses. Other analyses include hypothesis testing, regressions, extrapolations and bond calculations.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://codemphasis.wordpress.com/2010/09/12/microstrategy-intelligence-server/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">361</post-id>
		<media:content url="https://0.gravatar.com/avatar/c9d46c6f775249f0c152e4bc4b1feb247c1ae0ef36b0432302a707e18716f681?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">arnab86</media:title>
		</media:content>
	</item>
		<item>
		<title>Business Intelligence Architecture</title>
		<link>https://codemphasis.wordpress.com/2010/09/08/business-intelligence-architecture/</link>
					<comments>https://codemphasis.wordpress.com/2010/09/08/business-intelligence-architecture/#respond</comments>
		
		<dc:creator><![CDATA[Arnab Guha]]></dc:creator>
		<pubDate>Wed, 08 Sep 2010 11:19:14 +0000</pubDate>
				<category><![CDATA[Data warehousing]]></category>
		<category><![CDATA[ETL]]></category>
		<category><![CDATA[MicroStrategy]]></category>
		<category><![CDATA[Reporting]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Online transaction processing]]></category>
		<guid isPermaLink="false">http://codemphasis.wordpress.com/?p=348</guid>

					<description><![CDATA[A business intelligence architecture using MicroStrategy is shown in the following diagram: The Architecture has the following components: Source System (OLTP): Source systems are typically databases or mainframes that store transaction processing data. As such, they are an Online Transaction Processing System (OLTP). Transaction Processing involves simple recording of transactions like sales, inventory, withdrawals, deposits [&#8230;]]]></description>
										<content:encoded><![CDATA[<p>A <a class="zem_slink" title="Business intelligence" rel="wikipedia" href="http://en.wikipedia.org/wiki/Business_intelligence">business intelligence</a> architecture using <a class="zem_slink" title="Microstrategy" rel="homepage" href="http://www.microstrategy.com/" rel="nofollow">MicroStrategy</a> is shown in the following diagram:</p>
<p><a href="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/bi-arch.jpeg"><img loading="lazy" data-attachment-id="353" data-permalink="https://codemphasis.wordpress.com/2010/09/08/business-intelligence-architecture/bi-arch/" data-orig-file="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/bi-arch.jpeg" data-orig-size="474,340" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}" data-image-title="Business Intelligence Architecture" data-image-description="" data-image-caption="" data-large-file="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/bi-arch.jpeg?w=474" class="aligncenter size-full wp-image-353" title="Business Intelligence Architecture" src="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/bi-arch.jpeg?w=595" alt=""   srcset="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/bi-arch.jpeg 474w, https://codemphasis.wordpress.com/wp-content/uploads/2010/09/bi-arch.jpeg?w=150&amp;h=108 150w, https://codemphasis.wordpress.com/wp-content/uploads/2010/09/bi-arch.jpeg?w=300&amp;h=215 300w" sizes="(max-width: 474px) 100vw, 474px" /></a></p>
<p>The Architecture has the following components:</p>
<ul>
<li><strong>Source System (OLTP):</strong></li>
</ul>
<p style="padding-left:30px;">Source systems are typically databases or mainframes that store <a class="zem_slink" title="Transaction processing" rel="wikipedia" href="http://en.wikipedia.org/wiki/Transaction_processing">transaction processing</a> data. As such, they are an <a class="zem_slink" title="Online transaction processing" rel="wikipedia" href="http://en.wikipedia.org/wiki/Online_transaction_processing">Online Transaction Processing</a> System (OLTP). Transaction Processing involves simple recording of transactions like sales, inventory, withdrawals, deposits and so forth.</p>
<ul>
<li><strong><a class="zem_slink" title="Data warehouse" rel="wikipedia" href="http://en.wikipedia.org/wiki/Data_warehouse">Data Warehouse</a> (<a class="zem_slink" title="Online analytical processing" rel="wikipedia" href="http://en.wikipedia.org/wiki/Online_analytical_processing">OLAP</a>):</strong></li>
</ul>
<p style="padding-left:30px;">A well designed and robust data warehouse lies at the heart of the business intelligence system and enables its users to leverage the competitive advantage that business intelligence provides. A data warehouse is an example of Online Analytical Processing System (OLAP).</p>
<p style="padding-left:30px;">Analytical Processing involves manipulating transactional records to calculate sales trends, growth patterns, percent to total contributions, trend reporting, profit analysis etc.</p>
<ul>
<li><strong>ETL Processes:</strong></li>
</ul>
<p style="padding-left:30px;">The <a class="zem_slink" title="Extract, transform, load" rel="wikipedia" href="http://en.wikipedia.org/wiki/Extract%2C_transform%2C_load">extraction, transformation and loading</a> (ETL) process contains information that facilitates the transfer of the data from the source systems to the data warehouse. We have discussed about this in details in my previous post.</p>
<ul>
<li><strong><a class="zem_slink" title="Metadata" rel="wikipedia" href="http://en.wikipedia.org/wiki/Metadata">Metadata</a> Database:</strong></li>
</ul>
<p style="padding-left:30px;">The metadata database contains information that facilitates the retrieval of data from the data warehouse when using MicroStrategy applications. It stores MicroStrategy object definitions and information about the data warehouse in proprietary format and maps MicroStrategy objects to the data warehouse structures and content.</p>
<ul>
<li><strong>MicroStrategy Application:</strong></li>
</ul>
<p style="padding-left:30px;">The MicroStrategy applications allow you to interact with the business intelligence system. They allow you to logically organize data hierarchically to quickly and easily create, calculate, and analyze complex data relationships. They also provide the ability to look at the data from different perspective.</p>
<p style="padding-left:30px;">A variety of grid and graph formats are available for superior report presentation. You can even build documents, which enable you to combine multiple reports with text and graphics.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://codemphasis.wordpress.com/2010/09/08/business-intelligence-architecture/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">348</post-id>
		<media:content url="https://0.gravatar.com/avatar/c9d46c6f775249f0c152e4bc4b1feb247c1ae0ef36b0432302a707e18716f681?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">arnab86</media:title>
		</media:content>

		<media:content url="https://codemphasis.wordpress.com/wp-content/uploads/2010/09/bi-arch.jpeg" medium="image">
			<media:title type="html">Business Intelligence Architecture</media:title>
		</media:content>
	</item>
	</channel>
</rss>
