Clear Cloud HomeWhat is 4KGrooVe IPCloud TopologyPinterest ArchitectureWhat-is-a-Wi-Fi-PhoneBe an Android DeveloperPost PC EraMobile Device LiquidationsMobile RevolutionEnrollment-ApplicationFacebook ArchitectureTHIS WEEK'S TOP FREE TRAINING_MODS:Wuala Cloud Storage peer-to-peerContact UsWhat is Wireless Video ConferencingFAQsTablet_PCsPC Broker GuideCloud Computing ArchitectureWhat's a Set-Top TV Internet Box?3D PrintersLaptop Categories for the Mobile BrokerWhy Mobile Device Brokers Are NeededMobile Device Brokers and Pocket CloudMobile Device Brokers and Mobile PaymentsHow Mobile Device Brokers Make Revenuebroker_mechanics_2Job Oppys for Mobile Device BrokersHigh End Netbooks for Mobile Device BrokersMobile Brokers top SmartphonesMobile Device Brokers go Mobile BroadbandCloud_RevolutionWhat is Mobile Cloud ComputingMobile Cloud Computing TrendsMobile Device Broker BasicsNotebook_MarketingMobile Cloud & Brokers New DevelopmentsCloud RevolutionYour Opportunity as a Mobile Device BrokerWhy Mobile Device Brokers are WantedTraining_ModulesVIDEO LEARNING CENTERAbout UsLTE 4G for Mobile BrokersBroker_B2B_MechanicsWhat's WIMAX Wireless 4G Connect?Lets follow a successful mobile device brokerSee Ted & his Swap Meet Profit SessionsTed in Action at the Swap Meet - WORKINGDelight in Ted's Mobile Momentum to PROFITSTed's Post Sale Activities & Cloud FunASK CLEAR CLOUD NETWORKMarket OpportunityHow it Worksnew_broker_channelsWhat is Social TVEnterprise MobilityWhat is Micro CloudLTE TabletsBlu Ray Player with SkypeSkype on my TVIntel SSDFacebook CloudFree Storage CloudGoogle CloudGoogle StorageWiMax WiFiLTE Verizon AdvancedLTE Dish NetworkWhat is an SSD DriveLTE AdvancedVideo Game CloudVideo Cloud EncodingWhat is an IP TV playerBlu ray TabletsExploring the LTE NetworkAndroid TetheringExploring-4g-tabletsUltra High Speed InternetWhat is Desktop VirtualizationCloud ApplicationsUnderstanding Mobile VideoQWERTY PhonesIntel UltrabookTablet EvolutionIn Demand Cloud Computing JobsEnterprise Optimized TabletsCloud Services BrokeragesPolice Mobile Data TerminalsExploring Wireless Computing ConceptsWhat is an Intelligent NetworkExamining Augmented Reality AppsEmergence of the Personal CloudWhat is Ultra Fast Internet ConnectionExplaining Mobile Payment SystemsWhat is a VOIP Business Phone SystemPrivate Cloud Storage ServicesExploring Today's IT Data CentersWhat is Network VirtualizationWhat is Cloud Multi-TenancyWhat is Hosted CloudCloud Artificial IntelligenceFacebook MobileIndia CloudPrivate Cloud Storage SolutionsWhat is Private Cloud ComputingIP Video Surveillance4G Service ProvidersExplaining What is a Cloud APICloud Management4G AndroidWhat is a Mobile PaymentWhat is Samsung Cloud PlatformWhat's Google App EngineDescribing What's a Private CloudWhat's Google Compute EngineWhat is Cloud StackWhat is OpenstackWhat is Amazon Cloud Computing ServicesWhat is Android LTEWhat is Windows 8 TabletWhat is the HP CloudMobile Web Apps Against Native AppsIs it True the Desktop PC is Dead?What is Data VisualizationWhat is an InfographicWhat is Hadoop and Big DataBYOD OptimizationWhat is a Private CloudWhat are Cloud AppsWhat is a Google App AccntWhat is Enterprise Resource PlanningCloud Storage 2012What is a Cloud APIWhat is PinterestWhat is an Internet Enabled TVSMB and the CloudMobile Cloud ServicesWhat is WAPMobile Social NetworkWhat is Free WiFiWhat is Cloud CommunicationsWhat's the Mobile CloudWhat is Cloud CRMWhat is the Social CloudWhat is a Cloud ServerCloud Computing Glossary 2012What is a Virtual MachineExplaining Facebook ArchitectureWhat is WiDiWhat's High Performance ComputingIP-TV-BasicsWhat is a Hybrid CloudWhat is SkyDriveWhat is Google DriveWhat is FlexPod Cloud ArchitectureVideo Apps in the CloudElastic CloudAmazon Cloud SevicesWhat is Cloud Data MiningLTE 4G and TabletsWhy WiMax FailedBig Data Computing in the CloudOne brain your brain & my brain is the CloudThick Clients and Cloud ComputingThin Clients and Cloud ComputingLTE Trends 2012Cloud Computing in IndiaCloud Trends 2012new-developments-in-the-cloudWuala-vs-Dropbox-and-othersWhat is Wuala Social Grid StorageOccupy Wall Street and Cloud ComputingExploring What is Sony Internet TVCorporate TabletWhat-is-Amazon-Cloud-ServicesWhat are LTE 4G Cloud ServicesCloud Services for Tablets and Mobile4G LTE new developmentsWhat is Mobile Cloud ServicesSSD in the Data CenterHow Does Facebook Architecture Work?SSD and Cloud Computingwhat is CPU GPU computingWhat is Augmented-Realitywhat is HPC in the cloudWhat 's 4G LTE and WiMaxwhat is a Cloud ClusterWhat is a M2M NetworkWhat is M2M Communicationswhat is Grid ComputingExploring what is NFCChrome Vs. AndroidWhat is a HypervisorTablets for Enterprisewhat is cloud based virtualizationFacebook and the Cloud PlatformWhat is Cloud Based Video StorageIs HSPA+ same as 4GWhat is a Massively Scaled Data CenterWhat is an Internet Enabled TVwhat is a Mobile OSTablets and 4Gwhat is Google Cloud Printwhat is a Cloud Based Video EditorWhat is Cloud SOAExplaining Augmented Reality Layerswhat is Video Chatis a Router a SwitchExploring IPTVwhat are Corporate Cloud ServicesLTE Vs WiMaxCloud Computing Platformswhat's a high bandwidth 4G NetworkExplaining Amazon Instant Videowhat-is-a-SaaS-Home-Security-systemWhat is a 3D Smartphonewhat is Mobile TVwhat is Smart TVwhat is AOL In2TVwhat is ATT Project Lightspeedwhat is cloud-in-a-boxwhat is SONETwhat is Verizon FiOSwhat exactly is Autostereoscopic 3DVideo and Cloud ComputingWhat is DropboxCloud Video Delivery PlatformsLTE Verizon 3G to 4GWhat is Android Rootingwhat-is-WebOSInternet TV GuideWhat's Ivy Bridge 3D TransistorWhat's SaaSFacebook Oregon Data CenterWhat is an Apache Web Serverwhat-is-Buffalo-CloudStorwhat-is-a-Dual-Core SmartphoneWhat is Google NFC Walletwhat-is-a-Quad-Core Smartphonewhat-is-IPTVWhat-is-Mobile-Device-HapticsWhat is a Pocket RouterCLOUD COMPUTING GLOSSARYWhat is Office 365What is Amazon Cloud Playercloud computing top trendswhat-is-no-glasses-3Dwhat-is-a-powerline networkwhat-is-MIMO Technologyfree_kindle2What-is-Boxeewhat-is-Internet-TVwhat-is-AMD-Llanowhat-is-a-multiscreen displaywhat-is-Lytro-Technologywhat-is-a-LAN-partywhat-is-NFC-Near-Field CommunicationWhat is a SFF PCwhat-is-an-eyefinity-displayWhat is an AIO PCWhat is HKMGVIDEO LEARNING CENTERHow Internet Traffic Moves Over PlanetCloud Computing and Cloud ArchitectureDynamic Cloud ServerWhat is the Internet BackboneTwitters Internet Infrastructure64bit-vs-32bitsClient-Server Vs Cloud Computingwhat-is-arm-processorWhat's cloud computing architectureWhat is Web Services Architecturewhat's a Tablet eReaderTablet Cloud ServicesWhat is iCloudwhat-is-a-hex-coreTop Tablet AppsWhat's a KindleFree SupportVerizon 4G LTE Modem is Speed Demonwhat is a Tablet OS4G Network Deployment & Evolution-LTEWiMax Battles LTE for Wireless King-MakerWhat is a Cloud Data CenterWireless M2M Communicationswhat is Intel ThunderboltClearWire and Clear Cloud NetworkWiMax vs LTEconnect laptop to tvwhat is a Video Conferencing Callwhat is Data Center Virtualizationwhat is a HTPCwhat is a Chromebooktablet-as-a-hotspotwhat is SilverlightWhat is mVOIPwhat-is-Amazon Cloud PlayerSkype and Videos and iPhone Appwhat-is-HP Wireless TV Connectwhat is HTC Senseciscos-new-data-centerWIRELESS COMMUNICATIONSLaptops_and_Cellularwhat-is-RevoluTVWhat is Wi-Fi Direct4G Networks and WiMaxEnterprise Mobility and Wireless 4GWiFi Facts and Laptop Cloud ExperienceWhat's SkypeMobile Internet has arrivedLearn_Wi-Fiwhat is Amazon Cloud DriveWhat's the Mobile Web?Laptop's_Wi-Fi_RadioCisco_Tabletwhat-is-a-4G-mobile-hotspotCorp Workers Getting Lots of TabletsGalaxy Tab Tabletwhat-is-BoxeeTablets Coming on BigIP Internet TV Platform Mobile BrokersThe SMB and the CloudeReaders are Tabletswhat-is-new-USB-3.0Verizon 4G Networkwhat is a P2P Networkwhat-is-a-4G-Tabletwhat is ARMwhat-is-Adobe-AIRwhat-is-Amazon-Instant-Videowhat-is-google-navigationwhat-is-an-Amazon-data-center32 bit Vs. 64 Bit3D Laptop and Sandy Bridgewhat-is-ubuntu-netbook-editionTablet Explosion New Post 8-20-10!what-is-a-google-data-centerwhat is Atrix 4GDoes Touch on a Screen Matter?About UMPCsA $35 Tablet?Millions of ChannelsWhat's a Hybrid Tablet-Smartphone?Tegra_TabletWhat's Google Places in the CloudHitachi Virtual Storage PlatformWhat is peer to peer online storagedial2domobile enterprise applicationsAcer 2-screen TabletWhat is Mobile BankingWhat's a VIDEO CALL?Mobile_Cloud_TalkCloud Computing ArchitecturesThin Clients & Web 2.0 for BrokersThin Client Computing ExplainedWhat is Ubuntu ServerWhat is Chrome OSGoogle Fiber NetworkWhat is Augmented Reality for Mobile?what-is-Amazon-instant-videoeReader_basicsCloud SecretsWhat is a HypervisorEmerging Cloud OS'sLight Peak is 100 gigabits per secondExploring Augmented RealityWhat is Display PortRouters now are home supercomputersHTC ThunderBoltWhat is Light PeakWhat's M2M?3D Smartphone with 4G and Touch and Wi-FiWhat is iOS 5Cloud StorageHandheld_HeavenWhat is MS SkydriveBlackberry_TorchMobile DevicesWhat is a Data BrokerAndroid_ExplosionMobile Devices AdvancedWhat is USB 3.0what is dropboxPalm PreWhat is IE9What is Firefox 4Amazon APP StoreFREE CLOUD APPSMORE FREE CLOUD APPSPrint from the Cloud with ePrintWhat is a PicoCellOnline Backup Service CARBONITEWhat is P2P NetworkingWhat is ZohoAmazon Web Services-Mobile Device ProsHP WebOSCloud Computing in India is HugeGroup Texting is evolving and expandingAmazon Web Services for BrokersCLOUD_ANALYSISNetflix Video Streaming and FacebookIndia is Growing Data CentersWhats_on_line_storage?What is WebOSAndroid 3.0 Honeycomb TabletWhat's a 1GHZ SmartphoneMainframe in PocketHTC Incredible for Mobile BrokersCloud Based Storage PlatformsVideo Over CloudWhat is HTML5What_is_a_SmartphoneGoogle's Giant SmartphoneAndroid 2.2 Mobile & CloudBehind_the_CloudWhat is Mozy Cloud StorageWhat is a Content Delivery NetworkFujitsu Cloud ServicesSony Cloud ServicesAndroid Smartphone Becomes a HotspotWhat's 4G?What is WIDIExplaining IaaSVideo CallingVideo Over LTEWhat is Cloud SecurityIntels Sandy Bridge Core ProcessorsCloud Gaming Distribution's RiseAndroid in the CloudAll About Smartphones and BlackberrysNew Cloud DevelopmentsTablet Explosion in GrowthWireless Network PlatformsDedicated Vs Cloud ServersTop Android Apps using Mobile CloudMASTER_LINK_PAGEExplaining Streaming VideoWhat is 4G CellularWhat's a HSPA+ Network...is it 4G?What's a Blu-ray LaptopHere's IPv6 Networking for the CuriousCloud StorageWhat is Windows Phone 7 and how it worksBluray NetbooksFacebook & the CloudCluster GPUChina Supercomputer is better than oursNew_Notebook_TrendsCloud Storagewhat_is_IaaSChrome OSgoogle ebookstoremobile apps for the cloudCruel Cloud RealityWhat_is_PaaSNook Color eReaderIP-TV for Mobile Device BrokersAbout Rugged LaptopsLearn about SSDCorporate iPad is now a RealityMobile Devices Erasing the Enterprise DesktopCloud Delivered Hi-Def VIDEO & Mobile DevicesCloud Download or StreamedMobile Cloud Computing Glossary-WITH VIDEOS!Mobile_Cloud_ProsWhat does Streaming Video MeanCloud Streams 100 Million ChannelsComputing Moving Back to the CloudWhite_House_CrashCloud_Server_FarmsMobile_Cloud_FutureWhat exactly is a Mobile_HotspotMobile Hotspot in your PocketAndroid_MonsterCloud_Q_ACellular and LaptopsWave_FailureUltraportable LaptopsWindows7_Awesomesave_it_pleaseFrash_is_FlashRugged_LaptopsIndia_$35_LaptopBluRay LaptopsHow to Upgrade a NetbookTrying eREADERSenrollee_mods_1Google TV Explored and ExaminedBluray Drive with Web AccessWhy Cloud Computing Reduces Jobsenrollee_2_modsLaptop ConnectorsGaming LaptopsDesktop_Replacements

Follow Me on Pinterest

mainframe-attendants
mainframe-attendants.jpg
big-data
big-data-00.jpg

SEARCH THIS SITE:

THE HADOOP BASICS:

Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. It is part of the
Apache project sponsored by the Apache Software Foundation.

Hadoop was originally conceived on the basis of Googles MapReduce, in which an application is broken down into numerous small parts. Any of these parts (also called fragments or blocks) can be run on any node in the cluster. Hadoop makes it possible to run applications on systems with thousands of nodes involving thousands of terabytes. A distributed file system (DFS ) facilitates rapid data transfer rate among nodes and allows the system to continue operating uninterrupted in case of a node failure.

The risk of catastrophic system failure is low, even if a significant number of nodes become inoperative.

The Hadoop framework is used by major players including Google, Yahoo and IBM, largely for applications involving search engines and advertising. The preferred operating systems are Windows and Linux but Hadoop can also work with BSD and OS X. Hadoop was originally the name of a stuffed toy elephant belonging to a child of the framework's creator, Doug Cutting.

FREE MOBILE CLOUD COMPUTING CONCEPTS - TRAINING_MODULES_WITH_TONS_OF_VIDEOS

big-data-security
big-data-security.jpg

I’m sure you’ve heard about Big Data.

The most well known technology used for Big Data is Hadoop. Hadoop is used by Yahoo, eBay, LinkedIn and Facebook. It has been inspired from Google publications on MapReduce, GoogleFS and BigTable. As Hadoop can be hosted on commodity hardware (usually Intel PC on Linux with one or 2 CPU and a few TB on HDD, without any RAID replication technology), it allows them to store huge quantity of data (petabytes or even more) at very low cost (compared to SAN bay systems).

Hadoop is an open source suite, under an apache foundation:    http://hadoop.apache.org/   

The Hadoop “brand” contains many different tools. Two of them are core parts of Hadoop:

  • Hadoop Distributed File System (HDFS) is a virtual file system that looks like any other file system except than when you move a file on HDFS, this file is split into many small files, each of those files is replicated and stored on (usually, may be customized) 3 servers for fault tolerance constraints.
  • Hadoop MapReduce is a way to split every request into smaller requests which are sent to many small servers, allowing a truly scalable use of CPU power (describing MapReduce would worth a dedicated post).

Some other components are often installed on Hadoop solutions:

  • HBase is inspired from Google’s BigTable. HBase is a non-relational, scalable, and fault-tolerant database that is layered on top of HDFS. HBase is written in Java. Each row is identified by a key and consists of an arbitrary number of columns that can be grouped into column families.
  • ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. Zookeeper is used by HBase, and can be used by MapReduce programs.
  • Solr / Lucene as search engine. This query engine library has been developed by Apache for more than 10 years.
  • Languages. Two languages are identified as original Hadoop languages: PIG and Hive. For instance, you can use them to develop MapReduce processes at a higher level than MapReduce procedures. Other languages may be used, like C, Java or JAQL. Through JDBC or ODBC connectors (or directly in the languages) SQL can be used too.

hadoop1 1024x341 What is Hadoop?
Hadoop Internal Software Architecture

Even if the most known Hadoop suite is provided by a very specialized actor named Cloudera, big vendors are positioning themselves on Hadoop:

  • IBM has got BigInsights (Cloudera Hadoop distribution plus their own custom version of Hadoop) and has recently acquired many niche actors in the analytical and big data market (like Platform Computing which has got a product enhancing the capabilities and performance of MapReduce)
  • Oracle has launched BigData machine. Based on Cloudera Hadoop, this server is dedicated to storage and usage of non-structured content (as structured content stays on Exadata)
  • Informatica has a tool called HParser to complete PowerCenter This tool is built to launch Informatica process in a MapReduce mode, distributed on the Hadoop servers.
  • Microsoft has got a Hadoop version for Microsoft Windows and for Azure, their cloud solution, and a big native integration with SQL Server 2012.
  • Some very large database solutions like EMC Greenplum (partnering with MapR), HP Vertica, Teradata Aster Data (partnering with HortonWorks) or SAP Sybase IQ are able to connect directly to Hadoop storage.

elephant-in-the-room
elephant-in-the-room.jpg

big-data-stromes.jpg
big-data-stromes.jpg

created at TagCrowd.com

big-data-dream.
big-datra-dream.jpg
big-data-biggies
big-data-biggies.jpg
hadoop-IBM
hadoop-ibm.jpg
big-data-rain
big-data-flood.jpg

big-data-party
big-data-party.jpg

++++++++++++++
Hadoop software market to hit $812.8 million in 2016, says IDC

Summary: IDC put the Hadoop-MapReduce ecosystem market at $77 million in 2011. That’ll change in a hurry.

The market for Hadoop and MapReduce related software will grow at a compound annual growth rate of more than 60 percent through 2016, according to IDC data.

IDC put the Hadoop-MapReduce ecosystem market at $77 million in 2011. That sum sounds small given the focus on big data and the headlines that go with it. But the financials will catch up with the attention. IDC is projecting a Hadoop-MapReduce market of $812.8 million in 2016.

Big data has received a lot of focus as companies aim to crunch structured and unstructured data to see around corners. The growth of big data software could also pose a threat to database incumbents such as Oracle.

IDC expects the Hadoop-MapReduce market to develop like Linux did. Linux began with a lot of attention and a small market and then grew to be commonplace in most data centers.

The one wild card for big data growth will be talent to crunch the figures as well as analyze them.
+++++++++++++++++++=

big-data-art
big-data-art.jpg

Hadoop is a set of open source technologies that supports reliable and cost-efficient ways of dealing with large amounts of data. Given the vast amounts of business critical and required data companies gather (e.g. required due to Sarbanes–Oxley (SOX) or EU Data Retention Directive), Hadoop becomes increasingly relevant.

Hadoop Technologies

Several Hadoop technologies are inspired by Google’s infrastructure.

1. Processing and Storage

1.1 Processing – Mapreduce
Mapreduce can be used to process and extract knowledge from arbitrary amounts of data, e.g. web data, measurement data or financial transactions –
Visa reduced their processing time for transactional statistics from 1 month to 13 minutes with Hadoop. In order to use Mapreduce developers need to parallelize their problem and program against an API – here for an example of machine learning with Hadoop. Hadoop’s Mapreduce is inspired by the paper MapReduce: Simplified Data Processing on Large Clusters.

1.2 File Storage – HDFS
HDFS is scalable and distributed file system. It supports configurable degree of replication for reliable storage even when running on cheap hardware. HDFS is inspired by the paper
The Google File System

1.3 Database – HBase
HBase is a distributed database that supports storing billions of rows with millions of columns that runs on top of HDFS. HBase can replace traditional databases if they get problems scaling or become to expensive licence-wise, see
this presentation about Hbase. HBase is inspired by the paper Bigtable: A Distributed Storage System for Structured Data

2. Data Analysis

Mapreduce can be used to analyze all kinds of data (e.g. text, multimedia, numerical data) and have high flexibility, but for more structured data the following Hadoop Technologies can be used:

2.1 Pig
SQL-like language/system running on top of Mapreduce.
Pig is developed by Yahoo and inspired by the paper Interpreting the Data: Parallel Analysis with Sawzall

2.2 Hive
Datawarehouse running on top of Hadoop, developed by Facebook. Query language is very similar to SQL.

3. Distributed Systems Development

3.1 Avro
Avro is used for efficient serialization of data and communication between services. It is in several ways similar to
Google’s protocolbuffers and Facebook’s Thrift.

3.2 Zookeeper
Coordination between distributed processes. It is inspired by the paper
The Chubby lock service for loosely-coupled distributed systems

3.3 Chukwa
Monitoring of distributed systems.

big-data-art-77777
big-data-tree-path.jpg
big-data-storage
big-data-storage.jpg
binary-tunnel
binary-tunnel.jpg
man-looking-at-binary-data-stream
man-looking-at-binary-data-stream.jpg
going-at-the-console
going-at-the-console.jpg
big-data-rain
big-data-rain.jpg