Download Print this page
Dell PowerEdge C5220 Test Report
Dell PowerEdge C5220 Test Report

Dell PowerEdge C5220 Test Report

Hide thumbs Also See for PowerEdge C5220:

Advertisement

DELL POWEREDGE C5220: HADOOP MAPREDUCE PERFORMANCE
Every year, the amount of data that businesses must process grows
enormously. The ability to sort, filter, and analyze this data is becoming more
and more vital to many businesses in analyzing their customers and their market
segment. Additionally, businesses need an infrastructure that is powerful and
flexible, but also compact and scale-friendly. The Dell PowerEdge C5220 server is
an ideal solution to pair with Apache Hadoop, a powerful multi-node data
analysis application. With the PowerEdge C5220, organizations can scale out to
their data processing requirements and successfully handle these ever-increasing
data volumes, finding new value in their big data.
To test the Hadoop performance capabilities of the Dell PowerEdge
C5220, we configured eight Dell PowerEdge C5220 servers into a Hadoop cluster
and ran the MapReduce benchmark (mrbench) on the platform. We found that
eight Dell PowerEdge C5220 servers, all contained within the single shared
infrastructure design of the Dell PowerEdge C5000 chassis, could run our
mrbench tests of varying sizes, map processes, and reduce processes, in times
averaging just 15.9 to 25.6 seconds, making this platform ideal for scale-out
data-analysis application workloads.
A PRINCIPLED TECHNOLOGIES TEST REPORT
Commissioned by Dell Inc.; April 2012

Advertisement

loading

Summary of Contents for Dell PowerEdge C5220

  • Page 1 Additionally, businesses need an infrastructure that is powerful and flexible, but also compact and scale-friendly. The Dell PowerEdge C5220 server is an ideal solution to pair with Apache Hadoop, a powerful multi-node data analysis application.
  • Page 2 They require a reliable and powerful hardware platform, along with a reliable and fast software platform on which to process this big data. The Dell PowerEdge C series servers provide this solid infrastructure for companies to offer their data processing capabilities.
  • Page 3 C5220 microserver an optimal choice to deploy for extremely dense compute fabrics handling big data deployments, large software as a service (SaaS) environments, and cloud deployments. Figure 2 presents a view of the Dell PowerEdge C5220. Figure 2: The PowerEdge C5220 microserver.
  • Page 4 Designed with power-efficiency and maintainability in, the Dell PowerEdge C5220 maximizes operating efficiency with a shared-infrastructure design. To learn more about the Dell PowerEdge C5220 and the entire Dell PowerEdge C Series, visit http://www.dell.com/us/enterprise/p/poweredge-cloud-servers. WHAT WE TESTED...
  • Page 5 Hadoop analysis, quickly and efficiently. Selecting the right server for your underlying hardware infrastructure is critical at hyperscale. In our tests, an eight-node Dell PowerEdge C5220 Hadoop cluster was able to efficiently process multiple MapReduce scenarios. While each scenario and each...
  • Page 6: Appendix A - Server Configuration Information

    Chip organization Double-sided Rank Dual Operating system Name CentOS 6.2, x86_64 File system ext4 Kernel 2.6.32-220.13.1.el6.x86_64 Language English Updates All as of 4/12/2012 Graphics Vendor and model number AST2050 A Principled Technologies test report 6 Dell PowerEdge C5220: Hadoop MapReduce Performance...
  • Page 7 Intel 82580DB Gigabit Network Connection Type Integrated Driver Intel(R) Gigabit Ethernet Network Driver; igb, 3.0.6-k USB ports Number 1 internal Type Figure 5: Configuration details for the two test servers. A Principled Technologies test report 7 Dell PowerEdge C5220: Hadoop MapReduce Performance...
  • Page 8: Appendix B - How We Tested

    For example: 192.168.1.10 had01-ctrl01 192.168.1.31 cl01n01 192.168.1.32 cl01n02 192.168.1.33 cl01n03 192.168.1.34 cl01n04 192.168.1.35 cl01n05 192.168.1.36 cl01n06 192.168.1.37 cl01n07 192.168.1.38 cl01n08 13. Start the DNS server: chkconfig dnsmasq on A Principled Technologies test report 8 Dell PowerEdge C5220: Hadoop MapReduce Performance...
  • Page 9 5. On the Cloudera Manager (Free Edition) License screen, review the license, and select Next. 6. On the next screen, select yes to accept this license. 7. On the Oracle Binary Code License Agreement screen, review the license, and select Next. A Principled Technologies test report 9 Dell PowerEdge C5220: Hadoop MapReduce Performance...
  • Page 10 $(seq 31 38); do ssh 192.168.1.$i mkdir -p '/dfs/{d1,d2,n1,n2,s1,s2}' '/mapred/{local,jt}' ssh 192.168.1.$i mount '/dfs/d{1,2}' done for i in $(seq 31 38); do ssh 192.168.1.$i chmod 700 '/dfs/{d1,d2,n1,n2,s1,s2}' \; chmod 755 '/mapred/{local,jt}' A Principled Technologies test report 10 Dell PowerEdge C5220: Hadoop MapReduce Performance...
  • Page 11 16. Enter the root password in the Root Password and Confirm fields, and click Next. 17. At the Partition selection screen, select Replace Existing Linux System(s), and click Next. 18. If a warning appears, click Write changes to disk. A Principled Technologies test report 11 Dell PowerEdge C5220: Hadoop MapReduce Performance...
  • Page 12 23. Disable these unused services by running the following command-line script: CHK_OFFs="auditd autofs cups ip6tables iptables nfslock netfs portreserve postfix\ qpidd rhnsd rhsmcertd rpcgssd rpcidmapd rpcbind" for i in ${CHK_OFFs}; do chkconfig $i off service $i stop done A Principled Technologies test report 12 Dell PowerEdge C5220: Hadoop MapReduce Performance...
  • Page 13: About Principled Technologies

    CONNECTION WITH ITS TESTING, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. IN NO EVENT SHALL PRINCIPLED TECHNOLOGIES, INC.’S LIABILITY, INCLUDING FOR DIRECT DAMAGES, EXCEED THE AMOUNTS PAID IN CONNECTION WITH PRINCIPLED TECHNOLOGIES, INC.’S TESTING. CUSTOMER’S SOLE AND EXCLUSIVE REMEDIES ARE AS SET FORTH HEREIN. A Principled Technologies test report 13 Dell PowerEdge C5220: Hadoop MapReduce Performance...