Saturday, September 27, 2014

Hadoop Administration 9 : Pre and Post System Validation

System Validation -Pre Install:

Test Case                               Expected Output
Disk Check                             All the disks should pass the test
Memory                                All the nodes should pass memory test
RPC Test                                All the nodes should pass RPC Test
Network Test                        All the nodes should pass network test
Cluster Audit Script              All the nodes should report same configuration
CPU Test

Memory Check:
The free command allows you to display the amount of free and used memory on the system. To do so, type the following at a shell prompt:

The free command provides information about both the physical memory (Mem) and swap space (Swap). It displays the total amount of memory (total), as well as the amount of memory that is in use (used), free (free), shared (shared), in kernel buffers (buffers), and cached (cached). For example:

-sh-4.1$ free -m
                     total       used       free     shared    buffers     cached
Mem:         96709      90716       5993          0        742      85233
-/+ buffers/cache:       4740      91968
Swap:        32767        420      32347
-sh-4.1$



Post Validation:
Test Case                                Expected Output
DFSIO                                    Puts IO load on the cluster and completes successfully
TeraGen                                 Puts load on the cluster and jobs run successfully
TeraSort                                  Puts load on the cluster and jobs run successfully
Cloudera/MapR Audit
More Benchmarking


Hardware Failures:
Ensure that "No Interruption to services" when you perform below activity:

Power off Data Node –Pulling the Plug
Stop Network Interface for Data Node
Disk controller Failure for Data Node – Pull the hard disk out of the box.
Disk Controller Failure for CLDB Node – Pull the hard disk out of the box.
Disk Controller Failure for MySQL Node – Pull the hard disk out of the box.
Network Interface Failure for Hive Metastore Node
Power off Hive Metastore Node - – Pulling the Plug
Power off MySQL Node – Pulling the Plug
Power off entire RACK
Network Switch Failure – We need to engage Network team to figure out how to do this and if it is doable.

No comments:

Post a Comment