Hi,
All I have been reading
http://wiki.lustre.org/images/1/1b/Hadoop_wp_v0.4.2.pdfFile link where
it talks about how lustre can be more efficient for BigMapOutput kind
of application.
I just thought to try my hands comparing HDFS Vs Lustre.
To Test it, I have 1 MDS, 2 OSS/OST and 2 Lustre Client running. My
Plan is to install Hadoop on 2 Lustre Clients.
I am going to run wordcount example with this setup
Versus
I have 1 Namenode and 4 DataNode and general Hadoop + HDFS setup.
I wonder how am I going to compare HDFS and Lustre through what number
of nodes to actually equilibrium the setup?
If I take 1 MDS, 2 OSS/OST and 2 Lustre Client vs 1 Namenode and 4
DataNode. Will this setup be balanced comparison.
Please suggest.