From Matplotlib
Matplotlib contains a number of tools that can graphically model HDFS data after being fed a data frame From Pandas.
Using PyPlot
Before any Matplotlib tool, such as pyplot, can be used, it must be imported:from matplotlib import pyplot as plt
Once a Pandas data frame is obtained, it can be used to create a plot visualizing HDFS data. For example, the following plot generates and displays a bar graph relating FileId and Length values:
df.plot(kind="bar", x="FileId", y=["Length"]) plt.show()