I first encountered a mention of R some months ago, in a Zed Shaw rant about programmers and statistics. Sounded interesting but of no immediate use to me. Until last week.
I had some CPU load data to analyze. Plotting them as a time-series with Excel was easy enough but what my data would look like as an histogram ? How could I display the distribution of my samples over their range ? Too much for my Excel level !
Then I remembered about R. I asked Google, installed the package, skimmed the manual and within less than half-an-hour was able to display and analyze my data.
Let’s load the data from my CSV file :
>data <- read.csv(“cpu.csv”,header=T)
Let’s plot the distribution of the LOAD column :
To compute the statiscal properties of a series :
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.2466 0.2804 0.2992 0.3015 0.3201 0.3710
Cherry on the pie, produced graphs can be as easily saved as PDF or JPG document.
R can be downloaded on The R-Project site.