From my perspective the most important event that happened atuseR! 2014 was that I got to meetthe 0xdata team and now, long story short,here I am introducing the latest version of H2O, labeledLagrange (126.96.36.199),to the R and greater data science communities. Beforejoining 0xdata, I was working at a competitor on a rival project and wasrepeatedly asked why my generalized linear model analytic didn’t run as fast asH2O’s GLM. The answer then as it is now is the same – becauseH2O has a cutting edge distributed in-memory parallel computingarchitecture – but I no longer receive an electric shock every time I say so.
For those hearing about H2O for the first time, it is an open-sourcedistributed in-memory data analysis tool designed for extremely large data setsand the H2O Lagrange (188.8.131.52) release provides scalable solutionsfor the followinganalysis techniques:
- Generalized Linear Model
- Random Forest
- Principal Components Analysis
- Gradient Boosted Regression and Classification
- Naive Bayes
- Deep Learning
In my first blog post at 0xdata, I wanted to keep it simple and make sure Rusers know how to get the
h2o package, which is cross-referenced on theHigh-Performance and Parallel ComputingandMachine and Statistical LearningCRAN Task Views, up and running on theircomputers. To so do, open an R console of your choice and type
# Download, install, and initialize the H2O package install.packages("h2o", repos = c("http://h2o-release.s3.amazonaws.com/h2o/rel-lagrange/11/R", getOption("repos"))) library(h2o) localH2O <- h2o.init() # List and run some demos to see H2O at work demo(package = "h2o") demo(h2o.glm) demo(h2o.deeplearning)
After you are done experimenting with the demos in R, you can open up a webbrowser to http://localhost:54321/ to give the H2O web interface aonce over and then hop over to0xdata’s YouTube channel for somein-depth talks.
Over the coming weeks we at 0xdata will continue toblog about how to use H2Othrough R and other interfaces. If there is a particular use case you would liketo see addressed, join ourh2ostream Google Groupsconversation or e-mail us at email@example.com. Until then, happy analyzing.