Running analysis on the right data!

All in the day:

Anqi Fu, our wickedly smart Math & Data Science hacker-intern from Stanford this summer, was characterizing GLMNet in R on sparse data and comparing with other tools. We were using a data sets predicting Two Bedroom median rent based on neighborhoods from huduser.org.

DATA: http://www.huduser.org/portal/datasets/fmr/CensusRentData/index.html

She found the analysis brisk and surprisingly fast.. Until we got around to checking the data matrix and the factor
call. Most of the data was missing! So she exclaimed:

bart-simpson-generator-GLM

[Credits to Addletters.org & Matt Groenig for the Simpsons]

Results of her work “Characterizing GLMNet on Sparse Matrices”, will have to wait for a future post!

Published by

wpengine

This is the "wpengine" admin user that our staff uses to gain access to your admin area to provide support and troubleshooting. It can only be accessed by a button in our secure log that auto generates a password and dumps that password after the staff member has logged in. We have taken extreme measures to ensure that our own user is not going to be misused to harm any of our clients sites.