Zillow aggregates some very interesting data, especially if you're interested aspects of home prices, demographics such as income, education, etc. -- not to mention all of this information is provided with latitude and longitude coordinates to boot. This light-weight module takes advantage of that gives you the ability
- Take the 'n' largest cities in the US (data scraped from Wikipedia)
- Go to Zillow's API, & extract regionid's (of which there are several hundred for any metropolitan city) along with some interesting Zillow Index Data
- Join that data with demographic data, such as median house prices, cost per square foot, median income, etc. all provided at a "neighborhood", "city", & "state" level.
- Put it all into Jesus' favorite data structure... pandas, to do some more interesting data analysis
$ git clone email@example.com:benjaminmgross/api-scrapin.git #assuming ssh install $ cd api-scrapin $ python setup.py install
I know what you're thinking, "why can't I pip install it?" Stop whining! ... fine, I haven't figured out how to do that yet with packages, but I'm working on it...
Up and Running in 5 Steps
Step 1: Get Yourself a Zillow API Token
- Go to Zillow's Registration Page where you will be prompted to create a login.
- After you create a login, go to the Zillow API Overview Page
- Click on the get a ZWSID
- Fill out the information, click all of the check boxes of different APIs you might wannt, and then get ready to receive your Zillow API key in your inbox!
Step 2: Install the Package
Step 3: Let 'er Rip
The crux of what makes this package special is the ability to merge what are called "region-id" and cities.
For instance, there are 267 region-id's around the New York City area, and for each one of those region-id's, there's extensive demographic information (such as income, commute times, etc), but this information is never provided "together" -- as in, here's the city, all of it's region-id's, and extensive demographic data about those region-id's / cities.
You can try to figure out out how to join all that data from disparate Zillow API's... or you can just use this package.
Step 4: Do some cool analysis
You got this one covered...
Step 5: Write me an email and tell me you love me
2 months, 2 weeks ago failed
.. image:: http://readthedocs.org/projects/api-scrapin/badge/?version=latest :target: http://api-scrapin.readthedocs.io/en/latest/?badge=latest :alt: Documentation Status
<a href='http://api-scrapin.readthedocs.io/en/latest/?badge=latest'> <img src='http://readthedocs.org/projects/api-scrapin/badge/?version=latest' alt='Documentation Status' /> </a>
Project Privacy Level