Week 2 Status Report
- srujansaurabh18
- Feb 17, 2018
- 2 min read
The following are the descriptions about what each of our teammate has done in the week of 12th February.
Bhargav Arisetty
What's Cooking?
This week, I've found out another interesting dataset that contains the data about air quality index. Extending my work on data discovery fragment, I've uncovered ways to integrate the found datasets during the last 2 weeks with the ones which are available in the city of Chicago, census, and datausa and phrased the details about them. Worked with the team while consolidating these chunks of information into a report.
Upcoming
Next week, I'll be working on scraping the restaurant's review/rating data from the yelp.
Hiccups?
No impediments.
Srujan Belde
What's Cooking?
As a part of data discovery, I have found few interesting data sets like, homicide table which I found would of a better attachment to data-sets in cityofChicago and census websites in data science perspective. I found few interesting results which I have mentioned in our report. I also got another set with various information about Chicago sub listed by using zip codes of areas. Health atlas was a very good set of information which i found to be extremely useful as it has data from many years in many fields like medicine etc. As a result, i have formulated a report with our team.
Upcoming
In the data discovery phase i found that its really hard to find data, so i want to continue my hunt for more data as i want to know to what extent can we get data from web and also i want to start data extraction part. In parallel i would like to learn the most efficient data scraping methods so as to get the best data out.
Hiccups?
The most hard part was find the data itself, as there were very limited resources, every 2 of 3 data-sets we get are sourced to cityofchicago or cencus. Integrating was challenging as it was hard to find how 2 data-sets together would result in a better data-set.
Pranay Rasulury
What's Cooking?
For the data discovery phase, I found 3 datasets Airbnb, Glassdoor and Regional Transportation Authority Mapping & Statistics which were 3 of those finalized for the report. Also contributed in writing about the datasets and how they can be integrated with data of cityofchicago.org portal.
Upcoming
I will be working with team to scrape the data from yelp required for the Data Extraction phase.
Hiccups?
No impediments faced.
Kavyath Basani
What's Cooking?
This week, I have worked on data discovery phase. I have discovered datasets from Chicago Public School system, Wunderland which are incorporated into the final amalgamation (say report) of the datasets in data discovery phase. I have worked on drafting, editing and proofreading the final report.
Upcoming Next week, I will be working on scraping the data for data extraction.
Hiccups?
I faced a little difficulty in searching for reliable datasets that could be used efficiently
Comments