Dirty Data: How to Fact Check Your Data and Clean It
Here are exercises for this chapter and a database for the chapter. This database of small business loans can be used for cleaning data (correcting city names) in a spreadsheet or database manager.
Suggested exercises include:
1. Download the SBA database and import into a spreadsheet and clean (standardize) the city name of Arlington Heights.
2. Import the same database into DB Browser for SQLite and clean (standardize) the city name of Arlington Heights.
3. Using the Giver table from earlier chapters, standardize the names of occupations in either a spreadsheet or database manager or both.
© Brant Houston