Data for Journalists Chapter Ten

Chapter Ten

Dirty Data: How to Fact Check Your Data and Clean It

Here are exercises for this chapter and a database for the chapter. This database of small business loans can be used for cleaning data (correcting city names) in a spreadsheet or database manager.

  • Exercises

Suggested exercises include:

1. Download the SBA database and import into a spreadsheet and clean (standardize) the city name of Arlington Heights.

2. Import the same database into DB Browser for SQLite and clean (standardize) the city name of Arlington Heights.

3. Using the Giver table from earlier chapters, standardize the names of occupations in either a spreadsheet or database manager or both.

  • Data

Spreadsheet for Chapter 10 data: SBA LOAN database

 

© Brant Houston