Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Modify standardization & use of crash data to permit crashes by month rather than week #162

Closed
terryf82 opened this issue Aug 13, 2018 · 2 comments
Assignees

Comments

@terryf82
Copy link
Collaborator

No description provided.

@terryf82
Copy link
Collaborator Author

Certain cities (Philadelphia and Brisbane at least) provide crash data with no day of month property, for privacy reasons.

For these cities to work we need to look at upgrading at least two processes:

  1. the standardize_crashes script needs to understand when the raw crash data has no day of month property and record the available date accordingly
  2. the modeling process needs to know the city uses per-month crash data (by reading the config?) and generate predictions of the same nature.

@terryf82 terryf82 self-assigned this Aug 30, 2018
@terryf82 terryf82 mentioned this issue Aug 30, 2018
@terryf82
Copy link
Collaborator Author

PR 181 implements something of a work-around for this issue, so that we can continue onboarding cities that choose not to supply the date of month for crashes (Pittsburgh, Brisbane and Philladelphia so far).

The initialize_city script now provides additional options for specifying different date formats, with date_complete, or in combination date_year, date_month and date_day. This last one is optional, because some cities like those mentioned above withhold the day of month to anonymize the crash data. If this field is left blank, standardize_crashes will pick a random day of the month and assign it.

I don't know there's any point in moving further on this, until we decide the value of temporal predictions (by week, month or any other period) versus simply predictions at point of execution.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant