Skip to content

Latest commit

 

History

History
26 lines (13 loc) · 803 Bytes

File metadata and controls

26 lines (13 loc) · 803 Bytes

Generic CSV Data Paraser using Python Generators:

  • FileReader(self, filename, column_to_track,*, date_column=None)

  • Second row data will automatically determine the data types for .csv file: float, string, integer, date

  • Date restricted only to one column via date_column

  • Return the frequency distribution of data per Column Header via column_to_track

    • Make sure you replace white space with "_" when passing in header names to column_to_track

Processing "nyc_parking_tickets_extract.csv":

Here are the highest frequency of citations sorted by...

  • Vehicle Make: ('TOYOT', 112)

  • Vehicle Body Type: ('SUBN', 352)

  • Violation Description: ('PHTO SCHOOL ZN SPEED VIOLATION', 140)

  • Registration State: ('NY', 779)

  • Issue Date: (datetime.date(2016, 11, 14), 10)