Skip to content

dhirajmahato/Cheatsheet

Repository files navigation

Cheatsheet

Libraries/tools

SOP

  • Data Preparation: preparing data for ingestion into a data processing stream.
    • df['Identifier'].is_unique
    • df.set_index('Identifier', inplace=True)
    • df.get_dtype_counts()
    • regular expression to extract our cleaned values
  • Normalizing data sets, which generally means scaling the data to values.
    • Convert to numerics

Data FLow

Releases

No releases published

Packages

No packages published