This repository presents a set of Jupyter Notebooks where I practice SQL on the Olist E-commerce database. All the queries are merely to retrieve data and calculate some important KPIs, not to analyze or to address any problem for the business.
There are 7 practice notebooks, a data folder, and a photo folder. The Olist database is stored in my local desktop and connected to Jupyter Notebook.
The database has 11 datasets which contain different information of customers, sellers, orders procedure, etc. This repository, however, does not use all of these data; each practice notebook has its own topic and related datasets.
Olist is neither an E-commerce company nor a marketplace itself. Olist provides a platform for merchants to sell their products online within a marketplace. So, in this database there are three stakeholders: Olist, merchants or sellers, and the final customers or consumers. Here, merchants are Olist direct customers from where it makes profits.
Some confusing points in the data:
- One order can have different reviews at different time even though the order has yet to be completed (delivered to the customer).
- Customers state is actually their shipping address state (and one customer can have different ones), which is not explicit in the dataset.
- Discrepancies between order value and payment amount.