Skip to content
View ddelange's full-sized avatar
💥
["translatio", "imitatio", "aemulatio"]
💥
["translatio", "imitatio", "aemulatio"]

Block or report ddelange

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

etl

Extract-Transform-Load, Data Wrangling, Data Mining, ...
249 repositories

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 7,930 767 Updated Nov 12, 2024

A curated list of resources dedicated to Feature Engineering Techniques for Machine Learning

583 189 Updated Oct 26, 2018

🌏🌍🌎Translators🌎🌍🌏 is a library that aims to bring free, multiple, enjoyable translations to individuals and students in Python. Translators是一个旨在用Python为个人和学生带来免费、多样、愉快翻译的库。

Python 1,710 193 Updated Sep 28, 2024

Access other storage backends via the S3 API

Java 1,776 231 Updated Nov 11, 2024

Library for exploring and validating machine learning data

Python 764 174 Updated Nov 1, 2024

Python library of 60+ commonly-used validator functions

Python 126 12 Updated Dec 8, 2022

A Python library to provide functions to handle, parse and validate standard numbers.

Python 504 211 Updated Oct 13, 2024

Python Data Validation for Humans™.

Python 980 155 Updated Oct 7, 2024

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

Python 504 43 Updated Oct 24, 2024

🚤 Label data at scale. Fun and precision included.

Python 323 19 Updated Nov 12, 2024

Mirror of the Xapian repository. You're welcome to open pull requests on github (they'll just get merged indirectly).

C++ 805 281 Updated Oct 24, 2024

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 17,304 1,637 Updated Nov 12, 2024

⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

Java 12,857 1,113 Updated Nov 12, 2024

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

Rust 9,952 218 Updated Nov 11, 2024

A Python Library for Graph Outlier Detection (Anomaly Detection)

Python 1,332 128 Updated Sep 20, 2024

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

Rust 1,286 121 Updated Nov 12, 2024

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Python 13,717 1,435 Updated Nov 10, 2024

Small, dependency-free, fast Python package to infer binary file types checking the magic numbers signature

Python 660 113 Updated Sep 9, 2024

Truly universal encoding detector in pure Python

Python 582 51 Updated Nov 12, 2024

A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.

Python 250 37 Updated Sep 25, 2024

A library of Reversible Data Transforms

Python 121 26 Updated Nov 8, 2024

Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators

Python 5,405 1,052 Updated Jul 28, 2024

An s3 datastore implementation

Go 240 66 Updated May 1, 2024

ExifLooter finds geolocation on all image urls and directories also integrates with OpenStreetMap

Go 427 24 Updated Jul 14, 2024

Postgresql capture data change software in Rust to allow realtime websockets

Rust 13 1 Updated Sep 24, 2024

Apache DataFusion SQL Query Engine

Rust 6,276 1,188 Updated Nov 12, 2024

Rasterio reads and writes geospatial raster datasets

Python 2,273 534 Updated Nov 10, 2024

A powerful and user-friendly binary analysis platform!

Python 7,582 1,084 Updated Nov 12, 2024

A scalable, event-driven durable execution platform. Run stateful step functions functions deployed to serverless, servers, or the edge.

TypeScript 2,076 80 Updated Nov 12, 2024

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

Python 8,567 1,368 Updated Nov 12, 2024