1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
Updated
Jan 20, 2025 - Python
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Always know what to expect from your data.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Automatically visualize your pandas dataframe via a single print! 📊 💡
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Visualize and compare datasets, target values and associations, with one line of code.
Beautiful visualizations of how language differs among document types.
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Interactively explore unstructured datasets from your dataframe.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Automatically find issues in image datasets and practice data-centric computer vision.
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Developer-first embedded analytics
Build 12 Data Apps in Python with Streamlit
Ways of doing Data Science Engineering and Machine Learning in R and Python
Kernel Density Estimation in Python
Complete-Life-Cycle-of-a-Data-Science-Project
Code review for data in dbt
Preliminary Exploratory Visualisation of Data
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Add a description, image, and links to the exploratory-data-analysis topic page so that developers can more easily learn about it.
To associate your repository with the exploratory-data-analysis topic, visit your repo's landing page and select "manage topics."