ydata-profiling (previously pandas-profiling) is an open-source package that allows to run data quality checks and profiling from both pandas DataFrames and Spark DataFrames. I enables users to generate data profiling reports in a simple and fast manner through a single line of code.
Download this research paper to learn more about:
The importance of standardized data quality profiling for the success of AI development
The benefit of adopting an automated data quality profiling solution like ydata-profiling
ydata-profiling compared to other solutions for data profiling