YData was recognized as the best synthetic data vendor! Read the complete benchmark.
Advanced EDA Made Simple Using Pandas Profiling

Advanced EDA Made Simple Using Pandas Profiling

Digging beyond the standard data profiling Pandas Profiling was always my goto-secret tool to understand the data and uncover meaningful insights, in a few minutes, under a few lines of code. Whenever I was given a new dataset, I would...

Gonçalo Martins Ribeiro and Fabiana Clemente founders of YData.

Startup portuguesa YData e co-fundadora ganham prémios internacionais

Gonçalo Martins Ribeiro, sócio fundador e CEO da YData, e Fabiana Clemente, sócia fundadora e Chief Data Officer da YData. A startup portuguesa YData, da área de Inteligência Artificial, foi eleita “Best Newcomer” nos South Europe Startup...

GANs for Synthetic Data Generation

GANs for Synthetic Data Generation

A practical guide to generating synthetic data using open-sourced GAN implementations The advancements in technology have paved the way for generating millions of gigabytes of real-world data in a single minute, which would be great for...

Data-Centric paradigm of AI development

Why adopting the Data-Centric paradigm of AI development?

Data-centric AI and the reshape of the tooling space The end-to-end development of Data Science solutions can be broadly described as the process of analysis, planning, development and operationalization of a business problem that can be...

Data has a better idea

How to handle a real dataset

A guide to go a step beyond with your data Lately, there has been a lot of discussion about data quality and its impacts on model performance. Mainly due to this presentation which highlighted this topic — model-centric vs data-centric,...

Why do we need a Data-Centric AI Community

Why do we need a Data-Centric AI Community?

A place to discuss data quality for data science According to Alation’s State of Data Culture Report, 87% of employees attribute poor data quality to why most organizations fail to adopt AI meaningfully. Based on a 2020 study by McKinsey,...

AI Infrastructure Alliance

The AI Infrastructure Alliance Launches With 25 Members

Today, the AI Infrastructure Alliance (AIIA), a non-profit organization with 25 global members officially launched with the mission to create a robust collaboration environment for companies and communities in the artificial intelligence...

validate your synthetic data quality

How to validate your synthetic data quality

A tutorial on how you can combine ydata-synthetic with Great Expectations With the rapid evolution of machine learning algorithms and coding frameworks, the lack of high-quality data is the real bottleneck in the AI industry. Transform...

AI insurance

Will Insurance be impacted by AI?

The answer is pretty obvious, right? Let’s take a deeper look at the P&C business. Like any other business nowadays, artificial intelligence also became a vital aspect of modern Insurance. Insurance companies seat on a gold mine of data,...

YData secures 2.33 million in funding

AI startup YData secures €2.33 million to fast-track expansion

YData, the Lisbon-based startup that created the first data preparation platform to accelerate the development of AI solutions, has successfully closed a Seed funding round worth €2.33 million to fast-track its expansion across Europe and...

AI industry with real-world data

A Data Scientist’s Guide to Identify and Resolve Data Quality Issues

Doing this early for your next project will save you weeks of effort and stress If you've worked in the AI industry with real-world data, you’d understand the pain. No matter how streamlined the data collection process is, the data we’re...

Measure Data Quality

How Can I Measure Data Quality?

Introducing YData Quality: An open-source package for comprehensive Data Quality. Flag all your data quality issues by priority in a few lines of code “Everyone wants to do the model work, not the data work” — Google Research According to...

Subscribe our newsletter for latest updates