Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

How to use easyeda, an exploratory data analysis tool

2025-04-02 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >

Share

Shulou(Shulou.com)06/01 Report--

This article will explain in detail how to use the exploratory data analysis tool easyeda. The content of the article is of high quality, so the editor will share it with you for reference. I hope you will have a certain understanding of the relevant knowledge after reading this article.

Exploratory data analysis (Exploratory Data Analysis) is a common task in the daily work of algorithm engineers. By analyzing the missing data, distribution, and correlation with tags, data EDA can help algorithm engineers evaluate the quality of data, understand the characteristics of data, provide direction for feature engineering, and form preliminary expectations of the upper limit of effect that can be achieved by subsequent models.

I encapsulated the set of scripts I often use for data EDA into a library easyeda and opened it up in PyPI and github. This is the first open source Python package I have released personally.

First, brief introduction to easyeda

Easyeda is a simple but practical exploratory data analysis tool.

Easyeda can carry out exploratory data analysis on common binary classification problems, multi-classification problems, and regression problems.

Easyeda supports exploratory data analysis of all common numeric, string, and Bool data attributes.

Easyeda supports common missing value analysis, data distribution analysis, data and label correlation analysis, training set and test set data co-distribution analysis.

Second, use examples

First, you can install easyeda using pip.

Pip install easyeda

It can then be called with the following example code.

From easyeda import eda

Import pandas as pd

From sklearn import datasets

From sklearn.model_selection import train_test_split

Boston = datasets.load_boston ()

Df = pd.DataFrame (boston.data,columns = boston.feature_names)

Df ["label"] = boston.target

Dftrain,dftest = train_test_split (df,test_size = 0.3)

Dfeda = eda (dftrain,dftest,language= "Chinese")

There are only two lines of core code:

From easyeda import eda

Dfeda = eda (dftrain,dftest,language= "Chinese")

On the exploratory data analysis tool easyeda how to share here, I hope that the above content can be of some help to you, can learn more knowledge. If you think the article is good, you can share it for more people to see.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 291

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Internet Technology

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report