Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

How to use get_dummies () and factorize () of pandas

2025-04-03 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Development >

Share

Shulou(Shulou.com)06/01 Report--

This article mainly introduces the pandas get_dummies () and factorize () how to use the relevant knowledge, the content is detailed and easy to understand, the operation is simple and fast, has a certain reference value, I believe that after reading this pandas get_dummies () and factorize () how to use the article will have a harvest, let's take a look.

1.get_dummies ()

Pandas.get_dummies (data, prefix=None, prefix_sep='_', dummy_na=False, columns=None,sparse=False, drop_first=False): Convert categorical variable into dummy/indicator variables

> import pandas as pd > s = pd.Series (list ('abca')) > pd.get_dummies (s) ab c 0 1 01 01 02 0 0 13 1 0 02.pd.factorize ()

Pandas.factorize (values, sort=False, order=None, na_sentinel=-1,size_hint=None): Encode input values as an enumerated type or categorical variable

Series.factorize (sort=False, na_sentinel=-1): Encode the object as an enumerated type or categorical variable

Pandas has a method called factorize (), which creates numbers to represent category variables and maps an ID to each category, which ends up generating only one feature, unlike dummy.

Parameters:sort: boolean, default False

Sort by values

Na_sentinel: int, default-1

Value to mark "not found"

Returns:labels: the indexer to the original array

Uniques: the unique Index

Labels: the corresponding coded array

Uniques: the type that needs to be encoded

Supplement: the use and meaning of pandas.get_dummies

Get_dummies is the way to implement one hot encode using pandas.

The get_dummies parameters are as follows:

Pandas.get_dummies (data,prefix = None,prefix_sep ='_', dummy_na = False,columns = None,sparse = False,drop_first = False,dtype = None)

Data: array-like,Series or DataFrame

Prefix: string, string list or string dict. Default is None.

The string used to append the DataFrame column name. When calling get_dummies on DataFrame, pass a list with a length equal to the number of columns. Alternatively, a prefix can be a dictionary that maps column names to prefixes.

Prefix_sep: string, default is'_'

If a prefix is appended, the delimiter / delimiter should be used. Or pass the same list or dictionary as the prefix.

Dummy_na: bool, default is False

If you omit False NaN, add a column to indicate NaN.

Columns: similar list. Default is none.

The name of the column in the DataFrame to encode. If the column is None, then all D-type cells with column objects or categories will be converted.

Sparse: bool, default is False

Whether pseudo-coded columns should be supported by SparseArray (True) or regular NumPy array (False).

Drop_first: bool, default is False

Whether to get a kmurl dummy from the k classification level by deleting the first level.

New features in version 0.18.0.

Dtype: type D, default np.uint8

The data type of the new column. Only one dtype is allowed.

New features in version 0.23.0.

This is the end of the article on "how to use get_dummies () and factorize () of pandas". Thank you for reading! I believe you all have a certain understanding of the knowledge of "how to use get_dummies () and factorize () of pandas". If you want to learn more, you are welcome to follow the industry information channel.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 242

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Development

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report