'머신러닝/데이터 분석' 카테고리의 글 목록

머신러닝/데이터 분석

[python pandas] cut, qcut 차이점 정리해드립니다! 2021.05.26

[python pandas] cut, qcut 차이점 정리해드립니다!

2021. 5. 26. 16:50

728x90

pandas의 cut과 qcut은 간단히 설명하면 절대평가와 상대평가의 의미와 비슷합니다. 절대평가는 특정 점수 이상일 경우에 성적을 부여하는 것이고 상대평가는 인원수에 따라 상위 몇 %인지에 따라 성적이 부여됩니다.

cut의 경우 특정 점수가 부여되고 그 점수를 경계로 성적이 부여되고, qcut의 경우 상위 퍼센트가 부여되고 그 퍼센트를 경계로 성적이 부여됩니다.

예시를 통해 알아보겠습니다.

CUT

import pandas as pd
import numpy as np

df=pd.read_csv('StudentsPerformance.csv')
df.head()

df.columns

df = df.drop(columns=['race/ethnicity', 'parental level of education', 'lunch', 'test preparation course'])
df.head()

pd.cut(df['math score'],bins=4)

pd.cut(df['math score'],bins=4).value_counts()

grades = ['C','B-', 'B', 'A-', 'A']
cut_bins = [0, 40, 55, 65, 85, 100]

df['grades'] = pd.cut(df['math score'], bins = cut_bins, labels = grades)
df.head()

qcut

pd.qcut(df['math score'],q=4)

pd.qcut(df['math score'], q=4).value_counts

df['math score'].describe()

[참조]

https://www.youtube.com/watch?v=cgpiFkpBLyY

728x90

저작자표시

PREV 1 NEXT

디너코딩

머신러닝/데이터 분석

[python pandas] cut, qcut 차이점 정리해드립니다!

+ Recent posts

티스토리툴바