알고리즘 성능 평가와 시각화(작성중)

*회귀분석 성능 평가를 위해 train, test set을 나눌 필요가 있나? classification은 꼭 나눠야 한다고 생각.

*회귀분석을 통해 미래를 예측하나?

->이 땐 test set 확보를 통해 모델이 특정 데이터셋에 overfitted 되어있지 않음을 확인(그럼에도..예를 들어 18~20년 train set 21년 test set을 통해 확인하면서 hyperparameter 최적화를 한다면 21년 set에 너무 적합한거 아님? 22년 예측에 잘 활용될 수 있을까?)

*회귀분석에 iteration 필요? 그냥 한 번 돌릴 땐 몇 번의 iteration을 돌린거지

#Step 1 - create the evaluation metrics function

eval_metrics = function(model, df, predictions, target){
    resids = df[,target] - predictions
    resids2 = resids**2
    N = length(predictions)
    r2 = as.character(round(summary(model)$r.squared, 2))
    adj_r2 = as.character(round(summary(model)$adj.r.squared, 2))
    print(adj_r2) #Adjusted R-squared
    print(as.character(round(sqrt(sum(resids2)/N), 2))) #RMSE
}

# Step 2 - predicting and evaluating the model on train data
predictions = predict(lr, newdata = train)
eval_metrics(lr, train, predictions, target = 'unemploy')

# Step 3 - predicting and evaluating the model on test data
predictions = predict(lr, newdata = test)
eval_metrics(lr, test, predictions, target = 'unemploy')

1. 회귀 분석의 성능 평가

https://inistory.tistory.com/111

[회귀모델 성능평가지표] MAE, MSE, RMSE, R2 Score

분류 모델의 성능을 평가하기 위해서 주로 Accuracy, Precision, Recall 을 사용합니다. 하지만 회귀모델의 경우 어떻게 성능을 평가할 수 있을까요? 대표적인 회귀 모델(Regression) 의 성능 평가 지표에

inistory.tistory.com

https://medium.com/@parag.jain164/performance-matrix-for-regression-model-c9ba0d2cae33

Performance Matrix for Regression Model

List of Metrics covered in this blog :

medium.com

시각화

scatter plot 그리고 regression plot 그려서 어느 형태가 실제 y를 더 잘 나타내는지 확인

2. 분류 분석의 성능 평가

시각화

저작자표시 비영리 동일조건 (새창열림)

'쫌쫌따리 통계+데이터+AI' 카테고리의 다른 글

신뢰구간(Confidence Interval) (0)	2022.06.10
베이지안 정리 (0)	2022.06.10
Regression Algorithms (1)	2022.06.08
EDA(Exploratory Data Analysis) vs 전처리 (3)	2022.06.08
AWS Machine Learning Specialty 10시간 호다닥 합격 (1)	2022.04.17

하늘 아래 내 방

알고리즘 성능 평가와 시각화(작성중)

'쫌쫌따리 통계+데이터+AI' 카테고리의 다른 글

댓글

티스토리툴바

알고리즘 성능 평가와 시각화(작성중)

'쫌쫌따리 통계+데이터+AI' 카테고리의 다른 글

관련글

댓글

티스토리툴바