Prediction and the influencing factor study of colorectal cancer hospitalization costs in China based on machine learning-random forest and support vector regression: a retrospective study

AimsAs people's standard of living improves, the incidence of colorectal cancer is increasing, and colorectal cancer hospitalization costs are relatively high. Therefore, predicting the cost of hospitalization for colorectal cancer patients can provide guidance for controlling healthcare costs...

Full description

Saved in:
Bibliographic Details
Main Authors: Jun Gao (Author), Yan Liu (Author)
Format: Book
Published: Frontiers Media S.A., 2024-02-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_ddafef3e5b2844269f76e8b658b9a1ab
042 |a dc 
100 1 0 |a Jun Gao  |e author 
700 1 0 |a Jun Gao  |e author 
700 1 0 |a Yan Liu  |e author 
245 0 0 |a Prediction and the influencing factor study of colorectal cancer hospitalization costs in China based on machine learning-random forest and support vector regression: a retrospective study 
260 |b Frontiers Media S.A.,   |c 2024-02-01T00:00:00Z. 
500 |a 2296-2565 
500 |a 10.3389/fpubh.2024.1211220 
520 |a AimsAs people's standard of living improves, the incidence of colorectal cancer is increasing, and colorectal cancer hospitalization costs are relatively high. Therefore, predicting the cost of hospitalization for colorectal cancer patients can provide guidance for controlling healthcare costs and for the development of related policies.MethodsThis study used the first page of medical record data on colorectal cancer inpatient cases of a tertiary first-class hospital in Shenzhen from 2018 to 2022. The impacting factors of hospitalization costs for colorectal cancer were analyzed. Random forest and support vector regression models were used to establish predictive models of the cost of hospitalization for colorectal cancer patients and to compare and evaluate.ResultsIn colorectal cancer inpatients, major procedures, length of stay, level of procedure, Charlson comorbidity index, age, and medical payment method were the important influencing factors. In terms of the test set, the R2 of the Random forest model was 0.833, the R2 of the Support vector regression model was 0.824; the root mean square error (RMSE) of the Random forest model was 0.029, and the RMSE of the Support vector regression model was 0.032. In the Random Forest model, the weight of the major procedure was the highest (0.286).ConclusionMajor procedures and length of stay have the greatest impacts on hospital costs for colorectal cancer patients. The random forest model is a better method to predict the hospitalization costs for colorectal cancer patients than the support vector regression. 
546 |a EN 
690 |a colorectal cancer 
690 |a hospitalization costs 
690 |a influencing factors 
690 |a random forest 
690 |a support vector regression 
690 |a Public aspects of medicine 
690 |a RA1-1270 
655 7 |a article  |2 local 
786 0 |n Frontiers in Public Health, Vol 12 (2024) 
787 0 |n https://www.frontiersin.org/articles/10.3389/fpubh.2024.1211220/full 
787 0 |n https://doaj.org/toc/2296-2565 
856 4 1 |u https://doaj.org/article/ddafef3e5b2844269f76e8b658b9a1ab  |z Connect to this object online.