This study aims to predict return-to-work outcomes for workers injured in industrial accidents using a TabNet-RUSBoost hybrid model. The study analyzed data from 1,383 workers who had completed recuperation. Key predictors identified include length of recuperation, disability grade, occupation activity, self-efficacy, and socioeconomic status. The model effectively addresses class imbalance and demonstrates superior predictive performance. These findings underscore the importance of a holistic approach, incorporating both medical and psychosocial factors.
Abstract Handling imbalanced datasets in binary classification, especially in employment big data, is challenging. Traditional methods like oversampling and undersampling have limitations. This paper integrates TabNet and Generative Adversarial Networks (GANs) to address class imbalance. The generator creates synthetic samples for the minority class, and the discriminator, using TabNet, ensures authenticity. Evaluations on benchmark datasets show significant improvements in accuracy, precision, recall, and F1-score for the minority class, outperforming traditional methods. This integration offers a robust solution for imbalanced datasets in employment big data, leading to fairer and more effective predictive models.