Text this: Optimization of deep learning models for the prediction of gene mutations using unsupervised clustering