Would it be possible to include automatic generation of a randomly permuted dataset (generated by randomly permuting the class identities) to have these models run in parallel to further validate predictive performance as is done in the following paper (https://academic.oup.com/braincomms/article/3/2/fcab084/6237484?login=true)? Another paper (https://pubmed.ncbi.nlm.nih.gov/25596422/) demonstrates that when sample sizes are small (which is common in biological contexts), prediction accuracy by chance alone can approach 70% or higher.