Inverse folding

Inverse folding

Training protein structure-based neural networks exclusively on predicted protein structures worsens performance on experimental structures due to the training data's idealized local geometry

Predicted protein structures, particularly monomeric structures, have become ubiquitous thanks to the release of the AlphaFold Database[1] and its successors[2]. Yet training structure-based neural networks exclusively on these synthetic structures has now been widely shown to worsen performance on experimental structures. Hsu et al., who trained the structure-based