Latest

Training protein structure-based neural networks exclusively on predicted protein structures worsens performance on experimental structures due to the training data's idealized local geometry

Predicted protein structures, particularly monomeric structures, have become ubiquitous thanks to the release of the AlphaFold Database[1] and its successors[2]. Yet training structure-based neural networks exclusively on these synthetic structures has now been widely shown to worsen performance on experimental structures. Hsu et al., who trained the structure-based

Diffusion-based structure prediction can be guided by backpropagating to the conditioning embeddings rather than the atomic coordinates directly, and such embeddings can be re-refined in subsequent iterations

Diffusion-based biomolecular structure prediction, which is used in latest-generation methods like AlphaFold3[1] and BioEmu[2], can be guided or steered into specific conformations by backpropagating to the conditioning representations rather than the atomic coordinates being diffused [3][4]. This was recently shown by two methods, EmbedOpt and IT-Optimization. There