DisProtEdit is a controllable protein editing framework that disentangles structural and functional representations via dual-channel natural language supervision. Each protein is annotated with structure and function texts, derived automatically using GPT-4o, forming the SwissProtDis dataset (540k entries). Our model uses alignment and uniformity objectives for modality fusion and introduces a novel angular MMD loss for disentanglement. Editing is performed by modifying text prompts and interpolating in latent space, supporting modular control. We evaluate on a new multi-attribute editing benchmark and TAPE tasks, showing strong accuracy (up to 61.7% both-hit edit success) and competitive downstream performance, with improved interpretability and controllability.
This method still far from perfect, as protein editing is a very challenging task. But we uncovered several key findings. First, alignment and uniformity objectives effectively integrate protein and text modalities. Second, angular MMD loss is essential for separating structure/function semantics. Lastly, DisProtEdit supports fine-grained control over edits, including hard cases like increasing helices and boosting stability, though these remain biologically challenging tasks.
@misc{ku2025disproteditexploringdisentangledrepresentations,
title={DisProtEdit: Exploring Disentangled Representations for Multi-Attribute Protein Editing},
author={Max Ku and Sun Sun and Hongyu Guo and Wenhu Chen},
year={2025},
booktitle={ICML Workshop on Generative AI and Biology},
eprint={2506.14853},
archivePrefix={arXiv},
primaryClass={q-bio.QM},
url={https://arxiv.org/abs/2506.14853},
}