Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AA compositional bias #17

Open
AlexWindels opened this issue Dec 12, 2023 · 1 comment
Open

AA compositional bias #17

AlexWindels opened this issue Dec 12, 2023 · 1 comment

Comments

@AlexWindels
Copy link

Hi all,

I am currently exploring protein generator on the HuggingFace space. I am trying out the AA compositional bias conditioning and I ran the following example: 'W0.2,E0.1', with 40 diffusion steps and a protein length of 250 residues. This resulted in the following protein sequence:

AAPPPAAAVAAAAAAAPPAPAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAPAAAAAAAAAAAAAAAAAAAAAAAPAAAALAAAAPAPAAAAAAAPAAAVAAAAAAAAAAAAAAAAAAAAAAAPAAAPAAAAAAAAAAAAAVAAAAAAAAAAAAPAAVPAAAAAAAAAAAAAAAAAAAAAPAAAAAAAAAAAPAAAAPAAAAAAAAAAAAPAAAAAAAAAALAAAAAAAAAVA

As you can see, the sequence is almost exclusively composed out of alanines and no tryptophans or glutamic acids occur, although explicitly conditioned on these residues. When I change residues and/or bias, the results are similar and I never obtain a sequence coming close to the conditions.

Can you verify something is going wrong here?

Best,

Alex

@0merle0
Copy link
Member

0merle0 commented Dec 24, 2023

Hey Alex,

I would try with a smaller number of amino acids (100 aa) or more steps (100 steps), the network often can struggle at larger lengths to generate cohesive sequence and structure pairs, if theres a more specific application you are going for here let me know and I am happy to discuss more!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants