Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Evolla model #36231

Open
2 tasks done
zhoubay opened this issue Feb 17, 2025 · 0 comments
Open
2 tasks done

Add Evolla model #36231

zhoubay opened this issue Feb 17, 2025 · 0 comments

Comments

@zhoubay
Copy link

zhoubay commented Feb 17, 2025

Model description

Model Name: Evolla

Model Specifications

  • Model Type: Protein-language generative model
  • Parameters: 80 billion
  • Training Data: AI-generated dataset with 546 million protein question-answer pairs and 150 billion word tokens

Architecture

Multimodal model integrating a protein language model (PLM) as the encoder, a large language model (LLM) as the decoder, and a sequence compressor/aligner module.

Key Features

  • Decodes the molecular language of proteins through natural language dialogue
  • Generates precise, contextually nuanced insights into protein function
  • Trained on extensive data to capture protein complexity and functional diversity

Applications

  • Protein Function Annotation: Provides detailed functional insights for proteins
  • Enzyme Commission (EC) Number Prediction: Assists in classifying enzymatic activities
  • Gene Ontology (GO) Annotation: Helps in understanding protein roles in biological processes
  • Subcellular Localization Prediction: Predicts where proteins are located within a cell
  • Disease Association Analysis: Identifies potential links between proteins and diseases
  • Other Protein Function Characterization Tasks: Supports various research needs in proteomics and functional genomics

Performance

Demonstrates expert-level insights, advancing research in proteomics and functional genomics.

Availability

License

MIT License

Contact

For inquiries, contact the corresponding author(s) via email (e.g., [email protected]).

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant