GitHub - SIAT-code/ProtChat

ProtChat: An AI Multi-Agent for Automated Protein Analysis Leveraging GPT-4 and Protein Multimodal Model

Website

we have developed a website that allows users from any computer background to use it without the need to pay for a GPT-4 API Key. Future maintenance and updates follow this standard.

Setup

Clone the repository and navigate into the directory:

git clone [email protected]:SIAT-code/ProtChat.git
cd ProtChat

Installation

To use this project, do as the following to install the dependencies.

Create a new environment using: conda create --name protchat python=3.9.
Activate the environment you have just created: conda activate protchat.
Finally, run the following command to install the required packages inside the conda environment:

pip install -r requirements.txt

Dataset

You can download the datasets from this link, which provides some samples. The directory of data files like the following:

| -- downstream_task
|    |-- stability
|    |-- fluorescence
|    |-- remote_homology
|    |-- secondary_structure
|    |-- antigen_binding
|    |-- pdbbind
|    |-- kinase
|    |-- skempi

All the datasets that we used in the project can be found in the datasets folder. They are preprocessed and ready to use.You can use them directly.

Checkpoints

We have prepared model checkpoints for direct inference. Please refer to this link. The directory of the checkpoint files is as follows:

| -- checkpoints
|    |-- stability
|    |-- fluorescence
|    |-- remote_homology
|    |-- secondary_structure
|    |-- antigen_binding
|    |-- pdbbind
|    |-- kinase
|    |-- skempi
|        |-- fold1
|        |-- fold2
          ...
|        |-- fold10

Inference

Please refer to AutoGen-MASSA.ipynb for the inference code, the dialogue template for the agent is as follows:

user_proxy.initiate_chat(manager, message="""Can you analyze on kinase task which belongs to protein-drug interaction prediction benchmark, where the data file and format are as follows, and evaluate the predictions based on the task, finally visualize the evaluation results?

downstream_task/kinase/samples_seq_mole_go.txt

Start the work now.
""")

Fine-tune

If you wish to perform fine-tuning, please refer to the AutoGen-MASSA.ipynb as well, the dialogue template for the agent is as follows:

user_proxy.initiate_chat(manager, message="""Can you fine-tune and analyze on antigen binding task which belongs to protein property prediction benchmark, where the data file and format are as follows, and evaluate the predictions based on the task, finally visualize the evaluation results?

downstream_task/antigen_binding/sequence_go.txt

Start the work now.
""")

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
checkpoints		checkpoints
downstream_task		downstream_task
figures		figures
scripts		scripts
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ProtChat: An AI Multi-Agent for Automated Protein Analysis Leveraging GPT-4 and Protein Multimodal Model

Website

Setup

Installation

Dataset

Checkpoints

Inference

Fine-tune

About

Releases

Packages

Languages

SIAT-code/ProtChat

Folders and files

Latest commit

History

Repository files navigation

ProtChat: An AI Multi-Agent for Automated Protein Analysis Leveraging GPT-4 and Protein Multimodal Model

Website

Setup

Installation

Dataset

Checkpoints

Inference

Fine-tune

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages