The Antibody Annotator has the capacity to add additional annotations that are not part of the immunoglobulin scaffold. This article explains how to create a Feature Database for annotating features like fusion proteins and other large sequence features, allowing a degree of sequence mismatch if desired.
If you would like to identify short, exact motifs like HisTags and signal peptides etc you can see this article to learn how to specify your own Liabilities and Assets.
Creating a feature database
To use this function, you will need to first create a Feature database containing annotated sequences with those additional annotations you would like to add. Go to the Organization Databases section in the left navigation bar in Biologics and hover over Reference Sequences. Three dots will appear to the left, click these and select New database.
Name your database and make sure to create a "Feature" type database.
Note: Annotated Germline databases are instead used for annotating your antibody sequences. Please refer to the following article to learn more on how to create your own reference database.
Following this, upload any reference sequences into this databases. You can add as many sequences as required into your database but note that:
- We currently only support nucleotide sequences
- You can have reference sequences containing ambiguous bases but those sequences must have an unambiguous stretch of at least 10 nucleotides
Annotating your sequences with a Feature Database
To annotate these additional features, select the Additional Features option in the Antibody annotator window and select your Feature database from the dropdown.
You can specify the percentage of mismatches between your query (input sequence) and the reference sequence by inputting the appropriate number in the Mismatches % field. In addition to this, you can also specify the Gap Size in the same manner.
Below is an example of a database containing a pIII and a PelB leader sequence involved in Phage display.
Antibody-GFP fusion protein
Below is an example of antibody sequence fused to a GFP protein, annotated using Antibody Annotator and a custom feature database containing the GFP sequence.