Protein sequence viewer¶
A common element on several InterPro website pages is the protein sequence viewer (in the sequence search result, on the protein and structure pages). It summarises the InterPro entries (IPR) (top coloured bar) and member database signatures matches to the protein or structure being looked at, represented by the grey bar at the top of the viewer, categorised by InterPro entry types.
The Representative Domains track is displayed in the protein sequence viewer in the protein page. This representation is generated automatically using the type of the member databases models, which might differ from the InterPro entries types.
Various options, make it easy to work with (as illustrated in the figure above):
Clicking on the Full screen button at the top of the viewer will switch to full screen view.
The viewer can be zoomed in and out by:
Clicking the two buttons (+ and -) at the top right corner.
Dragging the grey scale at the top to the desired positions on both left and right sides
Pressing the [Ctrl] key and scroll through the viewer
More options that customise the viewer are grouped under Options dropdown.
Colour By allows to change the colours in which the InterPro entries and signatures bars based on accession, member database or domain relationship.
The labels on the right side of the viewer can be customised. The Accession labels are shown by default. To see names and/or short names along with accession, the name/short name checkboxes should be ticked or if the user prefers to see the names/short names alone, the respective options should be selected.
Snapshot has two options: Save as image allows to take a snapshot of the viewer and is saved as an image (.png). Print allows the user to print the viewer, thus supporting the download in PDF format.
Collapse All allows to collapse all the signatures bars displayed in the viewer at once to only display the InterPro entries bars.
The tooltips are shown when hovering over each bar. They can be disabled by unchecking the Tooltip Active option.
Residues annotations are provided by the CDD, SFLD and PIRSR databases.
Clicking on the header of a category (say Unintegrated) hides the bars for the entire category.
When zoomed in, panning can be achieved by either dragging the scale at the top or by dragging any bar in the desired direction (see figure below).
For some proteins, additional information are provided by resources other than the member database consortium, they are displayed under the Other features category of the viewer. Available data include:
Disordered regions from MobiDB
Coiled regions from COILS
Cytoplasmic/non-cytoplasmic domains from Phobius
Signal peptide regions from SignalP and/or Phobius
Spurious protein from AntiFam
CATH-FunFams is an automatically generated profile HMM database, with FunFams entries segregated by an entropy-based approach that distinguishes different patterns of conserved residues, corresponding to differences in functional determinants
Pfam-N annotations result from a deep learning methodology developed by the Google Research team led by Dr Lucy Colwell to increase the Pfam coverage of protein sequences
Eukaryotic linear motifs from ELM
For some proteins, we also have annotations that are fetched directly from the resource API. These annotations are displayed under the External Sources category of the viewer. Note: by default this category is collapsed. Available data include: