The model

GPCRHMM

A GPCR detection method


Logo
Detection TM segments Instructions Contact

Instructions

This server predicts G protein-coupled receptors from amino acid sequence.

GPCRHMM is described in

Markus Wistrand*, Lukas Käll* and Erik L.L. Sonnhammer.
A general model of G protein-coupled receptor sequences and its application to detect remote homologs.
Protein Science, 15 (3):509-21, Mars 2006.
*These authors contributed equally to this work.

Input

The program takes proteins in FASTA format. It recognizes the 20 amino acids and B, Z, and X, which are all treated equally as unknown. Any other character is changed to X, so please make sure the sequences are sensible proteins.

This is an example (one protein):

>EBI2_HUMAN P32249 [Homo sapiens (human)] EBV-induced G protein-coupled receptor 2 (EBI2).
MDIQMANNFTPPSATPQGNDCDLYAHHSTARIVMPLHYSLVFIIGLVGNLLALVVIVQNR
KKINSTTLYSTNLVISDILFTTALPTRIAYYAMGFDWRIGDALCRITALVFYINTYAGVN
FMTCLSIDRFIAVVHPLRYNKIKRIEHAKGVCIFVWILVFAQTLPLLINPMSKQEAERIT
CMEYPNFEETKSLPWILLGACFIGYVLPLIIILICYSQICCKLFRTAKQNPLTEKSGVNK
KALNTIILIIVVFVLCFTPYHVAIIQHMIKKLRFSNFLECSQRHSFQISLHFTVCLMNFN
CCMDPFIYFFACKGYKRKVMRMLKRQVSVSISSAVKSAPEENSREMTETQMMIHSKSSNG
K

GPCR detection

The main usage of the server is to detect putative GPCRs in from amino acid sequence.
Either give the name of the local file in which you have the proteins in the, or paste the sequence(s) into the input form. If you paste; make sure that the first line of each entry (the identifier) is not broken.
There is a global and a local score (explained below). The local score is only calculated for the proteins that have a global score higher than a threshold of 0. To only use global scoring is faster, but tend to accumulate false positives.

Global/Local score

GPCRHMM is based on a hidden Markov model that mimics the common topology of GPCRs: they all span the membrane sevenfold. Global scoring takes the sequence through the entire model, including the N- and C-terminal sections. The reported score is calculated using the forward algorithm.

Local scoring takes the subsequence that spans the 1st-7th TM-helix (as predicted by the 1-best algorithm using the global model). The subsequence is scored to a core model that only have compartments corresponding to the 1st-7th TM-helix (forward algorithm). The local score can detect false positives that don't fit the model well, but which have long N- or C-terminal regions that accumulate score because of biased sequence composition.

Output

Output is global score, local score and a prediction (GPCR/no). All reported scores are log-odds scores related to a null model. The null model reflects the amino acid composition in SwissProt.

Prediction of GPCR transmembrane segment localization

An alternative usage of the server is to localize the transmembrane segments of known GPCRs. This will be done by restricting the predictions to a GPCR model. The prediction will hence allways contain seven transmembrane segments.