; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G09807 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G09807
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag-pol polyprotein
Genome locationClcChr02:14412589..14418892
RNA-Seq ExpressionClc02G09807
SyntenyClc02G09807
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]2.4e-14735.61Show/hide
Query:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------
        +EGG    PP+LDGSNY YWKARM  FLKS+D++TWK VI  W  P++ D EGK   EL  E+DWT++E                               
Subjt:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------

Query:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----
          DAW+IL + HEGTSKVK+SRLQLL  KFE+LKM EEE I +F+  +L++ N   +L E+I+ E+LVRK+LR LPKRFDMKVT IEEA  I  M+    
Subjt:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----

Query:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT
                           KN A+V   +         +DE +  ++ LL KQF KVL R DKR      ++ P    K        + + S  K I+C 
Subjt:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT

Query:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTDVAFERTRE--VKSD---HHQLPLEEIIEKLMED
         CEGYGH  AE P  LK+  K  S  +C +D + E  +DSD ++ AL G            S+I  D      R+  +KS+     +  L+++I  L  +
Subjt:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTDVAFERTRE--VKSD---HHQLPLEEIIEKLMED

Query:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------
               I ELK E+     +LE MTKSI+MLN  +D ++++L  GK                                                     
Subjt:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS------------------------------------
                     V+ VKGLT NLI+ISQLCD+G +V+F+K +C+VT+  + ++M  +RS D+                                     
Subjt:  -----------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS------------------------------------

Query:  ----IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQTL
            I +  V  I  L      +C +CQ+GKQV++SH++    TT  VLELLH+DLMG MQVESL GK+YA V VDDF+R+T VKFIREKS TF V + L
Subjt:  ----IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQTL

Query:  CCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGTTQ
          +LQRE+                  +  +FC  EGI HE+S++I+PQ+NG+VERK  TLQ+ AR MLH K L    WAEA+NTAC++HNR++LR GT  
Subjt:  CCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGTTQ

Query:  TNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLADKD
        T Y +W+ RKP++K+FHIF S C ILADRE R+K D KSD GIFLGYS+NSRAYRVFN+RT++ MESINVV+DD   + AR +D       D     D  
Subjt:  TNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLADKD

Query:  KNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD
         +     +++E+S+    ES  +Q   + S++I+K HP E IIGD N GVTTR ++
Subjt:  KNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD

AAO73523.1 gag-pol polyprotein [Glycine max]4.4e-14635.54Show/hide
Query:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------
        +EGG    PP+LDGSNY YWKARM  FLKS+D++TWK VI  W  P++ D EGK   EL  E+DWT++E                               
Subjt:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------

Query:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----
          DA +IL   HEGTSKVKMSRLQLL  KFE+LKM EEE I +F+  +L++ N   +L E+I+ E+LVRK+LR LPKRFDMKVT IEEA  I  M+    
Subjt:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----

Query:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT
                           KN A+V   +        ++DE +  ++ LL KQF KVL R DKR      ++ P    K        + + S  K I+C 
Subjt:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT

Query:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLMED
         CEGYGH  AE P  LK+  K  S  +C +D + E  +DSD ++ AL+G            S+I  D   A  R   +KS+     +  L+++I  L  +
Subjt:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLMED

Query:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------
               I ELK E+     +LE MTKSI+MLN  +D ++++L  GK                                                     
Subjt:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS----------------------------------
                       V+ VKGLT NLI+ISQLCD+G +V+F+K +C+VT+  + ++M  +RS D+                                   
Subjt:  -------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS----------------------------------

Query:  ------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQ
              + +  V  I  L      +C +CQ+GKQV++SH++    TT  VLELLH+DLMG MQVESL GK+YA V VDDF+R+T V FIREKS TF V +
Subjt:  ------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQ

Query:  TLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGT
         L  +LQRE+                  +F +FC  EGI HE+S++I+PQ+NG+VERK  TLQ+ AR MLH K L    WAEA+NTAC++HNR++LR GT
Subjt:  TLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGT

Query:  TQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLAD
          T Y +W+ RKP++K+FHIF S C ILADRE R+K D KSD GIFLGYS+NSRAYRVFN+RT++ MESINVV+DD   + AR +D       D     D
Subjt:  TQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLAD

Query:  KDKNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD
           +     +++E+S+    ES  +Q   + S++I+K HP E IIGD N GVTTR ++
Subjt:  KDKNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD

AAO73525.1 gag-pol polyprotein [Glycine max]8.7e-14235.19Show/hide
Query:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDED------------------------------
        +EGG    PP+LDG+NY YWKARM  FLKS+D++TWK VI  W  P++ D EGK   EL  E+DWT++ED                              
Subjt:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDED------------------------------

Query:  ----AWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK---
              +IL   HEGTSKVKMSRLQLL  KFE+LKM EEE I +F+  +L++ N   +L E+++ E+LVRK+LR LPKRFDMKVT IEEA  I  M+   
Subjt:  ----AWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK---

Query:  -----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRC
                            KN A+V   +        ++DE +  ++ LL KQF KVL R D+R      ++ P    K    H   + + S  K I+C
Subjt:  -----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRC

Query:  TECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTDVAFERTRE--VKSD---HHQLPLEEIIEKLME
          CEGYGH +AE P  LK++ K  S    D D + E  +DSD ++ AL G     S + +   +I  D      R+  +KS+     +  L+++I  L  
Subjt:  TECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTDVAFERTRE--VKSD---HHQLPLEEIIEKLME

Query:  DNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK----------------------------------------------------
        +       I ELK E+     +LE MTKSI+MLN  +D ++++L  GK                                                    
Subjt:  DNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS---------------------------------
                        V+ VKGLT NLI+ISQLCD+G +V+F+K +C+VT+  + ++M  +RS D+                                  
Subjt:  --------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS---------------------------------

Query:  -------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVC
               I +  V  I  L      +C +CQ+GKQV++SH++    TT  VLELLH+DLMG MQVESL GK+YA V VDDF+R+T V FIREKS+TF V 
Subjt:  -------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVC

Query:  QTLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPG
        + L  +LQRE+                  KF +FC  EGI HE+S++I+PQ+NG+VERK  TLQ+  R MLH K L    WAEA+NTAC++HNR++LR G
Subjt:  QTLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPG

Query:  TTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLA
        T  T Y +W+ RKP +K+FHIF S C ILADRE R+K D KSD GIFLGYS+NSRAYRVFN+RT++ MESINVV+DD     AR +D           +A
Subjt:  TTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLA

Query:  DKDKNLVPVDKDSESSNLDPKESVSDQIKCSSKIKKNHPLENIIGDLNSGVTTRRKD
        D  K+    +K S+S+  +P  +  D+     +I+K  P E IIGD N GVTTR ++
Subjt:  DKDKNLVPVDKDSESSNLDPKESVSDQIKCSSKIKKNHPLENIIGDLNSGVTTRRKD

AAO73527.1 gag-pol polyprotein [Glycine max]3.6e-14835.73Show/hide
Query:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------
        +EGG    PP+LDGSNY YWKARM  FLKS+D++TWK VI  W  P++ D EGK   EL  E+DWT++E                               
Subjt:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------

Query:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----
          DAW+IL + HEGTSKVKMSRLQLL  KFE+LKM EEE I +F+  +L++ N   +L E+I+ E+LVRK+LR LPKRFDMKVT IEEA  I  M+    
Subjt:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----

Query:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT
                           KN A+V   +        ++DE +  ++ LL KQF KVL R DKR      ++ P    K        + + S  K I+C 
Subjt:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT

Query:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLMED
         CEGYGH  AE P  LK+  K  S  +C +D + E  +DSD ++ AL G            S+I  D   A  R   +KS+     +  L+++I  L  +
Subjt:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLMED

Query:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------
               I ELK E+     +LE M KSI+MLN  +D ++++L  GK                                                     
Subjt:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS----------------------------------
                       V+ VKGLT NLI+ISQLCD+G +V+F+K +C+VT+  + ++M  +RS D+                                   
Subjt:  -------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS----------------------------------

Query:  ------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQ
              I +  V  I  L      +C +CQ+GKQV++SH++    TT  VLELLH+DLMG MQVESL GK+YA V VDDF+R+T V FIREKS TF V +
Subjt:  ------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQ

Query:  TLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGT
         L  +LQRE+                  +F +FC  EGI HE+S++I+PQ+NG+VERK  TLQ+ AR MLH K L    WAEA+NTAC++HNR++LR GT
Subjt:  TLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGT

Query:  TQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLAD
          T Y +W+ RKP++K+FHIF S C ILADRE R+K D KSD GIFLGYS+NSRAYRVFN+RT++ MESINVV+DD   + AR +D       D   L D
Subjt:  TQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLAD

Query:  KDKNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD
           +     +++E+S+    ES  +Q   + S++I+K HP E IIGD N GVTTR ++
Subjt:  KDKNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD

AAO73529.1 gag-pol polyprotein [Glycine max]6.4e-14535.29Show/hide
Query:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------
        +EGG    PP+LDG+NY YWKARM  FLKS+D++TWK VI  W  P++ D EGK   EL  E+DWT++E                               
Subjt:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------

Query:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----
          DAW+IL   HEGTSKVKMSRLQLL  KFE+LKM EEE I +F+  +L++ N   +L E+++ E+LVRK+LR LPKRFDMKVT IEEA  I  M+    
Subjt:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----

Query:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQE-SGDKKIRC
                           KN A+V   +        ++DE +  ++  L KQF KVL R D+R      ++  + D +  + +   + ++ S  K I+C
Subjt:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQE-SGDKKIRC

Query:  TECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLME
          CEGYGH +AE P  LK++ K  S    D D + E  +DSD ++ AL G            S+I  D    F R   +KS+     +  L+++I  L  
Subjt:  TECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLME

Query:  DNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGKQ---------------------------------------------------
        +       I +LK E+     +LE MTKSI+MLN  +D ++++L  GK+                                                   
Subjt:  DNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGKQ---------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------DVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS---------------------------------
                        V+ VKGLT NLI+ISQLCD+G +V+F+K +C+VT+  + ++M  +RS D+                                  
Subjt:  ---------------DVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS---------------------------------

Query:  -------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVC
               I +  V  I  L      +C +CQ+GKQV++SH++    TT  VLELLH+DLMG MQVESL GK+YA V VDDF+R+T V FIREKS+TF V 
Subjt:  -------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVC

Query:  QTLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPG
        + L  +LQRE+                  KF +FC  EGI HE+S++I+PQ+NG+VERK  TLQ+ AR MLH K L    WAEA+NTAC++HNR++LR G
Subjt:  QTLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPG

Query:  TTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLA
        T  T Y +W+ RKP +K+FHIF S C ILADRE R+K D KSD GIFLGYS+NSRAYRVFN+RT++ MESINVV+DD     AR +D           +A
Subjt:  TTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLA

Query:  DKDKNLVPVDKDSESSNLDPKESVSDQIKCSSKIKKNHPLENIIGDLNSGVTTRRKD
        D  K+    + +S+S+  +P  +  D+ + S +I+K HP E IIGD N GVTTR ++
Subjt:  DKDKNLVPVDKDSESSNLDPKESVSDQIKCSSKIKKNHPLENIIGDLNSGVTTRRKD

TrEMBL top hitse value%identityAlignment
Q84VH6 Gag-pol polyprotein3.1e-14535.29Show/hide
Query:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------
        +EGG    PP+LDG+NY YWKARM  FLKS+D++TWK VI  W  P++ D EGK   EL  E+DWT++E                               
Subjt:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------

Query:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----
          DAW+IL   HEGTSKVKMSRLQLL  KFE+LKM EEE I +F+  +L++ N   +L E+++ E+LVRK+LR LPKRFDMKVT IEEA  I  M+    
Subjt:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----

Query:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQE-SGDKKIRC
                           KN A+V   +        ++DE +  ++  L KQF KVL R D+R      ++  + D +  + +   + ++ S  K I+C
Subjt:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQE-SGDKKIRC

Query:  TECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLME
          CEGYGH +AE P  LK++ K  S    D D + E  +DSD ++ AL G            S+I  D    F R   +KS+     +  L+++I  L  
Subjt:  TECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLME

Query:  DNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGKQ---------------------------------------------------
        +       I +LK E+     +LE MTKSI+MLN  +D ++++L  GK+                                                   
Subjt:  DNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGKQ---------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------DVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS---------------------------------
                        V+ VKGLT NLI+ISQLCD+G +V+F+K +C+VT+  + ++M  +RS D+                                  
Subjt:  ---------------DVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS---------------------------------

Query:  -------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVC
               I +  V  I  L      +C +CQ+GKQV++SH++    TT  VLELLH+DLMG MQVESL GK+YA V VDDF+R+T V FIREKS+TF V 
Subjt:  -------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVC

Query:  QTLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPG
        + L  +LQRE+                  KF +FC  EGI HE+S++I+PQ+NG+VERK  TLQ+ AR MLH K L    WAEA+NTAC++HNR++LR G
Subjt:  QTLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPG

Query:  TTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLA
        T  T Y +W+ RKP +K+FHIF S C ILADRE R+K D KSD GIFLGYS+NSRAYRVFN+RT++ MESINVV+DD     AR +D           +A
Subjt:  TTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLA

Query:  DKDKNLVPVDKDSESSNLDPKESVSDQIKCSSKIKKNHPLENIIGDLNSGVTTRRKD
        D  K+    + +S+S+  +P  +  D+ + S +I+K HP E IIGD N GVTTR ++
Subjt:  DKDKNLVPVDKDSESSNLDPKESVSDQIKCSSKIKKNHPLENIIGDLNSGVTTRRKD

Q84VH8 Gag-pol polyprotein1.8e-14835.73Show/hide
Query:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------
        +EGG    PP+LDGSNY YWKARM  FLKS+D++TWK VI  W  P++ D EGK   EL  E+DWT++E                               
Subjt:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------

Query:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----
          DAW+IL + HEGTSKVKMSRLQLL  KFE+LKM EEE I +F+  +L++ N   +L E+I+ E+LVRK+LR LPKRFDMKVT IEEA  I  M+    
Subjt:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----

Query:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT
                           KN A+V   +        ++DE +  ++ LL KQF KVL R DKR      ++ P    K        + + S  K I+C 
Subjt:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT

Query:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLMED
         CEGYGH  AE P  LK+  K  S  +C +D + E  +DSD ++ AL G            S+I  D   A  R   +KS+     +  L+++I  L  +
Subjt:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLMED

Query:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------
               I ELK E+     +LE M KSI+MLN  +D ++++L  GK                                                     
Subjt:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS----------------------------------
                       V+ VKGLT NLI+ISQLCD+G +V+F+K +C+VT+  + ++M  +RS D+                                   
Subjt:  -------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS----------------------------------

Query:  ------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQ
              I +  V  I  L      +C +CQ+GKQV++SH++    TT  VLELLH+DLMG MQVESL GK+YA V VDDF+R+T V FIREKS TF V +
Subjt:  ------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQ

Query:  TLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGT
         L  +LQRE+                  +F +FC  EGI HE+S++I+PQ+NG+VERK  TLQ+ AR MLH K L    WAEA+NTAC++HNR++LR GT
Subjt:  TLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGT

Query:  TQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLAD
          T Y +W+ RKP++K+FHIF S C ILADRE R+K D KSD GIFLGYS+NSRAYRVFN+RT++ MESINVV+DD   + AR +D       D   L D
Subjt:  TQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLAD

Query:  KDKNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD
           +     +++E+S+    ES  +Q   + S++I+K HP E IIGD N GVTTR ++
Subjt:  KDKNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD

Q84VI0 Gag-pol polyprotein4.2e-14235.19Show/hide
Query:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDED------------------------------
        +EGG    PP+LDG+NY YWKARM  FLKS+D++TWK VI  W  P++ D EGK   EL  E+DWT++ED                              
Subjt:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDED------------------------------

Query:  ----AWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK---
              +IL   HEGTSKVKMSRLQLL  KFE+LKM EEE I +F+  +L++ N   +L E+++ E+LVRK+LR LPKRFDMKVT IEEA  I  M+   
Subjt:  ----AWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK---

Query:  -----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRC
                            KN A+V   +        ++DE +  ++ LL KQF KVL R D+R      ++ P    K    H   + + S  K I+C
Subjt:  -----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRC

Query:  TECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTDVAFERTRE--VKSD---HHQLPLEEIIEKLME
          CEGYGH +AE P  LK++ K  S    D D + E  +DSD ++ AL G     S + +   +I  D      R+  +KS+     +  L+++I  L  
Subjt:  TECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTDVAFERTRE--VKSD---HHQLPLEEIIEKLME

Query:  DNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK----------------------------------------------------
        +       I ELK E+     +LE MTKSI+MLN  +D ++++L  GK                                                    
Subjt:  DNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS---------------------------------
                        V+ VKGLT NLI+ISQLCD+G +V+F+K +C+VT+  + ++M  +RS D+                                  
Subjt:  --------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS---------------------------------

Query:  -------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVC
               I +  V  I  L      +C +CQ+GKQV++SH++    TT  VLELLH+DLMG MQVESL GK+YA V VDDF+R+T V FIREKS+TF V 
Subjt:  -------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVC

Query:  QTLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPG
        + L  +LQRE+                  KF +FC  EGI HE+S++I+PQ+NG+VERK  TLQ+  R MLH K L    WAEA+NTAC++HNR++LR G
Subjt:  QTLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPG

Query:  TTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLA
        T  T Y +W+ RKP +K+FHIF S C ILADRE R+K D KSD GIFLGYS+NSRAYRVFN+RT++ MESINVV+DD     AR +D           +A
Subjt:  TTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLA

Query:  DKDKNLVPVDKDSESSNLDPKESVSDQIKCSSKIKKNHPLENIIGDLNSGVTTRRKD
        D  K+    +K S+S+  +P  +  D+     +I+K  P E IIGD N GVTTR ++
Subjt:  DKDKNLVPVDKDSESSNLDPKESVSDQIKCSSKIKKNHPLENIIGDLNSGVTTRRKD

Q84VI2 Gag-pol polyprotein2.1e-14635.54Show/hide
Query:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------
        +EGG    PP+LDGSNY YWKARM  FLKS+D++TWK VI  W  P++ D EGK   EL  E+DWT++E                               
Subjt:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------

Query:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----
          DA +IL   HEGTSKVKMSRLQLL  KFE+LKM EEE I +F+  +L++ N   +L E+I+ E+LVRK+LR LPKRFDMKVT IEEA  I  M+    
Subjt:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----

Query:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT
                           KN A+V   +        ++DE +  ++ LL KQF KVL R DKR      ++ P    K        + + S  K I+C 
Subjt:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT

Query:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLMED
         CEGYGH  AE P  LK+  K  S  +C +D + E  +DSD ++ AL+G            S+I  D   A  R   +KS+     +  L+++I  L  +
Subjt:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTD--VAFERTREVKSD---HHQLPLEEIIEKLMED

Query:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------
               I ELK E+     +LE MTKSI+MLN  +D ++++L  GK                                                     
Subjt:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS----------------------------------
                       V+ VKGLT NLI+ISQLCD+G +V+F+K +C+VT+  + ++M  +RS D+                                   
Subjt:  -------------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS----------------------------------

Query:  ------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQ
              + +  V  I  L      +C +CQ+GKQV++SH++    TT  VLELLH+DLMG MQVESL GK+YA V VDDF+R+T V FIREKS TF V +
Subjt:  ------IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQ

Query:  TLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGT
         L  +LQRE+                  +F +FC  EGI HE+S++I+PQ+NG+VERK  TLQ+ AR MLH K L    WAEA+NTAC++HNR++LR GT
Subjt:  TLCCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGT

Query:  TQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLAD
          T Y +W+ RKP++K+FHIF S C ILADRE R+K D KSD GIFLGYS+NSRAYRVFN+RT++ MESINVV+DD   + AR +D       D     D
Subjt:  TQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLAD

Query:  KDKNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD
           +     +++E+S+    ES  +Q   + S++I+K HP E IIGD N GVTTR ++
Subjt:  KDKNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD

Q84VI4 Gag-pol polyprotein1.1e-14735.61Show/hide
Query:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------
        +EGG    PP+LDGSNY YWKARM  FLKS+D++TWK VI  W  P++ D EGK   EL  E+DWT++E                               
Subjt:  REGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPEL--EKDWTEDE-------------------------------

Query:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----
          DAW+IL + HEGTSKVK+SRLQLL  KFE+LKM EEE I +F+  +L++ N   +L E+I+ E+LVRK+LR LPKRFDMKVT IEEA  I  M+    
Subjt:  --DAWDILVVAHEGTSKVKMSRLQLLTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMK----

Query:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT
                           KN A+V   +         +DE +  ++ LL KQF KVL R DKR      ++ P    K        + + S  K I+C 
Subjt:  ----------------STYKNDAYVVKIK--------ESDENVAQSISLLTKQFGKVLRRWDKRGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCT

Query:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTDVAFERTRE--VKSD---HHQLPLEEIIEKLMED
         CEGYGH  AE P  LK+  K  S  +C +D + E  +DSD ++ AL G            S+I  D      R+  +KS+     +  L+++I  L  +
Subjt:  ECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTDVAFERTRE--VKSD---HHQLPLEEIIEKLMED

Query:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------
               I ELK E+     +LE MTKSI+MLN  +D ++++L  GK                                                     
Subjt:  NHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGK-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS------------------------------------
                     V+ VKGLT NLI+ISQLCD+G +V+F+K +C+VT+  + ++M  +RS D+                                     
Subjt:  -----------QDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDS------------------------------------

Query:  ----IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQTL
            I +  V  I  L      +C +CQ+GKQV++SH++    TT  VLELLH+DLMG MQVESL GK+YA V VDDF+R+T VKFIREKS TF V + L
Subjt:  ----IAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQTL

Query:  CCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGTTQ
          +LQRE+                  +  +FC  EGI HE+S++I+PQ+NG+VERK  TLQ+ AR MLH K L    WAEA+NTAC++HNR++LR GT  
Subjt:  CCQLQREQ------------------KFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGTTQ

Query:  TNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLADKD
        T Y +W+ RKP++K+FHIF S C ILADRE R+K D KSD GIFLGYS+NSRAYRVFN+RT++ MESINVV+DD   + AR +D       D     D  
Subjt:  TNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLADKD

Query:  KNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD
         +     +++E+S+    ES  +Q   + S++I+K HP E IIGD N GVTTR ++
Subjt:  KNLVPVDKDSESSNLDPKESVSDQ--IKCSSKIKKNHPLENIIGDLNSGVTTRRKD

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-2125.63Show/hide
Query:  VCDDCQVGKQVRVSHKRALVCT-TKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQTLCCQLQR---------------
        +C+ C  GKQ R+  K+    T  K  L ++H D+ G +   +L+ K Y ++ VD F  Y     I+ KS+ F + Q    + +                
Subjt:  VCDDCQVGKQVRVSHKRALVCT-TKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQTLCCQLQR---------------

Query:  ---EQKFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRP--GTTQTNYGLWRRRKPNIKYFHIF
             +   FC  +GI +  +   +PQ NGV ER I T+ + AR M+ G  L   FW EA+ TA ++ NRI  R    +++T Y +W  +KP +K+  +F
Subjt:  ---EQKFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRP--GTTQTNYGLWRRRKPNIKYFHIF

Query:  ESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLADKD-----KNLVPVDKDSESSN
         +   +   +  + K+D KS   IF+GY  N   +++++   +  + + +VV+D+ +   +R      +F  D+    +K+     + ++  +  +ES  
Subjt:  ESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLADKD-----KNLVPVDKDSESSN

Query:  LDPKESVSDQIKCSSK
         D  + + D  +  +K
Subjt:  LDPKESVSDQIKCSSK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-2324.22Show/hide
Query:  FNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMKSTYKNDAYVV--KIKESDENVAQSISLLTKQFGKVLRRWDKRGGYRD
        FN  +  + N    L  KI +E+    +L  LP  +D   TTI   HG  T++      A ++  K+++  EN  Q  +L+T+  G+  +R        +
Subjt:  FNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMKSTYKNDAYVV--KIKESDENVAQSISLLTKQFGKVLRRWDKRGGYRD

Query:  NHVPPNQDSKSPNSHSNLNSQESGDKKIRCTECEGYGHYQAEWPNFLK-------RKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQV--TF
        N+       KS N         S  +   C  C   GH++ + PN  K       +KN   +A +   +++V    + +EE   L G   P S  V  T 
Subjt:  NHVPPNQDSKSPNSHSNLNSQESGDKKIRCTECEGYGHYQAEWPNFLK-------RKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQV--TF

Query:  PSDIVTDVAFERTREVKSDHHQLPL-EEIIEKLMEDNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDD-----MNKILSCGKQDVIPVKGLTTN
         S   T V     R V  D   + +      K+       + T       LK  +   +     I  +    D       N+     K  ++  KG+   
Subjt:  PSDIVTDVAFERTREVKSDHHQLPL-EEIIEKLMEDNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDD-----MNKILSCGKQDVIPVKGLTTN

Query:  LI--NISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDSIAEVVVLEILPLSENYDSV--CDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGL
         +    +++C   L+   ++D+  V      +   R     +   +++  + L       +V  CD C  GKQ RVS + +      ++L+L++ D+ G 
Subjt:  LI--NISQLCDQGLSVSFSKDKCIVTDTDNTIVMTRTRSFDDSIAEVVVLEILPLSENYDSV--CDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGL

Query:  MQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQTLCCQLQRE------------------QKFEDFCNLEGIMHEYSSSISPQRNGVVERKICT
        M++ES+ G KY +  +DD +R   V  ++ K   F V Q     ++RE                  ++FE++C+  GI HE +   +PQ NGV ER   T
Subjt:  MQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQTLCCQLQRE------------------QKFEDFCNLEGIMHEYSSSISPQRNGVVERKICT

Query:  LQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGTTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNN
        + +  R+ML    L   FW EA+ TAC++ NR    P   +    +W  ++ +  +  +F         +E R K D KS   IF+GY      YR+++ 
Subjt:  LQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGTTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNN

Query:  RTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLADKDKN-LVP--VDKDSESSNLDPKESVSDQI
          +  + S +VV          +R+S +    D   +++K KN ++P  V   S S+N    ES +D++
Subjt:  RTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLADKDKN-LVP--VDKDSESSNLDPKESVSDQI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-0821.43Show/hide
Query:  STDDMNKILSCGKQDVIPVKGLTTNLINISQLCD-QGLSVSFSKDKCIVTDTDNTIVMTRTRSFDD----------------------------------
        ST    K       +++ V  +  NLI++ +LC+  G+SV F      V D +  + + + ++ D+                                  
Subjt:  STDDMNKILSCGKQDVIPVKGLTTNLINISQLCD-QGLSVSFSKDKCIVTDTDNTIVMTRTRSFDD----------------------------------

Query:  --SIAEVVV----LEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVC
          SI   V+    L +L  S  + S C DC + K  +V   ++ + +T+  LE ++ D+     + S +  +Y ++ VD F RYT +  +++KS      
Subjt:  --SIAEVVV----LEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVC

Query:  QTLCCQLQREQKFE------------------DFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPG
         T    L  E +F+                  ++ +  GI H  S   +P+ NG+ ERK   + +    +L   ++   +W  A   A ++ NR+     
Subjt:  QTLCCQLQREQKFE------------------DFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPG

Query:  TTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDH
          ++ +       PN     +F   C        + K D KS   +FLGYS    AY   + +T     S +V  D++
Subjt:  TTQTNYGLWRRRKPNIKYFHIFESICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.7e-1124.02Show/hide
Query:  CDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKS---NTFHVCQTLCCQLQREQKFE---------
        C DC + K  +V    + + ++K  LE ++ D+     + S++  +Y ++ VD F RYT +  +++KS   +TF + ++L      E +F+         
Subjt:  CDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKS---NTFHVCQTLCCQLQREQKFE---------

Query:  ---------DFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGTTQTNYGLWRRRKPNIKYFHIFE
                 D+ +  GI H  S   +P+ NG+ ERK   + +M   +L   ++   +W  A + A ++ NR+       Q+ +     + PN +   +F 
Subjt:  ---------DFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGTTQTNYGLWRRRKPNIKYFHIFE

Query:  SICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDD
          C        R K + KS    F+GYS    AY   +  T     S +V  D+
Subjt:  SICQILADREYRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGACTGATTTTCCCTCATTTTCAAGCTCAAACCGACTCCATCTTTCAGCTGAGCTTCAAGAGGATATAGAAAAAGATATTTGGGGCAAATTTTTACAGAAGCGAAG
ACAGAAGCTGTGGGAGACGTGTTTCGCTGGAAAATGCAACGACATTGAAGGCGTAAGGGTGAGCAGCGAGAAGAGTCTCGCTGGTGTCCCTGGATACGGAGAAGGCAATG
TGGGTAGCCAGGATATAGAAATGGTTCTCAATGCTGAAGGACTCAGAGAAGGCATGTGGCCTAAATCTCGTTTATGGAACCCGGTATTGAAGAACTTGAGAAGGTATATG
GTTAACCTAGGATATGATATCTATGTGCACATAGAGGTAGCGAGCGAGTCCGAGGTGCTTAGCCTACACAGCAACTCGGCTGGGCTCACCATCAGTGTCTTGAATTTTGA
GAATTGTAAATATTTGATCTTAAACCTATGTTGTACATATAGAGGATTAAGGGACAGAAGTGTTGGTCAAGGTATAAGGTTGACAGTGGATGCCGTAAGAGGGTCGGTGT
CTGCTGTCCTCACATTTTCTCCTAGGTTAAGAGGGGAAAGCAACTTTTATCTCTTTGTTAGAACTCTAATCAGAAAGCTTGGTTCTGAACGTACTAAGAAAAAGGATTTT
GTGTTACTCGACTGCACCTGGTTGGCTTTCTATACATTGAGATATTTAATGGAGCCTATCAGGGAAGGTGGATTAACAACTCATCCTCCCGTACTCGATGGTTCAAACTA
TTCGTATTGGAAGGCTAGGATGACAACATTCTTGAAATCCATTGACAACAAGACTTGGAAGGTCGTCATCTCTAGATGGAGTCTGCCTCAAGTCACCGACATAGAAGGAA
AGGTTTGTCCTGAGCTTGAGAAAGATTGGACAGAGGATGAAGATGCATGGGATATTCTTGTTGTTGCACATGAAGGGACCTCCAAAGTAAAAATGTCTAGACTGCAACTC
TTGACCATAAAATTTGAATCACTGAAGATGCTGGAAGAAGAGACTATAATAGAGTTTAATGATCGGCTCTTGGATGTTGGAAATGAATCATTTTCTCTTCGCGAGAAGAT
ATCAAAAGAAGAGTTAGTGCGTAAGGTTCTTCGCTTGCTTCCAAAAAGATTTGATATGAAAGTCACAACTATTGAGGAGGCTCATGGCATTGCCACTATGAAATCAACCT
ATAAAAATGATGCTTATGTTGTCAAAATTAAGGAATCTGATGAGAACGTGGCTCAATCGATATCGCTTCTCACCAAACAGTTTGGAAAGGTGCTCAGAAGATGGGACAAA
CGTGGAGGATATCGGGATAACCATGTGCCTCCTAATCAAGATAGCAAGAGTCCCAATAGTCACTCGAACCTGAATAGTCAAGAATCGGGAGACAAAAAAATTAGATGTAC
AGAATGTGAAGGATATGGCCACTACCAAGCTGAATGGCCGAATTTCTTAAAAAGGAAGAACAAGAGTTATTCTGCCACATTATGTGATGCTGATGAGGATGTTGAATCTA
GCAACGACTCTGATGAAGAAATTCGTGCTTTAATGGGTTGTTTGTCTCCTATAAGTTCCCAAGTGACTTTTCCTTCAGACATCGTGACGGACGTGGCTTTTGAGAGGACT
CGAGAAGTTAAGTCTGATCATCACCAACTCCCATTAGAAGAGATAATCGAAAAATTAATGGAAGACAATCACCCTCTCTTGAGTACTATATTCGAATTGAAGAAGGAACT
GAAATCCTCTAAAGTCGAGCTTGAAGCGATGACCAAGTCAATCCGCATGTTGAACTCCAGCACCGATGATATGAATAAGATTTTGTCTTGTGGAAAGCAAGATGTCATTC
CCGTAAAAGGGTTAACTACAAACTTGATCAACATTAGTCAACTATGCGACCAAGGTCTCTCAGTGAGCTTCTCTAAGGATAAGTGTATTGTGACTGACACAGATAATACT
ATTGTCATGACAAGGACTCGCTCTTTTGACGACTCTATTGCGGAAGTAGTTGTCCTCGAAATTCTTCCGCTCTCAGAAAACTATGACAGTGTGTGCGATGACTGCCAAGT
AGGAAAACAAGTTAGAGTCTCTCACAAGAGAGCTCTGGTCTGTACTACCAAACATGTTCTTGAACTACTTCACATTGACTTGATGGGTCTAATGCAAGTGGAAAGTTTGG
AAGGAAAAAAATATGCTCTTGTCTGTGTCGATGATTTCGCACGATATACCCGGGTGAAGTTTATAAGAGAGAAATCAAATACCTTTCATGTTTGTCAAACTCTTTGTTGC
CAACTTCAGAGAGAACAAAAGTTTGAAGATTTTTGTAATCTGGAAGGGATTATGCATGAATATTCATCTTCTATCAGTCCTCAACGGAATGGAGTTGTAGAAAGGAAAAT
TTGCACTTTGCAAAAGATGGCAAGGGCAATGCTGCATGGGAAGAATTTGCTCTTACTTTTCTGGGCAGAAGCGCTTAATACAGCTTGTCACATGCATAATCGAATCTCTC
TACGCCCTGGTACTACTCAGACGAATTATGGACTATGGAGAAGGAGGAAACCCAACATCAAATACTTCCACATCTTCGAAAGCATTTGTCAGATTCTTGCAGATAGAGAG
TATAGAAAAAAATGGGATTCCAAGTCAGATGTTGGAATCTTTCTGGGATACTCTTCAAACAGTAGGGCCTACAGAGTATTTAATAACCGTACTCAATCTTCCATGGAATC
AATCAATGTAGTTATAGATGATCACCACCGAAATTGTGCCAGATGGAGAGACAGTATGTTGATGTTCCCATATGATACACATGTTCTTGCTGATAAAGATAAGAACCTAG
TGCCAGTTGATAAAGACAGTGAGAGTAGTAATTTAGACCCTAAAGAATCTGTATCTGACCAGATAAAGTGTTCTAGCAAAATTAAGAAAAATCATCCTTTGGAGAACATT
ATAGGTGACTTAAACTCTGGAGTCACAACTCGAAGGAAGGATAAAGGTTGGGGAAAAGCTATCATCTTTCTGGCGAATAAAACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGACTGATTTTCCCTCATTTTCAAGCTCAAACCGACTCCATCTTTCAGCTGAGCTTCAAGAGGATATAGAAAAAGATATTTGGGGCAAATTTTTACAGAAGCGAAG
ACAGAAGCTGTGGGAGACGTGTTTCGCTGGAAAATGCAACGACATTGAAGGCGTAAGGGTGAGCAGCGAGAAGAGTCTCGCTGGTGTCCCTGGATACGGAGAAGGCAATG
TGGGTAGCCAGGATATAGAAATGGTTCTCAATGCTGAAGGACTCAGAGAAGGCATGTGGCCTAAATCTCGTTTATGGAACCCGGTATTGAAGAACTTGAGAAGGTATATG
GTTAACCTAGGATATGATATCTATGTGCACATAGAGGTAGCGAGCGAGTCCGAGGTGCTTAGCCTACACAGCAACTCGGCTGGGCTCACCATCAGTGTCTTGAATTTTGA
GAATTGTAAATATTTGATCTTAAACCTATGTTGTACATATAGAGGATTAAGGGACAGAAGTGTTGGTCAAGGTATAAGGTTGACAGTGGATGCCGTAAGAGGGTCGGTGT
CTGCTGTCCTCACATTTTCTCCTAGGTTAAGAGGGGAAAGCAACTTTTATCTCTTTGTTAGAACTCTAATCAGAAAGCTTGGTTCTGAACGTACTAAGAAAAAGGATTTT
GTGTTACTCGACTGCACCTGGTTGGCTTTCTATACATTGAGATATTTAATGGAGCCTATCAGGGAAGGTGGATTAACAACTCATCCTCCCGTACTCGATGGTTCAAACTA
TTCGTATTGGAAGGCTAGGATGACAACATTCTTGAAATCCATTGACAACAAGACTTGGAAGGTCGTCATCTCTAGATGGAGTCTGCCTCAAGTCACCGACATAGAAGGAA
AGGTTTGTCCTGAGCTTGAGAAAGATTGGACAGAGGATGAAGATGCATGGGATATTCTTGTTGTTGCACATGAAGGGACCTCCAAAGTAAAAATGTCTAGACTGCAACTC
TTGACCATAAAATTTGAATCACTGAAGATGCTGGAAGAAGAGACTATAATAGAGTTTAATGATCGGCTCTTGGATGTTGGAAATGAATCATTTTCTCTTCGCGAGAAGAT
ATCAAAAGAAGAGTTAGTGCGTAAGGTTCTTCGCTTGCTTCCAAAAAGATTTGATATGAAAGTCACAACTATTGAGGAGGCTCATGGCATTGCCACTATGAAATCAACCT
ATAAAAATGATGCTTATGTTGTCAAAATTAAGGAATCTGATGAGAACGTGGCTCAATCGATATCGCTTCTCACCAAACAGTTTGGAAAGGTGCTCAGAAGATGGGACAAA
CGTGGAGGATATCGGGATAACCATGTGCCTCCTAATCAAGATAGCAAGAGTCCCAATAGTCACTCGAACCTGAATAGTCAAGAATCGGGAGACAAAAAAATTAGATGTAC
AGAATGTGAAGGATATGGCCACTACCAAGCTGAATGGCCGAATTTCTTAAAAAGGAAGAACAAGAGTTATTCTGCCACATTATGTGATGCTGATGAGGATGTTGAATCTA
GCAACGACTCTGATGAAGAAATTCGTGCTTTAATGGGTTGTTTGTCTCCTATAAGTTCCCAAGTGACTTTTCCTTCAGACATCGTGACGGACGTGGCTTTTGAGAGGACT
CGAGAAGTTAAGTCTGATCATCACCAACTCCCATTAGAAGAGATAATCGAAAAATTAATGGAAGACAATCACCCTCTCTTGAGTACTATATTCGAATTGAAGAAGGAACT
GAAATCCTCTAAAGTCGAGCTTGAAGCGATGACCAAGTCAATCCGCATGTTGAACTCCAGCACCGATGATATGAATAAGATTTTGTCTTGTGGAAAGCAAGATGTCATTC
CCGTAAAAGGGTTAACTACAAACTTGATCAACATTAGTCAACTATGCGACCAAGGTCTCTCAGTGAGCTTCTCTAAGGATAAGTGTATTGTGACTGACACAGATAATACT
ATTGTCATGACAAGGACTCGCTCTTTTGACGACTCTATTGCGGAAGTAGTTGTCCTCGAAATTCTTCCGCTCTCAGAAAACTATGACAGTGTGTGCGATGACTGCCAAGT
AGGAAAACAAGTTAGAGTCTCTCACAAGAGAGCTCTGGTCTGTACTACCAAACATGTTCTTGAACTACTTCACATTGACTTGATGGGTCTAATGCAAGTGGAAAGTTTGG
AAGGAAAAAAATATGCTCTTGTCTGTGTCGATGATTTCGCACGATATACCCGGGTGAAGTTTATAAGAGAGAAATCAAATACCTTTCATGTTTGTCAAACTCTTTGTTGC
CAACTTCAGAGAGAACAAAAGTTTGAAGATTTTTGTAATCTGGAAGGGATTATGCATGAATATTCATCTTCTATCAGTCCTCAACGGAATGGAGTTGTAGAAAGGAAAAT
TTGCACTTTGCAAAAGATGGCAAGGGCAATGCTGCATGGGAAGAATTTGCTCTTACTTTTCTGGGCAGAAGCGCTTAATACAGCTTGTCACATGCATAATCGAATCTCTC
TACGCCCTGGTACTACTCAGACGAATTATGGACTATGGAGAAGGAGGAAACCCAACATCAAATACTTCCACATCTTCGAAAGCATTTGTCAGATTCTTGCAGATAGAGAG
TATAGAAAAAAATGGGATTCCAAGTCAGATGTTGGAATCTTTCTGGGATACTCTTCAAACAGTAGGGCCTACAGAGTATTTAATAACCGTACTCAATCTTCCATGGAATC
AATCAATGTAGTTATAGATGATCACCACCGAAATTGTGCCAGATGGAGAGACAGTATGTTGATGTTCCCATATGATACACATGTTCTTGCTGATAAAGATAAGAACCTAG
TGCCAGTTGATAAAGACAGTGAGAGTAGTAATTTAGACCCTAAAGAATCTGTATCTGACCAGATAAAGTGTTCTAGCAAAATTAAGAAAAATCATCCTTTGGAGAACATT
ATAGGTGACTTAAACTCTGGAGTCACAACTCGAAGGAAGGATAAAGGTTGGGGAAAAGCTATCATCTTTCTGGCGAATAAAACTTAG
Protein sequenceShow/hide protein sequence
MPTDFPSFSSSNRLHLSAELQEDIEKDIWGKFLQKRRQKLWETCFAGKCNDIEGVRVSSEKSLAGVPGYGEGNVGSQDIEMVLNAEGLREGMWPKSRLWNPVLKNLRRYM
VNLGYDIYVHIEVASESEVLSLHSNSAGLTISVLNFENCKYLILNLCCTYRGLRDRSVGQGIRLTVDAVRGSVSAVLTFSPRLRGESNFYLFVRTLIRKLGSERTKKKDF
VLLDCTWLAFYTLRYLMEPIREGGLTTHPPVLDGSNYSYWKARMTTFLKSIDNKTWKVVISRWSLPQVTDIEGKVCPELEKDWTEDEDAWDILVVAHEGTSKVKMSRLQL
LTIKFESLKMLEEETIIEFNDRLLDVGNESFSLREKISKEELVRKVLRLLPKRFDMKVTTIEEAHGIATMKSTYKNDAYVVKIKESDENVAQSISLLTKQFGKVLRRWDK
RGGYRDNHVPPNQDSKSPNSHSNLNSQESGDKKIRCTECEGYGHYQAEWPNFLKRKNKSYSATLCDADEDVESSNDSDEEIRALMGCLSPISSQVTFPSDIVTDVAFERT
REVKSDHHQLPLEEIIEKLMEDNHPLLSTIFELKKELKSSKVELEAMTKSIRMLNSSTDDMNKILSCGKQDVIPVKGLTTNLINISQLCDQGLSVSFSKDKCIVTDTDNT
IVMTRTRSFDDSIAEVVVLEILPLSENYDSVCDDCQVGKQVRVSHKRALVCTTKHVLELLHIDLMGLMQVESLEGKKYALVCVDDFARYTRVKFIREKSNTFHVCQTLCC
QLQREQKFEDFCNLEGIMHEYSSSISPQRNGVVERKICTLQKMARAMLHGKNLLLLFWAEALNTACHMHNRISLRPGTTQTNYGLWRRRKPNIKYFHIFESICQILADRE
YRKKWDSKSDVGIFLGYSSNSRAYRVFNNRTQSSMESINVVIDDHHRNCARWRDSMLMFPYDTHVLADKDKNLVPVDKDSESSNLDPKESVSDQIKCSSKIKKNHPLENI
IGDLNSGVTTRRKDKGWGKAIIFLANKT