; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022367 (gene) of Snake gourd v1 genome

Gene IDTan0022367
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG03:4774811..4775776
RNA-Seq ExpressionTan0022367
SyntenyTan0022367
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.6e-12068.54Show/hide
Query:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------
        +PRRSG VV QP+RY+ L ETQ++IPD+  EDPLTY QAM D D+D+W+ AM+ EMESMYFNSVW LVD P  VK IGCKWIYKRKR             
Subjt:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI
                 GVD           KSI ILL+IAT+Y+YE+WQ+DVK AFLNG LEE+IYM QP+GFI   QEQKVC+L++SIYGLKQASRSWNIRFD  I
Subjt:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI

Query:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
        +SYGF+QN DEPCVYKKI+NS +AFLILYVDDILLIGND  YLT++K+WL TQFQMKD G+AQ++LGIQIVRN KNKTL +SQASYIDKVLSR+KMQ+SK
Subjt:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Query:  KGLLPFRHEIHLSKEQCPKTP
        KG LPFRH IHLSKEQCPKTP
Subjt:  KGLLPFRHEIHLSKEQCPKTP

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]9.1e-12168.85Show/hide
Query:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------
        MPRRSG VV QP+RY+ L ETQVVIPD+  EDPL+Y QAM D DKD+WV AMD EMESMYFNSVW+LVD P+GVK IGCKWIYKRKR             
Subjt:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI
                 GVD           KSI ILL+IAT+YDYE+WQ+DVK AFLNG LEE+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  I
Subjt:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI

Query:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
        +SYGFDQN DEPCVYKKI    +AFL+LYVDDILLIGND GYLT++K WL  QFQMKD G+AQ+VLGIQI+R+ KNKTL LSQA+YIDK+L R+ MQ+SK
Subjt:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Query:  KGLLPFRHEIHLSKEQCPKTP
        KGLLPFRH +HLSKEQ PKTP
Subjt:  KGLLPFRHEIHLSKEQCPKTP

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-11968.22Show/hide
Query:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------
        MPRRSG VV QP+RY+ L ETQVVIPD+  EDPL+Y QAM D DKD+WV AMD EMESMYFNSVW+LVD P+GVK IGCKWIYKRKR             
Subjt:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI
                 GVD           KSI ILL+IA +YDYE+WQ+DVK AFLNG LEE+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  I
Subjt:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI

Query:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
        +SYGFDQN DEPCVYKKI    +AFL+LYVDDILLIGND GYLT++K WL  QFQMKD G+ Q+VLGIQI+R+ KNKTL LSQA+YIDK+L R+ MQ+SK
Subjt:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Query:  KGLLPFRHEIHLSKEQCPKTP
        KGLLPFRH +HLSKEQ PKTP
Subjt:  KGLLPFRHEIHLSKEQCPKTP

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]9.1e-12168.85Show/hide
Query:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------
        MPRRSG VV QP+RY+ L ETQVVIPD+  EDPL+Y QAM D DKD+WV AMD EMESMYFNSVW+LVD P+GVK IGCKWIYKRKR             
Subjt:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI
                 GVD           KSI ILL+IAT+YDYE+WQ+DVK AFLNG LEE+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  I
Subjt:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI

Query:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
        +SYGFDQN DEPCVYKKI    +AFL+LYVDDILLIGND GYLT++K WL  QFQMKD G+AQ+VLGIQI+R+ KNKTL LSQA+YIDK+L R+ MQ+SK
Subjt:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Query:  KGLLPFRHEIHLSKEQCPKTP
        KGLLPFRH +HLSKEQ PKTP
Subjt:  KGLLPFRHEIHLSKEQCPKTP

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-11968.22Show/hide
Query:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------
        MPRRSG VV QP+RY+ L ETQVVIPD+  EDPL+Y QAM D DKD+WV AMD EMESMYFNSVW+LVD P+GVK IGCKWIYKRKR             
Subjt:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI
                 GVD           KSI ILL+IAT+YDYE+WQ+DVK AFLNG LEE+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  I
Subjt:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI

Query:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
        +SYGFDQN DEPCVYKKI    +AFL+LYVDDILLIGND GYLT++K WL  QFQMKD G+AQ+VLGIQI+R+ KNKTL LSQA+YIDK+L R+ MQ+SK
Subjt:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Query:  KGLLPFRHEIHLSKEQCPKTP
        K LLPF+H  HLS+EQCPKTP
Subjt:  KGLLPFRHEIHLSKEQCPKTP

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein6.4e-12068.22Show/hide
Query:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------
        MPRRSG VV QP+RY+ L ETQVVIPD+  EDPL+Y QAM D DKD+WV AMD EMESMYFNSVW+LVD P+GVK IGCKWIYKRKR             
Subjt:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI
                 GVD           KSI ILL+IA +YDYE+WQ+DVK AFLNG LEE+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  I
Subjt:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI

Query:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
        +SYGFDQN DEPCVYKKI    +AFL+LYVDDILLIGND GYLT++K WL  QFQMKD G+ Q+VLGIQI+R+ KNKTL LSQA+YIDK+L R+ MQ+SK
Subjt:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Query:  KGLLPFRHEIHLSKEQCPKTP
        KGLLPFRH +HLSKEQ PKTP
Subjt:  KGLLPFRHEIHLSKEQCPKTP

A0A5A7TZD0 Gag/pol protein4.4e-12168.85Show/hide
Query:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------
        MPRRSG VV QP+RY+ L ETQVVIPD+  EDPL+Y QAM D DKD+WV AMD EMESMYFNSVW+LVD P+GVK IGCKWIYKRKR             
Subjt:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI
                 GVD           KSI ILL+IAT+YDYE+WQ+DVK AFLNG LEE+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  I
Subjt:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI

Query:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
        +SYGFDQN DEPCVYKKI    +AFL+LYVDDILLIGND GYLT++K WL  QFQMKD G+AQ+VLGIQI+R+ KNKTL LSQA+YIDK+L R+ MQ+SK
Subjt:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Query:  KGLLPFRHEIHLSKEQCPKTP
        KGLLPFRH +HLSKEQ PKTP
Subjt:  KGLLPFRHEIHLSKEQCPKTP

A0A5A7UYE8 Gag/pol protein4.4e-12168.85Show/hide
Query:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------
        MPRRSG VV QP+RY+ L ETQVVIPD+  EDPL+Y QAM D DKD+WV AMD EMESMYFNSVW+LVD P+GVK IGCKWIYKRKR             
Subjt:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI
                 GVD           KSI ILL+IAT+YDYE+WQ+DVK AFLNG LEE+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  I
Subjt:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI

Query:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
        +SYGFDQN DEPCVYKKI    +AFL+LYVDDILLIGND GYLT++K WL  QFQMKD G+AQ+VLGIQI+R+ KNKTL LSQA+YIDK+L R+ MQ+SK
Subjt:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Query:  KGLLPFRHEIHLSKEQCPKTP
        KGLLPFRH +HLSKEQ PKTP
Subjt:  KGLLPFRHEIHLSKEQCPKTP

A0A5D3CYF4 Gag/pol protein4.9e-12068.22Show/hide
Query:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------
        MPRRSG VV QP+RY+ L ETQVVIPD+  EDPL+Y QAM D DKD+WV AMD EMESMYFNSVW+LVD P+GVK IGCKWIYKRKR             
Subjt:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI
                 GVD           KSI ILL+IAT+YDYE+WQ+DVK AFLNG LEE+I+M QP+GFI  GQEQKVC+L RSIYGLKQASRSWNIRFD  I
Subjt:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI

Query:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
        +SYGFDQN DEPCVYKKI    +AFL+LYVDDILLIGND GYLT++K WL  QFQMKD G+AQ+VLGIQI+R+ KNKTL LSQA+YIDK+L R+ MQ+SK
Subjt:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Query:  KGLLPFRHEIHLSKEQCPKTP
        K LLPF+H  HLS+EQCPKTP
Subjt:  KGLLPFRHEIHLSKEQCPKTP

E2GK51 Gag/pol protein (Fragment)7.5e-12168.54Show/hide
Query:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------
        +PRRSG VV QP+RY+ L ETQ++IPD+  EDPLTY QAM D D+D+W+ AM+ EMESMYFNSVW LVD P  VK IGCKWIYKRKR             
Subjt:  MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI
                 GVD           KSI ILL+IAT+Y+YE+WQ+DVK AFLNG LEE+IYM QP+GFI   QEQKVC+L++SIYGLKQASRSWNIRFD  I
Subjt:  ---------GVDG----------KSICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVI

Query:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
        +SYGF+QN DEPCVYKKI+NS +AFLILYVDDILLIGND  YLT++K+WL TQFQMKD G+AQ++LGIQIVRN KNKTL +SQASYIDKVLSR+KMQ+SK
Subjt:  RSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Query:  KGLLPFRHEIHLSKEQCPKTP
        KG LPFRH IHLSKEQCPKTP
Subjt:  KGLLPFRHEIHLSKEQCPKTP

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.9e-3128.42Show/hide
Query:  PLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRK----------------RGVDGK----------------SICILLAI
        P ++++     DK  W  A++ E+ +   N+ W +  +P+    +  +W++  K                RG   K                S   +L++
Subjt:  PLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRK----------------RGVDGK----------------SICILLAI

Query:  ATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQNDDEPCVY---KKIINSSIAFLILY
           Y+ +V Q+DVK AFLNG L+E IYM  P+G         VC+L ++IYGLKQA+R W   F++ ++   F  +  + C+Y   K  IN +I +++LY
Subjt:  ATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQNDDEPCVY---KKIINSSIAFLILY

Query:  VDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLP----FRHEIHLSKEQC
        VDD+++   D   + N K +L  +F+M D  + +  +GI+I    +   + LSQ++Y+ K+LS+F M++      P      +E+  S E C
Subjt:  VDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLP----FRHEIHLSKEQC

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-5336.99Show/hide
Query:  RRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKRGVDGK----------
        RRS    ++  RY S     V+I D+   +P +  + +   +K++ + AM +EMES+  N  + LV+ P G + + CKW++K K+  D K          
Subjt:  RRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKRGVDGK----------

Query:  ----------------------SICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRS
                              SI  +L++A   D EV Q+DVK AFL+G LEE IYM+QP+GF   G++  VC+L +S+YGLKQA R W ++FD  ++S
Subjt:  ----------------------SICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRS

Query:  YGFDQNDDEPCVY-KKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKK
          + +   +PCVY K+   ++   L+LYVDD+L++G D+G +  +K  L+  F MKD G AQ +LG++IVR   ++ L LSQ  YI++VL RF M+++K 
Subjt:  YGFDQNDDEPCVY-KKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKK

Query:  GLLPFRHEIHLSKEQCPKT
           P    + LSK+ CP T
Subjt:  GLLPFRHEIHLSKEQCPKT

P25600 Putative transposon Ty5-1 protein YCL074W8.8e-1832.28Show/hide
Query:  IDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGY
        +DV  AFLN  ++E IY+ QP GF+       V  L   +YGLKQA   WN   +  ++  GF +++ E  +Y +  +    ++ +YVDD+L+       
Subjt:  IDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGY

Query:  LTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK
           +K+ LT  + MKD G     LG+ I     N  +TLS   YI K  S  ++   K
Subjt:  LTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.4e-3331.7Show/hide
Query:  VIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDG-VKHIGCKWIYKRKRGVDGK---------------
        +I+P+   SLA   V +  E   +P T  QA+ D   ++W  AM  E+ +   N  WDLV  P   V  +GC+WI+ +K   DG                
Subjt:  VIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDG-VKHIGCKWIYKRKRGVDGK---------------

Query:  -----------------SICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQ
                         SI I+L +A    + + Q+DV  AFL G L + +YM QP GFI   +   VC+L++++YGLKQA R+W +     + + GF  
Subjt:  -----------------SICILLAIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQ

Query:  NDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLPFR
        +  +  ++      SI ++++YVDDIL+ GND   L N  + L+ +F +KD  +  + LGI+  R      L LSQ  YI  +L+R  M  +K    P  
Subjt:  NDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLPFR

Query:  HEIHLS
            LS
Subjt:  HEIHLS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.4e-3331.62Show/hide
Query:  DPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLV-DKPDGVKHIGCKWIYKRKRGVDGK--------------------------------SICILL
        +P T  QAM D   D+W  AM  E+ +   N  WDLV   P  V  +GC+WI+ +K   DG                                 SI I+L
Subjt:  DPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLV-DKPDGVKHIGCKWIYKRKRGVDGK--------------------------------SICILL

Query:  AIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQNDDEPCVYKKIINSSIAFLILYV
         +A    + + Q+DV  AFL G L + +YM QP GF+   +   VCRL+++IYGLKQA R+W +     + + GF  +  +  ++      SI ++++YV
Subjt:  AIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQNDDEPCVYKKIINSSIAFLILYV

Query:  DDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLPFRHEIHLSKEQCPKTP
        DDIL+ GND   L +  + L+ +F +K+  D  + LGI+  R  +   L LSQ  Y   +L+R  M  +K    P      L+     K P
Subjt:  DDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLPFRHEIHLSKEQCPKTP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-3530.31Show/hide
Query:  EDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKRGVDG--------------------------------KSICILL
        ++P TYN+A   K+   W  AMD E+ +M     W++   P   K IGCKW+YK K   DG                                 S+ ++L
Subjt:  EDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKRGVDG--------------------------------KSICILL

Query:  AIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGF-IKPGQE---QKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQNDDEPCVYKKIINSSIAFL
        AI+  Y++ + Q+D+  AFLNG L+E IYM  P G+  + G       VC LK+SIYGLKQASR W ++F   +  +GF Q+  +   + KI  +    +
Subjt:  AIATYYDYEVWQIDVKIAFLNGKLEETIYMDQPKGF-IKPGQE---QKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQNDDEPCVYKKIINSSIAFL

Query:  ILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLPFRHEIHLS
        ++YVDDI++  N++  +  +K  L + F+++D G  ++ LG++I R+     + + Q  Y   +L    +   K   +P    +  S
Subjt:  ILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLPFRHEIHLS

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-0738.2Show/hide
Query:  FLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLPFRHEIHLS
        +L+LYVDDILL G+    L  +   L++ F MKD G   + LGIQI + H +  L LSQ  Y +++L+   M D K    P   +++ S
Subjt:  FLILYVDDILLIGNDEGYLTNIKEWLTTQFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLPFRHEIHLS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.6e-0438.64Show/hide
Query:  WVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKRGVDG
        W  AM +E++++  N  W LV  P     +GCKW++K K   DG
Subjt:  WVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKRGVDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGACGTAGTGGGTGGGTTGTGATACAGCCTGACCGTTACATGAGTTTAGCTGAAACCCAAGTTGTTATACCAGATGAACTCTGCGAGGATCCATTGACTTATAA
TCAAGCAATGGTTGACAAAGACAAAGACAAATGGGTCATAGCCATGGACCAAGAAATGGAGTCGATGTACTTCAATTCTGTTTGGGATCTTGTAGATAAGCCTGATGGGG
TTAAACATATAGGATGTAAATGGATCTACAAGAGAAAACGTGGTGTTGATGGGAAGTCTATCTGTATCCTTCTTGCCATTGCCACATATTATGACTATGAGGTATGGCAA
ATAGATGTTAAGATAGCCTTTTTAAATGGAAAACTTGAGGAAACCATCTACATGGACCAACCCAAAGGGTTCATTAAACCAGGACAAGAGCAAAAAGTCTGCAGGCTTAA
AAGGTCGATTTATGGACTGAAACAAGCCTCTAGGTCCTGGAATATAAGATTTGATGAGGTTATCAGATCTTATGGTTTTGACCAGAATGATGATGAACCTTGTGTCTACA
AGAAAATCATTAATAGTTCTATCGCTTTCCTAATTCTCTATGTGGATGATATCCTACTCATTGGGAATGATGAAGGTTATCTCACTAACATCAAGGAATGGCTAACTACG
CAATTCCAAATGAAAGATTTCGGTGATGCGCAGTTTGTTCTTGGGATCCAGATTGTCCGTAACCACAAGAATAAAACACTAACCTTGTCTCAAGCATCATACATAGACAA
AGTGTTGTCAAGGTTTAAGATGCAAGATTCCAAAAAGGGCTTGTTGCCTTTTAGACATGAAATCCATTTGTCTAAGGAACAGTGTCCTAAGACACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGACGTAGTGGGTGGGTTGTGATACAGCCTGACCGTTACATGAGTTTAGCTGAAACCCAAGTTGTTATACCAGATGAACTCTGCGAGGATCCATTGACTTATAA
TCAAGCAATGGTTGACAAAGACAAAGACAAATGGGTCATAGCCATGGACCAAGAAATGGAGTCGATGTACTTCAATTCTGTTTGGGATCTTGTAGATAAGCCTGATGGGG
TTAAACATATAGGATGTAAATGGATCTACAAGAGAAAACGTGGTGTTGATGGGAAGTCTATCTGTATCCTTCTTGCCATTGCCACATATTATGACTATGAGGTATGGCAA
ATAGATGTTAAGATAGCCTTTTTAAATGGAAAACTTGAGGAAACCATCTACATGGACCAACCCAAAGGGTTCATTAAACCAGGACAAGAGCAAAAAGTCTGCAGGCTTAA
AAGGTCGATTTATGGACTGAAACAAGCCTCTAGGTCCTGGAATATAAGATTTGATGAGGTTATCAGATCTTATGGTTTTGACCAGAATGATGATGAACCTTGTGTCTACA
AGAAAATCATTAATAGTTCTATCGCTTTCCTAATTCTCTATGTGGATGATATCCTACTCATTGGGAATGATGAAGGTTATCTCACTAACATCAAGGAATGGCTAACTACG
CAATTCCAAATGAAAGATTTCGGTGATGCGCAGTTTGTTCTTGGGATCCAGATTGTCCGTAACCACAAGAATAAAACACTAACCTTGTCTCAAGCATCATACATAGACAA
AGTGTTGTCAAGGTTTAAGATGCAAGATTCCAAAAAGGGCTTGTTGCCTTTTAGACATGAAATCCATTTGTCTAAGGAACAGTGTCCTAAGACACCTTAA
Protein sequenceShow/hide protein sequence
MPRRSGWVVIQPDRYMSLAETQVVIPDELCEDPLTYNQAMVDKDKDKWVIAMDQEMESMYFNSVWDLVDKPDGVKHIGCKWIYKRKRGVDGKSICILLAIATYYDYEVWQ
IDVKIAFLNGKLEETIYMDQPKGFIKPGQEQKVCRLKRSIYGLKQASRSWNIRFDEVIRSYGFDQNDDEPCVYKKIINSSIAFLILYVDDILLIGNDEGYLTNIKEWLTT
QFQMKDFGDAQFVLGIQIVRNHKNKTLTLSQASYIDKVLSRFKMQDSKKGLLPFRHEIHLSKEQCPKTP