; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011962 (gene) of Snake gourd v1 genome

Gene IDTan0011962
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG05:33258742..33259755
RNA-Seq ExpressionTan0011962
SyntenyTan0011962
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.0e-9053.51Show/hide
Query:  RDHQPHSRIVLSEIFREATDK---STKVVDQAG------PSQVLRMPRHSGR------------------------------DAMNDIDRDEWIKAMDLE
        R+HQP S+IVL E+F+ ATDK   STKVVD+A        SQ LR+PR SGR                               AMND+DRD+WIKAM+LE
Subjt:  RDHQPHSRIVLSEIFREATDK---STKVVDQAG------PSQVLRMPRHSGR------------------------------DAMNDIDRDEWIKAMDLE

Query:  MESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGN
        MESMYFNSVW LVD P   +PIGCKWIYKRKRDQA GKVQTFKARL+AKGYTQ+EGVDYEETFSP AMLKSIRILLSIATFY+YEIWQMDVKT  F NGN
Subjt:  MESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGN

Query:  LEESIYMDQPR-----------------------------------VHSSG-------------------------------------------------
        LEESIYM QP                                    + S G                                                 
Subjt:  LEESIYMDQPR-----------------------------------VHSSG-------------------------------------------------

Query:  --SRAKGEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS
           +  GEAQ++LGIQIVRN KNKTLA+S ASYIDK+L RYKM+NSKKG LPFRHGIHLSKEQ PKTPQEVEDMR IPY+SA  S
Subjt:  --SRAKGEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-9252.55Show/hide
Query:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAG---------------PSQVLRMPRHSGR------------------------------DAMNDIDRDEW
        MR+H+P S++VLS    EATD+ST+VVD+ G               PSQ LRMPR SGR                               AMND+D+D+W
Subjt:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAG---------------PSQVLRMPRHSGR------------------------------DAMNDIDRDEW

Query:  IKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKT
        +KAMDLEMESMYFNSVW+LVD P+G +PIGCKWIYKRKRD A GKVQTFKARL+AKGYTQREGVDYEETFSP AMLKSIRILLSIATFYDYEIWQMDVKT
Subjt:  IKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKT

Query:  RPFSNGNLEESIYM-------------------------------------------------DQPRVHSSGSRAK------------------------
          F NGNLEESI+M                                                 D+P V+   ++ K                        
Subjt:  RPFSNGNLEESIYM-------------------------------------------------DQPRVHSSGSRAK------------------------

Query:  -------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS
                     GEAQ+VLGIQI+R+ KNKTLALS A+YIDK+LVRY M+NSKKG LPFRHG+HLSKEQSPKTPQEVEDMRRIPYASA  S
Subjt:  -------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS

KAA0056349.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-9056.68Show/hide
Query:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAGP---------------SQVLRMPRHSGRDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIG
        MRDH+P S++VL+    EAT  ST+VVD+ GP               SQ LRMPR  GR AMND+D+D+W+KAMDLEMESMYFNSVW+LVD P+  +PIG
Subjt:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAGP---------------SQVLRMPRHSGRDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIG

Query:  CKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGNLEESIYM----------------
        CKWIYKRKRD A GKVQTFKARL+AKGYTQRE VDYEETFS  AM KSIRI+LSIATFYDYEIWQMD KT  F NG+LEESI+M                
Subjt:  CKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGNLEESIYM----------------

Query:  --------DQPRVHSSGSRAK-------------------------------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKK
                D+P V+   ++ K                                     GEAQ+VLGIQI+R+ KNK LALS A+YIDK+LVRY M+NSKK
Subjt:  --------DQPRVHSSGSRAK-------------------------------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKK

Query:  GSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS
        G LPFRHG+HLSK+Q PK PQE+EDMRRIPY SA  S
Subjt:  GSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-9252.55Show/hide
Query:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAG---------------PSQVLRMPRHSGR------------------------------DAMNDIDRDEW
        MR+H+P S++VLS    EATD+ST+VVD+ G               PSQ LRMPR SGR                               AMND+D+D+W
Subjt:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAG---------------PSQVLRMPRHSGR------------------------------DAMNDIDRDEW

Query:  IKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKT
        +KAMDLEMESMYFNSVW+LVD P+G +PIGCKWIYKRKRD A GKVQTFKARL+AKGYTQREGVDYEETFSP AMLKSIRILLSIATFYDYEIWQMDVKT
Subjt:  IKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKT

Query:  RPFSNGNLEESIYM-------------------------------------------------DQPRVHSSGSRAK------------------------
          F NGNLEESI+M                                                 D+P V+   ++ K                        
Subjt:  RPFSNGNLEESIYM-------------------------------------------------DQPRVHSSGSRAK------------------------

Query:  -------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS
                     GEAQ+VLGIQI+R+ KNKTLALS A+YIDK+LVRY M+NSKKG LPFRHG+HLSKEQSPKTPQEVEDMRRIPYASA  S
Subjt:  -------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS

TYK29165.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-9056.68Show/hide
Query:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAGP---------------SQVLRMPRHSGRDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIG
        MRDH+P S++VL+    EAT  ST+VVD+ GP               SQ LRMPR  GR AMND+D+D+W+KAMDLEMESMYFNSVW+LVD P+  +PIG
Subjt:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAGP---------------SQVLRMPRHSGRDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIG

Query:  CKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGNLEESIYM----------------
        CKWIYKRKRD A GKVQTFKARL+AKGYTQRE VDYEETFS  AM KSIRI+LSIATFYDYEIWQMD KT  F NG+LEESI+M                
Subjt:  CKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGNLEESIYM----------------

Query:  --------DQPRVHSSGSRAK-------------------------------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKK
                D+P V+   ++ K                                     GEAQ+VLGIQI+R+ KNK LALS A+YIDK+LVRY M+NSKK
Subjt:  --------DQPRVHSSGSRAK-------------------------------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKK

Query:  GSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS
        G LPFRHG+HLSK+Q PK PQE+EDMRRIPY SA  S
Subjt:  GSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein1.7e-9252.55Show/hide
Query:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAG---------------PSQVLRMPRHSGR------------------------------DAMNDIDRDEW
        MR+H+P S++VLS    EATD+ST+VVD+ G               PSQ LRMPR SGR                               AMND+D+D+W
Subjt:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAG---------------PSQVLRMPRHSGR------------------------------DAMNDIDRDEW

Query:  IKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKT
        +KAMDLEMESMYFNSVW+LVD P+G +PIGCKWIYKRKRD A GKVQTFKARL+AKGYTQREGVDYEETFSP AMLKSIRILLSIATFYDYEIWQMDVKT
Subjt:  IKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKT

Query:  RPFSNGNLEESIYM-------------------------------------------------DQPRVHSSGSRAK------------------------
          F NGNLEESI+M                                                 D+P V+   ++ K                        
Subjt:  RPFSNGNLEESIYM-------------------------------------------------DQPRVHSSGSRAK------------------------

Query:  -------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS
                     GEAQ+VLGIQI+R+ KNKTLALS A+YIDK+LVRY M+NSKKG LPFRHG+HLSKEQSPKTPQEVEDMRRIPYASA  S
Subjt:  -------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS

A0A5A7US54 Gag/pol protein3.6e-9056.68Show/hide
Query:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAGP---------------SQVLRMPRHSGRDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIG
        MRDH+P S++VL+    EAT  ST+VVD+ GP               SQ LRMPR  GR AMND+D+D+W+KAMDLEMESMYFNSVW+LVD P+  +PIG
Subjt:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAGP---------------SQVLRMPRHSGRDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIG

Query:  CKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGNLEESIYM----------------
        CKWIYKRKRD A GKVQTFKARL+AKGYTQRE VDYEETFS  AM KSIRI+LSIATFYDYEIWQMD KT  F NG+LEESI+M                
Subjt:  CKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGNLEESIYM----------------

Query:  --------DQPRVHSSGSRAK-------------------------------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKK
                D+P V+   ++ K                                     GEAQ+VLGIQI+R+ KNK LALS A+YIDK+LVRY M+NSKK
Subjt:  --------DQPRVHSSGSRAK-------------------------------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKK

Query:  GSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS
        G LPFRHG+HLSK+Q PK PQE+EDMRRIPY SA  S
Subjt:  GSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS

A0A5A7UYE8 Gag/pol protein1.7e-9252.55Show/hide
Query:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAG---------------PSQVLRMPRHSGR------------------------------DAMNDIDRDEW
        MR+H+P S++VLS    EATD+ST+VVD+ G               PSQ LRMPR SGR                               AMND+D+D+W
Subjt:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAG---------------PSQVLRMPRHSGR------------------------------DAMNDIDRDEW

Query:  IKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKT
        +KAMDLEMESMYFNSVW+LVD P+G +PIGCKWIYKRKRD A GKVQTFKARL+AKGYTQREGVDYEETFSP AMLKSIRILLSIATFYDYEIWQMDVKT
Subjt:  IKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKT

Query:  RPFSNGNLEESIYM-------------------------------------------------DQPRVHSSGSRAK------------------------
          F NGNLEESI+M                                                 D+P V+   ++ K                        
Subjt:  RPFSNGNLEESIYM-------------------------------------------------DQPRVHSSGSRAK------------------------

Query:  -------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS
                     GEAQ+VLGIQI+R+ KNKTLALS A+YIDK+LVRY M+NSKKG LPFRHG+HLSKEQSPKTPQEVEDMRRIPYASA  S
Subjt:  -------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS

A0A5D3E133 Gag/pol protein3.6e-9056.68Show/hide
Query:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAGP---------------SQVLRMPRHSGRDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIG
        MRDH+P S++VL+    EAT  ST+VVD+ GP               SQ LRMPR  GR AMND+D+D+W+KAMDLEMESMYFNSVW+LVD P+  +PIG
Subjt:  MRDHQPHSRIVLSEIFREATDKSTKVVDQAGP---------------SQVLRMPRHSGRDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIG

Query:  CKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGNLEESIYM----------------
        CKWIYKRKRD A GKVQTFKARL+AKGYTQRE VDYEETFS  AM KSIRI+LSIATFYDYEIWQMD KT  F NG+LEESI+M                
Subjt:  CKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGNLEESIYM----------------

Query:  --------DQPRVHSSGSRAK-------------------------------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKK
                D+P V+   ++ K                                     GEAQ+VLGIQI+R+ KNK LALS A+YIDK+LVRY M+NSKK
Subjt:  --------DQPRVHSSGSRAK-------------------------------------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKK

Query:  GSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS
        G LPFRHG+HLSK+Q PK PQE+EDMRRIPY SA  S
Subjt:  GSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS

E2GK51 Gag/pol protein (Fragment)9.6e-9153.51Show/hide
Query:  RDHQPHSRIVLSEIFREATDK---STKVVDQAG------PSQVLRMPRHSGR------------------------------DAMNDIDRDEWIKAMDLE
        R+HQP S+IVL E+F+ ATDK   STKVVD+A        SQ LR+PR SGR                               AMND+DRD+WIKAM+LE
Subjt:  RDHQPHSRIVLSEIFREATDK---STKVVDQAG------PSQVLRMPRHSGR------------------------------DAMNDIDRDEWIKAMDLE

Query:  MESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGN
        MESMYFNSVW LVD P   +PIGCKWIYKRKRDQA GKVQTFKARL+AKGYTQ+EGVDYEETFSP AMLKSIRILLSIATFY+YEIWQMDVKT  F NGN
Subjt:  MESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGN

Query:  LEESIYMDQPR-----------------------------------VHSSG-------------------------------------------------
        LEESIYM QP                                    + S G                                                 
Subjt:  LEESIYMDQPR-----------------------------------VHSSG-------------------------------------------------

Query:  --SRAKGEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS
           +  GEAQ++LGIQIVRN KNKTLA+S ASYIDK+L RYKM+NSKKG LPFRHGIHLSKEQ PKTPQEVEDMR IPY+SA  S
Subjt:  --SRAKGEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-2036.43Show/hide
Query:  DRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQ
        D+  W +A++ E+ +   N+ W +  +P+ +  +  +W++  K ++    ++ +KARL+A+G+TQ+  +DYEETF+P A + S R +LS+   Y+ ++ Q
Subjt:  DRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQ

Query:  MDVKTRPFSNGNLEESIYMDQPRVHSSGS
        MDVKT  F NG L+E IYM  P+  S  S
Subjt:  MDVKTRPFSNGNLEESIYMDQPRVHSSGS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-3330.49Show/hide
Query:  RDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATF
        ++ ++  ++++ +KAM  EMES+  N  +KLV+ P G+RP+ CKW++K K+D    K+  +KARL+ KG+ Q++G+D++E FSP   + SIR +LS+A  
Subjt:  RDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATF

Query:  YDYEIWQMDVKTRPFSNGNLEESIYMDQPR----------------------------------------------------------------------
         D E+ Q+DVKT  F +G+LEE IYM+QP                                                                       
Subjt:  YDYEIWQMDVKTRPFSNGNLEESIYMDQPR----------------------------------------------------------------------

Query:  ----------VHSSGSRAK-------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYA
                      G  +K       G AQ +LG++IVR   ++ L LS   YI+++L R+ MKN+K  S P    + LSK+  P T +E  +M ++PY+
Subjt:  ----------VHSSGSRAK-------GEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMKNSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYA

Query:  SAAAS
        SA  S
Subjt:  SAAAS

P92520 Uncharacterized mitochondrial protein AtMg008205.9e-1339.08Show/hide
Query:  WIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIA
        W +AM  E++++  N  W LV  P  +  +GCKW++K K   + G +   KARL+AKG+ Q EG+ + ET+SP     +IR +L++A
Subjt:  WIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-1940.77Show/hide
Query:  RDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPI-GCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIAT
        R A+  +  + W  AM  E+ +   N  W LV  P     I GC+WI+ +K + + G +  +KARL+AKGY QR G+DY ETFSP     SIRI+L +A 
Subjt:  RDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPI-GCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIAT

Query:  FYDYEIWQMDVKTRPFSNGNLEESIYMDQP
           + I Q+DV    F  G L + +YM QP
Subjt:  FYDYEIWQMDVKTRPFSNGNLEESIYMDQP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-1941.54Show/hide
Query:  RDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPI-GCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIAT
        R A+  +  D W +AM  E+ +   N  W LV  P     I GC+WI+ +K + + G +  +KARL+AKGY QR G+DY ETFSP     SIRI+L +A 
Subjt:  RDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPI-GCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIAT

Query:  FYDYEIWQMDVKTRPFSNGNLEESIYMDQP
           + I Q+DV    F  G L + +YM QP
Subjt:  FYDYEIWQMDVKTRPFSNGNLEESIYMDQP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-2644.07Show/hide
Query:  WIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVK
        W  AMD E+ +M     W++   P  ++PIGCKW+YK K + + G ++ +KARL+AKGYTQ+EG+D+ ETFSP   L S++++L+I+  Y++ + Q+D+ 
Subjt:  WIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVK

Query:  TRPFSNGNLEESIYMDQP
        +  F NG+L+E IYM  P
Subjt:  TRPFSNGNLEESIYMDQP

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.2e-1439.08Show/hide
Query:  WIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIA
        W +AM  E++++  N  W LV  P  +  +GCKW++K K   + G +   KARL+AKG+ Q EG+ + ET+SP     +IR +L++A
Subjt:  WIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLAKGYTQREGVDYEETFSPFAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGATCATCAGCCTCATAGCAGGATTGTCTTAAGTGAAATTTTCAGGGAAGCTACAGATAAATCAACAAAAGTTGTTGATCAAGCTGGTCCTTCTCAAGTGTTGAG
AATGCCTCGACATAGTGGGAGGGATGCAATGAATGATATAGATAGGGACGAGTGGATTAAAGCCATGGACCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGAAAC
TTGTAGATCAACCAGATGGTGAAAGACCTATCGGTTGCAAGTGGATCTACAAGAGGAAACGAGATCAAGCTTCTGGTAAGGTGCAAACCTTTAAGGCTCGACTTTTGGCA
AAGGGTTATACCCAAAGGGAAGGGGTGGACTATGAAGAAACCTTCTCCCCATTTGCCATGCTTAAGTCCATTAGAATACTCTTGTCCATTGCCACATTTTATGACTATGA
AATTTGGCAAATGGATGTCAAGACACGACCTTTCTCGAATGGCAATCTTGAAGAGAGTATCTATATGGATCAACCGAGGGTTCATAGTTCAGGGTCAAGAGCAAAAGGAG
AAGCTCAGTTTGTTCTTGGAATCCAAATTGTTCGGAATCACAAGAACAAAACGCTAGCACTGTCTCCGGCATCTTATATCGATAAGATGTTGGTTAGATATAAGATGAAG
AATTCCAAAAAGGGTTCATTACCTTTCAGGCATGGAATTCATTTGTCTAAGGAACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGAATTCCCTATGCATC
CGCTGCTGCTTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGATCATCAGCCTCATAGCAGGATTGTCTTAAGTGAAATTTTCAGGGAAGCTACAGATAAATCAACAAAAGTTGTTGATCAAGCTGGTCCTTCTCAAGTGTTGAG
AATGCCTCGACATAGTGGGAGGGATGCAATGAATGATATAGATAGGGACGAGTGGATTAAAGCCATGGACCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGAAAC
TTGTAGATCAACCAGATGGTGAAAGACCTATCGGTTGCAAGTGGATCTACAAGAGGAAACGAGATCAAGCTTCTGGTAAGGTGCAAACCTTTAAGGCTCGACTTTTGGCA
AAGGGTTATACCCAAAGGGAAGGGGTGGACTATGAAGAAACCTTCTCCCCATTTGCCATGCTTAAGTCCATTAGAATACTCTTGTCCATTGCCACATTTTATGACTATGA
AATTTGGCAAATGGATGTCAAGACACGACCTTTCTCGAATGGCAATCTTGAAGAGAGTATCTATATGGATCAACCGAGGGTTCATAGTTCAGGGTCAAGAGCAAAAGGAG
AAGCTCAGTTTGTTCTTGGAATCCAAATTGTTCGGAATCACAAGAACAAAACGCTAGCACTGTCTCCGGCATCTTATATCGATAAGATGTTGGTTAGATATAAGATGAAG
AATTCCAAAAAGGGTTCATTACCTTTCAGGCATGGAATTCATTTGTCTAAGGAACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGAATTCCCTATGCATC
CGCTGCTGCTTCGTAG
Protein sequenceShow/hide protein sequence
MRDHQPHSRIVLSEIFREATDKSTKVVDQAGPSQVLRMPRHSGRDAMNDIDRDEWIKAMDLEMESMYFNSVWKLVDQPDGERPIGCKWIYKRKRDQASGKVQTFKARLLA
KGYTQREGVDYEETFSPFAMLKSIRILLSIATFYDYEIWQMDVKTRPFSNGNLEESIYMDQPRVHSSGSRAKGEAQFVLGIQIVRNHKNKTLALSPASYIDKMLVRYKMK
NSKKGSLPFRHGIHLSKEQSPKTPQEVEDMRRIPYASAAAS