; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010934 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010934
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:1604245..1606116
RNA-Seq ExpressionPay0010934
SyntenyPay0010934
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045828.1 DUF4219 domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]2.4e-15964.11Show/hide
Query:  ILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATTIINY
        ++KQGYADPDDKGKLRENKKKDSKALMIIQQAV+DIVFSRIAAATTSKQAWLILQNAFQGDLRVLV+     R D+ +                    +Y
Subjt:  ILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATTIINY

Query:  SGEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQ------VKDVVPKY-----------------------------
        SGEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQ         VVP +                             
Subjt:  SGEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQ------VKDVVPKY-----------------------------

Query:  ------------------------NDSDH----------------------VMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF
                                +D +                       V TRGRGRGGYRG GRGTRKRCNRNEEQRQFGVQSSNK+NIQCYHCKKF
Subjt:  ------------------------NDSDH----------------------VMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF

Query:  -----------------------------------------------------------GLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGN
                                                                   GLK VFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGN
Subjt:  -----------------------------------------------------------GLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGN

Query:  RILTNVQYVPDIGYNLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQKRGSR
        RILTNVQYVPDIGYNLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQKRGSR
Subjt:  RILTNVQYVPDIGYNLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQKRGSR

Query:  EEESNCRGDGKKHVANERPFE
        EEESNCRGDGKKHVANERPFE
Subjt:  EEESNCRGDGKKHVANERPFE

KAA0055915.1 copia protein [Cucumis melo var. makuwa]1.1e-14564.27Show/hide
Query:  MGTAQPLIPIFKGEGYEFWSI--------------LKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLV
        MGT QPLIPIFKGEGYEFWSI              ++QGY DPDD+GKL+EN++KD KAL+I+QQAVHD VFSRIAAATTSKQAWLILQ AFQGD RVLV
Subjt:  MGTAQPLIPIFKGEGYEFWSI--------------LKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLV

Query:  VKLQSLRRDFETLMMKNGESIADFLSRATTIINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEE
        VKLQSL+RDFETLMMKNGESIADFLSRATTII+     GE          VLRSLTPKFDHVV AIEESKDLST+TFIELM SLQA+E RIN SME+N+E
Subjt:  VKLQSLRRDFETLMMKNGESIADFLSRATTIINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEE

Query:  KAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF----------------------------------
        KAF+VKDVVPKYNDSD VMT+G+G GGYR RGRGT K CN+NEEQRQFGVQSSNKANIQCYHCKKF                                  
Subjt:  KAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF----------------------------------

Query:  -------------------------GLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQ-----------YVPDI-GYNLLSVGQL
                                 GLKPVFKELNEGEKLKVEL N KELQVEGK  +GIETH+GNRILTNVQ            V ++  + L +    
Subjt:  -------------------------GLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQ-----------YVPDI-GYNLLSVGQL

Query:  IESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ
             S L+   S+S+TFEKFKHFKAKV+KQSG+FIKSLRSDRGG+FL NNFNHFCEEHGIHRELTTPYT EQ
Subjt:  IESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ

TYK12002.1 UBN2 domain-containing protein [Cucumis melo var. makuwa]2.0e-13472.18Show/hide
Query:  SILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATTIIN
        ++++QGYADPDD+GKLR NKKKDSK L+IIQQAVHD VFS+I  ATTSKQAWLILQ  FQGD RVLVVKLQSLRRDFETLMMKNGESIADFLSRATTII+
Subjt:  SILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATTIIN

Query:  ----YS---------GEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGR
            YS          +VLRSLTPKFDHVV  IEESKDLST+TFIELM SLQA+E RINRSMERNEEKAFQVKDVV KYNDSD V TRGRGRGGYRGRG 
Subjt:  ----YS---------GEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGR

Query:  GTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKFG----------LKPVFKELNEGEKL--KVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGY
        G  K CN+NEEQRQFGVQSSNKANIQCYH KKFG           +   ++  E +++  +VEL NGKELQVEGKGTVGIETHHGNRILT          
Subjt:  GTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKFG----------LKPVFKELNEGEKL--KVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGY

Query:  NLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ
                       +F  +SKS+TFEKFKHFKAKV+KQSG+FIKSLRSDRGGEFL NNFNHFCEEHGIHRELTTPYTPEQ
Subjt:  NLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ

TYK27735.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]4.3e-14554.75Show/hide
Query:  EFWSILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATT
        + W +++QGY DPDD+GKLREN+KKDSKAL+IIQQAVHD VFSRIA ATTSKQAWLILQ AFQGD RVL+VKLQSLRRDFETLMMKNGESIADFLSRATT
Subjt:  EFWSILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATT

Query:  IINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQVKDVVPKYNDSDHVMTRGRGRGGYRG
        II+     GE          VLRSLTPKFDHVV AIEESK+L T+TFIELM SL+A+E RINRSMERNEEKAFQVKD VPKYNDSD VMTRGRGRGGYRG
Subjt:  IINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQVKDVVPKYNDSDHVMTRGRGRGGYRG

Query:  RGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF-----------------------------------------------------------GLKPV
        RG GT K CNRNE QRQFGVQSSNKANIQCYHCKKF                                                           GLKPV
Subjt:  RGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF-----------------------------------------------------------GLKPV

Query:  FKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLIESGYSILFDD-------------------------------
        FKELNEGEKLKV+L NGKELQVEGKGTV IETHHGNRILTNVQYVPDIGYNLLSVGQL+ESGYSILFDD                               
Subjt:  FKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLIESGYSILFDD-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------ESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ
                         ESKS+TFEKFKHFKAKV+KQSG+FIKSLRSDRG EFL NNFNHFC+EHGIHRELTTPYTPEQ
Subjt:  -----------------ESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ

TYK28117.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]1.6e-13956.15Show/hide
Query:  MGTAQPLIPIFKGEGYEFWSI--------------LKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLV
        MGTAQPLIPIFKGEGYEFWSI              ++QGY DPDD+GKL+EN++KDSKAL+IIQQAVHD VFSRIAAATT                    
Subjt:  MGTAQPLIPIFKGEGYEFWSI--------------LKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLV

Query:  VKLQSLRRDFETLMMKNGESIADFLSRATTIINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEE
               RDFETLMMKNGESIADFLSRATTII+     GE          VLRSLTPKFDHVVVAIEESKDLST+TFIELM SLQA+E RIN SME+NEE
Subjt:  VKLQSLRRDFETLMMKNGESIADFLSRATTIINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEE

Query:  KAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKFG---------------------------------
        KAF+VKDVVPKYNDSD VMT+G+G GGYR RGRGT K CN+NEEQRQFGVQSSNKANIQCYHCKKFG                                 
Subjt:  KAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKFG---------------------------------

Query:  --------------------------LKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLIESGYSILFDD-
                                  LKPVFKELNEGEKLKVEL NGKELQVEGK T+GIETH+GNRILTNVQYVPDIGYNLLSVGQL+ESG+SILFDD 
Subjt:  --------------------------LKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLIESGYSILFDD-

Query:  -----------------------------------------------------------------------------------ESKSKTFEKFKHFKAKV
                                                                                           +S+S+TFEKFKHFKAKV
Subjt:  -----------------------------------------------------------------------------------ESKSKTFEKFKHFKAKV

Query:  KKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ
        +KQSG+FIKS RSDRGG+FL NNFNHFCEEHGIHRELTTPYT EQ
Subjt:  KKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ

TrEMBL top hitse value%identityAlignment
A0A5A7TSJ0 DUF4219 domain-containing protein/UBN2 domain-containing protein1.1e-15964.11Show/hide
Query:  ILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATTIINY
        ++KQGYADPDDKGKLRENKKKDSKALMIIQQAV+DIVFSRIAAATTSKQAWLILQNAFQGDLRVLV+     R D+ +                    +Y
Subjt:  ILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATTIINY

Query:  SGEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQ------VKDVVPKY-----------------------------
        SGEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQ         VVP +                             
Subjt:  SGEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQ------VKDVVPKY-----------------------------

Query:  ------------------------NDSDH----------------------VMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF
                                +D +                       V TRGRGRGGYRG GRGTRKRCNRNEEQRQFGVQSSNK+NIQCYHCKKF
Subjt:  ------------------------NDSDH----------------------VMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF

Query:  -----------------------------------------------------------GLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGN
                                                                   GLK VFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGN
Subjt:  -----------------------------------------------------------GLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGN

Query:  RILTNVQYVPDIGYNLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQKRGSR
        RILTNVQYVPDIGYNLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQKRGSR
Subjt:  RILTNVQYVPDIGYNLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQKRGSR

Query:  EEESNCRGDGKKHVANERPFE
        EEESNCRGDGKKHVANERPFE
Subjt:  EEESNCRGDGKKHVANERPFE

A0A5A7UQM0 Copia protein5.5e-14664.27Show/hide
Query:  MGTAQPLIPIFKGEGYEFWSI--------------LKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLV
        MGT QPLIPIFKGEGYEFWSI              ++QGY DPDD+GKL+EN++KD KAL+I+QQAVHD VFSRIAAATTSKQAWLILQ AFQGD RVLV
Subjt:  MGTAQPLIPIFKGEGYEFWSI--------------LKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLV

Query:  VKLQSLRRDFETLMMKNGESIADFLSRATTIINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEE
        VKLQSL+RDFETLMMKNGESIADFLSRATTII+     GE          VLRSLTPKFDHVV AIEESKDLST+TFIELM SLQA+E RIN SME+N+E
Subjt:  VKLQSLRRDFETLMMKNGESIADFLSRATTIINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEE

Query:  KAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF----------------------------------
        KAF+VKDVVPKYNDSD VMT+G+G GGYR RGRGT K CN+NEEQRQFGVQSSNKANIQCYHCKKF                                  
Subjt:  KAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF----------------------------------

Query:  -------------------------GLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQ-----------YVPDI-GYNLLSVGQL
                                 GLKPVFKELNEGEKLKVEL N KELQVEGK  +GIETH+GNRILTNVQ            V ++  + L +    
Subjt:  -------------------------GLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQ-----------YVPDI-GYNLLSVGQL

Query:  IESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ
             S L+   S+S+TFEKFKHFKAKV+KQSG+FIKSLRSDRGG+FL NNFNHFCEEHGIHRELTTPYT EQ
Subjt:  IESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ

A0A5D3CL10 UBN2 domain-containing protein9.7e-13572.18Show/hide
Query:  SILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATTIIN
        ++++QGYADPDD+GKLR NKKKDSK L+IIQQAVHD VFS+I  ATTSKQAWLILQ  FQGD RVLVVKLQSLRRDFETLMMKNGESIADFLSRATTII+
Subjt:  SILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATTIIN

Query:  ----YS---------GEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGR
            YS          +VLRSLTPKFDHVV  IEESKDLST+TFIELM SLQA+E RINRSMERNEEKAFQVKDVV KYNDSD V TRGRGRGGYRGRG 
Subjt:  ----YS---------GEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGR

Query:  GTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKFG----------LKPVFKELNEGEKL--KVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGY
        G  K CN+NEEQRQFGVQSSNKANIQCYH KKFG           +   ++  E +++  +VEL NGKELQVEGKGTVGIETHHGNRILT          
Subjt:  GTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKFG----------LKPVFKELNEGEKL--KVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGY

Query:  NLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ
                       +F  +SKS+TFEKFKHFKAKV+KQSG+FIKSLRSDRGGEFL NNFNHFCEEHGIHRELTTPYTPEQ
Subjt:  NLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ

A0A5D3DWC7 Putative gag-pol polyprotein, identical7.7e-14056.15Show/hide
Query:  MGTAQPLIPIFKGEGYEFWSI--------------LKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLV
        MGTAQPLIPIFKGEGYEFWSI              ++QGY DPDD+GKL+EN++KDSKAL+IIQQAVHD VFSRIAAATT                    
Subjt:  MGTAQPLIPIFKGEGYEFWSI--------------LKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLV

Query:  VKLQSLRRDFETLMMKNGESIADFLSRATTIINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEE
               RDFETLMMKNGESIADFLSRATTII+     GE          VLRSLTPKFDHVVVAIEESKDLST+TFIELM SLQA+E RIN SME+NEE
Subjt:  VKLQSLRRDFETLMMKNGESIADFLSRATTIINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEE

Query:  KAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKFG---------------------------------
        KAF+VKDVVPKYNDSD VMT+G+G GGYR RGRGT K CN+NEEQRQFGVQSSNKANIQCYHCKKFG                                 
Subjt:  KAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKFG---------------------------------

Query:  --------------------------LKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLIESGYSILFDD-
                                  LKPVFKELNEGEKLKVEL NGKELQVEGK T+GIETH+GNRILTNVQYVPDIGYNLLSVGQL+ESG+SILFDD 
Subjt:  --------------------------LKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLIESGYSILFDD-

Query:  -----------------------------------------------------------------------------------ESKSKTFEKFKHFKAKV
                                                                                           +S+S+TFEKFKHFKAKV
Subjt:  -----------------------------------------------------------------------------------ESKSKTFEKFKHFKAKV

Query:  KKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ
        +KQSG+FIKS RSDRGG+FL NNFNHFCEEHGIHRELTTPYT EQ
Subjt:  KKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ

A0A5D3DWP2 Putative gag-pol polyprotein, identical2.1e-14554.75Show/hide
Query:  EFWSILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATT
        + W +++QGY DPDD+GKLREN+KKDSKAL+IIQQAVHD VFSRIA ATTSKQAWLILQ AFQGD RVL+VKLQSLRRDFETLMMKNGESIADFLSRATT
Subjt:  EFWSILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADFLSRATT

Query:  IINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQVKDVVPKYNDSDHVMTRGRGRGGYRG
        II+     GE          VLRSLTPKFDHVV AIEESK+L T+TFIELM SL+A+E RINRSMERNEEKAFQVKD VPKYNDSD VMTRGRGRGGYRG
Subjt:  IINYS---GE----------VLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQVKDVVPKYNDSDHVMTRGRGRGGYRG

Query:  RGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF-----------------------------------------------------------GLKPV
        RG GT K CNRNE QRQFGVQSSNKANIQCYHCKKF                                                           GLKPV
Subjt:  RGRGTRKRCNRNEEQRQFGVQSSNKANIQCYHCKKF-----------------------------------------------------------GLKPV

Query:  FKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLIESGYSILFDD-------------------------------
        FKELNEGEKLKV+L NGKELQVEGKGTV IETHHGNRILTNVQYVPDIGYNLLSVGQL+ESGYSILFDD                               
Subjt:  FKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLIESGYSILFDD-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------ESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ
                         ESKS+TFEKFKHFKAKV+KQSG+FIKSLRSDRG EFL NNFNHFC+EHGIHRELTTPYTPEQ
Subjt:  -----------------ESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQ

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-0740.98Show/hide
Query:  ESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPE
        ++K + F+ F+ F A V++++G  +K LRSD GGE+    F  +C  HGI  E T P TP+
Subjt:  ESKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPE

Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein3.5e-0429.51Show/hide
Query:  YEFWSILKQGYADPDDKGK--------LRENKKKDSKALMIIQQAVHDIVFSRIAAATTSK
        ++ W I+++G+ +P+++G         LR+++K+D KAL +I Q + +  F ++  AT++K
Subjt:  YEFWSILKQGYADPDDKGK--------LRENKKKDSKALMIIQQAVHDIVFSRIAAATTSK

AT3G21000.1 Gag-Pol-related retrotransposon family protein1.8e-0825.15Show/hide
Query:  PDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAW-LILQNAFQGDLRVL-VVKLQSLRRDFETLMMKNGESIADFLSRATTIINYSG----
        P++  K R+   KD+KAL I+Q ++ D VF +  +A+++K  W L+ +   Q  +R L  V ++ L +  E L M + ES + +L +A  I+   G    
Subjt:  PDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAW-LILQNAFQGDLRVL-VVKLQSLRRDFETLMMKNGESIADFLSRATTIINYSG----

Query:  ---------EVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAF-QVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRC
                  V  +L+  FD +   +EE  D+   T   L+E    +  R++ S    EE  F  +KD+  K                   +  G   + 
Subjt:  ---------EVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAF-QVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRC

Query:  NRNEEQRQFGVQSSNK-------ANIQCYHCKKFGLK-------------PV--------FKELNEGEKLKVELENGKELQVEGKGTVGIETHHG-NRIL
        N N+E  +F + +  +        + +       G K             P+        F  L+   K  V   +G  L VEGKG V I    G  + +
Subjt:  NRNEEQRQFGVQSSNK-------ANIQCYHCKKFGLK-------------PV--------FKELNEGEKLKVELENGKELQVEGKGTVGIETHHG-NRIL

Query:  TNVQYVPDIGYNLLSVGQLIESGYSI
         NV +VP +  N+LS G+++   YSI
Subjt:  TNVQYVPDIGYNLLSVGQLIESGYSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTACAGCACAACCACTCATTCCAATCTTCAAAGGAGAAGGCTACGAGTTTTGGAGTATTCTAAAACAAGGTTATGCGGATCCTGACGACAAAGGCAAGTTGCGGGA
GAACAAGAAGAAAGACTCGAAGGCGTTGATGATTATTCAACAAGCAGTCCATGACATTGTTTTTTCCCGGATTGCTGCTGCAACAACGTCAAAACAAGCGTGGCTGATTT
TGCAAAATGCATTTCAAGGAGATTTAAGAGTACTTGTGGTTAAATTGCAATCACTTAGACGAGACTTTGAGACCTTAATGATGAAAAATGGAGAATCAATTGCTGATTTT
TTATCACGGGCAACAACAATTATTAACTATAGTGGAGAAGTATTGAGAAGTTTGACTCCAAAGTTTGATCATGTTGTGGTTGCAATAGAAGAATCAAAGGATCTGTCCAC
TTACACATTTATTGAATTAATGGAATCTCTTCAAGCATATGAGTTGAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAGCGTTTCAGGTAAAGGATGTAGTTCCAA
AGTATAATGACAGTGATCATGTCATGACTCGAGGCCGAGGAAGAGGAGGATATCGTGGTCGAGGTCGTGGTACCAGAAAAAGATGTAATCGAAATGAAGAACAAAGGCAG
TTCGGAGTGCAATCAAGCAACAAAGCTAATATTCAATGCTACCATTGCAAGAAGTTCGGTTTGAAGCCTGTATTCAAGGAGCTTAACGAAGGAGAAAAGTTGAAGGTGGA
GCTCGAAAACGGCAAAGAACTACAAGTAGAAGGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGTGCCCGATATTGGAT
ATAATTTGCTGAGTGTTGGACAACTAATAGAGAGTGGGTATTCTATCTTGTTTGACGATGAAAGCAAATCAAAAACATTTGAGAAGTTCAAGCATTTCAAGGCAAAGGTA
AAAAAGCAAAGTGGCGTGTTCATCAAATCTCTTCGTAGTGATAGAGGTGGAGAATTTTTGTTCAACAACTTCAACCATTTTTGTGAGGAACATGGCATCCATAGGGAGTT
GACAACACCTTACACTCCGGAGCAAAAAAGGGGTAGCCGAGAGGAAGAATCAAACTGTCGTGGAGATGGCAAGAAGCATGTTGCAAATGAAAGGCCTTTCGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTACAGCACAACCACTCATTCCAATCTTCAAAGGAGAAGGCTACGAGTTTTGGAGTATTCTAAAACAAGGTTATGCGGATCCTGACGACAAAGGCAAGTTGCGGGA
GAACAAGAAGAAAGACTCGAAGGCGTTGATGATTATTCAACAAGCAGTCCATGACATTGTTTTTTCCCGGATTGCTGCTGCAACAACGTCAAAACAAGCGTGGCTGATTT
TGCAAAATGCATTTCAAGGAGATTTAAGAGTACTTGTGGTTAAATTGCAATCACTTAGACGAGACTTTGAGACCTTAATGATGAAAAATGGAGAATCAATTGCTGATTTT
TTATCACGGGCAACAACAATTATTAACTATAGTGGAGAAGTATTGAGAAGTTTGACTCCAAAGTTTGATCATGTTGTGGTTGCAATAGAAGAATCAAAGGATCTGTCCAC
TTACACATTTATTGAATTAATGGAATCTCTTCAAGCATATGAGTTGAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAGCGTTTCAGGTAAAGGATGTAGTTCCAA
AGTATAATGACAGTGATCATGTCATGACTCGAGGCCGAGGAAGAGGAGGATATCGTGGTCGAGGTCGTGGTACCAGAAAAAGATGTAATCGAAATGAAGAACAAAGGCAG
TTCGGAGTGCAATCAAGCAACAAAGCTAATATTCAATGCTACCATTGCAAGAAGTTCGGTTTGAAGCCTGTATTCAAGGAGCTTAACGAAGGAGAAAAGTTGAAGGTGGA
GCTCGAAAACGGCAAAGAACTACAAGTAGAAGGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGTGCCCGATATTGGAT
ATAATTTGCTGAGTGTTGGACAACTAATAGAGAGTGGGTATTCTATCTTGTTTGACGATGAAAGCAAATCAAAAACATTTGAGAAGTTCAAGCATTTCAAGGCAAAGGTA
AAAAAGCAAAGTGGCGTGTTCATCAAATCTCTTCGTAGTGATAGAGGTGGAGAATTTTTGTTCAACAACTTCAACCATTTTTGTGAGGAACATGGCATCCATAGGGAGTT
GACAACACCTTACACTCCGGAGCAAAAAAGGGGTAGCCGAGAGGAAGAATCAAACTGTCGTGGAGATGGCAAGAAGCATGTTGCAAATGAAAGGCCTTTCGAATGA
Protein sequenceShow/hide protein sequence
MGTAQPLIPIFKGEGYEFWSILKQGYADPDDKGKLRENKKKDSKALMIIQQAVHDIVFSRIAAATTSKQAWLILQNAFQGDLRVLVVKLQSLRRDFETLMMKNGESIADF
LSRATTIINYSGEVLRSLTPKFDHVVVAIEESKDLSTYTFIELMESLQAYELRINRSMERNEEKAFQVKDVVPKYNDSDHVMTRGRGRGGYRGRGRGTRKRCNRNEEQRQ
FGVQSSNKANIQCYHCKKFGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLIESGYSILFDDESKSKTFEKFKHFKAKV
KKQSGVFIKSLRSDRGGEFLFNNFNHFCEEHGIHRELTTPYTPEQKRGSREEESNCRGDGKKHVANERPFE