; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0226761 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0226761
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-Pol
Genome locationCMiso1.1chr08:19155120..19155878
RNA-Seq ExpressionCmc08g0226761
SyntenyCmc08g0226761
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:1901576 - organic substance biosynthetic process (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0050896 - response to stimulus (biological process)
GO:0046488 - phosphatidylinositol metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016829 - lyase activity (molecular function)
GO:0046914 - transition metal ion binding (molecular function)
GO:0043168 - anion binding (molecular function)
GO:0000166 - nucleotide binding (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
GO:0016307 - phosphatidylinositol phosphate kinase activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0004672 - protein kinase activity (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAD34493.1 Gag-Pol [Ipomoea batatas]5.1e-9871.05Show/hide
Query:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF
        ME NVAYCLLTEDGEP T ++A+ SSD +QW  AMQEEIEALHKN TW+LV LPQGRKPIGNK                   LV+KGYAQK+GID+NEIF
Subjt:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF

Query:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK
        SPVVRLTTVR+VLA+CATF+LHLEQL VKTAFLHGDLEEEIYMLQ EGFE K    LVC+LNKSLYGLKQA RCWYK              + DP AYFK
Subjt:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK

Query:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        RFG+ +F++LLLYVDDMLV G NKDHI+ELKA LAREFEMKDLG ANKILGMQIHRD  NRKIWLS
Subjt:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS

KAA0060044.1 Gag-Pol [Cucumis melo var. makuwa]1.2e-10787.98Show/hide
Query:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNKLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATF
        MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK                            IVLAICATF
Subjt:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNKLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATF

Query:  DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPYAYFKRFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAH
        DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPYAYFKRFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAH
Subjt:  DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPYAYFKRFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAH

Query:  LAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        LAREFEMKDLGLANKILGMQIHRDINNRKIWLS
Subjt:  LAREFEMKDLGLANKILGMQIHRDINNRKIWLS

KAE8725385.1 Desiccation-related protein PCC13-62 [Hibiscus syriacus]5.7e-9770.3Show/hide
Query:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF
        +E N+AYCLLTEDGEP T ++A+ SSDA+ WM AMQEEIEALHKN TW+LV LPQGRKPIGNK                   LV+KGYAQK+GID+NEIF
Subjt:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF

Query:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK
        SPVVRLTTVR++LA+CAT +LHLEQL VKTAFLHG+LEEEIYMLQ EGFE K K  LVC+LNKSLYGLKQA RCWYK              + DP AYFK
Subjt:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK

Query:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        R G+ DF++LLLYVDDMLV G NKDHIEELKA LAREFEMKDLG ANKILGMQIHRD +NRKIWLS
Subjt:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS

KAE8731597.1 hypothetical protein F3Y22_tig00002793pilonHSYRG00074 [Hibiscus syriacus]1.6e-9669.55Show/hide
Query:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF
        +E N+AYCLLTEDGEP T ++A+ SSDA+ WM AMQEEIEALHKN TW+LV LPQGRKPIGNK                   LV+KGYAQK+GID+NEIF
Subjt:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF

Query:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK
        SPVVRLTTVR+VLA+CATF+LHLEQL VKT FLHG+LEE+IYMLQ EGFE   K  LVC+LNKSLYGLKQA RCWYK              + DP AYFK
Subjt:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK

Query:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        R G+ DF++L+LYVDDMLV G NKDHIEELKA LAREFEMKDLG ANKILGMQIHRD +NRKIWLS
Subjt:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS

TYK21895.1 polyprotein [Cucumis melo var. makuwa]9.6e-12190.69Show/hide
Query:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNKLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATF
        MESNVAY LL EDGEPLTLKDAMTSSD AQWMTAMQEEIEALHKNKTWELVMLP+GRKPIGNKLVIKGYAQK+GIDYNEIFSPVVRLTTVRIVLAICATF
Subjt:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNKLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATF

Query:  DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFKRFGEKDFIVLLLYVDDMLV
        DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFE KGKDKLVCKLNKSLYGLK ALRCWYK              ++DPYAYFKRFGEKDFIVLLLYVDDMLV
Subjt:  DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFKRFGEKDFIVLLLYVDDMLV

Query:  VGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        VGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
Subjt:  VGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS

TrEMBL top hitse value%identityAlignment
A0A5A7UYA7 Gag-Pol5.9e-10887.98Show/hide
Query:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNKLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATF
        MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK                            IVLAICATF
Subjt:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNKLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATF

Query:  DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPYAYFKRFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAH
        DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPYAYFKRFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAH
Subjt:  DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPYAYFKRFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAH

Query:  LAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        LAREFEMKDLGLANKILGMQIHRDINNRKIWLS
Subjt:  LAREFEMKDLGLANKILGMQIHRDINNRKIWLS

A0A5D3DEC2 Polyprotein4.6e-12190.69Show/hide
Query:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNKLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATF
        MESNVAY LL EDGEPLTLKDAMTSSD AQWMTAMQEEIEALHKNKTWELVMLP+GRKPIGNKLVIKGYAQK+GIDYNEIFSPVVRLTTVRIVLAICATF
Subjt:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNKLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATF

Query:  DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFKRFGEKDFIVLLLYVDDMLV
        DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFE KGKDKLVCKLNKSLYGLK ALRCWYK              ++DPYAYFKRFGEKDFIVLLLYVDDMLV
Subjt:  DLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFKRFGEKDFIVLLLYVDDMLV

Query:  VGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        VGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
Subjt:  VGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS

A0A6A3C800 Desiccation-related protein PCC13-622.7e-9770.3Show/hide
Query:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF
        +E N+AYCLLTEDGEP T ++A+ SSDA+ WM AMQEEIEALHKN TW+LV LPQGRKPIGNK                   LV+KGYAQK+GID+NEIF
Subjt:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF

Query:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK
        SPVVRLTTVR++LA+CAT +LHLEQL VKTAFLHG+LEEEIYMLQ EGFE K K  LVC+LNKSLYGLKQA RCWYK              + DP AYFK
Subjt:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK

Query:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        R G+ DF++LLLYVDDMLV G NKDHIEELKA LAREFEMKDLG ANKILGMQIHRD +NRKIWLS
Subjt:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS

A0A6A3CSP3 Uncharacterized protein8.0e-9769.55Show/hide
Query:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF
        +E N+AYCLLTEDGEP T ++A+ SSDA+ WM AMQEEIEALHKN TW+LV LPQGRKPIGNK                   LV+KGYAQK+GID+NEIF
Subjt:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF

Query:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK
        SPVVRLTTVR+VLA+CATF+LHLEQL VKT FLHG+LEE+IYMLQ EGFE   K  LVC+LNKSLYGLKQA RCWYK              + DP AYFK
Subjt:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK

Query:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        R G+ DF++L+LYVDDMLV G NKDHIEELKA LAREFEMKDLG ANKILGMQIHRD +NRKIWLS
Subjt:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS

Q6BCY1 Gag-Pol2.5e-9871.05Show/hide
Query:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF
        ME NVAYCLLTEDGEP T ++A+ SSD +QW  AMQEEIEALHKN TW+LV LPQGRKPIGNK                   LV+KGYAQK+GID+NEIF
Subjt:  MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIF

Query:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK
        SPVVRLTTVR+VLA+CATF+LHLEQL VKTAFLHGDLEEEIYMLQ EGFE K    LVC+LNKSLYGLKQA RCWYK              + DP AYFK
Subjt:  SPVVRLTTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFK

Query:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        RFG+ +F++LLLYVDDMLV G NKDHI+ELKA LAREFEMKDLG ANKILGMQIHRD  NRKIWLS
Subjt:  RFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-2932.14Show/hide
Query:  PLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGR-------------KPIGN------KLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAI
        P +  +     D + W  A+  E+ A   N TW +   P+ +               +GN      +LV +G+ QK  IDY E F+PV R+++ R +L++
Subjt:  PLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGR-------------KPIGN------KLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAI

Query:  CATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFKRFGE-KDFIVLLLYV
           ++L + Q+ VKTAFL+G L+EEIYM   +G      +  VCKLNK++YGLKQA RCW++              S+D   Y    G   + I +LLYV
Subjt:  CATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK--------------SLDPYAYFKRFGE-KDFIVLLLYV

Query:  DDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        DD+++   +   +   K +L  +F M DL      +G++I  ++   KI+LS
Subjt:  DDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.8e-6852.31Show/hide
Query:  YCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIFSPVVRL
        Y L+++D EP +LK+ ++  +  Q M AMQEE+E+L KN T++LV LP+G++P+  K                   LV+KG+ QKKGID++EIFSPVV++
Subjt:  YCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIFSPVVRL

Query:  TTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSL--------------DPYAYFKRFGEKD
        T++R +L++ A+ DL +EQL VKTAFLHGDLEEEIYM Q EGFE  GK  +VCKLNKSLYGLKQA R WY                 DP  YFKRF E +
Subjt:  TTVRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSL--------------DPYAYFKRFGEKD

Query:  FIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS
        FI+LLLYVDDML+VG +K  I +LK  L++ F+MKDLG A +ILGM+I R+  +RK+WLS
Subjt:  FIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHRDINNRKIWLS

P25600 Putative transposon Ty5-1 protein YCL074W1.4e-1032.82Show/hide
Query:  VKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK----SLDPYAYFKRFGEKDF---------IVLLLYVDDMLVVGSNKDHIE
        V TAFL+  ++E IY+ Q  GF  +     V +L   +YGLKQA   W +    +L    + +  GE            I + +YVDD+LV   +    +
Subjt:  VKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYK----SLDPYAYFKRFGEKDF---------IVLLLYVDDMLVVGSNKDHIE

Query:  ELKAHLAREFEMKDLGLANKILGMQIHRDIN
         +K  L + + MKDLG  +K LG+ IH+  N
Subjt:  ELKAHLAREFEMKDLGLANKILGMQIHRDIN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.2e-2934.01Show/hide
Query:  LTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGN--------------------KLVIKGYAQKKGIDYNEIFSPVVRLTT
        L  + EP T   A+      +W  AM  EI A   N TW+LV  P     I                      +LV KGY Q+ G+DY E FSPV++ T+
Subjt:  LTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGN--------------------KLVIKGYAQKKGIDYNEIFSPVVRLTT

Query:  VRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPY-------------AYFKRFGEKDFIV
        +RIVL +       + QL V  AFL G L +++YM Q  GF  K +   VCKL K+LYGLKQA R WY  L  Y             + F     K  + 
Subjt:  VRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPY-------------AYFKRFGEKDFIV

Query:  LLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHR
        +L+YVDD+L+ G++   +     +L++ F +KD    +  LG++  R
Subjt:  LLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-2733.6Show/hide
Query:  LTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGN--------------------KLVIKGYAQKKGIDYNEIFSPVVRLTT
        L  + EP T   AM      +W  AM  EI A   N TW+LV  P     I                      +LV KGY Q+ G+DY E FSPV++ T+
Subjt:  LTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGN--------------------KLVIKGYAQKKGIDYNEIFSPVVRLTT

Query:  VRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPY-------------AYFKRFGEKDFIV
        +RIVL +       + QL V  AFL G L +E+YM Q  GF  K +   VC+L K++YGLKQA R WY  L  Y             + F     +  I 
Subjt:  VRIVLAICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPY-------------AYFKRFGEKDFIV

Query:  LLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHR
        +L+YVDD+L+ G++   ++     L++ F +K+    +  LG++  R
Subjt:  LLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.2e-3838.37Show/hide
Query:  EPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLA
        EP T  +A    +   W  AM +EI A+    TWE+  LP  +KPIG K                   LV KGY Q++GID+ E FSPV +LT+V+++LA
Subjt:  EPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLA

Query:  ICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKL----VCKLNKSLYGLKQALRCWY-------------KSLDPYAYFKRFGEKDFIVLL
        I A ++  L QL +  AFL+GDL+EEIYM    G+ A+  D L    VC L KS+YGLKQA R W+             +S   + YF +     F+ +L
Subjt:  ICATFDLHLEQLAVKTAFLHGDLEEEIYMLQLEGFEAKGKDKL----VCKLNKSLYGLKQALRCWY-------------KSLDPYAYFKRFGEKDFIVLL

Query:  LYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHR
        +YVDD+++  +N   ++ELK+ L   F+++DLG     LG++I R
Subjt:  LYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQIHR

ATMG00810.1 DNA/RNA polymerases superfamily protein6.0e-0448.89Show/hide
Query:  LLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQI
        LLLYVDD+L+ GS+   +  L   L+  F MKDLG  +  LG+QI
Subjt:  LLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.4e-0836.26Show/hide
Query:  WMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATFDL
        W  AMQEE++AL +NKTW LV  P  +  +G K                   LV KG+ Q++GI + E +SPVVR  T+R +L +    ++
Subjt:  WMTAMQEEIEALHKNKTWELVMLPQGRKPIGNK-------------------LVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATFDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGCAATGTTGCATACTGTCTTCTAACCGAGGATGGAGAGCCATTAACTCTAAAAGATGCAATGACCAGTTCAGATGCTGCTCAGTGGATGACAGCTATGCAGGA
AGAAATAGAAGCTCTTCATAAGAACAAGACTTGGGAACTCGTAATGCTACCACAAGGGAGAAAGCCCATTGGGAACAAATTGGTGATAAAGGGATATGCTCAGAAAAAAG
GCATTGACTACAATGAAATATTTTCTCCAGTAGTTCGACTTACTACTGTTAGAATAGTTTTAGCAATATGTGCTACATTTGACTTACACTTGGAGCAGTTAGCTGTAAAA
ACTGCATTTCTCCATGGAGATCTTGAAGAAGAAATATATATGCTTCAACTAGAAGGCTTTGAAGCAAAAGGAAAAGATAAATTGGTTTGCAAGTTAAACAAGTCTCTATA
CGGTCTCAAACAGGCGTTAAGGTGTTGGTACAAGTCTCTAGACCCTTATGCATATTTTAAGAGGTTTGGAGAAAAAGACTTCATTGTCTTATTGTTGTACGTAGACGACA
TGTTGGTAGTAGGCTCTAACAAAGATCATATTGAAGAATTGAAGGCTCATTTGGCTAGGGAATTTGAAATGAAAGACTTGGGACTAGCAAACAAGATTCTAGGGATGCAA
ATTCACCGAGACATAAATAATAGGAAGATTTGGCTATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGCAATGTTGCATACTGTCTTCTAACCGAGGATGGAGAGCCATTAACTCTAAAAGATGCAATGACCAGTTCAGATGCTGCTCAGTGGATGACAGCTATGCAGGA
AGAAATAGAAGCTCTTCATAAGAACAAGACTTGGGAACTCGTAATGCTACCACAAGGGAGAAAGCCCATTGGGAACAAATTGGTGATAAAGGGATATGCTCAGAAAAAAG
GCATTGACTACAATGAAATATTTTCTCCAGTAGTTCGACTTACTACTGTTAGAATAGTTTTAGCAATATGTGCTACATTTGACTTACACTTGGAGCAGTTAGCTGTAAAA
ACTGCATTTCTCCATGGAGATCTTGAAGAAGAAATATATATGCTTCAACTAGAAGGCTTTGAAGCAAAAGGAAAAGATAAATTGGTTTGCAAGTTAAACAAGTCTCTATA
CGGTCTCAAACAGGCGTTAAGGTGTTGGTACAAGTCTCTAGACCCTTATGCATATTTTAAGAGGTTTGGAGAAAAAGACTTCATTGTCTTATTGTTGTACGTAGACGACA
TGTTGGTAGTAGGCTCTAACAAAGATCATATTGAAGAATTGAAGGCTCATTTGGCTAGGGAATTTGAAATGAAAGACTTGGGACTAGCAAACAAGATTCTAGGGATGCAA
ATTCACCGAGACATAAATAATAGGAAGATTTGGCTATCTTAG
Protein sequenceShow/hide protein sequence
MESNVAYCLLTEDGEPLTLKDAMTSSDAAQWMTAMQEEIEALHKNKTWELVMLPQGRKPIGNKLVIKGYAQKKGIDYNEIFSPVVRLTTVRIVLAICATFDLHLEQLAVK
TAFLHGDLEEEIYMLQLEGFEAKGKDKLVCKLNKSLYGLKQALRCWYKSLDPYAYFKRFGEKDFIVLLLYVDDMLVVGSNKDHIEELKAHLAREFEMKDLGLANKILGMQ
IHRDINNRKIWLS