; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0069951 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0069951
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr03:15525680..15526531
RNA-Seq ExpressionCmc03g0069951
SyntenyCmc03g0069951
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.7e-13281.98Show/hide
Query:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY
        MD +FQ+YLIE GIQSQL APSTPQQNGV ERRNRTLLDMVRSMMS+AQLPDSFWGYALET I+ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCP +
Subjt:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY

Query:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR
        VLVQNPKKLE RSKLCLFVGYPKES+GG+FY PQENKVF+STNATFLEEDH RNHQ  SK+VL+E+FKN+ D+PSSSTKVVDK     Q+H  QEL  PR
Subjt:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR

Query:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK
        RSGRVV Q +RYLGL ETQIII DDG+EDPLTYKQAMNDVD DQWIKAM+L+MESMY NSVWTLVD P+D+KPIGCKWIYKRK
Subjt:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-12073.43Show/hide
Query:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY
        MDL+FQ+Y+IEHGIQSQL AP TPQQNGV ERRNRTLLDMVRSMMS+AQLP SFWGYA+ET ++ILNNVPSKSV ETP+ELW+GRK SL HFRIWGCP +
Subjt:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY

Query:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNS---IDRPSSSTKVVDKTRNIGQTHPFQELG
        VLV NPKKLE RS+LC FVGYPKE++GG+F+DPQEN+VF+STNATFLEEDH+RNH+  SKLVL E    S   +D    S++ VD+T   GQ+HP Q L 
Subjt:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNS---IDRPSSSTKVVDKTRNIGQTHPFQELG

Query:  EPRRSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK
         PRRSGRVV Q +RYLGL+ETQ++I DDG+EDPL+YKQAMNDVD DQW+KAMDL+MESMY NSVW LVD P  +KPIGCKWIYKRK
Subjt:  EPRRSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK

KAA0062742.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-13085.71Show/hide
Query:  EHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTYVLVQNPKKLE
        EHGIQSQL APSTPQQNGV ERRNRTLLDMVRSM+SFAQL DSF GYALETTIYILNNVPSKS  E P+ELWKGRKGSL HFRIWGC  +VLVQNPKKLE
Subjt:  EHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTYVLVQNPKKLE

Query:  SRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPRRSGRVVRQSD
          SKLCLFVGYPKESKGG+FYDPQENKVF+STNATFLEEDHIRNHQT SKLVLEEI   S DRPS  TKVVDKTRNIGQTH  QEL EPRRSGRVVRQ D
Subjt:  SRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPRRSGRVVRQSD

Query:  RYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK
        RYL LSE QIII +DGIEDPLTYKQAMNDVD DQWIKAMDL+MES+Y NSVWT VDQPND+KPIG KWIYKRK
Subjt:  RYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]6.1e-14689.75Show/hide
Query:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY
        MDL+FQ+YLIEHGIQSQ  APS PQQNGVL+RRNR LLDMVRSMMSF QLPDSFW YALETTIYILNNVPSKSV ETPYELWKGRKGSLRHFRIWGCP +
Subjt:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY

Query:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR
        V VQNPKKLE RSKLCLFVGYPKESKGG+FYDPQENKVF+STNATFLEEDHIRNHQT SKLVLEEI KN+ DRPSS TKVVDKTRNIGQTH FQELG+PR
Subjt:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR

Query:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK
        RSGRVVRQSDRYLGLSE QIII DDGIEDPLTYK AMNDVD DQWIKAMDL+MESMYSNSVWTLVDQPND+KPIGCKWIYKRK
Subjt:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK

TYK30724.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-12986.47Show/hide
Query:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY
        MDL+FQ+YLIEH IQSQL AP+TPQQNGV ERRNR LLDMVRSM SFAQLPDSFWGYALETTIYILNNVPSKSV ETPYELWKGRKGSLRHFRIWGCP++
Subjt:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY

Query:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR
        VLVQNP KLE RSKLCLFVGYPKESKGG+FYDPQENKVF+S NATFLEE HIRNHQT +KLVLEEI KN  DRPSS TKVVDKTRNIGQTHP QELGE R
Subjt:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR

Query:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVD
        RS RVVRQ +RYLGLSE +III +DGIEDPLTYKQAMNDVD DQWIK MDL+MESMYSNS+WTLVD
Subjt:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVD

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein1.3e-12073.43Show/hide
Query:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY
        MDL+FQ+Y+IEHGIQSQL AP TPQQNGV ERRNRTLLDMVRSMMS+AQLP SFWGYA+ET ++ILNNVPSKSV ETP+ELW+GRK SL HFRIWGCP +
Subjt:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY

Query:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNS---IDRPSSSTKVVDKTRNIGQTHPFQELG
        VLV NPKKLE RS+LC FVGYPKE++GG+F+DPQEN+VF+STNATFLEEDH+RNH+  SKLVL E    S   +D    S++ VD+T   GQ+HP Q L 
Subjt:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNS---IDRPSSSTKVVDKTRNIGQTHPFQELG

Query:  EPRRSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK
         PRRSGRVV Q +RYLGL+ETQ++I DDG+EDPL+YKQAMNDVD DQW+KAMDL+MESMY NSVW LVD P  +KPIGCKWIYKRK
Subjt:  EPRRSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK

A0A5D3BX45 Gag/pol protein3.0e-14689.75Show/hide
Query:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY
        MDL+FQ+YLIEHGIQSQ  APS PQQNGVL+RRNR LLDMVRSMMSF QLPDSFW YALETTIYILNNVPSKSV ETPYELWKGRKGSLRHFRIWGCP +
Subjt:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY

Query:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR
        V VQNPKKLE RSKLCLFVGYPKESKGG+FYDPQENKVF+STNATFLEEDHIRNHQT SKLVLEEI KN+ DRPSS TKVVDKTRNIGQTH FQELG+PR
Subjt:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR

Query:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK
        RSGRVVRQSDRYLGLSE QIII DDGIEDPLTYK AMNDVD DQWIKAMDL+MESMYSNSVWTLVDQPND+KPIGCKWIYKRK
Subjt:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK

A0A5D3DFR5 Gag/pol protein1.3e-13085.71Show/hide
Query:  EHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTYVLVQNPKKLE
        EHGIQSQL APSTPQQNGV ERRNRTLLDMVRSM+SFAQL DSF GYALETTIYILNNVPSKS  E P+ELWKGRKGSL HFRIWGC  +VLVQNPKKLE
Subjt:  EHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTYVLVQNPKKLE

Query:  SRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPRRSGRVVRQSD
          SKLCLFVGYPKESKGG+FYDPQENKVF+STNATFLEEDHIRNHQT SKLVLEEI   S DRPS  TKVVDKTRNIGQTH  QEL EPRRSGRVVRQ D
Subjt:  SRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPRRSGRVVRQSD

Query:  RYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK
        RYL LSE QIII +DGIEDPLTYKQAMNDVD DQWIKAMDL+MES+Y NSVWT VDQPND+KPIG KWIYKRK
Subjt:  RYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK

A0A5D3E496 Gag/pol protein1.9e-12986.47Show/hide
Query:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY
        MDL+FQ+YLIEH IQSQL AP+TPQQNGV ERRNR LLDMVRSM SFAQLPDSFWGYALETTIYILNNVPSKSV ETPYELWKGRKGSLRHFRIWGCP++
Subjt:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY

Query:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR
        VLVQNP KLE RSKLCLFVGYPKESKGG+FYDPQENKVF+S NATFLEE HIRNHQT +KLVLEEI KN  DRPSS TKVVDKTRNIGQTHP QELGE R
Subjt:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR

Query:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVD
        RS RVVRQ +RYLGLSE +III +DGIEDPLTYKQAMNDVD DQWIK MDL+MESMYSNS+WTLVD
Subjt:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVD

E2GK51 Gag/pol protein (Fragment)8.4e-13381.98Show/hide
Query:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY
        MD +FQ+YLIE GIQSQL APSTPQQNGV ERRNRTLLDMVRSMMS+AQLPDSFWGYALET I+ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCP +
Subjt:  MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTY

Query:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR
        VLVQNPKKLE RSKLCLFVGYPKES+GG+FY PQENKVF+STNATFLEEDH RNHQ  SK+VL+E+FKN+ D+PSSSTKVVDK     Q+H  QEL  PR
Subjt:  VLVQNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPR

Query:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK
        RSGRVV Q +RYLGL ETQIII DDG+EDPLTYKQAMNDVD DQWIKAM+L+MESMY NSVWTLVD P+D+KPIGCKWIYKRK
Subjt:  RSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.1e-2332.07Show/hide
Query:  QFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFE---TPYELWKGRKGSLRHFRIWGCPTY
        + +++ ++ GI   L  P TPQ NGV ER  RT+ +  R+M+S A+L  SFWG A+ T  Y++N +PS+++ +   TPYE+W  +K  L+H R++G   Y
Subjt:  QFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFE---TPYELWKGRKGSLRHFRIWGCPTY

Query:  VLVQNPK-KLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDR--PSSSTKVV
        V ++N + K + +S   +FVGY  E  G   +D    K  ++ +    E + + +     + V  +  K S ++  P+ S K++
Subjt:  VLVQNPK-KLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDR--PSSSTKVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-4132.8Show/hide
Query:  QFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSV-FETPYELWKGRKGSLRHFRIWGCPTYVL
        +F+EY   HGI+ +   P TPQ NGV ER NRT+++ VRSM+  A+LP SFWG A++T  Y++N  PS  + FE P  +W  ++ S  H +++GC  +  
Subjt:  QFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSV-FETPYELWKGRKGSLRHFRIWGCPTYVL

Query:  V--QNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSST--------------------KV
        V  +   KL+ +S  C+F+GY  E  G   +DP + KV  S +  F  E  +R     S+ V   I  N +  PS+S                     +V
Subjt:  V--QNPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSST--------------------KV

Query:  VDKTRNIGQ-----THPFQ--ELGEP-RRSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLK
        +++   + +      HP Q  E  +P RRS R   +S RY     T+ +++ D   +P + K+ ++  + +Q +KAM  +MES+  N  + LV+ P   +
Subjt:  VDKTRNIGQ-----THPFQ--ELGEP-RRSGRVVRQSDRYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLK

Query:  PIGCKWIYKRK
        P+ CKW++K K
Subjt:  PIGCKWIYKRK

P92512 Uncharacterized mitochondrial protein AtMg007103.4e-0634.15Show/hide
Query:  NRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSV-FETPYELWKGRKGSLRHFRIWGCPTYVLVQNPKKLESRSK
        NRT+++ VRSM+    LP +F   A  T ++I+N  PS ++ F  P E+W     +  + R +GC  Y+   +  KL+ R+K
Subjt:  NRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSV-FETPYELWKGRKGSLRHFRIWGCPTYVLVQNPKKLESRSK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.6e-1828.28Show/hide
Query:  EYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSV-FETPYELWKGRKGSLRHFRIWGCPTYVLVQ-
        EY  +HGI      P TP+ NG+ ER++R +++   +++S A +P ++W YA    +Y++N +P+  +  E+P++   G   +    R++GC  Y  ++ 
Subjt:  EYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSV-FETPYELWKGRKGSLRHFRIWGCPTYVLVQ-

Query:  -NPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLE
         N  KL+ +S+ C+F+GY       +    Q +++++S +  F E
Subjt:  -NPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-1728.08Show/hide
Query:  QEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSV-FETPYELWKGRKGSLRHFRIWGCPTYVLVQ
        ++YL +HGI      P TP+ NG+ ER++R +++M  +++S A +P ++W YA    +Y++N +P+  +  ++P++   G+  +    +++GC  Y  ++
Subjt:  QEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSV-FETPYELWKGRKGSLRHFRIWGCPTYVLVQ

Query:  --NPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLE
          N  KLE +SK C F+GY       +       +++ S +  F E
Subjt:  --NPKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.0e-0539.29Show/hide
Query:  EDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK
        ++P TY +A    +F  W  AMD ++ +M +   W +   P + KPIGCKW+YK K
Subjt:  EDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.4e-0734.15Show/hide
Query:  NRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSV-FETPYELWKGRKGSLRHFRIWGCPTYVLVQNPKKLESRSK
        NRT+++ VRSM+    LP +F   A  T ++I+N  PS ++ F  P E+W     +  + R +GC  Y+   +  KL+ R+K
Subjt:  NRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSV-FETPYELWKGRKGSLRHFRIWGCPTYVLVQNPKKLESRSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTGCAATTCCAAGAATATTTGATAGAACATGGAATCCAATCACAACTCTATGCACCTAGTACGCCTCAGCAGAACGGTGTATTAGAAAGAAGAAACCGA
ACTTTGTTAGACATGGTTCGCTCCATGATGAGTTTTGCTCAGTTGCCAGATTCTTTTTGGGGATATGCTTTAGAAACAACTATCTATATTTTGAACAACGTTCCC
TCTAAAAGTGTTTTTGAAACACCTTATGAGCTGTGGAAAGGGCGTAAAGGTAGTTTACGTCACTTTAGGATTTGGGGATGTCCAACATACGTGTTGGTGCAAAAC
CCTAAAAAATTGGAAAGTCGTTCAAAATTATGCCTATTTGTAGGTTATCCAAAAGAATCAAAAGGTGGTATATTTTATGACCCTCAAGAAAATAAAGTATTTTTA
TCGACAAATGCTACGTTCTTAGAGGAAGACCACATAAGAAATCATCAAACTCACAGTAAACTAGTATTAGAAGAGATTTTCAAGAATTCTATAGATAGACCTAGT
TCATCTACTAAAGTAGTTGATAAAACTAGGAATATTGGTCAAACACATCCTTTTCAAGAGTTGGGAGAGCCTCGTCGTAGTGGAAGGGTTGTACGACAGTCTGAT
CGGTATTTGGGTTTAAGTGAAACTCAAATCATCATACTTGATGATGGGATAGAGGATCCATTAACCTATAAACAGGCAATGAATGATGTGGACTTTGACCAATGG
ATTAAAGCCATGGATCTAAAAATGGAATCTATGTATTCCAATTCTGTTTGGACTCTAGTAGATCAACCAAATGACTTAAAACCTATTGGTTGTAAATGGATCTAC
AAGAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTGCAATTCCAAGAATATTTGATAGAACATGGAATCCAATCACAACTCTATGCACCTAGTACGCCTCAGCAGAACGGTGTATTAGAAAGAAGAAACCGA
ACTTTGTTAGACATGGTTCGCTCCATGATGAGTTTTGCTCAGTTGCCAGATTCTTTTTGGGGATATGCTTTAGAAACAACTATCTATATTTTGAACAACGTTCCC
TCTAAAAGTGTTTTTGAAACACCTTATGAGCTGTGGAAAGGGCGTAAAGGTAGTTTACGTCACTTTAGGATTTGGGGATGTCCAACATACGTGTTGGTGCAAAAC
CCTAAAAAATTGGAAAGTCGTTCAAAATTATGCCTATTTGTAGGTTATCCAAAAGAATCAAAAGGTGGTATATTTTATGACCCTCAAGAAAATAAAGTATTTTTA
TCGACAAATGCTACGTTCTTAGAGGAAGACCACATAAGAAATCATCAAACTCACAGTAAACTAGTATTAGAAGAGATTTTCAAGAATTCTATAGATAGACCTAGT
TCATCTACTAAAGTAGTTGATAAAACTAGGAATATTGGTCAAACACATCCTTTTCAAGAGTTGGGAGAGCCTCGTCGTAGTGGAAGGGTTGTACGACAGTCTGAT
CGGTATTTGGGTTTAAGTGAAACTCAAATCATCATACTTGATGATGGGATAGAGGATCCATTAACCTATAAACAGGCAATGAATGATGTGGACTTTGACCAATGG
ATTAAAGCCATGGATCTAAAAATGGAATCTATGTATTCCAATTCTGTTTGGACTCTAGTAGATCAACCAAATGACTTAAAACCTATTGGTTGTAAATGGATCTAC
AAGAGAAAATGA
Protein sequenceShow/hide protein sequence
MDLQFQEYLIEHGIQSQLYAPSTPQQNGVLERRNRTLLDMVRSMMSFAQLPDSFWGYALETTIYILNNVPSKSVFETPYELWKGRKGSLRHFRIWGCPTYVLVQN
PKKLESRSKLCLFVGYPKESKGGIFYDPQENKVFLSTNATFLEEDHIRNHQTHSKLVLEEIFKNSIDRPSSSTKVVDKTRNIGQTHPFQELGEPRRSGRVVRQSD
RYLGLSETQIIILDDGIEDPLTYKQAMNDVDFDQWIKAMDLKMESMYSNSVWTLVDQPNDLKPIGCKWIYKRK