; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0049731 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0049731
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr02:15610412..15611308
RNA-Seq ExpressionCmc02g0049731
SyntenyCmc02g0049731
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.9e-13882.89Show/hide
Query:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        MVRSMMS+AQL  SFWGYALET I+ILNNVPSKSV ETPYELWKGRK SL +FRIWGCPAH+LVQN KKLE  SKLCLFVGYPKES+GGLFY PQENKVF
Subjt:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND
        VSTNA FLEEDH RNHQ RSK+VL+E+ KNATD+PSSSTKVVDK     Q+H SQEL  PRRSGRVV QP+RYLG  E QIIIPDDG+EDPLT KQAMND
Subjt:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND

Query:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI
         D DQWIKAM+LEMESMYFN VWTLVD PS+V+PIGCKWIYKRKRDQ GKVQTFKARLVAKGY QKEG+DYEETFSPVAM+KSIRILLSIATFY+YEI
Subjt:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-12674.75Show/hide
Query:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        MVRSMMS+AQL  SFWGYA+ET ++ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAH+LV N KKLE  S+LC FVGYPKE++GGLF+DPQEN+VF
Subjt:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNAT---DRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQA
        VSTNA FLEEDH+RNH+ RSKLVL E +  +T   D    S++ VD+    GQ+HPSQ L  PRRSGRVV QP+RYLG +E Q++IPDDG+EDPL+ KQA
Subjt:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNAT---DRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQA

Query:  MNDADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYE
        MND D DQW+KAMDLEMESMYFN VW LVD P  V+PIGCKWIYKRKRD  GKVQTFKARLVAKGY Q+EG+DYEETFSPVAM+KSIRILLSIATFYDYE
Subjt:  MNDADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYE

Query:  I
        I
Subjt:  I

KAA0053406.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-14688.93Show/hide
Query:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        MVRSMM+FAQL  SFWGYALET IYILNNVPSK+VSETPYELWKGRKGSL +FRIWGCPAH+LVQN KKLEH SKLCLFVGYPKESKG LFYDPQENKVF
Subjt:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND
        VSTNA FLEEDHIRNHQ  SKLVLEEIS NAT+RPSSSTKV+DK RNIGQTHPSQEL EP RSGRVVRQ DRYLG SEAQIIIPDDGIEDPLT KQAMND
Subjt:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND

Query:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI
         DCDQWIKAM+LEMESMY N V  LVDQPSEVRPIGCKWIYKRKRDQ GKVQTFKARLVAKGY QKEGIDYEET SPVAMIK IRILLSIATFYDYEI
Subjt:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-14688.93Show/hide
Query:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        MVRSMMSF QL  SFW YALETTIYILNNVPSKSVSETPYELWKGRKGSL HFRIWGCPAH+ VQN KKLE  SKLCLFVGYPKESKGGLFYDPQENKVF
Subjt:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND
        VSTNA FLEEDHIRNHQTRSKLVLEEISKN TDRPSS TKVVDK RNIGQTH  QELG+PRRSGRVVRQ DRYLG SEAQIIIPDDGIEDPLT K AMND
Subjt:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND

Query:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI
         D DQWIKAMDLEMESMY N VWTLVDQP++V+PIGCKWIYKRKRDQ GKVQTFKARLVAKGY QKEGIDYEE FS  AMIKSIRILLSIATFYDYEI
Subjt:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI

TYK15887.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-14788.93Show/hide
Query:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        MVRSMM+FAQ   SFWGYALET IYILNNVPSK+VSETPYELWKGRKGSL +FRIWGCPAH+LVQN KKLEH SKLCLFVGYPKESKG LFYDPQENKVF
Subjt:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND
        VSTNA FLEEDHIRNHQ  SKLVLEEIS NAT++PSSSTKV+DK RNIGQTHPSQELGEP RSGRVVRQ DRYLG SEAQIIIPDDGIEDPLT KQAMND
Subjt:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND

Query:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI
         DCDQWIKAM+LEMESMY N V  LVDQPSEVRPIGCKWIYKRKRDQ GKVQTFKARLVAKGY QKEGIDYEETFSPVAMIK IRILLSIATFYDYEI
Subjt:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein7.2e-12774.75Show/hide
Query:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        MVRSMMS+AQL  SFWGYA+ET ++ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAH+LV N KKLE  S+LC FVGYPKE++GGLF+DPQEN+VF
Subjt:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNAT---DRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQA
        VSTNA FLEEDH+RNH+ RSKLVL E +  +T   D    S++ VD+    GQ+HPSQ L  PRRSGRVV QP+RYLG +E Q++IPDDG+EDPL+ KQA
Subjt:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNAT---DRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQA

Query:  MNDADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYE
        MND D DQW+KAMDLEMESMYFN VW LVD P  V+PIGCKWIYKRKRD  GKVQTFKARLVAKGY Q+EG+DYEETFSPVAM+KSIRILLSIATFYDYE
Subjt:  MNDADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYE

Query:  I
        I
Subjt:  I

A0A5A7UGR3 Gag/pol protein1.1e-14688.93Show/hide
Query:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        MVRSMM+FAQL  SFWGYALET IYILNNVPSK+VSETPYELWKGRKGSL +FRIWGCPAH+LVQN KKLEH SKLCLFVGYPKESKG LFYDPQENKVF
Subjt:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND
        VSTNA FLEEDHIRNHQ  SKLVLEEIS NAT+RPSSSTKV+DK RNIGQTHPSQEL EP RSGRVVRQ DRYLG SEAQIIIPDDGIEDPLT KQAMND
Subjt:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND

Query:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI
         DCDQWIKAM+LEMESMY N V  LVDQPSEVRPIGCKWIYKRKRDQ GKVQTFKARLVAKGY QKEGIDYEET SPVAMIK IRILLSIATFYDYEI
Subjt:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI

A0A5D3BX45 Gag/pol protein1.8e-14688.93Show/hide
Query:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        MVRSMMSF QL  SFW YALETTIYILNNVPSKSVSETPYELWKGRKGSL HFRIWGCPAH+ VQN KKLE  SKLCLFVGYPKESKGGLFYDPQENKVF
Subjt:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND
        VSTNA FLEEDHIRNHQTRSKLVLEEISKN TDRPSS TKVVDK RNIGQTH  QELG+PRRSGRVVRQ DRYLG SEAQIIIPDDGIEDPLT K AMND
Subjt:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND

Query:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI
         D DQWIKAMDLEMESMY N VWTLVDQP++V+PIGCKWIYKRKRDQ GKVQTFKARLVAKGY QKEGIDYEE FS  AMIKSIRILLSIATFYDYEI
Subjt:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI

A0A5D3CVD7 Gag/pol protein1.7e-14788.93Show/hide
Query:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        MVRSMM+FAQ   SFWGYALET IYILNNVPSK+VSETPYELWKGRKGSL +FRIWGCPAH+LVQN KKLEH SKLCLFVGYPKESKG LFYDPQENKVF
Subjt:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND
        VSTNA FLEEDHIRNHQ  SKLVLEEIS NAT++PSSSTKV+DK RNIGQTHPSQELGEP RSGRVVRQ DRYLG SEAQIIIPDDGIEDPLT KQAMND
Subjt:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND

Query:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI
         DCDQWIKAM+LEMESMY N V  LVDQPSEVRPIGCKWIYKRKRDQ GKVQTFKARLVAKGY QKEGIDYEETFSPVAMIK IRILLSIATFYDYEI
Subjt:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI

E2GK51 Gag/pol protein (Fragment)1.4e-13882.89Show/hide
Query:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        MVRSMMS+AQL  SFWGYALET I+ILNNVPSKSV ETPYELWKGRK SL +FRIWGCPAH+LVQN KKLE  SKLCLFVGYPKES+GGLFY PQENKVF
Subjt:  MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND
        VSTNA FLEEDH RNHQ RSK+VL+E+ KNATD+PSSSTKVVDK     Q+H SQEL  PRRSGRVV QP+RYLG  E QIIIPDDG+EDPLT KQAMND
Subjt:  VSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMND

Query:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI
         D DQWIKAM+LEMESMYFN VWTLVD PS+V+PIGCKWIYKRKRDQ GKVQTFKARLVAKGY QKEG+DYEETFSPVAM+KSIRILLSIATFY+YEI
Subjt:  ADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.1e-2625Show/hide
Query:  RSMMSFAQLLGSFWGYALETTIYILNNVPSKSV---SETPYELWKGRKGSLHHFRIWGCPAHMLVQNHK-KLEHCSKLCLFVGYPKESKGGLFYDPQENK
        R+M+S A+L  SFWG A+ T  Y++N +PS+++   S+TPYE+W  +K  L H R++G   ++ ++N + K +  S   +FVGY  E  G   +D    K
Subjt:  RSMMSFAQLLGSFWGYALETTIYILNNVPSKSV---SETPYELWKGRKGSLHHFRIWGCPAHMLVQNHK-KLEHCSKLCLFVGYPKESKGGLFYDPQENK

Query:  VFVSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDR--PSSSTKVV-----------DKIRNIGQTHPSQELGEPRRSGRVV------------------
          V+ + +  E + + +   + + V  + SK + ++  P+ S K++           D I+ +  +  S+    P  S +++                  
Subjt:  VFVSTNAMFLEEDHIRNHQTRSKLVLEEISKNATDR--PSSSTKVV-----------DKIRNIGQTHPSQELGEPRRSGRVV------------------

Query:  ---------------RQPDRYLGSS-------------------EAQIIIP--DDGI--------------------EDPLTNKQAMN------------
                       R+ D +L  S                   E  I  P  +DGI                    ED   NK  +N            
Subjt:  ---------------RQPDRYLGSS-------------------EAQIIIP--DDGI--------------------EDPLTNKQAMN------------

Query:  -----DADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFY
               D   W +A++ E+ +   N  WT+  +P     +  +W++  K ++ G    +KARLVA+G+ QK  IDYEETF+PVA I S R +LS+   Y
Subjt:  -----DADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFY

Query:  DYEI
        + ++
Subjt:  DYEI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-4233.94Show/hide
Query:  VRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVS-ETPYELWKGRKGSLHHFRIWGCP--AHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENK
        VRSM+  A+L  SFWG A++T  Y++N  PS  ++ E P  +W  ++ S  H +++GC   AH+  +   KL+  S  C+F+GY  E  G   +DP + K
Subjt:  VRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVS-ETPYELWKGRKGSLHHFRIWGCP--AHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENK

Query:  VFVSTNAMFLEEDHIRNHQTRSKLVLEEISKN------ATDRPSSSTKVVDKIRNIGQ-------------------THPSQ--ELGEPRRSGRVVRQPD
        V  S + +F  E  +R     S+ V   I  N       ++ P+S+    D++   G+                    HP+Q  E  +P R     R   
Subjt:  VFVSTNAMFLEEDHIRNHQTRSKLVLEEISKN------ATDRPSSSTKVVDKIRNIGQ-------------------THPSQ--ELGEPRRSGRVVRQPD

Query:  RYLGSSEAQIIIPDDGIEDPLTNKQAMNDADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDY
        R   S+E  ++I DD   +P + K+ ++  + +Q +KAM  EMES+  N  + LV+ P   RP+ CKW++K K+D   K+  +KARLV KG++QK+GID+
Subjt:  RYLGSSEAQIIIPDDGIEDPLTNKQAMNDADCDQWIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDY

Query:  EETFSPVAMIKSIRILLSIATFYDYEI
        +E FSPV  + SIR +LS+A   D E+
Subjt:  EETFSPVAMIKSIRILLSIATFYDYEI

P92520 Uncharacterized mitochondrial protein AtMg008202.1e-1443.02Show/hide
Query:  WIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIA
        W +AM  E++++  N  W LV  P     +GCKW++K K    G +   KARLVAKG+ Q+EGI + ET+SPV    +IR +L++A
Subjt:  WIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.7e-2022.86Show/hide
Query:  SMMSFAQLLGSFWGYALETTIYILNNVPSKSVS-ETPYELWKGRKGSLHHFRIWGCPAHMLVQ--NHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF
        +++S A +  ++W YA    +Y++N +P+  +  E+P++   G   +    R++GC  +  ++  N  KL+  S+ C+F+GY       L    Q ++++
Subjt:  SMMSFAQLLGSFWGYALETTIYILNNVPSKSVS-ETPYELWKGRKGSLHHFRIWGCPAHMLVQ--NHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVF

Query:  VSTNAMFLE---------------------------------------------EDH------------IRNHQTRSKLVLEEISKN-------------
        +S +  F E                                             + H             RN Q  S  +    S +             
Subjt:  VSTNAMFLE---------------------------------------------EDH------------IRNHQTRSKLVLEEISKN-------------

Query:  ---ATDRPSSSTKVVDKIRNIGQTHP--------SQELGEPRRSGRVVRQPDRYLGSS--------------------------------------EAQI
            T +P+ +       +N  Q +P        +Q L  P +S      P     SS                                      +A I
Subjt:  ---ATDRPSSSTKVVDKIRNIGQTHP--------SQELGEPRRSGRVVRQPDRYLGSS--------------------------------------EAQI

Query:  IIPDD---------GIEDPLTNKQAMNDADCDQWIKAMDLEMESMYFNFVWTLV-DQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDY
        I P+             +P T  QA+ D   ++W  AM  E+ +   N  W LV   PS V  +GC+WI+ +K +  G +  +KARLVAKGY Q+ G+DY
Subjt:  IIPDD---------GIEDPLTNKQAMNDADCDQWIKAMDLEMESMYFNFVWTLV-DQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDY

Query:  EETFSPVAMIKSIRILLSIA
         ETFSPV    SIRI+L +A
Subjt:  EETFSPVAMIKSIRILLSIA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.1e-1846.6Show/hide
Query:  DPLTNKQAMNDADCDQWIKAMDLEMESMYFNFVWTLV-DQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILL
        +P T  QAM D   D+W +AM  E+ +   N  W LV   P  V  +GC+WI+ +K +  G +  +KARLVAKGY Q+ G+DY ETFSPV    SIRI+L
Subjt:  DPLTNKQAMNDADCDQWIKAMDLEMESMYFNFVWTLV-DQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILL

Query:  SIA
         +A
Subjt:  SIA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-2045.16Show/hide
Query:  WIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI
        W  AMD E+ +M     W +   P   +PIGCKW+YK K +  G ++ +KARLVAKGY Q+EGID+ ETFSPV  + S++++L+I+  Y++ +
Subjt:  WIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.5e-1543.02Show/hide
Query:  WIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIA
        W +AM  E++++  N  W LV  P     +GCKW++K K    G +   KARLVAKG+ Q+EGI + ET+SPV    +IR +L++A
Subjt:  WIKAMDLEMESMYFNFVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGCTCTATGATGAGTTTTGCTCAGTTGCTAGGTTCTTTTTGGGGATATGCTTTAGAGACAACTATCTATATTTTGAACAACGTTCCCTCTAAAAGTGTTTCTGA
AACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACATCACTTTAGGATTTGGGGATGTCCAGCACACATGTTGGTACAAAACCATAAAAAATTGGAACATTGTT
CAAAATTATGCCTATTTGTAGGTTATCCAAAAGAATCAAAAGGTGGTTTATTTTATGATCCTCAAGAAAATAAAGTATTTGTATCAACAAATGCTATGTTCTTAGAGGAA
GACCACATTAGAAATCATCAAACTCGCAGTAAACTAGTATTAGAAGAAATTTCCAAAAATGCTACAGATAGACCTAGTTCATCTACTAAAGTAGTAGATAAAATTAGGAA
TATTGGTCAAACACATCCTTCTCAAGAGTTAGGAGAGCCTCGTCGTAGTGGGAGGGTTGTACGACAGCCTGATCGCTATTTGGGTTCAAGTGAAGCTCAAATCATCATAC
CTGATGATGGCATAGAGGATCCATTGACCAACAAACAGGCAATGAATGATGCGGATTGTGACCAATGGATCAAAGCCATGGACCTTGAAATGGAATCTATGTATTTCAAT
TTTGTCTGGACTCTAGTAGATCAACCAAGTGAGGTAAGACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAACTGGTAAAGTACAGACTTTCAAAGCTCG
ACTTGTGGCAAAAGGTTATAAACAAAAGGAGGGAATAGATTATGAAGAAACTTTCTCTCCTGTTGCCATGATAAAGTCGATTAGAATACTCTTATCCATCGCCACTTTTT
ATGATTATGAAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGCTCTATGATGAGTTTTGCTCAGTTGCTAGGTTCTTTTTGGGGATATGCTTTAGAGACAACTATCTATATTTTGAACAACGTTCCCTCTAAAAGTGTTTCTGA
AACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACATCACTTTAGGATTTGGGGATGTCCAGCACACATGTTGGTACAAAACCATAAAAAATTGGAACATTGTT
CAAAATTATGCCTATTTGTAGGTTATCCAAAAGAATCAAAAGGTGGTTTATTTTATGATCCTCAAGAAAATAAAGTATTTGTATCAACAAATGCTATGTTCTTAGAGGAA
GACCACATTAGAAATCATCAAACTCGCAGTAAACTAGTATTAGAAGAAATTTCCAAAAATGCTACAGATAGACCTAGTTCATCTACTAAAGTAGTAGATAAAATTAGGAA
TATTGGTCAAACACATCCTTCTCAAGAGTTAGGAGAGCCTCGTCGTAGTGGGAGGGTTGTACGACAGCCTGATCGCTATTTGGGTTCAAGTGAAGCTCAAATCATCATAC
CTGATGATGGCATAGAGGATCCATTGACCAACAAACAGGCAATGAATGATGCGGATTGTGACCAATGGATCAAAGCCATGGACCTTGAAATGGAATCTATGTATTTCAAT
TTTGTCTGGACTCTAGTAGATCAACCAAGTGAGGTAAGACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAACTGGTAAAGTACAGACTTTCAAAGCTCG
ACTTGTGGCAAAAGGTTATAAACAAAAGGAGGGAATAGATTATGAAGAAACTTTCTCTCCTGTTGCCATGATAAAGTCGATTAGAATACTCTTATCCATCGCCACTTTTT
ATGATTATGAAATTTGA
Protein sequenceShow/hide protein sequence
MVRSMMSFAQLLGSFWGYALETTIYILNNVPSKSVSETPYELWKGRKGSLHHFRIWGCPAHMLVQNHKKLEHCSKLCLFVGYPKESKGGLFYDPQENKVFVSTNAMFLEE
DHIRNHQTRSKLVLEEISKNATDRPSSSTKVVDKIRNIGQTHPSQELGEPRRSGRVVRQPDRYLGSSEAQIIIPDDGIEDPLTNKQAMNDADCDQWIKAMDLEMESMYFN
FVWTLVDQPSEVRPIGCKWIYKRKRDQTGKVQTFKARLVAKGYKQKEGIDYEETFSPVAMIKSIRILLSIATFYDYEI