; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc11g0301281 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc11g0301281
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr11:21505252..21506070
RNA-Seq ExpressionCmc11g0301281
SyntenyCmc11g0301281
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037650.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.7e-13590.81Show/hide
Query:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYM
        METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEM KG+SQ DSAKTDS+ITDEVINNETVLVPSA+VKKNHPSS IIGDPSAGITT+RKEKVDY 
Subjt:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYM

Query:  KLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVAR
        K+IADLCYVSAIEPTSVENA KDEYWINAMQEELLQFKRNNVWT+VPKP+GANVIGTKWIFKNKTDE GS+IRNKARL+AQ YAQVEGVDFDETFA VAR
Subjt:  KLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVAR

Query:  LKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE
        L+AI LLL ISCF+KFKLF+MDVKS FLNGYLNEEVY AQPKGFVDSEFPQYVYKLNKALYGLKQAPRA YE
Subjt:  LKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE

KAA0051798.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.5e-12074.29Show/hide
Query:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKV---
        METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEM KG+SQ  SAKTDS+ITDEVINNET+LVPSAHVKKNHPSS II DPSAGITTRRKE V   
Subjt:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKV---

Query:  -------------------------------------------DYMKLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANV
                                                   + ++   DLCYVSAIEPTSVEN+LKDEYWI  MQEE LQFKRNNVWTLVPKP+GAN+
Subjt:  -------------------------------------------DYMKLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANV

Query:  IGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVY
        IGTKWIFKNKTDE GS+IRNKARLVAQ Y QVEGVD DETFAPVARL+AI LLL ISCFQKFKLFQMDVKS FLNGYLNEEV  A+PKGF+DSEFPQYVY
Subjt:  IGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVY

Query:  KLNKALYGLKQAPRA
        KLNKALYGLKQAPRA
Subjt:  KLNKALYGLKQAPRA

KAA0060315.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.8e-11781.68Show/hide
Query:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTD-SNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDY
        METINVVVNDFESN NQFNIEDDET V  +V +TPL EM K DSQPDS KT+   ITDE +N+ETVLVPSAHVKKNHP S IIGD SAGITTRRKEKVDY
Subjt:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTD-SNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDY

Query:  MKLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVA
         K+IADLCYVS IEPTSVENALK+EYWINAMQEELLQFKRN+VWTLVPKP+  NVIGTKWIFKNKTDE  +V +NKARLVAQ YAQVEGVDFDETFAPVA
Subjt:  MKLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVA

Query:  RLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE
        +L+AI LLL    FQKFK++QMDVKS FLNGYLNEEVY AQPK F+DSEFPQYVYKLNKALYGLKQA RA YE
Subjt:  RLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE

KAA0067336.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.0e-12299.55Show/hide
Query:  THVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLCYVSAIEPTSVENALKDE
        THVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLCYVSAIEPTSVENALKDE
Subjt:  THVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLCYVSAIEPTSVENALKDE

Query:  YWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVK
        YWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKF LFQMDVK
Subjt:  YWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVK

Query:  STFLNGYLNEEVYEAQPKGFVDS
        STFLNGYLNEEVYEAQPKGFVDS
Subjt:  STFLNGYLNEEVYEAQPKGFVDS

TYK21443.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.4e-12381.25Show/hide
Query:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYM
        METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEM KG+SQ  SAKTDS+ITDEVINNET+LVPSAHVKKNHPSS II DPSAGITTRRKE ++  
Subjt:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYM

Query:  KLIA-------------------DLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQ
          I+                   DLCYVSAIEPTSVEN+LKDEYWI  MQEE LQFKRNNVWTLVPKP+GAN+IGTKWIFKNKTDE GS+IRNKARLVAQ
Subjt:  KLIA-------------------DLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQ

Query:  DYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRA
         Y QVEGVD DETFAPVARL+AI LLL ISCFQKFKLFQMDVKS FLNGYLNEEV  A+PKGF+DSEFPQYVYKLNKALYGLKQAPRA
Subjt:  DYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRA

TrEMBL top hitse value%identityAlignment
A0A5A7T2Q0 Gag-pol polyprotein1.3e-13590.81Show/hide
Query:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYM
        METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEM KG+SQ DSAKTDS+ITDEVINNETVLVPSA+VKKNHPSS IIGDPSAGITT+RKEKVDY 
Subjt:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYM

Query:  KLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVAR
        K+IADLCYVSAIEPTSVENA KDEYWINAMQEELLQFKRNNVWT+VPKP+GANVIGTKWIFKNKTDE GS+IRNKARL+AQ YAQVEGVDFDETFA VAR
Subjt:  KLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVAR

Query:  LKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE
        L+AI LLL ISCF+KFKLF+MDVKS FLNGYLNEEVY AQPKGFVDSEFPQYVYKLNKALYGLKQAPRA YE
Subjt:  LKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE

A0A5A7U931 Gag-pol polyprotein7.1e-12174.29Show/hide
Query:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKV---
        METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEM KG+SQ  SAKTDS+ITDEVINNET+LVPSAHVKKNHPSS II DPSAGITTRRKE V   
Subjt:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKV---

Query:  -------------------------------------------DYMKLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANV
                                                   + ++   DLCYVSAIEPTSVEN+LKDEYWI  MQEE LQFKRNNVWTLVPKP+GAN+
Subjt:  -------------------------------------------DYMKLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANV

Query:  IGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVY
        IGTKWIFKNKTDE GS+IRNKARLVAQ Y QVEGVD DETFAPVARL+AI LLL ISCFQKFKLFQMDVKS FLNGYLNEEV  A+PKGF+DSEFPQYVY
Subjt:  IGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVY

Query:  KLNKALYGLKQAPRA
        KLNKALYGLKQAPRA
Subjt:  KLNKALYGLKQAPRA

A0A5A7V365 Gag-pol polyprotein4.7e-11781.68Show/hide
Query:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTD-SNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDY
        METINVVVNDFESN NQFNIEDDET V  +V +TPL EM K DSQPDS KT+   ITDE +N+ETVLVPSAHVKKNHP S IIGD SAGITTRRKEKVDY
Subjt:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTD-SNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDY

Query:  MKLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVA
         K+IADLCYVS IEPTSVENALK+EYWINAMQEELLQFKRN+VWTLVPKP+  NVIGTKWIFKNKTDE  +V +NKARLVAQ YAQVEGVDFDETFAPVA
Subjt:  MKLIADLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVA

Query:  RLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE
        +L+AI LLL    FQKFK++QMDVKS FLNGYLNEEVY AQPK F+DSEFPQYVYKLNKALYGLKQA RA YE
Subjt:  RLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE

A0A5A7VLJ5 Gag-pol polyprotein9.9e-12399.55Show/hide
Query:  THVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLCYVSAIEPTSVENALKDE
        THVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLCYVSAIEPTSVENALKDE
Subjt:  THVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLCYVSAIEPTSVENALKDE

Query:  YWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVK
        YWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKF LFQMDVK
Subjt:  YWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVK

Query:  STFLNGYLNEEVYEAQPKGFVDS
        STFLNGYLNEEVYEAQPKGFVDS
Subjt:  STFLNGYLNEEVYEAQPKGFVDS

A0A5D3DCZ8 Gag-pol polyprotein6.8e-12481.25Show/hide
Query:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYM
        METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEM KG+SQ  SAKTDS+ITDEVINNET+LVPSAHVKKNHPSS II DPSAGITTRRKE ++  
Subjt:  METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYM

Query:  KLIA-------------------DLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQ
          I+                   DLCYVSAIEPTSVEN+LKDEYWI  MQEE LQFKRNNVWTLVPKP+GAN+IGTKWIFKNKTDE GS+IRNKARLVAQ
Subjt:  KLIA-------------------DLCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQ

Query:  DYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRA
         Y QVEGVD DETFAPVARL+AI LLL ISCFQKFKLFQMDVKS FLNGYLNEEV  A+PKGF+DSEFPQYVYKLNKALYGLKQAPRA
Subjt:  DYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRA

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.1e-2944.9Show/hide
Query:  WINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVKS
        W  A+  EL   K NN WT+  +P   N++ ++W+F  K +E G+ IR KARLVA+ + Q   +D++ETFAPVAR+ +   +L +      K+ QMDVK+
Subjt:  WINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQMDVKS

Query:  TFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE
         FLNG L EE+Y   P+G   S     V KLNKA+YGLKQA R  +E
Subjt:  TFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-2933.6Show/hide
Query:  THVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLCYVSAIEPTSVENAL---
        T   P    +  DE+S+   QP          DE +  E V  P+   +++ P   +       + +RR    +Y+ +  D       EP S++  L   
Subjt:  THVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLCYVSAIEPTSVENAL---

Query:  KDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQM
        +    + AMQEE+   ++N  + LV  P G   +  KW+FK K D    ++R KARLV + + Q +G+DFDE F+PV ++ +I  +L ++     ++ Q+
Subjt:  KDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQM

Query:  DVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALY
        DVK+ FL+G L EE+Y  QP+GF  +     V KLNK+LYGLKQAPR  Y
Subjt:  DVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALY

P92520 Uncharacterized mitochondrial protein AtMg008207.5e-1943.09Show/hide
Query:  TRRKEKVDYMKLIADLCYVSAI--EPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEG
        TR K  ++ +     L   + I  EP SV  ALKD  W  AMQEEL    RN  W LVP P   N++G KW+FK K    G++ R KARLVA+ + Q EG
Subjt:  TRRKEKVDYMKLIADLCYVSAI--EPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEG

Query:  VDFDETFAPVARLKAICLLLCIS
        + F ET++PV R   I  +L ++
Subjt:  VDFDETFAPVARLKAICLLLCIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-3333.58Show/hide
Query:  ESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLV----PSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLC
        +++ +Q   +++ T+ +P   +  L   ++  S   S  T ++ +       ++L+    P A +  N+ +   +   S G   +        K    + 
Subjt:  ESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLV----PSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLC

Query:  YVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLV-PKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICL
          +  EP +   ALKDE W NAM  E+     N+ W LV P P+   ++G +WIF  K +  GS+ R KARLVA+ Y Q  G+D+ ETF+PV +  +I +
Subjt:  YVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLV-PKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICL

Query:  LLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALY
        +L ++  + + + Q+DV + FL G L ++VY +QP GF+D + P YV KL KALYGLKQAPRA Y
Subjt:  LLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-3245Show/hide
Query:  EPTSVENALKDEYWINAMQEELLQFKRNNVWTLV-PKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCIS
        EP +   A+KD+ W  AM  E+     N+ W LV P P    ++G +WIF  K +  GS+ R KARLVA+ Y Q  G+D+ ETF+PV +  +I ++L ++
Subjt:  EPTSVENALKDEYWINAMQEELLQFKRNNVWTLV-PKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCIS

Query:  CFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALY
          + + + Q+DV + FL G L +EVY +QP GFVD + P YV +L KA+YGLKQAPRA Y
Subjt:  CFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.8e-3140.72Show/hide
Query:  LCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAIC
        +C   A EP++   A +   W  AM +E+   +  + W +   P     IG KW++K K +  G++ R KARLVA+ Y Q EG+DF ETF+PV +L ++ 
Subjt:  LCYVSAIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAIC

Query:  LLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFV----DSEFPQYVYKLNKALYGLKQAPR
        L+L IS    F L Q+D+ + FLNG L+EE+Y   P G+     DS  P  V  L K++YGLKQA R
Subjt:  LLLCISCFQKFKLFQMDVKSTFLNGYLNEEVYEAQPKGFV----DSEFPQYVYKLNKALYGLKQAPR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.3e-2043.09Show/hide
Query:  TRRKEKVDYMKLIADLCYVSAI--EPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEG
        TR K  ++ +     L   + I  EP SV  ALKD  W  AMQEEL    RN  W LVP P   N++G KW+FK K    G++ R KARLVA+ + Q EG
Subjt:  TRRKEKVDYMKLIADLCYVSAI--EPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEG

Query:  VDFDETFAPVARLKAICLLLCIS
        + F ET++PV R   I  +L ++
Subjt:  VDFDETFAPVARLKAICLLLCIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACAATCAATGTCGTGGTTAATGATTTTGAGTCTAATGTCAATCAGTTTAATATTGAGGATGATGAGACCCATGTTACACCTGAAGTTACTTCTACTCCCCTTGA
TGAAATGTCTAAAGGTGATTCGCAGCCAGACAGTGCTAAGACCGATTCAAACATAACTGATGAGGTCATAAACAATGAAACTGTGCTTGTCCCCTCTGCACATGTGAAAA
AGAATCATCCATCAAGTTTCATAATAGGTGATCCGTCAGCTGGAATTACTACCAGAAGAAAAGAAAAGGTAGATTACATGAAATTGATTGCTGATTTATGCTATGTATCA
GCAATAGAACCCACATCTGTTGAAAATGCTCTCAAGGATGAATACTGGATAAATGCCATGCAAGAAGAGTTATTACAGTTCAAGCGTAACAACGTTTGGACTTTGGTTCC
TAAACCTAATGGGGCGAACGTCATAGGAACTAAGTGGATATTTAAAAATAAAACTGATGAATATGGAAGTGTAATAAGAAACAAGGCCCGTTTGGTAGCTCAAGATTATG
CACAGGTAGAAGGTGTGGATTTTGATGAAACTTTTGCACCTGTGGCTAGACTTAAAGCTATTTGCCTCTTGCTCTGTATATCATGTTTCCAAAAATTTAAATTGTTTCAA
ATGGACGTTAAAAGTACCTTCCTGAATGGATACTTAAATGAGGAAGTCTATGAAGCACAACCTAAAGGGTTTGTTGATTCTGAATTTCCTCAGTATGTCTACAAGCTGAA
TAAAGCTCTATATGGGTTAAAACAAGCTCCTCGGGCTTTGTATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACAATCAATGTCGTGGTTAATGATTTTGAGTCTAATGTCAATCAGTTTAATATTGAGGATGATGAGACCCATGTTACACCTGAAGTTACTTCTACTCCCCTTGA
TGAAATGTCTAAAGGTGATTCGCAGCCAGACAGTGCTAAGACCGATTCAAACATAACTGATGAGGTCATAAACAATGAAACTGTGCTTGTCCCCTCTGCACATGTGAAAA
AGAATCATCCATCAAGTTTCATAATAGGTGATCCGTCAGCTGGAATTACTACCAGAAGAAAAGAAAAGGTAGATTACATGAAATTGATTGCTGATTTATGCTATGTATCA
GCAATAGAACCCACATCTGTTGAAAATGCTCTCAAGGATGAATACTGGATAAATGCCATGCAAGAAGAGTTATTACAGTTCAAGCGTAACAACGTTTGGACTTTGGTTCC
TAAACCTAATGGGGCGAACGTCATAGGAACTAAGTGGATATTTAAAAATAAAACTGATGAATATGGAAGTGTAATAAGAAACAAGGCCCGTTTGGTAGCTCAAGATTATG
CACAGGTAGAAGGTGTGGATTTTGATGAAACTTTTGCACCTGTGGCTAGACTTAAAGCTATTTGCCTCTTGCTCTGTATATCATGTTTCCAAAAATTTAAATTGTTTCAA
ATGGACGTTAAAAGTACCTTCCTGAATGGATACTTAAATGAGGAAGTCTATGAAGCACAACCTAAAGGGTTTGTTGATTCTGAATTTCCTCAGTATGTCTACAAGCTGAA
TAAAGCTCTATATGGGTTAAAACAAGCTCCTCGGGCTTTGTATGAATGA
Protein sequenceShow/hide protein sequence
METINVVVNDFESNVNQFNIEDDETHVTPEVTSTPLDEMSKGDSQPDSAKTDSNITDEVINNETVLVPSAHVKKNHPSSFIIGDPSAGITTRRKEKVDYMKLIADLCYVS
AIEPTSVENALKDEYWINAMQEELLQFKRNNVWTLVPKPNGANVIGTKWIFKNKTDEYGSVIRNKARLVAQDYAQVEGVDFDETFAPVARLKAICLLLCISCFQKFKLFQ
MDVKSTFLNGYLNEEVYEAQPKGFVDSEFPQYVYKLNKALYGLKQAPRALYE