; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0189841 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0189841
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr07:8680922..8682099
RNA-Seq ExpressionCmc07g0189841
SyntenyCmc07g0189841
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]9.8e-15469.23Show/hide
Query:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
        MNDVD DQWIKA++L+ME MY N  WTLVD  +DV PIGCKWIYKRKRDQAGKVQTFKA+LVAKGYTQKEG+ YEETFSPVAM+KSIRILLSIATFY+YE
Subjt:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE

Query:  IWQMDVKN--------------------------------------------------------------------KRIINSTVAFLVLYVDDILLIGND
        IWQMDVK                                                                     K+I+NS VAFL+LYVDDILLIGND
Subjt:  IWQMDVKN--------------------------------------------------------------------KRIINSTVAFLVLYVDDILLIGND

Query:  VGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSL
        V +LTD+KKWL TQFQMKDLG AQY+LGIQIVRNRKNKTLAMSQ SYIDK+LSRYKM NSKKG L +R+GIHLSKEQC KTPQEVEDM NIPY+S VGSL
Subjt:  VGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSL

Query:  MYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDS
        MYAML TR D+ YSVGIVSRY+SNPGRDHWT VKNILKY  R ++YMLVY +KDLILTGYTD +FQ+DKDARKSTSG VFTLNGGAVVWRS+KQ+CI DS
Subjt:  MYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDS

Query:  TMEAEYVAVCEAAKEA
        TMEAEYVA CEAAKEA
Subjt:  TMEAEYVAVCEAAKEA

KAA0025729.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-14979.15Show/hide
Query:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAIDLKME MYSN  WTLVDQ ++V PIGC+WIYKRKRDQAGKVQTFKA+LVAKGYTQ EGI YEETFSPV MIKSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE

Query:  IWQMDVKN-----------------------------------KRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVR
        IW MDVK                                    KRIINS VAFLVLYVDDILLIGNDVGHLTDIK+WLATQFQM DL NA YV GIQIVR
Subjt:  IWQMDVKN-----------------------------------KRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVR

Query:  NRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTV
        NRKNKTLA+SQTSYIDKMLSRYKM NSKKGLL YRYGIHLSKEQC KTPQEVEDMSNIPYAS VGSLM+AML TR D+ YSV IVSRY+SNPGRDHWTTV
Subjt:  NRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTV

Query:  KNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVV
        KNILKY  R KDYMLVY SKDLILTGYTD NFQTDKDARKSTS  VFTLNGGAVV
Subjt:  KNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVV

KAA0064270.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-181100Show/hide
Query:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE

Query:  IWQMDVKNKRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYR
        IWQMDVKNKRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYR
Subjt:  IWQMDVKNKRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYR

Query:  YGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTD
        YGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTD
Subjt:  YGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTD

Query:  KDARKSTSGLVFTLNGGAVV
        KDARKSTSGLVFTLNGGAVV
Subjt:  KDARKSTSGLVFTLNGGAVV

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-16274.52Show/hide
Query:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
        MNDVD DQWIKA+DL+ME MYSN  WTLVDQ NDV PIGCKWIYKRKRDQAGKVQTFKA+LVAKGYTQKEGI YEE FS  AMIKSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE

Query:  IWQMDVKN--------------------------------------------------------------------KRIINSTVAFLVLYVDDILLIGND
        IWQMDVK                                                                     KRI+NSTVAFLVLYVDDILLIGND
Subjt:  IWQMDVKN--------------------------------------------------------------------KRIINSTVAFLVLYVDDILLIGND

Query:  VGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSL
        VGHL DIKKWLA QFQMKDLGNAQYVLG+QIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLL YRYGIHLSKEQC KTPQEVEDMSNIPYAS VGSL
Subjt:  VGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSL

Query:  MYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDS
        MY ML TR ++ YSVGIVSR +S PGRDHWTTVKNILKY  R KDYMLVY SKDLILTGYTDF FQTDKDARKSTSGLVFT+NGGAVVWRSIKQSCI DS
Subjt:  MYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDS

Query:  TMEAEYVAVCEAAKEA
        TMEAEYVA CEAAKEA
Subjt:  TMEAEYVAVCEAAKEA

TYK06159.1 gag/pol protein [Cucumis melo var. makuwa]5.1e-15079.44Show/hide
Query:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAIDLKME MYSN  WTLVDQ ++V PIGC+WIYKRKRDQAGKVQTFKA+LVAKGYTQKEGI YEETFSPV MIKSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE

Query:  IWQMDVKN-----------------------------------KRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVR
        IW MDVK                                    KRIINS VAFLVLYVDDILLIGNDVGHLTDIK+WLATQFQM DL NA YV GIQIVR
Subjt:  IWQMDVKN-----------------------------------KRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVR

Query:  NRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTV
        NRKNKTLA+SQTSYIDKMLSRYKM NSKKGLL YRYGIHLSKEQC KTPQEVEDMSNIPYAS VGSLM+AML TR D+ YSV IVSRY+SNPGRDHWTTV
Subjt:  NRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTV

Query:  KNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVV
        KNILKY  R KDYMLVY SKDLILTGYTD NFQTDKDARKSTS  VFTLNGGAVV
Subjt:  KNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVV

TrEMBL top hitse value%identityAlignment
A0A5A7SKC9 Gag/pol protein9.3e-15079.15Show/hide
Query:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAIDLKME MYSN  WTLVDQ ++V PIGC+WIYKRKRDQAGKVQTFKA+LVAKGYTQ EGI YEETFSPV MIKSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE

Query:  IWQMDVKN-----------------------------------KRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVR
        IW MDVK                                    KRIINS VAFLVLYVDDILLIGNDVGHLTDIK+WLATQFQM DL NA YV GIQIVR
Subjt:  IWQMDVKN-----------------------------------KRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVR

Query:  NRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTV
        NRKNKTLA+SQTSYIDKMLSRYKM NSKKGLL YRYGIHLSKEQC KTPQEVEDMSNIPYAS VGSLM+AML TR D+ YSV IVSRY+SNPGRDHWTTV
Subjt:  NRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTV

Query:  KNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVV
        KNILKY  R KDYMLVY SKDLILTGYTD NFQTDKDARKSTS  VFTLNGGAVV
Subjt:  KNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVV

A0A5D3BX45 Gag/pol protein1.3e-16274.52Show/hide
Query:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
        MNDVD DQWIKA+DL+ME MYSN  WTLVDQ NDV PIGCKWIYKRKRDQAGKVQTFKA+LVAKGYTQKEGI YEE FS  AMIKSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE

Query:  IWQMDVKN--------------------------------------------------------------------KRIINSTVAFLVLYVDDILLIGND
        IWQMDVK                                                                     KRI+NSTVAFLVLYVDDILLIGND
Subjt:  IWQMDVKN--------------------------------------------------------------------KRIINSTVAFLVLYVDDILLIGND

Query:  VGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSL
        VGHL DIKKWLA QFQMKDLGNAQYVLG+QIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLL YRYGIHLSKEQC KTPQEVEDMSNIPYAS VGSL
Subjt:  VGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSL

Query:  MYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDS
        MY ML TR ++ YSVGIVSR +S PGRDHWTTVKNILKY  R KDYMLVY SKDLILTGYTDF FQTDKDARKSTSGLVFT+NGGAVVWRSIKQSCI DS
Subjt:  MYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDS

Query:  TMEAEYVAVCEAAKEA
        TMEAEYVA CEAAKEA
Subjt:  TMEAEYVAVCEAAKEA

A0A5D3C701 Gag/pol protein2.4e-15079.44Show/hide
Query:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAIDLKME MYSN  WTLVDQ ++V PIGC+WIYKRKRDQAGKVQTFKA+LVAKGYTQKEGI YEETFSPV MIKSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE

Query:  IWQMDVKN-----------------------------------KRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVR
        IW MDVK                                    KRIINS VAFLVLYVDDILLIGNDVGHLTDIK+WLATQFQM DL NA YV GIQIVR
Subjt:  IWQMDVKN-----------------------------------KRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVR

Query:  NRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTV
        NRKNKTLA+SQTSYIDKMLSRYKM NSKKGLL YRYGIHLSKEQC KTPQEVEDMSNIPYAS VGSLM+AML TR D+ YSV IVSRY+SNPGRDHWTTV
Subjt:  NRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTV

Query:  KNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVV
        KNILKY  R KDYMLVY SKDLILTGYTD NFQTDKDARKSTS  VFTLNGGAVV
Subjt:  KNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVV

A0A5D3D523 Gag/pol protein9.2e-182100Show/hide
Query:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
        MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
Subjt:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE

Query:  IWQMDVKNKRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYR
        IWQMDVKNKRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYR
Subjt:  IWQMDVKNKRIINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYR

Query:  YGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTD
        YGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTD
Subjt:  YGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTD

Query:  KDARKSTSGLVFTLNGGAVV
        KDARKSTSGLVFTLNGGAVV
Subjt:  KDARKSTSGLVFTLNGGAVV

E2GK51 Gag/pol protein (Fragment)4.8e-15469.23Show/hide
Query:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE
        MNDVD DQWIKA++L+ME MY N  WTLVD  +DV PIGCKWIYKRKRDQAGKVQTFKA+LVAKGYTQKEG+ YEETFSPVAM+KSIRILLSIATFY+YE
Subjt:  MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYE

Query:  IWQMDVKN--------------------------------------------------------------------KRIINSTVAFLVLYVDDILLIGND
        IWQMDVK                                                                     K+I+NS VAFL+LYVDDILLIGND
Subjt:  IWQMDVKN--------------------------------------------------------------------KRIINSTVAFLVLYVDDILLIGND

Query:  VGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSL
        V +LTD+KKWL TQFQMKDLG AQY+LGIQIVRNRKNKTLAMSQ SYIDK+LSRYKM NSKKG L +R+GIHLSKEQC KTPQEVEDM NIPY+S VGSL
Subjt:  VGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSL

Query:  MYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDS
        MYAML TR D+ YSVGIVSRY+SNPGRDHWT VKNILKY  R ++YMLVY +KDLILTGYTD +FQ+DKDARKSTSG VFTLNGGAVVWRS+KQ+CI DS
Subjt:  MYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDS

Query:  TMEAEYVAVCEAAKEA
        TMEAEYVA CEAAKEA
Subjt:  TMEAEYVAVCEAAKEA

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.9e-4128.33Show/hide
Query:  DCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQM
        D   W +AI+ ++     N  WT+  +  + N +  +W++  K ++ G    +KA+LVA+G+TQK  I YEETF+PVA I S R +LS+   Y+ ++ QM
Subjt:  DCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQM

Query:  DVK------------------------------NKRI-----------------------INSTV---------------AFLVLYVDDILLIGNDVGHL
        DVK                              NK I                       +NS+V                +++LYVDD+++   D+  +
Subjt:  DVK------------------------------NKRI-----------------------INSTV---------------AFLVLYVDDILLIGNDVGHL

Query:  TDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHN----SKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSL
         + K++L  +F+M DL   ++ +GI+I    +   + +SQ++Y+ K+LS++ M N    S        Y +  S E C           N P  S +G L
Subjt:  TDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHN----SKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSL

Query:  MYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSK---DLILTGYTDFNFQTDKDARKSTSGLVFTL-NGGAVVWRSIKQSC
        MY ML TR D+  +V I+SRY S    + W  +K +L+Y     D  L++      +  + GY D ++   +  RKST+G +F + +   + W + +Q+ 
Subjt:  MYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSK---DLILTGYTDFNFQTDKDARKSTSGLVFTL-NGGAVVWRSIKQSC

Query:  IVDSTMEAEYVAVCEAAKEA
        +  S+ EAEY+A+ EA +EA
Subjt:  IVDSTMEAEYVAVCEAAKEA

P0CV72 Secreted RxLR effector protein 1611.1e-2242.31Show/hide
Query:  MSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLI-LTGYTDFNFQTDKDARKSTSGLVFTLNGGA
        M N+PY S VG++MY M+ TR D+  +VG++S++ S+P   HW  +K +L+Y    + Y L ++      L GY+D ++  D ++R+STSG +F LNGG 
Subjt:  MSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLI-LTGYTDFNFQTDKDARKSTSGLVFTLNGGA

Query:  VVWRSIKQSCIVDSTMEAEYVAVCEAAKEA
        V WRS KQ  +  S+ E EY+A+ EA +EA
Subjt:  VVWRSIKQSCIVDSTMEAEYVAVCEAAKEA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-6836.59Show/hide
Query:  DQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQMDV
        +Q +KA+  +ME +  N  + LV+      P+ CKW++K K+D   K+  +KA+LV KG+ QK+GI ++E FSPV  + SIR +LS+A   D E+ Q+DV
Subjt:  DQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQMDV

Query:  KN---------------------------------------------------------------------KRIINSTVAFLVLYVDDILLIGNDVGHLT
        K                                                                      KR   +    L+LYVDD+L++G D G + 
Subjt:  KN---------------------------------------------------------------------KRIINSTVAFLVLYVDDILLIGNDVGHLT

Query:  DIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAML
         +K  L+  F MKDLG AQ +LG++IVR R ++ L +SQ  YI+++L R+ M N+K         + LSK+ C  T +E  +M+ +PY+S VGSLMYAM+
Subjt:  DIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAML

Query:  YTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDSTMEAE
         TR D+ ++VG+VSR+  NPG++HW  VK IL+Y        L +   D IL GYTD +   D D RKS++G +FT +GGA+ W+S  Q C+  ST EAE
Subjt:  YTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDSTMEAE

Query:  YVAVCEAAKE
        Y+A  E  KE
Subjt:  YVAVCEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.0e-3328.22Show/hide
Query:  DQWIKAIDLKMELMYSNIFWTLV-DQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQMD
        ++W  A+  ++     N  W LV    + V  +GC+WI+ +K +  G +  +KA+LVAKGY Q+ G+ Y ETFSPV    SIRI+L +A    + I Q+D
Subjt:  DQWIKAIDLKMELMYSNIFWTLV-DQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQMD

Query:  VKNKRI-------------------------------------------------------INS-------------TVAFLVLYVDDILLIGNDVGHLT
        V N  +                                                       +NS             ++ ++++YVDDIL+ GND   L 
Subjt:  VKNKRI-------------------------------------------------------INS-------------TVAFLVLYVDDILLIGNDVGHLT

Query:  DIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAML
        +    L+ +F +KD     Y LGI+    R    L +SQ  YI  +L+R  M  +K           LS    +K     E      Y   VGSL Y + 
Subjt:  DIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAML

Query:  YTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDY-MLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDSTMEA
        +TR D+ Y+V  +S++   P  +H   +K IL+Y     ++ + +     L L  Y+D ++  DKD   ST+G +  L    + W S KQ  +V S+ EA
Subjt:  YTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDY-MLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDSTMEA

Query:  EYVAVCEAAKE
        EY +V   + E
Subjt:  EYVAVCEAAKE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.7e-3628.47Show/hide
Query:  DQWIKAIDLKMELMYSNIFWTLV-DQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQMD
        D+W +A+  ++     N  W LV      V  +GC+WI+ +K +  G +  +KA+LVAKGY Q+ G+ Y ETFSPV    SIRI+L +A    + I Q+D
Subjt:  DQWIKAIDLKMELMYSNIFWTLV-DQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQMD

Query:  VKNKRI-------------------------------------------------------INS-------------TVAFLVLYVDDILLIGNDVGHLT
        V N  +                                                       +NS             ++ ++++YVDDIL+ GND   L 
Subjt:  VKNKRI-------------------------------------------------------INS-------------TVAFLVLYVDDILLIGNDVGHLT

Query:  DIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAML
             L+ +F +K+  +  Y LGI+    R  + L +SQ  Y   +L+R  M  +K           L+    +K P   E      Y   VGSL Y + 
Subjt:  DIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAML

Query:  YTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDY-MLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDSTMEA
        +TR D+ Y+V  +S+Y   P  DHW  +K +L+Y     D+ + +     L L  Y+D ++  D D   ST+G +  L    + W S KQ  +V S+ EA
Subjt:  YTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDY-MLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDSTMEA

Query:  EYVAVCEAAKE
        EY +V   + E
Subjt:  EYVAVCEAAKE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.4e-3526.7Show/hide
Query:  WIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKN
        W  A+D ++  M +   W +     +  PIGCKW+YK K +  G ++ +KA+LVAKGYTQ+EGI + ETFSPV  + S++++L+I+  Y++ + Q+D+ N
Subjt:  WIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKN

Query:  KRI-----------------------------------------------------------------------INSTVAFLVL-YVDDILLIGNDVGHL
          +                                                                       I +T+   VL YVDDI++  N+   +
Subjt:  KRI-----------------------------------------------------------------------INSTVAFLVL-YVDDILLIGNDVGHL

Query:  TDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAM
         ++K  L + F+++DLG  +Y LG++I R+     + + Q  Y   +L    +   K   +     +  S           + +    Y   +G LMY  
Subjt:  TDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDMSNIPYASTVGSLMYAM

Query:  LYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSK-DLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDSTME
        + TR D+ ++V  +S++   P   H   V  IL Y        L YSS+ ++ L  ++D +FQ+ KD R+ST+G    L    + W+S KQ  +  S+ E
Subjt:  LYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSK-DLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIVDSTME

Query:  AEYVAVCEAAKE
        AEY A+  A  E
Subjt:  AEYVAVCEAAKE

ATMG00810.1 DNA/RNA polymerases superfamily protein5.1e-1530.3Show/hide
Query:  FLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEV
        +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI  +     L +SQT Y +++L+   M + K   +S    + L+    S +  + 
Subjt:  FLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEV

Query:  EDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDY-MLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNG
         D S+  + S VG+L Y  L TR D+ Y+V IV +    P    +  +K +L+Y      + + ++ +  L +  + D ++      R+ST+G    L  
Subjt:  EDMSNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDY-MLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNG

Query:  GAVVWRSIKQSCIVDSTMEAEYVAVCEAAKE
          + W + +Q  +  S+ E EY A+   A E
Subjt:  GAVVWRSIKQSCIVDSTMEAEYVAVCEAAKE

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.5e-1440.7Show/hide
Query:  WIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIA
        W +A+  +++ +  N  W LV    + N +GCKW++K K    G +   KA+LVAKG+ Q+EGIY+ ET+SPV    +IR +L++A
Subjt:  WIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATGTGGACTGTGACCAATGGATCAAAGCCATAGACCTCAAAATGGAACTTATGTATTCCAATATTTTCTGGACTCTAGTAGATCAACAAAATGATGTAAATCC
TATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTGGTAAAGTACAGACTTTCAAAGCTCAACTAGTGGCAAAAGGTTATACACAAAAGGAGGGAATATATT
ATGAAGAAACTTTCTCTCCTGTTGCCATGATAAAGTCGATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGAACAAAAGG
ATCATCAATTCAACTGTAGCATTCTTAGTTTTGTATGTAGATGACATTCTACTCATTGGGAATGATGTAGGTCATCTAACTGATATTAAGAAATGGCTAGCTACGCAATT
CCAAATGAAAGATTTGGGAAATGCACAATATGTTCTTGGTATCCAAATAGTTCGGAACCGAAAGAACAAAACACTAGCCATGTCTCAAACATCTTATATAGACAAAATGT
TGTCAAGATATAAGATGCATAATTCCAAAAAGGGTCTGTTGTCGTACAGATATGGAATTCATTTATCAAAAGAACAATGTTCAAAGACACCTCAAGAAGTTGAGGATATG
AGTAACATTCCCTATGCTTCTACTGTTGGCAGCCTGATGTACGCAATGTTATATACTAGATCTGACATGTACTATTCAGTGGGGATAGTTAGTAGATATAAGTCCAATCC
TGGTCGTGATCATTGGACAACCGTTAAGAATATCCTAAAATATTTTATAAGAAAAAAAGACTACATGCTTGTGTATAGTTCTAAGGATCTGATCCTTACTGGATACACTG
ACTTCAATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATTAGTTTTCACTCTGAATGGAGGAGCAGTAGTATGGAGAAGCATAAAACAATCCTGTATTGTT
GACTCCACTATGGAAGCTGAATATGTAGCTGTCTGTGAGGCAGCAAAAGAAGCAGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGATGTGGACTGTGACCAATGGATCAAAGCCATAGACCTCAAAATGGAACTTATGTATTCCAATATTTTCTGGACTCTAGTAGATCAACAAAATGATGTAAATCC
TATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTGGTAAAGTACAGACTTTCAAAGCTCAACTAGTGGCAAAAGGTTATACACAAAAGGAGGGAATATATT
ATGAAGAAACTTTCTCTCCTGTTGCCATGATAAAGTCGATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGAACAAAAGG
ATCATCAATTCAACTGTAGCATTCTTAGTTTTGTATGTAGATGACATTCTACTCATTGGGAATGATGTAGGTCATCTAACTGATATTAAGAAATGGCTAGCTACGCAATT
CCAAATGAAAGATTTGGGAAATGCACAATATGTTCTTGGTATCCAAATAGTTCGGAACCGAAAGAACAAAACACTAGCCATGTCTCAAACATCTTATATAGACAAAATGT
TGTCAAGATATAAGATGCATAATTCCAAAAAGGGTCTGTTGTCGTACAGATATGGAATTCATTTATCAAAAGAACAATGTTCAAAGACACCTCAAGAAGTTGAGGATATG
AGTAACATTCCCTATGCTTCTACTGTTGGCAGCCTGATGTACGCAATGTTATATACTAGATCTGACATGTACTATTCAGTGGGGATAGTTAGTAGATATAAGTCCAATCC
TGGTCGTGATCATTGGACAACCGTTAAGAATATCCTAAAATATTTTATAAGAAAAAAAGACTACATGCTTGTGTATAGTTCTAAGGATCTGATCCTTACTGGATACACTG
ACTTCAATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATTAGTTTTCACTCTGAATGGAGGAGCAGTAGTATGGAGAAGCATAAAACAATCCTGTATTGTT
GACTCCACTATGGAAGCTGAATATGTAGCTGTCTGTGAGGCAGCAAAAGAAGCAGCATGA
Protein sequenceShow/hide protein sequence
MNDVDCDQWIKAIDLKMELMYSNIFWTLVDQQNDVNPIGCKWIYKRKRDQAGKVQTFKAQLVAKGYTQKEGIYYEETFSPVAMIKSIRILLSIATFYDYEIWQMDVKNKR
IINSTVAFLVLYVDDILLIGNDVGHLTDIKKWLATQFQMKDLGNAQYVLGIQIVRNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLSYRYGIHLSKEQCSKTPQEVEDM
SNIPYASTVGSLMYAMLYTRSDMYYSVGIVSRYKSNPGRDHWTTVKNILKYFIRKKDYMLVYSSKDLILTGYTDFNFQTDKDARKSTSGLVFTLNGGAVVWRSIKQSCIV
DSTMEAEYVAVCEAAKEAA