; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0049721 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0049721
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr02:15609368..15610408
RNA-Seq ExpressionCmc02g0049721
SyntenyCmc02g0049721
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.9e-17487.28Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        MDVKTAFLNGNLEESIYMVQPEGFI + +EQKVCKLQKSIY LKQ SRSWNI FDTAIKSYGFEQNVDEPCVYK+I+NS VAFL+LYVDDILLIGN++ +
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
        LTD+K+WL TQFQMK LG AQY+LGIQIVRN KNKTLAMSQ SYIDK+LSRYKMQNSKKG LP+R+GI LSKEQCPKTPQEVEDM NIPY+SA+GSLMYA
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA

Query:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME
        MLCTRPDICYSVGIVSRYQSNP RDHWTAVKNILKY RRT++YMLVYG+KDLILTGYTDSDFQ+DKDARKST GSVFTLNG AVVWRS+KQ+CIADSTME
Subjt:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME

Query:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS
        A YVAACEA+KEAVWL+KFLTDLEVVPNMHLPITLYCDNSGAVANS
Subjt:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS

KAA0046768.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-17290.46Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQK                                 NVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
        LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA

Query:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME
        MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME
Subjt:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME

Query:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS
        AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS
Subjt:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS

KAA0063026.1 gag/pol protein [Cucumis melo var. makuwa]6.5e-16681.69Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        MDVK AFLNGNLEESIYMVQPEG IQKG+EQKVCKLQKSIY LKQ SRSWNI FD AIKSYGF+QNV+EPCVYKRI NSTVAFLVLYVD+ILLIGN++  
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKM--------------------LSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQ
        LTDIK+WLATQFQMK LGNAQYVLGIQ VRN KNKTL MSQTSYID +                    LSRYKMQNSKKGLL YRY I LSKEQCPKTPQ
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKM--------------------LSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQ

Query:  EVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLN
        +VED+SNIPYAS + SLMYAMLCTRPDICYS+GI+SRYQSNP RDHWT VKNILKY RRTKDYMLVYGSKDLILTGYTDSDFQTDKD RKST GSVFTLN
Subjt:  EVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLN

Query:  GAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS
        G  VVWRSIKQSCIA+STMEA YVAACEA+KEAVWLKKFL DLEVVPNMHL ITLYCDNS  V NS
Subjt:  GAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]7.2e-17388.44Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        MDVKT FLN NLEESIYMVQPE FIQKG+EQK+CKLQKSIY LKQ SRS NI FDTAIKSYG EQNVDEPCVYKRI+NSTVAFLVLYVDDILLIGN++GH
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
        L DIK+WLA QFQMK LGNAQYVLG+QIVRN KNKTLAMSQTSYIDKMLSRYKM NSKKGLLPYRYGI LSKEQCPKTPQEVEDMSNIPYASA+GSLMY 
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA

Query:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME
        MLCTRP+ICYSVGIVSR QS P RDHWT VKNILKY RRTKDYMLVYGSKDLILTGYTD  FQTDKDARKST G VFT+NG AVVWRSIKQSCIADSTME
Subjt:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME

Query:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS
        A YVA CEA+KEAVWLKKFLTDLEVVPNMHLP TLYCDNSGAV NS
Subjt:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS

TYK11050.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-16891.19Show/hide
Query:  MVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGHLTDIKEWLATQFQMKYL
        MVQPEGFIQKG+EQKVCKL+KSIY LKQV+RSWNI FDTAIKSYGFEQNVDEPCVYKR IN+TVAFL+LYVDDILLIGN++GHLTDIKEWLATQFQMK L
Subjt:  MVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGHLTDIKEWLATQFQMKYL

Query:  GNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSR
        GNAQYVLGIQIVRN KNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGI LSKEQCPKTPQEVEDMSNIPYASAIGSLMYAMLCTR DICYSVGIV+R
Subjt:  GNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSR

Query:  YQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVWLK
        YQSNP RDHWTAVKNILKY RRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKST GS+FTLN  AVVW+SIKQSCIA STMEA YVAA EA+KEAV  K
Subjt:  YQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVWLK

Query:  KFLTDLEVVPNMHLPITLYCDNSGAVANS
        KFLTDLEVVPNMHLPITLYCDNSGAV NS
Subjt:  KFLTDLEVVPNMHLPITLYCDNSGAVANS

TrEMBL top hitse value%identityAlignment
A0A5A7TV73 Gag/pol protein7.8e-17390.46Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQK                                 NVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
        LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA

Query:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME
        MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME
Subjt:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME

Query:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS
        AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS
Subjt:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS

A0A5A7V9B0 Gag/pol protein3.2e-16681.69Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        MDVK AFLNGNLEESIYMVQPEG IQKG+EQKVCKLQKSIY LKQ SRSWNI FD AIKSYGF+QNV+EPCVYKRI NSTVAFLVLYVD+ILLIGN++  
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKM--------------------LSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQ
        LTDIK+WLATQFQMK LGNAQYVLGIQ VRN KNKTL MSQTSYID +                    LSRYKMQNSKKGLL YRY I LSKEQCPKTPQ
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKM--------------------LSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQ

Query:  EVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLN
        +VED+SNIPYAS + SLMYAMLCTRPDICYS+GI+SRYQSNP RDHWT VKNILKY RRTKDYMLVYGSKDLILTGYTDSDFQTDKD RKST GSVFTLN
Subjt:  EVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLN

Query:  GAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS
        G  VVWRSIKQSCIA+STMEA YVAACEA+KEAVWLKKFL DLEVVPNMHL ITLYCDNS  V NS
Subjt:  GAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS

A0A5D3BX45 Gag/pol protein3.5e-17388.44Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        MDVKT FLN NLEESIYMVQPE FIQKG+EQK+CKLQKSIY LKQ SRS NI FDTAIKSYG EQNVDEPCVYKRI+NSTVAFLVLYVDDILLIGN++GH
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
        L DIK+WLA QFQMK LGNAQYVLG+QIVRN KNKTLAMSQTSYIDKMLSRYKM NSKKGLLPYRYGI LSKEQCPKTPQEVEDMSNIPYASA+GSLMY 
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA

Query:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME
        MLCTRP+ICYSVGIVSR QS P RDHWT VKNILKY RRTKDYMLVYGSKDLILTGYTD  FQTDKDARKST G VFT+NG AVVWRSIKQSCIADSTME
Subjt:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME

Query:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS
        A YVA CEA+KEAVWLKKFLTDLEVVPNMHLP TLYCDNSGAV NS
Subjt:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS

A0A5D3CI71 Gag/pol protein1.2e-16891.19Show/hide
Query:  MVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGHLTDIKEWLATQFQMKYL
        MVQPEGFIQKG+EQKVCKL+KSIY LKQV+RSWNI FDTAIKSYGFEQNVDEPCVYKR IN+TVAFL+LYVDDILLIGN++GHLTDIKEWLATQFQMK L
Subjt:  MVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGHLTDIKEWLATQFQMKYL

Query:  GNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSR
        GNAQYVLGIQIVRN KNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGI LSKEQCPKTPQEVEDMSNIPYASAIGSLMYAMLCTR DICYSVGIV+R
Subjt:  GNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSR

Query:  YQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVWLK
        YQSNP RDHWTAVKNILKY RRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKST GS+FTLN  AVVW+SIKQSCIA STMEA YVAA EA+KEAV  K
Subjt:  YQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVWLK

Query:  KFLTDLEVVPNMHLPITLYCDNSGAVANS
        KFLTDLEVVPNMHLPITLYCDNSGAV NS
Subjt:  KFLTDLEVVPNMHLPITLYCDNSGAVANS

E2GK51 Gag/pol protein (Fragment)1.4e-17487.28Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        MDVKTAFLNGNLEESIYMVQPEGFI + +EQKVCKLQKSIY LKQ SRSWNI FDTAIKSYGFEQNVDEPCVYK+I+NS VAFL+LYVDDILLIGN++ +
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
        LTD+K+WL TQFQMK LG AQY+LGIQIVRN KNKTLAMSQ SYIDK+LSRYKMQNSKKG LP+R+GI LSKEQCPKTPQEVEDM NIPY+SA+GSLMYA
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA

Query:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME
        MLCTRPDICYSVGIVSRYQSNP RDHWTAVKNILKY RRT++YMLVYG+KDLILTGYTDSDFQ+DKDARKST GSVFTLNG AVVWRS+KQ+CIADSTME
Subjt:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTME

Query:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS
        A YVAACEA+KEAVWL+KFLTDLEVVPNMHLPITLYCDNSGAVANS
Subjt:  AGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANS

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.2e-5236.24Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVY---KRIINSTVAFLVLYVDDILLIGNN
        MDVKTAFLNG L+E IYM  P+G         VCKL K+IY LKQ +R W   F+ A+K   F  +  + C+Y   K  IN  + +++LYVDD+++   +
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVY---KRIINSTVAFLVLYVDDILLIGNN

Query:  IGHLTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLP----YRYGIDLSKEQCPKTPQEVEDMSNIPYASA
        +  + + K +L  +F+M  L   ++ +GI+I   E    + +SQ++Y+ K+LS++ M+N      P      Y +  S E C           N P  S 
Subjt:  IGHLTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLP----YRYGIDLSKEQCPKTPQEVEDMSNIPYASA

Query:  IGSLMYAMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLI----LTGYTDSDFQTDKDARKSTLGSVFTL-NGAAVVWRS
        IG LMY MLCTRPD+  +V I+SRY S    + W  +K +L+Y + T D  L++  K+L     + GY DSD+   +  RKST G +F + +   + W +
Subjt:  IGSLMYAMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLI----LTGYTDSDFQTDKDARKSTLGSVFTL-NGAAVVWRS

Query:  IKQSCIADSTMEAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVA
         +Q+ +A S+ EA Y+A  EA +EA+WLK  LT + +   +  PI +Y DN G ++
Subjt:  IKQSCIADSTMEAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAVA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-8847.67Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVY-KRIINSTVAFLVLYVDDILLIGNNIG
        +DVKTAFL+G+LEE IYM QPEGF   GK+  VCKL KS+Y LKQ  R W + FD+ +KS  + +   +PCVY KR   +    L+LYVDD+L++G + G
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVY-KRIINSTVAFLVLYVDDILLIGNNIG

Query:  HLTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMY
         +  +K  L+  F MK LG AQ +LG++IVR   ++ L +SQ  YI+++L R+ M+N+K    P    + LSK+ CP T +E  +M+ +PY+SA+GSLMY
Subjt:  HLTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMY

Query:  AMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTM
        AM+CTRPDI ++VG+VSR+  NP ++HW AVK IL+Y R T    L +G  D IL GYTD+D   D D RKS+ G +FT +G A+ W+S  Q C+A ST 
Subjt:  AMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTM

Query:  EAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAV
        EA Y+AA E  KE +WLK+FL +L +    ++   +YCD+  A+
Subjt:  EAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAV

P25600 Putative transposon Ty5-1 protein YCL074W6.8e-3332.17Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        MDV TAFLN  ++E IY+ QP GF+ +     V +L   +Y LKQ    WN   +  +K  GF ++  E  +Y R  +    ++ +YVDD+L+   +   
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
           +K+ L   + MK LG     LG+ I     N  + +S   YI K  S  ++   K    P    +  SK     T   ++D++  PY S +G L++ 
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA

Query:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGS-KDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIK-QSCIADST
            RPDI Y V ++SR+   PR  H  + + +L+Y   T+   L Y S   L LT Y D+      D   ST G V  L GA V W S K +  I   +
Subjt:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGS-KDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIK-QSCIADST

Query:  MEAGYVAACEASKE
         EA Y+ A E   E
Subjt:  MEAGYVAACEASKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-4333.24Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        +DV  AFL G L + +YM QP GFI K +   VCKL+K++Y LKQ  R+W +     + + GF  +V +  ++      ++ ++++YVDDIL+ GN+   
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
        L +  + L+ +F +K      Y LGI+  R      L +SQ  YI  +L+R  M  +K    P      LS     K     E      Y   +GSL Y 
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA

Query:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDY-MLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTM
        +  TRPDI Y+V  +S++   P  +H  A+K IL+Y   T ++ + +     L L  Y+D+D+  DKD   ST G +  L    + W S KQ  +  S+ 
Subjt:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDY-MLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTM

Query:  EAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGA
        EA Y +    S E  W+   LT+L +   +  P  +YCDN GA
Subjt:  EAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-4632.94Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH
        +DV  AFL G L + +YM QP GF+ K +   VC+L+K+IY LKQ  R+W +   T + + GF  ++ +  ++      ++ ++++YVDDIL+ GN+   
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGH

Query:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA
        L    + L+ +F +K   +  Y LGI+  R  +   L +SQ  Y   +L+R  M  +K    P      L+     K P   E      Y   +GSL Y 
Subjt:  LTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYA

Query:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDY-MLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTM
        +  TRPD+ Y+V  +S+Y   P  DHW A+K +L+Y   T D+ + +     L L  Y+D+D+  D D   ST G +  L    + W S KQ  +  S+ 
Subjt:  MLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDY-MLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTM

Query:  EAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGA
        EA Y +    S E  W+   LT+L +   +  P  +YCDN GA
Subjt:  EAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.6e-4331.61Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKE----QKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGN
        +D+  AFLNG+L+E IYM  P G+  +  +      VC L+KSIY LKQ SR W + F   +  +GF Q+  +   + +I  +    +++YVDDI++  N
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGKE----QKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGN

Query:  NIGHLTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGS
        N   + ++K  L + F+++ LG  +Y LG++I R+     + + Q  Y   +L    +   K   +P    +  S           + +    Y   IG 
Subjt:  NIGHLTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGS

Query:  LMYAMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSK-DLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIA
        LMY  + TR DI ++V  +S++   PR  H  AV  IL Y + T    L Y S+ ++ L  ++D+ FQ+ KD R+ST G    L  + + W+S KQ  ++
Subjt:  LMYAMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVYGSK-DLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIA

Query:  DSTMEAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAV
         S+ EA Y A   A+ E +WL +F  +L++   +  P  L+CDN+ A+
Subjt:  DSTMEAGYVAACEASKEAVWLKKFLTDLEVVPNMHLPITLYCDNSGAV

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.2e-0434.72Show/hide
Query:  TRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVY-GSKDLILTGYTDSDFQTDKDARKSTLG
        TRPD+ ++V  +S++ S  R     AV  +L Y + T    L Y  + DL L  + DSD+ +  D R+S  G
Subjt:  TRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDYMLVY-GSKDLILTGYTDSDFQTDKDARKSTLG

ATMG00810.1 DNA/RNA polymerases superfamily protein7.5e-1930.93Show/hide
Query:  FLVLYVDDILLIGNNIGHLTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSK--KGLLPYRYGIDLSKEQCPKTPQ
        +L+LYVDDILL G++   L  +   L++ F MK LG   Y LGIQI  +     L +SQT Y +++L+   M + K     LP +    +S  + P    
Subjt:  FLVLYVDDILLIGNNIGHLTDIKEWLATQFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSK--KGLLPYRYGIDLSKEQCPKTPQ

Query:  EVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDY-MLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTL
           D S+  + S +G+L Y  L TRPDI Y+V IV +    P    +  +K +L+Y + T  + + ++ +  L +  + DSD+      R+ST G    L
Subjt:  EVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSRYQSNPRRDHWTAVKNILKYHRRTKDY-MLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTL

Query:  NGAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVW
            + W + +Q  ++ S+ E  Y A    + E  W
Subjt:  NGAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTCAAGACAGCCTTTTTGAACGGTAATCTTGAAGAGAGTATTTATATGGTCCAACCAGAGGGATTTATACAAAAGGGCAAAGAACAAAAGGTTTGTAAGCTTCA
GAAATCAATTTATGAATTGAAACAGGTATCTAGATCCTGGAATATAGGATTTGATACTGCGATCAAATCTTATGGTTTTGAACAGAATGTTGATGAACCTTGTGTTTACA
AAAGGATCATCAATTCTACTGTAGCATTCTTAGTTCTATATGTAGATGACATTCTCCTCATTGGGAATAACATAGGTCATCTAACTGATATTAAGGAATGGCTAGCTACG
CAATTCCAAATGAAATATTTGGGAAATGCACAATATGTTCTTGGTATCCAAATAGTTCGGAACGAAAAGAACAAAACTCTAGCCATGTCTCAAACATCTTATATAGACAA
AATGTTATCAAGATATAAGATGCAGAATTCCAAAAAGGGTTTGCTGCCCTACAGATATGGAATTGATTTATCAAAAGAACAATGTCCAAAGACACCTCAAGAAGTTGAGG
ATATGAGTAACATTCCCTATGCTTCTGCTATTGGGAGCCTGATGTATGCAATGTTATGTACTAGACCTGACATTTGCTATTCAGTGGGGATTGTTAGTAGATATCAGTCC
AATCCTCGACGTGATCATTGGACAGCCGTTAAAAATATTCTAAAATATCATAGAAGAACAAAAGACTACATGCTTGTGTATGGTTCTAAGGATTTGATCCTTACTGGATA
CACTGACTCCGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATTAGGATCAGTTTTCACTCTGAATGGAGCAGCAGTAGTATGGAGAAGTATAAAACAATCTTGTA
TTGCTGACTCCACTATGGAAGCTGGATATGTAGCTGCCTGTGAAGCATCCAAAGAAGCAGTATGGCTTAAAAAGTTCTTAACAGATTTGGAAGTCGTTCCAAATATGCAT
CTGCCAATCACCTTATACTGTGACAACAGTGGTGCAGTTGCAAATTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTCAAGACAGCCTTTTTGAACGGTAATCTTGAAGAGAGTATTTATATGGTCCAACCAGAGGGATTTATACAAAAGGGCAAAGAACAAAAGGTTTGTAAGCTTCA
GAAATCAATTTATGAATTGAAACAGGTATCTAGATCCTGGAATATAGGATTTGATACTGCGATCAAATCTTATGGTTTTGAACAGAATGTTGATGAACCTTGTGTTTACA
AAAGGATCATCAATTCTACTGTAGCATTCTTAGTTCTATATGTAGATGACATTCTCCTCATTGGGAATAACATAGGTCATCTAACTGATATTAAGGAATGGCTAGCTACG
CAATTCCAAATGAAATATTTGGGAAATGCACAATATGTTCTTGGTATCCAAATAGTTCGGAACGAAAAGAACAAAACTCTAGCCATGTCTCAAACATCTTATATAGACAA
AATGTTATCAAGATATAAGATGCAGAATTCCAAAAAGGGTTTGCTGCCCTACAGATATGGAATTGATTTATCAAAAGAACAATGTCCAAAGACACCTCAAGAAGTTGAGG
ATATGAGTAACATTCCCTATGCTTCTGCTATTGGGAGCCTGATGTATGCAATGTTATGTACTAGACCTGACATTTGCTATTCAGTGGGGATTGTTAGTAGATATCAGTCC
AATCCTCGACGTGATCATTGGACAGCCGTTAAAAATATTCTAAAATATCATAGAAGAACAAAAGACTACATGCTTGTGTATGGTTCTAAGGATTTGATCCTTACTGGATA
CACTGACTCCGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATTAGGATCAGTTTTCACTCTGAATGGAGCAGCAGTAGTATGGAGAAGTATAAAACAATCTTGTA
TTGCTGACTCCACTATGGAAGCTGGATATGTAGCTGCCTGTGAAGCATCCAAAGAAGCAGTATGGCTTAAAAAGTTCTTAACAGATTTGGAAGTCGTTCCAAATATGCAT
CTGCCAATCACCTTATACTGTGACAACAGTGGTGCAGTTGCAAATTCATGA
Protein sequenceShow/hide protein sequence
MDVKTAFLNGNLEESIYMVQPEGFIQKGKEQKVCKLQKSIYELKQVSRSWNIGFDTAIKSYGFEQNVDEPCVYKRIINSTVAFLVLYVDDILLIGNNIGHLTDIKEWLAT
QFQMKYLGNAQYVLGIQIVRNEKNKTLAMSQTSYIDKMLSRYKMQNSKKGLLPYRYGIDLSKEQCPKTPQEVEDMSNIPYASAIGSLMYAMLCTRPDICYSVGIVSRYQS
NPRRDHWTAVKNILKYHRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTLGSVFTLNGAAVVWRSIKQSCIADSTMEAGYVAACEASKEAVWLKKFLTDLEVVPNMH
LPITLYCDNSGAVANS