; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0015871 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0015871
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr01:13725616..13726626
RNA-Seq ExpressionCmc01g0015871
SyntenyCmc01g0015871
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036855.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.8e-11863.8Show/hide
Query:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV
        ++LMG MQTESLGGK+YVLVVVDDYS++TWV FLK+K DTV++C  +CL LQREK +KI RIRSDHGK+FDNE  N+ C  EGIHHEF+A ITPQQNGVV
Subjt:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV

Query:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG-TLRIAG
        E+KNRMLQEM RVMIHAKN PL FW E VNT  HIH RVT R GTT+T YELWK RKPNVKYFH+FGSTCYILADREY +KWD +S QGIFL  +     
Subjt:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG-TLRIAG

Query:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAG
        YRVFN + G+VME INVV+ND +S + Q N E+DET    ++      E+ K D    +  K+    ++E+I  +  + P AHVKKNHP+S IIGD SAG
Subjt:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAG

Query:  MQTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY
        + TRRKEK+ YMKM+ DLCY S IEP +V++ALKDEY
Subjt:  MQTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY

KAA0042995.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.5e-13876.85Show/hide
Query:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV
        MDLMG MQT+SLGG                      K DTVEICK +CLKLQRE+ KKITRIRSDHGK+FDNEGFNSFCLLEG HHEFSA ITPQQNGVV
Subjt:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV

Query:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIA-G
        E+KN+ LQEM RVMIHAKNLPLCF+ EAVNT  HIHNRVTIRTGTT+T YE WKERK NVKYFHVFGSTCYILADREY +KWDARSEQGIFL   +I+  
Subjt:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIA-G

Query:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAG
        YRV+NNR  SVMETIN  INDLDSAIK MNDEEDETPNMSE+RT STVE SKADN SD PGKSLKKSSEEII KKL+LIP AHV+KNHP   IIGDPSAG
Subjt:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAG

Query:  MQTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY
        MQTRRK+KI Y+KMV +LCY STIEPSTVDSALK+EY
Subjt:  MQTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY

KAA0054354.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.5e-11183.95Show/hide
Query:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV
        MDLM  M+TES GGKRYVLVVV DYSRYTWVCFL+ K DTVEICK +CLKLQREK KKITRIRSDHGK+FDNEGFNSFCLLEGI HEFSA ITPQQNGVV
Subjt:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV

Query:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIA-G
        E+KNR LQEM  VMIHAKNLP+CFW EAVN   HIHNRVTIRTG TVT YELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG  + +  
Subjt:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIA-G

Query:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIR
        YRVFNNR G+V+ETINVVINDLDSA+KQMNDEED+T NMSE R
Subjt:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIR

KAA0060126.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.3e-11463.08Show/hide
Query:  GGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNRMLQEMTR
        GG++YVLVVVDDYSR+TWV FLK K+DT ++C  +CL LQREK +KI RIR +HG +F+NE  N+FC  EGIHHEF+A ITPQQNGVVE+KNR LQEM R
Subjt:  GGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNRMLQEMTR

Query:  VMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG-TLRIAGYRVFNNRFGSVM
        VMIHAKNLPL FW EAVNT  HIH+RVT R+GTTVT YELWK RKPN+KYFH+FGS CYILADR+Y +KWD +S+Q IFLG +     YRVFN + G+VM
Subjt:  VMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG-TLRIAGYRVFNNRFGSVM

Query:  ETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAGMQTRRKEKIGYM
        ETINVV+ND +S I Q N E DET    E+ +    E+SK ++  D   K+    ++E+I  +  L+P AHVKKNHP S +IGDPSAG+ TRRKEK+ Y 
Subjt:  ETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAGMQTRRKEKIGYM

Query:  KMVVDLCYTSTIEPSTVDSALKDEY
        KM+ DLCY S IEP++V++ALKDEY
Subjt:  KMVVDLCYTSTIEPSTVDSALKDEY

KAA0066729.1 Peptidase aspartic, catalytic [Cucumis melo var. makuwa]5.3e-11266.96Show/hide
Query:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV
        MDL+G M+TESLGGKRYVLVVVDDYSRYTWVCFLK + DT+EI K +CLKLQREK KKITRIRSDHGK+FDNEGFNSFCLLEGIHH+FSA ITP+QNGVV
Subjt:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV

Query:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIAGY
        E+KNR LQEM RVMIHAKNLPLCFW EAVNTV HIHN++T                                                          G 
Subjt:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIAGY

Query:  RVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAGM
         V N   G   E+         SAIKQMNDEEDET NMSE RT STVEVSK DNPSDDPGKSLKKS EEIITKK +LIPFAHVKKNHP S IIGDPSAGM
Subjt:  RVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAGM

Query:  QTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY
        QTRRKEKI Y+KMV DLCYTSTIEPSTVDSALKDEY
Subjt:  QTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY

TrEMBL top hitse value%identityAlignment
A0A5A7TNK7 Gag-pol polyprotein1.2e-13876.85Show/hide
Query:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV
        MDLMG MQT+SLGG                      K DTVEICK +CLKLQRE+ KKITRIRSDHGK+FDNEGFNSFCLLEG HHEFSA ITPQQNGVV
Subjt:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV

Query:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIA-G
        E+KN+ LQEM RVMIHAKNLPLCF+ EAVNT  HIHNRVTIRTGTT+T YE WKERK NVKYFHVFGSTCYILADREY +KWDARSEQGIFL   +I+  
Subjt:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIA-G

Query:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAG
        YRV+NNR  SVMETIN  INDLDSAIK MNDEEDETPNMSE+RT STVE SKADN SD PGKSLKKSSEEII KKL+LIP AHV+KNHP   IIGDPSAG
Subjt:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAG

Query:  MQTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY
        MQTRRK+KI Y+KMV +LCY STIEPSTVDSALK+EY
Subjt:  MQTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY

A0A5A7V0X1 Gag-pol polyprotein3.6e-11463.08Show/hide
Query:  GGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNRMLQEMTR
        GG++YVLVVVDDYSR+TWV FLK K+DT ++C  +CL LQREK +KI RIR +HG +F+NE  N+FC  EGIHHEF+A ITPQQNGVVE+KNR LQEM R
Subjt:  GGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNRMLQEMTR

Query:  VMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG-TLRIAGYRVFNNRFGSVM
        VMIHAKNLPL FW EAVNT  HIH+RVT R+GTTVT YELWK RKPN+KYFH+FGS CYILADR+Y +KWD +S+Q IFLG +     YRVFN + G+VM
Subjt:  VMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG-TLRIAGYRVFNNRFGSVM

Query:  ETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAGMQTRRKEKIGYM
        ETINVV+ND +S I Q N E DET    E+ +    E+SK ++  D   K+    ++E+I  +  L+P AHVKKNHP S +IGDPSAG+ TRRKEK+ Y 
Subjt:  ETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAGMQTRRKEKIGYM

Query:  KMVVDLCYTSTIEPSTVDSALKDEY
        KM+ DLCY S IEP++V++ALKDEY
Subjt:  KMVVDLCYTSTIEPSTVDSALKDEY

A0A5D3BA69 Gag-pol polyprotein1.8e-11863.8Show/hide
Query:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV
        ++LMG MQTESLGGK+YVLVVVDDYS++TWV FLK+K DTV++C  +CL LQREK +KI RIRSDHGK+FDNE  N+ C  EGIHHEF+A ITPQQNGVV
Subjt:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV

Query:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG-TLRIAG
        E+KNRMLQEM RVMIHAKN PL FW E VNT  HIH RVT R GTT+T YELWK RKPNVKYFH+FGSTCYILADREY +KWD +S QGIFL  +     
Subjt:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG-TLRIAG

Query:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAG
        YRVFN + G+VME INVV+ND +S + Q N E+DET    ++      E+ K D    +  K+    ++E+I  +  + P AHVKKNHP+S IIGD SAG
Subjt:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAG

Query:  MQTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY
        + TRRKEK+ YMKM+ DLCY S IEP +V++ALKDEY
Subjt:  MQTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY

A0A5D3CM30 Gag-pol polyprotein7.4e-11283.95Show/hide
Query:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV
        MDLM  M+TES GGKRYVLVVV DYSRYTWVCFL+ K DTVEICK +CLKLQREK KKITRIRSDHGK+FDNEGFNSFCLLEGI HEFSA ITPQQNGVV
Subjt:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV

Query:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIA-G
        E+KNR LQEM  VMIHAKNLP+CFW EAVN   HIHNRVTIRTG TVT YELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG  + +  
Subjt:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIA-G

Query:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIR
        YRVFNNR G+V+ETINVVINDLDSA+KQMNDEED+T NMSE R
Subjt:  YRVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIR

A0A5D3DVN5 Peptidase aspartic, catalytic2.5e-11266.96Show/hide
Query:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV
        MDL+G M+TESLGGKRYVLVVVDDYSRYTWVCFLK + DT+EI K +CLKLQREK KKITRIRSDHGK+FDNEGFNSFCLLEGIHH+FSA ITP+QNGVV
Subjt:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVV

Query:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIAGY
        E+KNR LQEM RVMIHAKNLPLCFW EAVNTV HIHN++T                                                          G 
Subjt:  EKKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIAGY

Query:  RVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAGM
         V N   G   E+         SAIKQMNDEEDET NMSE RT STVEVSK DNPSDDPGKSLKKS EEIITKK +LIPFAHVKKNHP S IIGDPSAGM
Subjt:  RVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAGM

Query:  QTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY
        QTRRKEKI Y+KMV DLCYTSTIEPSTVDSALKDEY
Subjt:  QTRRKEKIGYMKMVVDLCYTSTIEPSTVDSALKDEY

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.4e-2129.9Show/hide
Query:  DLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVE
        D+ G +   +L  K Y ++ VD ++ Y     +K K D   + +    K +     K+  +  D+G+++ +     FC+ +GI +  +   TPQ NGV E
Subjt:  DLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVE

Query:  KKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIR--TGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG
        +  R + E  R M+    L   FW EAV T +++ NR+  R    ++ T YE+W  +KP +K+  VFG+T Y+   +  + K+D +S + IF+G
Subjt:  KKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIR--TGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.4e-2729.93Show/hide
Query:  DLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVE
        D+ G M+ ES+GG +Y +  +DD SR  WV  LK K    ++ +K    ++RE  +K+ R+RSD+G ++ +  F  +C   GI HE +   TPQ NGV E
Subjt:  DLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVE

Query:  KKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG-TLRIAGY
        + NR + E  R M+    LP  FW EAV T  ++ NR             +W  ++ +  +  VFG   +    +E R K D +S   IF+G      GY
Subjt:  KKNRMLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG-TLRIAGY

Query:  RVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPN---MSEIRTPSTV-EVSKADNPSDDPGKSLKKSSEEI
        R+++     V+ + +VV    +S ++   D  ++  N    + +  PST    + A++ +D+  +  ++  E I
Subjt:  RVFNNRFGSVMETINVVINDLDSAIKQMNDEEDETPN---MSEIRTPSTV-EVSKADNPSDDPGKSLKKSSEEI

P22382 Gag-Pol polyprotein1.3e-0433.03Show/hide
Query:  FLKDKIDTVEICKK---MCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNRMLQEM-TRVMIHAKNLPLCFWVEA
        F+K ++ T E  KK     LKL  + W  I+++ +D+G  F ++   + C   GI H F     PQ  GVVE KN+ L+E+  ++    K L       A
Subjt:  FLKDKIDTVEICKK---MCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNRMLQEM-TRVMIHAKNLPLCFWVEA

Query:  VNTVSHIHN
        V   + IHN
Subjt:  VNTVSHIHN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.4e-1127.53Show/hide
Query:  RYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNRMLQEMTRVMI
        RY ++ VD ++RYTW+  LK K    E        L+     +I    SD+G +F       +    GI H  S   TP+ NG+ E+K+R + E    ++
Subjt:  RYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNRMLQEMTRVMI

Query:  HAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG
           ++P  +W  A     ++ NR+        + ++      PN     VFG  CY       + K D +S Q +FLG
Subjt:  HAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-1227.96Show/hide
Query:  SLGGKRYVLVVVDDYSRYTWVCFLKDKI---DTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNRML
        S+   RY ++ VD ++RYTW+  LK K    DT  I K +   ++     +I  + SD+G +F       +    GI H  S   TP+ NG+ E+K+R +
Subjt:  SLGGKRYVLVVVDDYSRYTWVCFLKDKI---DTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNRML

Query:  QEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG
         EM   ++   ++P  +W  A +   ++ NR+        + ++    + PN +   VFG  CY       R K + +S+Q  F+G
Subjt:  QEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCATGGGTCTAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTATGTGCTGGTAGTAGTTGATGATTACTCAAGATATACTTGGGTTTGCTTTCTCAAA
GACAAAATAGATACTGTTGAAATATGCAAAAAAATGTGTTTGAAGCTACAGCGTGAAAAATGGAAGAAGATAACCAGGATCCGAAGTGATCATGGTAAAAAGTTT
GATAATGAAGGCTTTAACAGTTTTTGTCTGTTAGAAGGAATACACCATGAATTTTCTGCACTTATAACTCCTCAACAAAATGGTGTAGTAGAAAAAAAGAACAGG
ATGTTACAAGAAATGACACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGTAGAAGCTGTAAATACTGTCAGTCACATTCATAACAGGGTAACT
ATTAGAACTGGAACGACTGTTACATTTTATGAACTTTGGAAAGAGAGAAAGCCAAATGTTAAATACTTCCATGTGTTTGGAAGTACGTGTTATATCTTAGCTGAC
AGGGAATACCGTCAGAAATGGGATGCAAGGTCAGAACAAGGAATCTTTCTCGGGACTCTCAGAATAGCCGGGTATAGAGTTTTCAATAACAGATTTGGAAGTGTT
ATGGAAACAATCAATGTAGTTATAAATGATCTCGATTCAGCTATCAAACAGATGAATGATGAGGAAGATGAGACTCCAAACATGTCTGAAATTAGAACTCCGAGT
ACTGTAGAAGTTTCTAAAGCTGATAACCCATCTGATGATCCAGGCAAAAGTTTGAAAAAATCATCAGAAGAAATTATCACTAAAAAATTAAAACTAATTCCATTT
GCTCATGTGAAGAAAAATCATCCAACAAGCTTTATTATAGGTGATCCGTCAGCTGGGATGCAGACCAGAAGGAAAGAAAAGATTGGTTACATGAAGATGGTTGTT
GATTTATGTTATACTTCCACCATTGAACCTTCTACTGTTGACTCTGCTCTCAAGGATGAGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTCATGGGTCTAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTATGTGCTGGTAGTAGTTGATGATTACTCAAGATATACTTGGGTTTGCTTTCTCAAA
GACAAAATAGATACTGTTGAAATATGCAAAAAAATGTGTTTGAAGCTACAGCGTGAAAAATGGAAGAAGATAACCAGGATCCGAAGTGATCATGGTAAAAAGTTT
GATAATGAAGGCTTTAACAGTTTTTGTCTGTTAGAAGGAATACACCATGAATTTTCTGCACTTATAACTCCTCAACAAAATGGTGTAGTAGAAAAAAAGAACAGG
ATGTTACAAGAAATGACACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGTAGAAGCTGTAAATACTGTCAGTCACATTCATAACAGGGTAACT
ATTAGAACTGGAACGACTGTTACATTTTATGAACTTTGGAAAGAGAGAAAGCCAAATGTTAAATACTTCCATGTGTTTGGAAGTACGTGTTATATCTTAGCTGAC
AGGGAATACCGTCAGAAATGGGATGCAAGGTCAGAACAAGGAATCTTTCTCGGGACTCTCAGAATAGCCGGGTATAGAGTTTTCAATAACAGATTTGGAAGTGTT
ATGGAAACAATCAATGTAGTTATAAATGATCTCGATTCAGCTATCAAACAGATGAATGATGAGGAAGATGAGACTCCAAACATGTCTGAAATTAGAACTCCGAGT
ACTGTAGAAGTTTCTAAAGCTGATAACCCATCTGATGATCCAGGCAAAAGTTTGAAAAAATCATCAGAAGAAATTATCACTAAAAAATTAAAACTAATTCCATTT
GCTCATGTGAAGAAAAATCATCCAACAAGCTTTATTATAGGTGATCCGTCAGCTGGGATGCAGACCAGAAGGAAAGAAAAGATTGGTTACATGAAGATGGTTGTT
GATTTATGTTATACTTCCACCATTGAACCTTCTACTGTTGACTCTGCTCTCAAGGATGAGTATTGA
Protein sequenceShow/hide protein sequence
MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKDKIDTVEICKKMCLKLQREKWKKITRIRSDHGKKFDNEGFNSFCLLEGIHHEFSALITPQQNGVVEKKNR
MLQEMTRVMIHAKNLPLCFWVEAVNTVSHIHNRVTIRTGTTVTFYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGTLRIAGYRVFNNRFGSV
METINVVINDLDSAIKQMNDEEDETPNMSEIRTPSTVEVSKADNPSDDPGKSLKKSSEEIITKKLKLIPFAHVKKNHPTSFIIGDPSAGMQTRRKEKIGYMKMVV
DLCYTSTIEPSTVDSALKDEY