; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0104761 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0104761
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:22909195..22910087
RNA-Seq ExpressionCmc04g0104761
SyntenyCmc04g0104761
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033336.1 gag/pol protein [Cucumis melo var. makuwa]9.1e-12685.14Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISIQSGRDHWTAVK
        MKDLGNAQYVLGIQIVRN+KNKT  MSQTSYIDKMLSRYKMQ+S KGLL Y Y IHLSKEQCPKTPQEV+DMSNI YAS VGSL+Y S   GRDHWTAVK
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISIQSGRDHWTAVK

Query:  NILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEAVWLKKFLTDLEIVPNMHL
        NILK L+RTKDYML+YGSKDLIL GYTDSDFQ+DKDARKSTS S+FTLNGGAVVWRSI+QSCIADSTME EY+AAC AA EAVWLKKFLTDLE+VPNMHL
Subjt:  NILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEAVWLKKFLTDLEIVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL
        PIT+Y DNSGAVANSREPRS KRGKHI+RKYH IREIV++GDVTVTKISSEQNMAD F KA +AKVFES+LHGLGL
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL

KAA0052272.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-12178.77Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------
        MKDL NAQYVLGI+IVRN+KNKT  MSQTSYIDKMLSRYKMQ+SKK LL Y Y IHLSKEQCPKTPQEV+DMSNI YASAVGSLMY  +           
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------

Query:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA
                 G DHWTAVKNILK L+RTKDYML+YGSKDLIL  YTDSDFQSDKD RKSTSES+FTLNG AVVW+ IKQSCIADSTMEAEY+A+C AA EA
Subjt:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA

Query:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGL
        VWLKKFLTDLEIVPN+HLPITLYCDNSGAVANS+EPRSHKR KHIERKYH IREI+HRGDVT+TKISSEQN+ DPF KAL AKVFES+LH L
Subjt:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGL

KAA0058854.1 gag/pol protein [Cucumis melo var. makuwa]6.1e-12278.57Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------
        MK LGNA YVL IQIVRN+KNKT  MSQTSYIDKMLSRYKMQ+SKK LL Y Y IHLSKEQCPKTPQEVEDMSNI YA A+GSLMY  +           
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------

Query:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA
                 GRDHWT VKNI+K L+RTKDYM +YGSKDLIL  YTDSDFQ+DKDARKSTS S+F LNGGAVVWRSIKQSCIADSTME +Y+AAC AA EA
Subjt:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA

Query:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL
        VWLKKFLTDLE+VPNM LPITLYCDNSGAVANSREPRSHK GKHIERKYH IR+IVHRGDVTVTKISSE+NMADPFIKAL AK+FES+LHGLGL
Subjt:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL

TYK21571.1 gag/pol protein [Cucumis melo var. makuwa]9.1e-12685.14Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISIQSGRDHWTAVK
        MKDLGNAQYVLGIQIVRN+KNKT  MSQTSYIDKMLSRYKMQ+S KGLL Y Y IHLSKEQCPKTPQEV+DMSNI YAS VGSL+Y S   GRDHWTAVK
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISIQSGRDHWTAVK

Query:  NILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEAVWLKKFLTDLEIVPNMHL
        NILK L+RTKDYML+YGSKDLIL GYTDSDFQ+DKDARKSTS S+FTLNGGAVVWRSI+QSCIADSTME EY+AAC AA EAVWLKKFLTDLE+VPNMHL
Subjt:  NILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEAVWLKKFLTDLEIVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL
        PIT+Y DNSGAVANSREPRS KRGKHI+RKYH IREIV++GDVTVTKISSEQNMAD F KA +AKVFES+LHGLGL
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL

TYK23767.1 gag/pol protein [Cucumis melo var. makuwa]6.1e-12278.57Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------
        MK LGNA YVL IQIVRN+KNKT  MSQTSYIDKMLSRYKMQ+SKK LL Y Y IHLSKEQCPKTPQEVEDMSNI YA A+GSLMY  +           
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------

Query:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA
                 GRDHWT VKNI+K L+RTKDYM +YGSKDLIL  YTDSDFQ+DKDARKSTS S+F LNGGAVVWRSIKQSCIADSTME +Y+AAC AA EA
Subjt:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA

Query:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL
        VWLKKFLTDLE+VPNM LPITLYCDNSGAVANSREPRSHK GKHIERKYH IR+IVHRGDVTVTKISSE+NMADPFIKAL AK+FES+LHGLGL
Subjt:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL

TrEMBL top hitse value%identityAlignment
A0A5A7STL2 Gag/pol protein4.4e-12685.14Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISIQSGRDHWTAVK
        MKDLGNAQYVLGIQIVRN+KNKT  MSQTSYIDKMLSRYKMQ+S KGLL Y Y IHLSKEQCPKTPQEV+DMSNI YAS VGSL+Y S   GRDHWTAVK
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISIQSGRDHWTAVK

Query:  NILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEAVWLKKFLTDLEIVPNMHL
        NILK L+RTKDYML+YGSKDLIL GYTDSDFQ+DKDARKSTS S+FTLNGGAVVWRSI+QSCIADSTME EY+AAC AA EAVWLKKFLTDLE+VPNMHL
Subjt:  NILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEAVWLKKFLTDLEIVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL
        PIT+Y DNSGAVANSREPRS KRGKHI+RKYH IREIV++GDVTVTKISSEQNMAD F KA +AKVFES+LHGLGL
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL

A0A5A7U945 Gag/pol protein8.6e-12278.77Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------
        MKDL NAQYVLGI+IVRN+KNKT  MSQTSYIDKMLSRYKMQ+SKK LL Y Y IHLSKEQCPKTPQEV+DMSNI YASAVGSLMY  +           
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------

Query:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA
                 G DHWTAVKNILK L+RTKDYML+YGSKDLIL  YTDSDFQSDKD RKSTSES+FTLNG AVVW+ IKQSCIADSTMEAEY+A+C AA EA
Subjt:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA

Query:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGL
        VWLKKFLTDLEIVPN+HLPITLYCDNSGAVANS+EPRSHKR KHIERKYH IREI+HRGDVT+TKISSEQN+ DPF KAL AKVFES+LH L
Subjt:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGL

A0A5A7UUR4 Gag/pol protein2.9e-12278.57Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------
        MK LGNA YVL IQIVRN+KNKT  MSQTSYIDKMLSRYKMQ+SKK LL Y Y IHLSKEQCPKTPQEVEDMSNI YA A+GSLMY  +           
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------

Query:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA
                 GRDHWT VKNI+K L+RTKDYM +YGSKDLIL  YTDSDFQ+DKDARKSTS S+F LNGGAVVWRSIKQSCIADSTME +Y+AAC AA EA
Subjt:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA

Query:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL
        VWLKKFLTDLE+VPNM LPITLYCDNSGAVANSREPRSHK GKHIERKYH IR+IVHRGDVTVTKISSE+NMADPFIKAL AK+FES+LHGLGL
Subjt:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL

A0A5D3DD25 Gag/pol protein4.4e-12685.14Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISIQSGRDHWTAVK
        MKDLGNAQYVLGIQIVRN+KNKT  MSQTSYIDKMLSRYKMQ+S KGLL Y Y IHLSKEQCPKTPQEV+DMSNI YAS VGSL+Y S   GRDHWTAVK
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISIQSGRDHWTAVK

Query:  NILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEAVWLKKFLTDLEIVPNMHL
        NILK L+RTKDYML+YGSKDLIL GYTDSDFQ+DKDARKSTS S+FTLNGGAVVWRSI+QSCIADSTME EY+AAC AA EAVWLKKFLTDLE+VPNMHL
Subjt:  NILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEAVWLKKFLTDLEIVPNMHL

Query:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL
        PIT+Y DNSGAVANSREPRS KRGKHI+RKYH IREIV++GDVTVTKISSEQNMAD F KA +AKVFES+LHGLGL
Subjt:  PITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL

A0A5D3DJL5 Gag/pol protein2.9e-12278.57Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------
        MK LGNA YVL IQIVRN+KNKT  MSQTSYIDKMLSRYKMQ+SKK LL Y Y IHLSKEQCPKTPQEVEDMSNI YA A+GSLMY  +           
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------

Query:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA
                 GRDHWT VKNI+K L+RTKDYM +YGSKDLIL  YTDSDFQ+DKDARKSTS S+F LNGGAVVWRSIKQSCIADSTME +Y+AAC AA EA
Subjt:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA

Query:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL
        VWLKKFLTDLE+VPNM LPITLYCDNSGAVANSREPRSHK GKHIERKYH IR+IVHRGDVTVTKISSE+NMADPFIKAL AK+FES+LHGLGL
Subjt:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.6e-2732.34Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQS----SKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYI---------
        M DL   ++ +GI+I   Q++K   +SQ++Y+ K+LS++ M++    S        YE+  S E C           N    S +G LMYI         
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQS----SKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYI---------

Query:  ---------SIQSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLI----LLGYTDSDFQSDKDARKSTSESIFTL-NGGAVVWRSIKQSCIADSTMEAEYI
                 S ++  + W  +K +L+ LK T D  L++  K+L     ++GY DSD+   +  RKST+  +F + +   + W + +Q+ +A S+ EAEY+
Subjt:  ---------SIQSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLI----LLGYTDSDFQSDKDARKSTSESIFTL-NGGAVVWRSIKQSCIADSTMEAEYI

Query:  AACVAAYEAVWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHG
        A   A  EA+WLK  LT + I   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE V    + +  I +E  +AD F K L A  F      
Subjt:  AACVAAYEAVWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHG

Query:  LGL
        LGL
Subjt:  LGL

P0CV72 Secreted RxLR effector protein 1614.2e-1739.1Show/hide
Query:  MSNILYASAVGSLMYISIQSGRD------------------HWTAVKNILKCLKRTKDYMLMYGSKDLI-LLGYTDSDFQSDKDARKSTSESIFTLNGGA
        M N+ Y SAVG++MY+ + +  D                  HW A+K +L+ L+ T+ Y L +       L+GY+D+D+  D ++R+STS  +F LNGG 
Subjt:  MSNILYASAVGSLMYISIQSGRD------------------HWTAVKNILKCLKRTKDYMLMYGSKDLI-LLGYTDSDFQSDKDARKSTSESIFTLNGGA

Query:  VVWRSIKQSCIADSTMEAEYIAACVAAYEAVWL
        V WRS KQ  +A S+ E EY+A   A  EAVWL
Subjt:  VVWRSIKQSCIADSTMEAEYIAACVAAYEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-5038.98Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------
        MKDLG AQ +LG++IVR + ++   +SQ  YI+++L R+ M+++K         + LSK+ CP T +E  +M+ + Y+SAVGSLMY  +           
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------

Query:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA
                 G++HW AVK IL+ L+ T    L +G  D IL GYTD+D   D D RKS++  +FT +GGA+ W+S  Q C+A ST EAEYIAA     E 
Subjt:  -------QSGRDHWTAVKNILKCLKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA

Query:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGLY
        +WLK+FL +L +    ++   +YCD+  A+  S+    H R KHI+ +YH IRE+V    + V KIS+ +N AD   K +    FE     +G++
Subjt:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-2128.91Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------
        +K+  +  Y LGI+  R  +     +SQ  Y   +L+R  M ++K           L+     K P   E      Y   VGSL Y++            
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------

Query:  ------QSGRDHWTAVKNILKCLKRTKDY-MLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA
                  DHW A+K +L+ L  T D+ + +     L L  Y+D+D+  D D   ST+  I  L    + W S KQ  +  S+ EAEY +    + E 
Subjt:  ------QSGRDHWTAVKNILKCLKRTKDY-MLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA

Query:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL
         W+   LT+L I   +  P  +YCDN GA      P  H R KHI   YH IR  V  G + V  +S+   +AD   K L    F+++   +G+
Subjt:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.7e-1727.95Show/hide
Query:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------
        ++DLG  +Y LG++I R+       + Q  Y   +L    +   K         + +            + +    Y   +G LMY+ I           
Subjt:  MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISI-----------

Query:  -----QSGR-DHWTAVKNILKCLKRTKDYMLMYGSK-DLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA
             ++ R  H  AV  IL  +K T    L Y S+ ++ L  ++D+ FQS KD R+ST+     L    + W+S KQ  ++ S+ EAEY A   A  E 
Subjt:  -----QSGR-DHWTAVKNILKCLKRTKDYMLMYGSK-DLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEA

Query:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIRE
        +WL +F  +L++   +  P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  VWLKKFLTDLEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHRIRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATTTGGGAAATGCTCAATACGTTCTTGGTATCCAAATAGTTCGGAACCAGAAGAATAAAACACAAACCATGTCTCAAACATCTTATATAGACAAAATG
TTGTCAAGATATAAGATGCAGAGTTCCAAAAAGGGTCTGTTGTTGTACATATATGAAATTCATTTATCAAAAGAACAATGTCCAAAGACACCTCAAGAAGTTGAG
GATATGAGTAACATTCTCTATGCTTCTGCTGTTGGGAGCCTGATGTATATATCAATCCAATCTGGACGTGATCATTGGACAGCCGTTAAGAATATTTTAAAATGT
CTTAAAAGAACAAAAGACTACATGCTCATGTATGGTTCTAAGGATCTGATCCTTCTTGGATACACCGACTCTGATTTTCAATCTGATAAAGATGCTAGAAAGTCT
ACATCAGAATCAATTTTCACTCTGAATGGAGGAGCGGTAGTATGGAGAAGCATAAAACAATCATGTATTGCCGACTCCACTATGGAAGCTGAATATATAGCAGCT
TGTGTAGCAGCTTATGAAGCAGTATGGCTTAAAAAGTTCTTAACAGATTTGGAAATTGTTCCAAATATGCATCTACCAATCACCTTATACTGTGACAATAGTGGT
GCAGTTGCAAATTCAAGAGAGCCTAGAAGTCATAAACGAGGAAAGCACATTGAACGAAAGTACCATCGTATCAGGGAAATCGTACATCGGGGAGATGTTACAGTA
ACAAAAATCTCCTCCGAGCAAAACATGGCTGATCCGTTTATAAAAGCTCTCATGGCTAAAGTGTTTGAGAGCTATCTACATGGTCTAGGTCTATATTGTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATTTGGGAAATGCTCAATACGTTCTTGGTATCCAAATAGTTCGGAACCAGAAGAATAAAACACAAACCATGTCTCAAACATCTTATATAGACAAAATG
TTGTCAAGATATAAGATGCAGAGTTCCAAAAAGGGTCTGTTGTTGTACATATATGAAATTCATTTATCAAAAGAACAATGTCCAAAGACACCTCAAGAAGTTGAG
GATATGAGTAACATTCTCTATGCTTCTGCTGTTGGGAGCCTGATGTATATATCAATCCAATCTGGACGTGATCATTGGACAGCCGTTAAGAATATTTTAAAATGT
CTTAAAAGAACAAAAGACTACATGCTCATGTATGGTTCTAAGGATCTGATCCTTCTTGGATACACCGACTCTGATTTTCAATCTGATAAAGATGCTAGAAAGTCT
ACATCAGAATCAATTTTCACTCTGAATGGAGGAGCGGTAGTATGGAGAAGCATAAAACAATCATGTATTGCCGACTCCACTATGGAAGCTGAATATATAGCAGCT
TGTGTAGCAGCTTATGAAGCAGTATGGCTTAAAAAGTTCTTAACAGATTTGGAAATTGTTCCAAATATGCATCTACCAATCACCTTATACTGTGACAATAGTGGT
GCAGTTGCAAATTCAAGAGAGCCTAGAAGTCATAAACGAGGAAAGCACATTGAACGAAAGTACCATCGTATCAGGGAAATCGTACATCGGGGAGATGTTACAGTA
ACAAAAATCTCCTCCGAGCAAAACATGGCTGATCCGTTTATAAAAGCTCTCATGGCTAAAGTGTTTGAGAGCTATCTACATGGTCTAGGTCTATATTGTTTGTAA
Protein sequenceShow/hide protein sequence
MKDLGNAQYVLGIQIVRNQKNKTQTMSQTSYIDKMLSRYKMQSSKKGLLLYIYEIHLSKEQCPKTPQEVEDMSNILYASAVGSLMYISIQSGRDHWTAVKNILKC
LKRTKDYMLMYGSKDLILLGYTDSDFQSDKDARKSTSESIFTLNGGAVVWRSIKQSCIADSTMEAEYIAACVAAYEAVWLKKFLTDLEIVPNMHLPITLYCDNSG
AVANSREPRSHKRGKHIERKYHRIREIVHRGDVTVTKISSEQNMADPFIKALMAKVFESYLHGLGLYCL