; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G010855 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G010855
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag/pol protein
Genome locationCG_Chr05:12351334..12352122
RNA-Seq ExpressionClCG05G010855
SyntenyClCG05G010855
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.9e-11079.39Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG+ QY+LGIQIVRNRKNK LAMSQASYIDK+ SRYKMQNSK+G LPFRH IHLSKEQCPKTPQEVEDMR I Y+ AVGSLMYAMLCTR +ICY VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        IVS Y+SNP  DHWT +KNILKYLRRTR+Y  V+   +LIL GYT+SDFQ+D D+RKSTS SV  LN GA+VW+S KQ CIADSTMEAEYVAACEAAKE 
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM
        VWL+KFL  LE+V NMHLPITLYCDNSGAVANSKE RSHKRGKHIE KY LIREIV RGDV+
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-10978.63Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG+ QYVLGIQI+R+RKNK LA+SQA+YIDK+  RY MQNSK+GLLPFRH +HLSKEQ PKTPQEVEDMRRI YA AVGSLMYAMLCTR +ICY VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        IVS Y+SNP  DHWT +K +LKYLRRTRDY  V+   +LIL GYT+SDFQTD DSRKSTS SV  LN GA+VW+S KQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM
        VWL+KFL  LE+V NM+LPITLYCDNSGAVANSKE RSHKRGKHIE KY LIREIVQRGDV+
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]5.9e-11180.15Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG+ QYVLGIQI+R+RKNK LA+SQA+YIDK+  RY MQNSK+GLLPFRH +HLSKEQ PKTPQEVEDMRRI YA AVGSLMYAMLCTR +ICY VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        IVS Y+SNP  DHWT +K ILKYLRRTRDY  V+   +LIL GYTNSDFQTD DSRKSTSRSV  LN GA+VW+S KQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM
        VWLKKFL  LE+V NM+LPITLYCDNSGAVANSKE RSHKRGKHIE KY LIREIVQRGDV+
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-10978.63Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG+ QYVLGIQI+R+RKNK LA+SQA+YIDK+  RY MQNSK+GLLPFRH +HLSKEQ PKTPQEVEDMRRI YA AVGSLMYAMLCTR +ICY VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        IVS Y+SNP  DHWT +K +LKYLRRTRDY  V+   +LIL GYT+SDFQTD DSRKSTS SV  LN GA+VW+S KQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM
        VWL+KFL  LE+V NM+LPITLYCDNSGAVANSKE RSHKRGKHIE KY LIREIVQRGDV+
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-11079.77Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG+ QYVLGIQI+R+RKNK LA+SQA+YIDK+  RY MQNSK+GLLPFRH +HLSKEQ PKTPQEVEDMRRI YA AVGSLMYAMLCTR +ICY VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        IVS Y+SNP  DHWTT+K ILKYLRRTRDY  V+   +LIL GYT+SDFQTD DSRKSTS SV  LNEGA+VW+S KQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM
        VWL+KFL  LE+V NM+LPITLYCDNSGAVANSKE RSHKRGKHIE KY LIREIVQRGDV+
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein2.9e-11180.15Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG+ QYVLGIQI+R+RKNK LA+SQA+YIDK+  RY MQNSK+GLLPFRH +HLSKEQ PKTPQEVEDMRRI YA AVGSLMYAMLCTR +ICY VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        IVS Y+SNP  DHWT +K ILKYLRRTRDY  V+   +LIL GYTNSDFQTD DSRKSTSRSV  LN GA+VW+S KQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM
        VWLKKFL  LE+V NM+LPITLYCDNSGAVANSKE RSHKRGKHIE KY LIREIVQRGDV+
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM

A0A5A7TZD0 Gag/pol protein2.1e-10978.63Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG+ QYVLGIQI+R+RKNK LA+SQA+YIDK+  RY MQNSK+GLLPFRH +HLSKEQ PKTPQEVEDMRRI YA AVGSLMYAMLCTR +ICY VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        IVS Y+SNP  DHWT +K +LKYLRRTRDY  V+   +LIL GYT+SDFQTD DSRKSTS SV  LN GA+VW+S KQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM
        VWL+KFL  LE+V NM+LPITLYCDNSGAVANSKE RSHKRGKHIE KY LIREIVQRGDV+
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM

A0A5A7UYE8 Gag/pol protein2.1e-10978.63Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG+ QYVLGIQI+R+RKNK LA+SQA+YIDK+  RY MQNSK+GLLPFRH +HLSKEQ PKTPQEVEDMRRI YA AVGSLMYAMLCTR +ICY VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        IVS Y+SNP  DHWT +K +LKYLRRTRDY  V+   +LIL GYT+SDFQTD DSRKSTS SV  LN GA+VW+S KQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM
        VWL+KFL  LE+V NM+LPITLYCDNSGAVANSKE RSHKRGKHIE KY LIREIVQRGDV+
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM

A0A5A7V1F5 Gag/pol protein6.4e-11179.77Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG+ QYVLGIQI+R+RKNK LA+SQA+YIDK+  RY MQNSK+GLLPFRH +HLSKEQ PKTPQEVEDMRRI YA AVGSLMYAMLCTR +ICY VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        IVS Y+SNP  DHWTT+K ILKYLRRTRDY  V+   +LIL GYT+SDFQTD DSRKSTS SV  LNEGA+VW+S KQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM
        VWL+KFL  LE+V NM+LPITLYCDNSGAVANSKE RSHKRGKHIE KY LIREIVQRGDV+
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM

E2GK51 Gag/pol protein (Fragment)1.4e-11079.39Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG+ QY+LGIQIVRNRKNK LAMSQASYIDK+ SRYKMQNSK+G LPFRH IHLSKEQCPKTPQEVEDMR I Y+ AVGSLMYAMLCTR +ICY VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        IVS Y+SNP  DHWT +KNILKYLRRTR+Y  V+   +LIL GYT+SDFQ+D D+RKSTS SV  LN GA+VW+S KQ CIADSTMEAEYVAACEAAKE 
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW-RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM
        VWL+KFL  LE+V NMHLPITLYCDNSGAVANSKE RSHKRGKHIE KY LIREIV RGDV+
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVM

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.3e-3033.21Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLP----FRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNIC
        M DL + ++ +GI+I    +   + +SQ++Y+ K+ S++ M+N      P      +E+  S E C  TP              +G LMY MLCTR ++ 
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLP----FRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNIC

Query:  YVVGIVSSYESNPEYDHWTTIKNILKYLRRTRDYARVWRLNLI----LIGYTNSDFQ-TDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVA
          V I+S Y S    + W  +K +L+YL+ T D   +++ NL     +IGY +SD+  ++ID + +T     + +   I W + +Q  +A S+ EAEY+A
Subjt:  YVVGIVSSYESNPEYDHWTTIKNILKYLRRTRDYARVWRLNLI----LIGYTNSDFQ-TDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVA

Query:  ACEAAKEVVWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQ
          EA +E +WLK  L  + I L    PI +Y DN G ++ +     HKR KHI+ KY   RE VQ
Subjt:  ACEAAKEVVWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQ

P0CV72 Secreted RxLR effector protein 1611.2e-2139.1Show/hide
Query:  MRRISYALAVGSLMYAMLCTRSNICYVVGIVSSYESNPEYDHWTTIKNILKYLRRTRDYARVWRL--NLILIGYTNSDFQTDIDSRKSTSRSVLILNEGA
        M+ + Y  AVG++MY M+ TR ++   VG++S + S+P   HW  +K +L+YL+ T+ Y   +       L+GY+++D+  D++SR+STS  +  LN G 
Subjt:  MRRISYALAVGSLMYAMLCTRSNICYVVGIVSSYESNPEYDHWTTIKNILKYLRRTRDYARVWRL--NLILIGYTNSDFQTDIDSRKSTSRSVLILNEGA

Query:  IVWKSTKQGCIADSTMEAEYVAACEAAKEVVWL
        + W+S KQ  +A S+ E EY+A  EA +E VWL
Subjt:  IVWKSTKQGCIADSTMEAEYVAACEAAKEVVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.5e-5244.14Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        MKDLG  Q +LG++IVR R ++ L +SQ  YI+++  R+ M+N+K    P    + LSK+ CP T +E  +M ++ Y+ AVGSLMYAM+CTR +I + VG
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLR-RTRDYARVWRLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV
        +VS +  NP  +HW  +K IL+YLR  T D       + IL GYT++D   DID+RKS++  +   + GAI W+S  Q C+A ST EAEY+AA E  KE+
Subjt:  IVSSYESNPEYDHWTTIKNILKYLR-RTRDYARVWRLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIV
        +WLK+FL  L +    ++   +YCD+  A+  SK    H R KHI+ +Y  IRE+V
Subjt:  VWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-1832.14Show/hide
Query:  YVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVGIVSSYESN
        Y LGI+    R    L +SQ  YI  + +R  M  +K    P      LS     K     E      Y   VGSL Y +  TR +I Y V  +S +   
Subjt:  YVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVGIVSSYESN

Query:  PEYDHWTTIKNILKYLRRTRDYARVWRL--NLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEVVWLKKFL
        P  +H   +K IL+YL  T ++    +    L L  Y+++D+  D D   ST+  ++ L    I W S KQ  +  S+ EAEY +    + E+ W+   L
Subjt:  PEYDHWTTIKNILKYLRRTRDYARVWRL--NLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEVVWLKKFL

Query:  AHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRG
          L I L    P  +YCDN GA         H R KHI   Y  IR  VQ G
Subjt:  AHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.1e-2232.94Show/hide
Query:  YVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVGIVSSYESN
        Y LGI+    R  + L +SQ  Y   + +R  M  +K    P      L+     K P   E      Y   VGSL Y +  TR ++ Y V  +S Y   
Subjt:  YVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVGIVSSYESN

Query:  PEYDHWTTIKNILKYLRRTRDYARVWRL--NLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEVVWLKKFL
        P  DHW  +K +L+YL  T D+    +    L L  Y+++D+  D D   ST+  ++ L    I W S KQ  +  S+ EAEY +    + E+ W+   L
Subjt:  PEYDHWTTIKNILKYLRRTRDYARVWRL--NLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEVVWLKKFL

Query:  AHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRG
          L I L+ H P+ +YCDN GA         H R KHI   Y  IR  VQ G
Subjt:  AHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-1927.84Show/hide
Query:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG
        ++DLG  +Y LG++I R+     + + Q  Y   +     +   K   +P    +  S      +  +  D +  +Y   +G LMY  + TR +I + V 
Subjt:  MKDLGD-QYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVG

Query:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW--RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKE
         +S +   P   H   +  IL Y++ T      +  +  + L  ++++ FQ+  D+R+ST+   + L    I WKS KQ  ++ S+ EAEY A   A  E
Subjt:  IVSSYESNPEYDHWTTIKNILKYLRRTRDYARVW--RLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKE

Query:  VVWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIRE
        ++WL +F   L++ L+   P  L+CDN+ A+  +     H+R KHIE     +RE
Subjt:  VVWLKKFLAHLEIVLNMHLPITLYCDNSGAVANSKELRSHKRGKHIEHKYQLIRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATTTGGGAGACCAGTATGTTCTTGGAATCCAAATAGTTCGGAATCGCAAGAACAAGATGTTAGCTATGTCTCAGGCATCATATATTGACAAAATGTTCTCTAG
ATATAAGATGCAAAATTCCAAGAGAGGTCTATTACCGTTCAGGCATGAAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGCA
TTTCTTATGCTTTAGCAGTCGGAAGTCTGATGTATGCTATGTTGTGTACACGATCTAACATATGCTATGTAGTAGGAATAGTCAGCAGCTATGAGTCCAATCCAGAATAT
GATCATTGGACTACCATTAAAAATATCCTCAAGTATCTAAGGAGAACGAGGGACTATGCTCGTGTATGGCGCTTAAATCTGATCCTTATAGGATACACTAACTCTGATTT
TCAGACCGATATAGATTCAAGGAAATCAACATCGAGATCAGTGTTAATTCTAAATGAAGGAGCAATAGTTTGGAAGAGTACCAAACAAGGCTGTATAGCTGACTCCACCA
TGGAAGCTGAGTATGTAGCTGCATGTGAAGCAGCAAAAGAAGTAGTATGGCTTAAAAAGTTCTTAGCGCATTTGGAAATTGTTCTAAATATGCATCTGCCTATCACTCTC
TATTGTGATAATAGTGGTGCAGTTGCAAATTCTAAAGAACTCAGAAGCCATAAGCGGGGCAAACACATTGAACACAAATATCAGCTCATCAGGGAGATTGTGCAAAGAGG
AGACGTAATGGCCAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATTTGGGAGACCAGTATGTTCTTGGAATCCAAATAGTTCGGAATCGCAAGAACAAGATGTTAGCTATGTCTCAGGCATCATATATTGACAAAATGTTCTCTAG
ATATAAGATGCAAAATTCCAAGAGAGGTCTATTACCGTTCAGGCATGAAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGCA
TTTCTTATGCTTTAGCAGTCGGAAGTCTGATGTATGCTATGTTGTGTACACGATCTAACATATGCTATGTAGTAGGAATAGTCAGCAGCTATGAGTCCAATCCAGAATAT
GATCATTGGACTACCATTAAAAATATCCTCAAGTATCTAAGGAGAACGAGGGACTATGCTCGTGTATGGCGCTTAAATCTGATCCTTATAGGATACACTAACTCTGATTT
TCAGACCGATATAGATTCAAGGAAATCAACATCGAGATCAGTGTTAATTCTAAATGAAGGAGCAATAGTTTGGAAGAGTACCAAACAAGGCTGTATAGCTGACTCCACCA
TGGAAGCTGAGTATGTAGCTGCATGTGAAGCAGCAAAAGAAGTAGTATGGCTTAAAAAGTTCTTAGCGCATTTGGAAATTGTTCTAAATATGCATCTGCCTATCACTCTC
TATTGTGATAATAGTGGTGCAGTTGCAAATTCTAAAGAACTCAGAAGCCATAAGCGGGGCAAACACATTGAACACAAATATCAGCTCATCAGGGAGATTGTGCAAAGAGG
AGACGTAATGGCCAAGTAG
Protein sequenceShow/hide protein sequence
MKDLGDQYVLGIQIVRNRKNKMLAMSQASYIDKMFSRYKMQNSKRGLLPFRHEIHLSKEQCPKTPQEVEDMRRISYALAVGSLMYAMLCTRSNICYVVGIVSSYESNPEY
DHWTTIKNILKYLRRTRDYARVWRLNLILIGYTNSDFQTDIDSRKSTSRSVLILNEGAIVWKSTKQGCIADSTMEAEYVAACEAAKEVVWLKKFLAHLEIVLNMHLPITL
YCDNSGAVANSKELRSHKRGKHIEHKYQLIREIVQRGDVMAK