; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G009090 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G009090
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag/pol protein
Genome locationCG_Chr01:11431513..11435230
RNA-Seq ExpressionClCG01G009090
SyntenyClCG01G009090
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025159.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-8634.01Show/hide
Query:  RIAAPRRLVTQVRAYTKSRPSSLRKKNMMKKKGKGKTTA-SFKGKKKTKLRKKY-----------------------KETSSWKQLRDGEITLKVRLGEI
        +I     +V   R +  S   S + +   + K KG T A   KGK K  +++KY                       +ETSS+KQL + E+TLKV  G++
Subjt:  RIAAPRRLVTQVRAYTKSRPSSLRKKNMMKKKGKGKTTA-SFKGKKKTKLRKKY-----------------------KETSSWKQLRDGEITLKVRLGEI

Query:  ISAKT-------------------------LEDYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF-----------
        ISA+                          L+D SLPPCESCLEGK+TK   F+EKG RAKE LEL+HSDLCG MN+ ARGG +  I F           
Subjt:  ISAKT-------------------------LEDYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF-----------

Query:  -------------EYDS------------------------------------------------------SKSVLETPYELWRGHKGSLCLFRIWGCPA
                     EY +                                                      SKSV ETP+ELWRG K SL  FRIWGCP 
Subjt:  -------------EYDS------------------------------------------------------SKSVLETPYELWRGHKGSLCLFRIWGCPA

Query:  HVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG---------------------
        HVLV + KKL+ RS LC FVGYPKET+GGLF+DPQENRV VS NATFLEE+H+R+H+PR KL++     KS+  +D VG                     
Subjt:  HVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG---------------------

Query:  ------------ELFINPTIT-------------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQ-----------
                      ++  T T                          W+KAMDLE+ESMYFN VW+LVD   GVKPIGCKWIYKRKRD            
Subjt:  ------------ELFINPTIT-------------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQ-----------

Query:  ------------------------------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI-------------------------
                                            DYEIWQMDVKT FLNG+LE+SI+M++ EGF+ +GQE+K+                         
Subjt:  ------------------------------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI-------------------------

Query:  --------------------------------------------------------------PQKVEDIRCITYTSAVRSLMYAILCTRSDICYVVRIVS
                                                                      PQ+VED+R I Y SAV SLMYA+LCTR DICY V IVS
Subjt:  --------------------------------------------------------------PQKVEDIRCITYTSAVRSLMYAILCTRSDICYVVRIVS

Query:  KYQFNSRYDHWNPIE------RSTKDQKLLKKEQD
        +YQ N   DHW  ++      R T+D  L+   +D
Subjt:  KYQFNSRYDHWNPIE------RSTKDQKLLKKEQD

KAA0033228.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-8836.06Show/hide
Query:  KNMMKKKGKGKTTASFKGKKKTKLRKKYKETSSWKQLRDGEITLKVRLGEIISAK-----------------TLEDYSLPPCESCLEGKITKSFVFSEKG
        K ++KKK +G T           +    +ETSS+KQL + E+TLKV  G++ISA+                  LED SLPPCESCLEGK+TK   F+ KG
Subjt:  KNMMKKKGKGKTTASFKGKKKTKLRKKYKETSSWKQLRDGEITLKVRLGEIISAK-----------------TLEDYSLPPCESCLEGKITKSFVFSEKG

Query:  LRAKESLELVHSDLCGPMNINARGGRDCSIYF------------------------EYDS----------------------------------------
         RAKE LEL+H DLCGPMN+ ARGG +  I F                        EY +                                        
Subjt:  LRAKESLELVHSDLCGPMNINARGGRDCSIYF------------------------EYDS----------------------------------------

Query:  ---------------------------------------------------SKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFV
                                                           SKSV ETP+ELWRG K SL  F+IWGC AHVLV + KKLE RS LC FV
Subjt:  ---------------------------------------------------SKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFV

Query:  GYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTITW----WIKAMDLEIESMYFNFVWKLVDQSNG
        GYPKET+GGLF+DPQEN+V VSKNATFLEE+H+RDH+PR KL++     KS+   D VG    ++ T T     W+KAMDLEIESMYFN +W+LVD   G
Subjt:  GYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTITW----WIKAMDLEIESMYFNFVWKLVDQSNG

Query:  VKPIGCKWIYKRKRDQ---------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI------------------------------
        V+PIGCKWIYKRKRD                +YEIWQMDVKT FLNG+LE+SI+M+Q +GF+ QGQE+K+                              
Subjt:  VKPIGCKWIYKRKRDQ---------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI------------------------------

Query:  -----------------------------------------------------------------------------------------PQKVEDIRCIT
                                                                                                 P +VED+R I 
Subjt:  -----------------------------------------------------------------------------------------PQKVEDIRCIT

Query:  YTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE------RSTKDQKLLKKE
        Y SAV SLM+A+LCTR DICY V IVS+YQ N   DHW  ++      R T+D  L+ K+
Subjt:  YTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE------RSTKDQKLLKKE

KAA0039205.1 gag/pol protein [Cucumis melo var. makuwa]9.9e-9642.7Show/hide
Query:  SSLRKKNMMKKKGKGK---TTASFKGKKKTKLRKKY---------KETSSWKQLRDGEITLKVRLGEIISAKT-------------------------LE
        SS  KK   +K+GKGK     A  KGK K  ++ K+         +ETSS KQL + E+TLKV  G++ISA+                          LE
Subjt:  SSLRKKNMMKKKGKGK---TTASFKGKKKTKLRKKY---------KETSSWKQLRDGEITLKVRLGEIISAKT-------------------------LE

Query:  DYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF--EYDSSK------------------------------SVLET
        D SLPPCESCLEGK+TK    + K  RAKESLEL+HSDLCGPMN+ ARGG +  I F  +Y S K                              S    
Subjt:  DYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF--EYDSSK------------------------------SVLET

Query:  PYE--------------LWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLV
        P +              LWRG K SL  FRIWGCPAHVLV++ KKLE RS LC F GYPKET+GGLF++PQENRV VS NATFLEE+H+RDH+PR KL++
Subjt:  PYE--------------LWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLV

Query:  EHILLKSSECLDVVG-ELFINPTIT-------------------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQ-
           + +S+  +D VG    ++ T T                               + +KAMDLE+ESMYFN VW+LVD   GVKP GCKWIYKRKRD  
Subjt:  EHILLKSSECLDVVG-ELFINPTIT-------------------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQ-

Query:  --------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI------------------------------------------PQKVE
                      DYEIWQMDVKT  LNG+LE+ I+M+Q EGF+ QGQE+K+                                          PQ+VE
Subjt:  --------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI------------------------------------------PQKVE

Query:  DIRCITYTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE
        D+R I+Y S V SL++A+LCTR DICY V IVS+YQ N   DHW  ++
Subjt:  DIRCITYTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE

KAA0066736.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-8836.85Show/hide
Query:  SSLRKKNMMKKKGKGK---TTASFKGKKKTKLRKKYK------------ETSSWKQLRDGEITLKVRLGEIISAKT------------------------
        SS  +K   +K+GKGK        KGK K  ++KK K            ETSS+KQL + E+TLKV   +IISA+                         
Subjt:  SSLRKKNMMKKKGKGK---TTASFKGKKKTKLRKKYK------------ETSSWKQLRDGEITLKVRLGEIISAKT------------------------

Query:  -LEDYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF-------------------------------EYD------
         LED SLPPCESCLEGK+TK   F+ KG RAKE LEL+HSDLCGPMN+ ARGG +  I F                                YD      
Subjt:  -LEDYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF-------------------------------EYD------

Query:  -----------------------SSKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATF
                                SKSV ETP++LWRG K SL  FRIWGCP H+LV + KKLE R  LC FVGYPKET+ GLF+DPQENRV VS NATF
Subjt:  -----------------------SSKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATF

Query:  LEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTIT---------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIY
        LEE+H+RDH+PR KL++     +S+  +D VG    ++ T T                      W+KAM+LE+ES+YFN VW+LVD   GVKPIGCKWIY
Subjt:  LEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTIT---------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIY

Query:  KRKRDQ------------------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI-------------------------------
        KRKRD                         DYEIWQMDVKT FLNG+LE+ I+++QLEGF+ +GQE+K+                               
Subjt:  KRKRDQ------------------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI-------------------------------

Query:  ---------------------------------------------------------------------------------PQKVEDIRCITYTSAVRSL
                                                                                         PQ+VED+R I Y S V SL
Subjt:  ---------------------------------------------------------------------------------PQKVEDIRCITYTSAVRSL

Query:  MYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE------RSTKDQKLLKKEQD
        MY +LCTR DICY V IVS+Y+ N   D W  ++      R T+D  L+   +D
Subjt:  MYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE------RSTKDQKLLKKEQD

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-8436.54Show/hide
Query:  KNMMKKKGKGKTTASFKGKKKTKLRKKYKETSSWKQLRDGEITLKVRLGEIISAKT-------------------------LEDYSLPPCESCLEGKITK
        K ++KKK K   T          +    +ETSS+KQL D E+TLKV  G++ISA+                          L+D SLPPCESCLEGK+TK
Subjt:  KNMMKKKGKGKTTASFKGKKKTKLRKKYKETSSWKQLRDGEITLKVRLGEIISAKT-------------------------LEDYSLPPCESCLEGKITK

Query:  SFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF------------------------EYDS--------------------------------
           F+ KG RAKE LEL+HSDLCGPMN+ ARGG +  I F                        EY +                                
Subjt:  SFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF------------------------EYDS--------------------------------

Query:  -----------------------------------------------------------SKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLER
                                                                   SKSV ETP+ELWRG K SL  FRIWGCPAHVLV + KKLE 
Subjt:  -----------------------------------------------------------SKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLER

Query:  RSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG---------------------------------
        RS LC FVGYPKET+GGLF+DPQENRV VS NATFLEE+H+R+H+PR KL++     +S+  +D VG                                 
Subjt:  RSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG---------------------------------

Query:  ELFINPTIT-------------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQ-----------------------
          ++  T T                          W+KAMDLE+ESMYFN VW+LVD   GVKPIGCKWIYKRKRD                        
Subjt:  ELFINPTIT-------------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQ-----------------------

Query:  ------------------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI---------------------------------PQKV
                                DYEIWQMDVKT FLNG+LE+SI+M+Q E F+ QGQE+K+                                 PQ+V
Subjt:  ------------------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI---------------------------------PQKV

Query:  EDIRCITYTSAVRSLMYAILCTRSDICYVVRIVSK
        ED+R I Y SAV SLMYA+LCTR DICY V IVS+
Subjt:  EDIRCITYTSAVRSLMYAILCTRSDICYVVRIVSK

TrEMBL top hitse value%identityAlignment
A0A5A7SRR6 Gag/pol protein2.2e-8836.06Show/hide
Query:  KNMMKKKGKGKTTASFKGKKKTKLRKKYKETSSWKQLRDGEITLKVRLGEIISAK-----------------TLEDYSLPPCESCLEGKITKSFVFSEKG
        K ++KKK +G T           +    +ETSS+KQL + E+TLKV  G++ISA+                  LED SLPPCESCLEGK+TK   F+ KG
Subjt:  KNMMKKKGKGKTTASFKGKKKTKLRKKYKETSSWKQLRDGEITLKVRLGEIISAK-----------------TLEDYSLPPCESCLEGKITKSFVFSEKG

Query:  LRAKESLELVHSDLCGPMNINARGGRDCSIYF------------------------EYDS----------------------------------------
         RAKE LEL+H DLCGPMN+ ARGG +  I F                        EY +                                        
Subjt:  LRAKESLELVHSDLCGPMNINARGGRDCSIYF------------------------EYDS----------------------------------------

Query:  ---------------------------------------------------SKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFV
                                                           SKSV ETP+ELWRG K SL  F+IWGC AHVLV + KKLE RS LC FV
Subjt:  ---------------------------------------------------SKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFV

Query:  GYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTITW----WIKAMDLEIESMYFNFVWKLVDQSNG
        GYPKET+GGLF+DPQEN+V VSKNATFLEE+H+RDH+PR KL++     KS+   D VG    ++ T T     W+KAMDLEIESMYFN +W+LVD   G
Subjt:  GYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTITW----WIKAMDLEIESMYFNFVWKLVDQSNG

Query:  VKPIGCKWIYKRKRDQ---------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI------------------------------
        V+PIGCKWIYKRKRD                +YEIWQMDVKT FLNG+LE+SI+M+Q +GF+ QGQE+K+                              
Subjt:  VKPIGCKWIYKRKRDQ---------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI------------------------------

Query:  -----------------------------------------------------------------------------------------PQKVEDIRCIT
                                                                                                 P +VED+R I 
Subjt:  -----------------------------------------------------------------------------------------PQKVEDIRCIT

Query:  YTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE------RSTKDQKLLKKE
        Y SAV SLM+A+LCTR DICY V IVS+YQ N   DHW  ++      R T+D  L+ K+
Subjt:  YTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE------RSTKDQKLLKKE

A0A5A7T1D8 Gag/pol protein1.1e-8435.76Show/hide
Query:  SHNRIAAPRRLVTQVRAYTKSRPSSLRKKNMMKKKGKGKTTASFKGK--------------KKTKLRKKYKETSSWKQLRDGEITLKVRLGEIISAKT--
        +H+R  AP    ++     K R     K   +  +GKGKT    KGK               K  ++KK KE SS+KQL + E+TL V +G++ISA+   
Subjt:  SHNRIAAPRRLVTQVRAYTKSRPSSLRKKNMMKKKGKGKTTASFKGK--------------KKTKLRKKYKETSSWKQLRDGEITLKVRLGEIISAKT--

Query:  -----------------------LEDYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYFEYDSS-------------
                               LED SLPPCESCL+GK+TK   F+ KG RAKE LEL++SDLCGPMN+ ARGG +C I F  D S             
Subjt:  -----------------------LEDYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYFEYDSS-------------

Query:  -----------------------------------------------------------------KSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDK
                                                                          SV ETP+ELWRG K SL  F+IWGCPAHVL  + 
Subjt:  -----------------------------------------------------------------KSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDK

Query:  KKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTIT------------------
        KKLE RS LC F+GYPKET+GGLF+DPQENRV VS NATFLEE+H+RDH+ R KL++  +  +S+  ++ VG    ++ T T                  
Subjt:  KKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTIT------------------

Query:  ---WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQ---------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI
            W KAMDLE+ESMYFN VW+LVD   GVKPIGCKWIYKRKRD                DYEIWQMDVKT FLNG+LE+SI+M+Q EGF+ QGQ++K+
Subjt:  ---WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQ---------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------PQKVEDIRCITYTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDH
                     Q+VED+R I Y S V SLMY +LCTR DICY + IVS+YQ +   DH
Subjt:  ------------PQKVEDIRCITYTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDH

A0A5D3BH07 Gag/pol protein9.4e-8437.35Show/hide
Query:  KETSSWKQLRDGEITLKVRLGEIISAKTLEDYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYFEYDSSK-------
        +ETSS+KQL + E+TLKV  G++ISA+ + D            K      F+EKG RAKE LEL+HSDLC PMN+ ARGG +  I F  D S+       
Subjt:  KETSSWKQLRDGEITLKVRLGEIISAKTLEDYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYFEYDSSK-------

Query:  ------------------------------SVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSK
                                      SV ETP+ELWRG K SL  FRIWGCPAHVLV + KKLE RS LC FVGYPKET+GGLF+DPQ NRV VS 
Subjt:  ------------------------------SVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSK

Query:  NATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTIT---------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGC
        NATFLEE+H+RDH+PR KL++     +S+  +D VG    +N T T                      W+KAMDLE+ESMYFN +W+LVD   GVKPIGC
Subjt:  NATFLEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTIT---------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGC

Query:  KWIYKRKRDQ---------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI------------------------------------
        KWIYKRKRD                DYEIWQMDVK  +LN + E+SI+M+Q EGF+ QGQE+K+                                    
Subjt:  KWIYKRKRDQ---------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------PQKVEDIRCITYTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIERST
                 PQ+VED+R I Y SAV SLMY +LCT+SDI   V IVS+YQ N   DHW    +ST
Subjt:  ---------PQKVEDIRCITYTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIERST

A0A5D3D4K7 Gag/pol protein4.8e-9642.7Show/hide
Query:  SSLRKKNMMKKKGKGK---TTASFKGKKKTKLRKKY---------KETSSWKQLRDGEITLKVRLGEIISAKT-------------------------LE
        SS  KK   +K+GKGK     A  KGK K  ++ K+         +ETSS KQL + E+TLKV  G++ISA+                          LE
Subjt:  SSLRKKNMMKKKGKGK---TTASFKGKKKTKLRKKY---------KETSSWKQLRDGEITLKVRLGEIISAKT-------------------------LE

Query:  DYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF--EYDSSK------------------------------SVLET
        D SLPPCESCLEGK+TK    + K  RAKESLEL+HSDLCGPMN+ ARGG +  I F  +Y S K                              S    
Subjt:  DYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF--EYDSSK------------------------------SVLET

Query:  PYE--------------LWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLV
        P +              LWRG K SL  FRIWGCPAHVLV++ KKLE RS LC F GYPKET+GGLF++PQENRV VS NATFLEE+H+RDH+PR KL++
Subjt:  PYE--------------LWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLEENHIRDHQPR-KLLV

Query:  EHILLKSSECLDVVG-ELFINPTIT-------------------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQ-
           + +S+  +D VG    ++ T T                               + +KAMDLE+ESMYFN VW+LVD   GVKP GCKWIYKRKRD  
Subjt:  EHILLKSSECLDVVG-ELFINPTIT-------------------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQ-

Query:  --------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI------------------------------------------PQKVE
                      DYEIWQMDVKT  LNG+LE+ I+M+Q EGF+ QGQE+K+                                          PQ+VE
Subjt:  --------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI------------------------------------------PQKVE

Query:  DIRCITYTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE
        D+R I+Y S V SL++A+LCTR DICY V IVS+YQ N   DHW  ++
Subjt:  DIRCITYTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE

A0A5D3DWJ5 Gag/pol protein2.2e-8836.85Show/hide
Query:  SSLRKKNMMKKKGKGK---TTASFKGKKKTKLRKKYK------------ETSSWKQLRDGEITLKVRLGEIISAKT------------------------
        SS  +K   +K+GKGK        KGK K  ++KK K            ETSS+KQL + E+TLKV   +IISA+                         
Subjt:  SSLRKKNMMKKKGKGK---TTASFKGKKKTKLRKKYK------------ETSSWKQLRDGEITLKVRLGEIISAKT------------------------

Query:  -LEDYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF-------------------------------EYD------
         LED SLPPCESCLEGK+TK   F+ KG RAKE LEL+HSDLCGPMN+ ARGG +  I F                                YD      
Subjt:  -LEDYSLPPCESCLEGKITKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYF-------------------------------EYD------

Query:  -----------------------SSKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATF
                                SKSV ETP++LWRG K SL  FRIWGCP H+LV + KKLE R  LC FVGYPKET+ GLF+DPQENRV VS NATF
Subjt:  -----------------------SSKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATF

Query:  LEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTIT---------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIY
        LEE+H+RDH+PR KL++     +S+  +D VG    ++ T T                      W+KAM+LE+ES+YFN VW+LVD   GVKPIGCKWIY
Subjt:  LEENHIRDHQPR-KLLVEHILLKSSECLDVVG-ELFINPTIT---------------------WWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIY

Query:  KRKRDQ------------------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI-------------------------------
        KRKRD                         DYEIWQMDVKT FLNG+LE+ I+++QLEGF+ +GQE+K+                               
Subjt:  KRKRDQ------------------------DYEIWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI-------------------------------

Query:  ---------------------------------------------------------------------------------PQKVEDIRCITYTSAVRSL
                                                                                         PQ+VED+R I Y S V SL
Subjt:  ---------------------------------------------------------------------------------PQKVEDIRCITYTSAVRSL

Query:  MYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE------RSTKDQKLLKKEQD
        MY +LCTR DICY V IVS+Y+ N   D W  ++      R T+D  L+   +D
Subjt:  MYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIE------RSTKDQKLLKKEQD

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-1522.32Show/hide
Query:  CSIYFEYDSSKSVLETPYELWRGHKGSLCLFRIWGCP--AHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLE-----------
        C +     S     E P  +W   + S    +++GC   AHV  E + KL+ +S  C+F+GY  E  G   +DP + +V+ S++  F E           
Subjt:  CSIYFEYDSSKSVLETPYELWRGHKGSLCLFRIWGCP--AHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQENRVLVSKNATFLE-----------

Query:  --ENHI-------------------------------------------------------RDHQP------------RKLLVEHILLKSSECLDVVGEL
          +N I                                                         HQP            R    E++L+      + + E+
Subjt:  --ENHI-------------------------------------------------------RDHQP------------RKLLVEHILLKSSECLDVVGEL

Query:  FINPTITWWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRD-----------------------------------------------QDYE
          +P     +KAM  E+ES+  N  +KLV+   G +P+ CKW++K K+D                                                D E
Subjt:  FINPTITWWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRD-----------------------------------------------QDYE

Query:  IWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI
        + Q+DVKT FL+G LE+ IYM Q EGF   G++  +
Subjt:  IWQMDVKTTFLNGHLEDSIYMTQLEGFLEQGQEKKI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.9e-0524.24Show/hide
Query:  WIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQD-----------------------------------------------YEIWQMDVKT
        W  AMD EI +M     W++       KPIGCKW+YK K + D                                               + + Q+D+  
Subjt:  WIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQD-----------------------------------------------YEIWQMDVKT

Query:  TFLNGHLEDSIYMTQLEGFLEQGQEKKIPQKV
         FLNG L++ IYM    G+  +  +   P  V
Subjt:  TFLNGHLEDSIYMTQLEGFLEQGQEKKIPQKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACTTCTCCCTTTGAAACGCATAGCCACAATCGCATTGCCGCACCCCGACGCTTGGTAACGCAAGTCCGTGCTTACACTAAGTCTAGACCTTCATCCTTGAGGAA
AAAGAATATGATGAAGAAGAAAGGAAAAGGAAAAACTACTGCAAGTTTCAAAGGCAAGAAAAAAACTAAGCTGAGAAAGAAATACAAAGAAACTAGTTCCTGGAAACAAC
TTAGAGATGGCGAGATAACTCTCAAAGTTAGATTAGGGGAGATCATCTCAGCCAAAACATTAGAAGACTACTCTTTACCTCCATGTGAGTCTTGTCTTGAGGGTAAGATA
ACCAAAAGTTTTGTTTTTTCTGAAAAAGGTCTTAGAGCCAAGGAATCCTTAGAACTCGTGCATTCAGACCTCTGTGGTCCTATGAATATTAATGCACGAGGAGGTAGAGA
CTGCAGTATATATTTTGAATATGATTCCTCTAAAAGTGTTTTAGAAACACCGTATGAATTATGGAGAGGACATAAAGGTAGTTTATGTCTCTTCAGAATTTGGGGATGCC
CAGCACATGTGTTGGTGGAAGATAAAAAGAAGTTGGAACGTCGTTCAACACTATGCCTATTTGTAGGATATCCCAAAGAAACAAAAGGTGGTTTGTTTTATGATCCTCAA
GAAAATAGGGTGCTTGTATCAAAAAACGCCACATTCTTAGAGGAAAACCACATAAGAGATCATCAACCTCGTAAGTTGTTAGTAGAACACATCCTTCTCAAGAGTTCAGA
GTGCCTCGACGTAGTGGGAGAGTTATTCATTAACCCGACCATTACATGGTGGATCAAAGCCATGGATCTCGAAATAGAGTCTATGTACTTCAATTTTGTATGGAAACTTG
TAGATCAATCAAATGGGGTAAAACCTATTGGTTGCAAATGGATCTACAAGAGAAAACGAGACCAAGATTATGAAATTTGGCAAATGGATGTCAAGACAACTTTTCTGAAT
GGGCATCTTGAAGATAGTATTTATATGACTCAACTAGAGGGGTTCCTTGAACAAGGTCAAGAGAAAAAGATACCTCAAAAAGTTGAGGATATAAGATGCATTACCTATAC
ATCAGCAGTCAGAAGTCTGATGTATGCCATATTGTGTACACGATCAGACATATGCTATGTAGTAAGAATAGTCAGCAAGTATCAGTTCAATTCAAGATACGATCATTGGA
ATCCTATTGAGAGGTCCACTAAAGATCAGAAATTGTTGAAGAAAGAACAAGATGCATCTTTTTCAAGGTTTTCTCTCATCATTTGCCAATGTGTCCTTCGCTTACCTAAA
AAGCATAACATGAAAAGAATGAGTACAAAATACTCAGTAAGTGACCCACTACTGAGCCTACTAGTAGCGGACCTAGAGGCACGCAGTGATCCCGAAGGTCACAATCATAA
ACATAAACATAGGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCACTTCTCCCTTTGAAACGCATAGCCACAATCGCATTGCCGCACCCCGACGCTTGGTAACGCAAGTCCGTGCTTACACTAAGTCTAGACCTTCATCCTTGAGGAA
AAAGAATATGATGAAGAAGAAAGGAAAAGGAAAAACTACTGCAAGTTTCAAAGGCAAGAAAAAAACTAAGCTGAGAAAGAAATACAAAGAAACTAGTTCCTGGAAACAAC
TTAGAGATGGCGAGATAACTCTCAAAGTTAGATTAGGGGAGATCATCTCAGCCAAAACATTAGAAGACTACTCTTTACCTCCATGTGAGTCTTGTCTTGAGGGTAAGATA
ACCAAAAGTTTTGTTTTTTCTGAAAAAGGTCTTAGAGCCAAGGAATCCTTAGAACTCGTGCATTCAGACCTCTGTGGTCCTATGAATATTAATGCACGAGGAGGTAGAGA
CTGCAGTATATATTTTGAATATGATTCCTCTAAAAGTGTTTTAGAAACACCGTATGAATTATGGAGAGGACATAAAGGTAGTTTATGTCTCTTCAGAATTTGGGGATGCC
CAGCACATGTGTTGGTGGAAGATAAAAAGAAGTTGGAACGTCGTTCAACACTATGCCTATTTGTAGGATATCCCAAAGAAACAAAAGGTGGTTTGTTTTATGATCCTCAA
GAAAATAGGGTGCTTGTATCAAAAAACGCCACATTCTTAGAGGAAAACCACATAAGAGATCATCAACCTCGTAAGTTGTTAGTAGAACACATCCTTCTCAAGAGTTCAGA
GTGCCTCGACGTAGTGGGAGAGTTATTCATTAACCCGACCATTACATGGTGGATCAAAGCCATGGATCTCGAAATAGAGTCTATGTACTTCAATTTTGTATGGAAACTTG
TAGATCAATCAAATGGGGTAAAACCTATTGGTTGCAAATGGATCTACAAGAGAAAACGAGACCAAGATTATGAAATTTGGCAAATGGATGTCAAGACAACTTTTCTGAAT
GGGCATCTTGAAGATAGTATTTATATGACTCAACTAGAGGGGTTCCTTGAACAAGGTCAAGAGAAAAAGATACCTCAAAAAGTTGAGGATATAAGATGCATTACCTATAC
ATCAGCAGTCAGAAGTCTGATGTATGCCATATTGTGTACACGATCAGACATATGCTATGTAGTAAGAATAGTCAGCAAGTATCAGTTCAATTCAAGATACGATCATTGGA
ATCCTATTGAGAGGTCCACTAAAGATCAGAAATTGTTGAAGAAAGAACAAGATGCATCTTTTTCAAGGTTTTCTCTCATCATTTGCCAATGTGTCCTTCGCTTACCTAAA
AAGCATAACATGAAAAGAATGAGTACAAAATACTCAGTAAGTGACCCACTACTGAGCCTACTAGTAGCGGACCTAGAGGCACGCAGTGATCCCGAAGGTCACAATCATAA
ACATAAACATAGGTGGTGA
Protein sequenceShow/hide protein sequence
MTTSPFETHSHNRIAAPRRLVTQVRAYTKSRPSSLRKKNMMKKKGKGKTTASFKGKKKTKLRKKYKETSSWKQLRDGEITLKVRLGEIISAKTLEDYSLPPCESCLEGKI
TKSFVFSEKGLRAKESLELVHSDLCGPMNINARGGRDCSIYFEYDSSKSVLETPYELWRGHKGSLCLFRIWGCPAHVLVEDKKKLERRSTLCLFVGYPKETKGGLFYDPQ
ENRVLVSKNATFLEENHIRDHQPRKLLVEHILLKSSECLDVVGELFINPTITWWIKAMDLEIESMYFNFVWKLVDQSNGVKPIGCKWIYKRKRDQDYEIWQMDVKTTFLN
GHLEDSIYMTQLEGFLEQGQEKKIPQKVEDIRCITYTSAVRSLMYAILCTRSDICYVVRIVSKYQFNSRYDHWNPIERSTKDQKLLKKEQDASFSRFSLIICQCVLRLPK
KHNMKRMSTKYSVSDPLLSLLVADLEARSDPEGHNHKHKHRW