; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G26330 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G26330
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr2:22360018..22362768
RNA-Seq ExpressionCSPI02G26330
SyntenyCSPI02G26330
Gene Ontology termsGO:0030244 - cellulose biosynthetic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016760 - cellulose synthase (UDP-forming) activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005150 - Cellulose synthase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-9337.15Show/hide
Query:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ
        KE C  A+DFRPISLTT+++K++AK LA+RLK  LP TIS  QM FV GRQIT+AIL+ANEA+D+W++KK RG + KLDIE AF K+NW FIDF+L KK 
Subjt:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ

Query:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIK-DINLTHLLFAHDILLFVEDSDEYIRN
        +  KWR  I SCISSVQYSI+ING+PRGRI P+RGIRQGDPLSPFIFVLAMDYLSR+L +L  +++I G+    ++NLTH+LFA DIL+FVED D+Y+ N
Subjt:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIK-DINLTHLLFAHDILLFVEDSDEYIRN

Query:  LHFAIHLFVKATGLNIILNKSTIFPVN---------DASWGQSKN---------------------KTVLEQYYREDSE-KAKQLE--------------
        L   +HLF  A+GLNI L+KSTIFP+N           SWG SK                        VL++  ++ S  K  QL               
Subjt:  LHFAIHLFVKATGLNIILNKSTIFPVN---------DASWGQSKN---------------------KTVLEQYYREDSE-KAKQLE--------------

Query:  -------------------------------------------------------------------------------------IILHLQGRNMDMFFT
                                                                                             II       M  F +
Subjt:  -------------------------------------------------------------------------------------IILHLQGRNMDMFFT

Query:  IAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNW----------------------STKSI--PSNDDSFL----------------VKSVL
          K+S +N+PW+++ + + WF   + WKVN+G  +SFW  NW                      S K    PS++D  L                +K+ L
Subjt:  IAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNW----------------------STKSI--PSNDDSFL----------------VKSVL

Query:  QTNAPNARNPDIAASL------------------------------KKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSNEIT
         T  PN  +P    +L                              K LWK + PKKCKFFI T  +  I T +++Q+RL N  L+PNWC +C KS E  
Subjt:  QTNAPNARNPDIAASL------------------------------KKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSNEIT

Query:  DHL
        +HL
Subjt:  DHL

KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-9135.81Show/hide
Query:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ
        KEKC   SD+RPISLTTSL+K++AKALANRLK  LP TI+  QM F+ GRQI DAIL+ANEA+D WK +K +G + KLD+E AF KI+W+FIDF+L KK 
Subjt:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ

Query:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIKD-INLTHLLFAHDILLFVEDSDEYIRN
        FP KWR WI +CIS+VQYSI++NG P+GRI   RGIRQGDPLSPFIFVLAMDYLSR+L HLE +  IKG++  +  N++HLLFA D+L+FVED++ Y+ N
Subjt:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIKD-INLTHLLFAHDILLFVEDSDEYIRN

Query:  LHFAIHLFVKATGLNIILNKSTIFPVN-------------------------------------------------------------------------
        L  A+ LF KA+GL    +KSTI P+N                                                                         
Subjt:  LHFAIHLFVKATGLNIILNKSTIFPVN-------------------------------------------------------------------------

Query:  ------------------------DASWGQSKNK----------------------------------TVLEQYYREDSEKAKQLEIILHLQGRNMDMFF
                                D  WG S++K                                    L +Y+ E +   K+     + +    D+  
Subjt:  ------------------------DASWGQSKNK----------------------------------TVLEQYYREDSEKAKQLEIILHLQGRNMDMFF

Query:  TIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIP-----------SNDDSFLVKSV--------------------------LQT
         + + S +N+PW +I K+ DW+ SK+ W  N+GSSLSFWH+ W   +IP           SN  S  VK +                          ++ 
Subjt:  TIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIP-----------SNDDSFLVKSV--------------------------LQT

Query:  NAPNARN----------------------PDIA-------------ASLKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSN
        + P   N                       DIA               LK LW+S +P+KCKFFI T  + ++ TM+KIQ+R  ++ LNP+WC+ C+ SN
Subjt:  NAPNARN----------------------PDIA-------------ASLKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSN

Query:  EITDHL
        E  +HL
Subjt:  EITDHL

KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]7.7e-9440.38Show/hide
Query:  DFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRIW
        DFRPISLTTS++KI+AK L+NRLK  LP TISG Q+ F+  RQITDAIL+ANEAVDYWK KK +G I KLDIE  F+ +NW+FID++L KK FP  WR W
Subjt:  DFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRIW

Query:  IHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITI-KDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLF
        I  CIS+V YS++ING+P+GRI  NRG+RQGDPLSPF+FV+AMDY SR+L HLE    IKG+++  + N++H+LFA DILLFVED+D ++ NL  A+ LF
Subjt:  IHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITI-KDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLF

Query:  VKATGLNIILNKSTIFPVN---------DASWG----------------------------------------------QSKNKTVLE----QYYREDSE
         KA+GL I L KS + PVN          + WG                                              Q  NK +L     +Y+ E + 
Subjt:  VKATGLNIILNKSTIFPVN---------DASWG----------------------------------------------QSKNKTVLE----QYYREDSE

Query:  KAKQLEIILHLQGRNMDMFFTIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIPSN----------DDSFLVK------------
          ++L I    +G++     +    S S APWRSI   +DWF S   W +NNG  +SFW++NWS +   S           D    VK            
Subjt:  KAKQLEIILHLQGRNMDMFFTIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIPSN----------DDSFLVK------------

Query:  ------------------SVLQTNAPNARN------PDIAAS-------------------------LKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQR
                           +L T  PN  +      PD   S                         L+ +WKS +P K KFF+       I TME IQ+
Subjt:  ------------------SVLQTNAPNARN------PDIAAS-------------------------LKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQR

Query:  RLKNLCLNPNWCVLCKKSNEITDHL
        R+ N  L PNWCVLC K NE  +HL
Subjt:  RLKNLCLNPNWCVLCKKSNEITDHL

TYK11012.1 uncharacterized protein E5676_scaffold874G00540 [Cucumis melo var. makuwa]7.7e-9440.38Show/hide
Query:  DFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRIW
        DFRPISLTTS++KI+AK L+NRLK  LP TISG Q+ F+  RQITDAIL+ANEAVDYWK KK +G I KLDIE  F+ +NW+FID++L KK FP  WR W
Subjt:  DFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRIW

Query:  IHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITI-KDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLF
        I  CIS+V YS++ING+P+GRI  NRG+RQGDPLSPF+FV+AMDY SR+L HLE    IKG+++  + N++H+LFA DILLFVED+D ++ NL  A+ LF
Subjt:  IHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITI-KDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLF

Query:  VKATGLNIILNKSTIFPVN---------DASWG----------------------------------------------QSKNKTVLE----QYYREDSE
         KA+GL I L KS + PVN          + WG                                              Q  NK +L     +Y+ E + 
Subjt:  VKATGLNIILNKSTIFPVN---------DASWG----------------------------------------------QSKNKTVLE----QYYREDSE

Query:  KAKQLEIILHLQGRNMDMFFTIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIPSN----------DDSFLVK------------
          ++L I    +G++     +    S S APWRSI   +DWF S   W +NNG  +SFW++NWS +   S           D    VK            
Subjt:  KAKQLEIILHLQGRNMDMFFTIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIPSN----------DDSFLVK------------

Query:  ------------------SVLQTNAPNARN------PDIAAS-------------------------LKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQR
                           +L T  PN  +      PD   S                         L+ +WKS +P K KFF+       I TME IQ+
Subjt:  ------------------SVLQTNAPNARN------PDIAAS-------------------------LKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQR

Query:  RLKNLCLNPNWCVLCKKSNEITDHL
        R+ N  L PNWCVLC K NE  +HL
Subjt:  RLKNLCLNPNWCVLCKKSNEITDHL

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]3.6e-9137.07Show/hide
Query:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ
        KEKC   +D+RPISLTTS++K++AK +A RLK  LP T++  QM FV GRQI DAILVANEA+DYW+ KK +G + KLDIE AF K+NW FIDF+L KK 
Subjt:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ

Query:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIK-DINLTHLLFAHDILLFVEDSDEYIRN
        +P KWR WI +CISSVQYSI+ING+PRG+I P+RGIRQGDP+SPFIFVLAMDY+SR+L  + +  +IKG+ ++ +INLTHLLFA DILLFVED +  I+N
Subjt:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIK-DINLTHLLFAHDILLFVEDSDEYIRN

Query:  LHFAIHLFVKATGLNIILNKSTIFPVN-DAS--------WGQSK-------------NKTVLEQYYREDSEKAK--------------------------
        L   I+LF  A+GL+I LNKSTI P+N DAS        WG S               K + + +++   EK                            
Subjt:  LHFAIHLFVKATGLNIILNKSTIFPVN-DAS--------WGQSK-------------NKTVLEQYYREDSEKAK--------------------------

Query:  ----QLEII---------------------------LHLQ----------------GRNMDMFFTI---------------------AKY----------
            QL I                            LHL                  R  D  F +                     AKY          
Subjt:  ----QLEII---------------------------LHLQ----------------GRNMDMFFTI---------------------AKY----------

Query:  ----SRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKS-----------IPSNDDSFL----------------------------------
            S S +PW SICK ++WF   + WK+ NG S SFWH++W   S           + +N +S +                                  
Subjt:  ----SRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKS-----------IPSNDDSFL----------------------------------

Query:  --------------------------VKSVLQTNAPNARNPDIAASLKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSNEI
                                  VK  LQ    N  +     + K LWK+ +PKKC FFI T  Y+ + T E++ +RL NLC  P+WCV+CK+++E 
Subjt:  --------------------------VKSVLQTNAPNARNPDIAASLKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSNEI

Query:  TDHLDML
          HL +L
Subjt:  TDHLDML

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein6.4e-9437.15Show/hide
Query:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ
        KE C  A+DFRPISLTT+++K++AK LA+RLK  LP TIS  QM FV GRQIT+AIL+ANEA+D+W++KK RG + KLDIE AF K+NW FIDF+L KK 
Subjt:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ

Query:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIK-DINLTHLLFAHDILLFVEDSDEYIRN
        +  KWR  I SCISSVQYSI+ING+PRGRI P+RGIRQGDPLSPFIFVLAMDYLSR+L +L  +++I G+    ++NLTH+LFA DIL+FVED D+Y+ N
Subjt:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIK-DINLTHLLFAHDILLFVEDSDEYIRN

Query:  LHFAIHLFVKATGLNIILNKSTIFPVN---------DASWGQSKN---------------------KTVLEQYYREDSE-KAKQLE--------------
        L   +HLF  A+GLNI L+KSTIFP+N           SWG SK                        VL++  ++ S  K  QL               
Subjt:  LHFAIHLFVKATGLNIILNKSTIFPVN---------DASWGQSKN---------------------KTVLEQYYREDSE-KAKQLE--------------

Query:  -------------------------------------------------------------------------------------IILHLQGRNMDMFFT
                                                                                             II       M  F +
Subjt:  -------------------------------------------------------------------------------------IILHLQGRNMDMFFT

Query:  IAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNW----------------------STKSI--PSNDDSFL----------------VKSVL
          K+S +N+PW+++ + + WF   + WKVN+G  +SFW  NW                      S K    PS++D  L                +K+ L
Subjt:  IAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNW----------------------STKSI--PSNDDSFL----------------VKSVL

Query:  QTNAPNARNPDIAASL------------------------------KKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSNEIT
         T  PN  +P    +L                              K LWK + PKKCKFFI T  +  I T +++Q+RL N  L+PNWC +C KS E  
Subjt:  QTNAPNARNPDIAASL------------------------------KKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSNEIT

Query:  DHL
        +HL
Subjt:  DHL

A0A5A7UTI6 LINE-1 retrotransposable element ORF2 protein6.0e-9235.81Show/hide
Query:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ
        KEKC   SD+RPISLTTSL+K++AKALANRLK  LP TI+  QM F+ GRQI DAIL+ANEA+D WK +K +G + KLD+E AF KI+W+FIDF+L KK 
Subjt:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ

Query:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIKD-INLTHLLFAHDILLFVEDSDEYIRN
        FP KWR WI +CIS+VQYSI++NG P+GRI   RGIRQGDPLSPFIFVLAMDYLSR+L HLE +  IKG++  +  N++HLLFA D+L+FVED++ Y+ N
Subjt:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIKD-INLTHLLFAHDILLFVEDSDEYIRN

Query:  LHFAIHLFVKATGLNIILNKSTIFPVN-------------------------------------------------------------------------
        L  A+ LF KA+GL    +KSTI P+N                                                                         
Subjt:  LHFAIHLFVKATGLNIILNKSTIFPVN-------------------------------------------------------------------------

Query:  ------------------------DASWGQSKNK----------------------------------TVLEQYYREDSEKAKQLEIILHLQGRNMDMFF
                                D  WG S++K                                    L +Y+ E +   K+     + +    D+  
Subjt:  ------------------------DASWGQSKNK----------------------------------TVLEQYYREDSEKAKQLEIILHLQGRNMDMFF

Query:  TIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIP-----------SNDDSFLVKSV--------------------------LQT
         + + S +N+PW +I K+ DW+ SK+ W  N+GSSLSFWH+ W   +IP           SN  S  VK +                          ++ 
Subjt:  TIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIP-----------SNDDSFLVKSV--------------------------LQT

Query:  NAPNARN----------------------PDIA-------------ASLKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSN
        + P   N                       DIA               LK LW+S +P+KCKFFI T  + ++ TM+KIQ+R  ++ LNP+WC+ C+ SN
Subjt:  NAPNARN----------------------PDIA-------------ASLKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSN

Query:  EITDHL
        E  +HL
Subjt:  EITDHL

A0A5A7UV84 Reverse transcriptase domain-containing protein3.7e-9440.38Show/hide
Query:  DFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRIW
        DFRPISLTTS++KI+AK L+NRLK  LP TISG Q+ F+  RQITDAIL+ANEAVDYWK KK +G I KLDIE  F+ +NW+FID++L KK FP  WR W
Subjt:  DFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRIW

Query:  IHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITI-KDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLF
        I  CIS+V YS++ING+P+GRI  NRG+RQGDPLSPF+FV+AMDY SR+L HLE    IKG+++  + N++H+LFA DILLFVED+D ++ NL  A+ LF
Subjt:  IHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITI-KDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLF

Query:  VKATGLNIILNKSTIFPVN---------DASWG----------------------------------------------QSKNKTVLE----QYYREDSE
         KA+GL I L KS + PVN          + WG                                              Q  NK +L     +Y+ E + 
Subjt:  VKATGLNIILNKSTIFPVN---------DASWG----------------------------------------------QSKNKTVLE----QYYREDSE

Query:  KAKQLEIILHLQGRNMDMFFTIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIPSN----------DDSFLVK------------
          ++L I    +G++     +    S S APWRSI   +DWF S   W +NNG  +SFW++NWS +   S           D    VK            
Subjt:  KAKQLEIILHLQGRNMDMFFTIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIPSN----------DDSFLVK------------

Query:  ------------------SVLQTNAPNARN------PDIAAS-------------------------LKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQR
                           +L T  PN  +      PD   S                         L+ +WKS +P K KFF+       I TME IQ+
Subjt:  ------------------SVLQTNAPNARN------PDIAAS-------------------------LKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQR

Query:  RLKNLCLNPNWCVLCKKSNEITDHL
        R+ N  L PNWCVLC K NE  +HL
Subjt:  RLKNLCLNPNWCVLCKKSNEITDHL

A0A5D3CI86 Reverse transcriptase domain-containing protein3.7e-9440.38Show/hide
Query:  DFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRIW
        DFRPISLTTS++KI+AK L+NRLK  LP TISG Q+ F+  RQITDAIL+ANEAVDYWK KK +G I KLDIE  F+ +NW+FID++L KK FP  WR W
Subjt:  DFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRIW

Query:  IHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITI-KDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLF
        I  CIS+V YS++ING+P+GRI  NRG+RQGDPLSPF+FV+AMDY SR+L HLE    IKG+++  + N++H+LFA DILLFVED+D ++ NL  A+ LF
Subjt:  IHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITI-KDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLF

Query:  VKATGLNIILNKSTIFPVN---------DASWG----------------------------------------------QSKNKTVLE----QYYREDSE
         KA+GL I L KS + PVN          + WG                                              Q  NK +L     +Y+ E + 
Subjt:  VKATGLNIILNKSTIFPVN---------DASWG----------------------------------------------QSKNKTVLE----QYYREDSE

Query:  KAKQLEIILHLQGRNMDMFFTIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIPSN----------DDSFLVK------------
          ++L I    +G++     +    S S APWRSI   +DWF S   W +NNG  +SFW++NWS +   S           D    VK            
Subjt:  KAKQLEIILHLQGRNMDMFFTIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIPSN----------DDSFLVK------------

Query:  ------------------SVLQTNAPNARN------PDIAAS-------------------------LKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQR
                           +L T  PN  +      PD   S                         L+ +WKS +P K KFF+       I TME IQ+
Subjt:  ------------------SVLQTNAPNARN------PDIAAS-------------------------LKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQR

Query:  RLKNLCLNPNWCVLCKKSNEITDHL
        R+ N  L PNWCVLC K NE  +HL
Subjt:  RLKNLCLNPNWCVLCKKSNEITDHL

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein1.7e-9137.07Show/hide
Query:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ
        KEKC   +D+RPISLTTS++K++AK +A RLK  LP T++  QM FV GRQI DAILVANEA+DYW+ KK +G + KLDIE AF K+NW FIDF+L KK 
Subjt:  KEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQ

Query:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIK-DINLTHLLFAHDILLFVEDSDEYIRN
        +P KWR WI +CISSVQYSI+ING+PRG+I P+RGIRQGDP+SPFIFVLAMDY+SR+L  + +  +IKG+ ++ +INLTHLLFA DILLFVED +  I+N
Subjt:  FPVKWRIWIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIK-DINLTHLLFAHDILLFVEDSDEYIRN

Query:  LHFAIHLFVKATGLNIILNKSTIFPVN-DAS--------WGQSK-------------NKTVLEQYYREDSEKAK--------------------------
        L   I+LF  A+GL+I LNKSTI P+N DAS        WG S               K + + +++   EK                            
Subjt:  LHFAIHLFVKATGLNIILNKSTIFPVN-DAS--------WGQSK-------------NKTVLEQYYREDSEKAK--------------------------

Query:  ----QLEII---------------------------LHLQ----------------GRNMDMFFTI---------------------AKY----------
            QL I                            LHL                  R  D  F +                     AKY          
Subjt:  ----QLEII---------------------------LHLQ----------------GRNMDMFFTI---------------------AKY----------

Query:  ----SRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKS-----------IPSNDDSFL----------------------------------
            S S +PW SICK ++WF   + WK+ NG S SFWH++W   S           + +N +S +                                  
Subjt:  ----SRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKS-----------IPSNDDSFL----------------------------------

Query:  --------------------------VKSVLQTNAPNARNPDIAASLKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSNEI
                                  VK  LQ    N  +     + K LWK+ +PKKC FFI T  Y+ + T E++ +RL NLC  P+WCV+CK+++E 
Subjt:  --------------------------VKSVLQTNAPNARNPDIAASLKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSNEI

Query:  TDHLDML
          HL +L
Subjt:  TDHLDML

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein2.9e-1932.87Show/hide
Query:  DFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYW-KTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRI
        +FRPISL     KIL K LANR++  + + I   Q+ F+ G Q    I  +   + Y  K K    +I  LD E AF KI   F+  +L++      +  
Subjt:  DFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYW-KTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRI

Query:  WIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIKDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLF
         I +  S    +I +NG+    I    G RQG PLSP++F + ++ L+R ++   Q+K+IKGI I    +   L A D+++++ D     R L   I+ F
Subjt:  WIHSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIKDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLF

Query:  VKATGLNIILNKSTIF
         +  G  I  NKS  F
Subjt:  VKATGLNIILNKSTIF

Q339N5 Cellulose synthase-like protein H11.9e-1852.33Show/hide
Query:  QSADCHHLVHLVGKKHQTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVV
        +S + HH          TRVS ++TNAP++LN+DCDMF N+P+VVLHAMC+ L    ++    +VQTPQ FY  L+DDPFGNQL V
Subjt:  QSADCHHLVHLVGKKHQTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVV

Q7PC71 Cellulose synthase-like protein H28.5e-1960.27Show/hide
Query:  QTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFE
        +TRVS V+TNAP +LN+DCDMF N+PQ VLHAMC+ L    D    G+VQ PQ FYD L+DDPFGNQ+   F+
Subjt:  QTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFE

Q7XUT9 Cellulose synthase-like protein H28.5e-1960.27Show/hide
Query:  QTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFE
        +TRVS V+TNAP +LN+DCDMF N+PQ VLHAMC+ L    D    G+VQ PQ FYD L+DDPFGNQ+   F+
Subjt:  QTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFE

Q7XUU0 Putative cellulose synthase-like protein H31.2e-2058.14Show/hide
Query:  HHLVHLVGKKHQTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFEVL
        HH          TRVS V+TNAP +LNVDCDMFANDPQVVLHAMC+ L    ++   G+VQ PQ FY  L+DDPFGN+L VI++ L
Subjt:  HHLVHLVGKKHQTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFEVL

Arabidopsis top hitse value%identityAlignment
AT2G32530.1 cellulose synthase-like B32.2e-1430.91Show/hide
Query:  EKIQRRLKNLCLNPNWCVLCKKSNEIT-----DHLDMLKQHKDIKRGGYLQSADCHHLVHLVGKKHQ--------------TRVSGVLTNAPYILNVDCD
        EK+ RR+++   + +W        + +     DH  ++K   +  +GG     +  H V++  +K                 RVSG++TNAPY+LNVDCD
Subjt:  EKIQRRLKNLCLNPNWCVLCKKSNEIT-----DHLDMLKQHKDIKRGGYLQSADCHHLVHLVGKKHQ--------------TRVSGVLTNAPYILNVDCD

Query:  MFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFEVLKGYPPSGMLG
        M+AN+  VV  AMC+FL    +     +VQ PQ FYD   D+           VL+ Y   G+ G
Subjt:  MFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFEVLKGYPPSGMLG

AT2G32610.1 cellulose synthase-like B17.6e-1539.45Show/hide
Query:  KRGGYLQSADCHHLVHLVGKKHQTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFEVLKGY
        KR  Y+ +  C  +  L       RVSG++TNAPYILNVDCDM+AND  VV  AMC+ L    +++   +VQ  Q FYD         +L+V+ +   G 
Subjt:  KRGGYLQSADCHHLVHLVGKKHQTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFEVLKGY

Query:  PPSGMLGRI
          +G+ G I
Subjt:  PPSGMLGRI

AT2G32620.1 cellulose synthase-like B4.9e-1433.83Show/hide
Query:  DHLDMLKQHKDIKRGGYLQSADCHHLVHLVGKKHQ--------------TRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTP
        DH  ++K   +  +GG     +  H+V++  +K                 RVSG++TNAPY+LNVDCDM+AN+  VV  AMC+FL    +     +VQ P
Subjt:  DHLDMLKQHKDIKRGGYLQSADCHHLVHLVGKKHQ--------------TRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTP

Query:  QCFYDGLEDDPFGNQLVVIFEVLKGYPPSGMLG
        Q FYD            +   V+K Y   G+ G
Subjt:  QCFYDGLEDDPFGNQLVVIFEVLKGYPPSGMLG

AT4G15290.1 Cellulose synthase family protein1.2e-1540Show/hide
Query:  RGGYLQSADCHHLVHLVGKKHQT--------------RVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFG
        +GG     +  HLV++  +K                 RVSG++TNAPY LNVDCDM+AN+P VV  AMCVFL +  +     +VQ PQ FYD      + 
Subjt:  RGGYLQSADCHHLVHLVGKKHQT--------------RVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFG

Query:  NQLVVIFEVL
        N+L V+  +L
Subjt:  NQLVVIFEVL

AT4G15320.1 cellulose synthase-like B61.4e-1647.67Show/hide
Query:  HLVHLVGKKHQTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFEVLK
        +++ L+    Q RVSG++TNAPY+LNVDCDM+AN+P VV  AMCVFL +  +     +VQ PQ FYD      + N+LVV+   +K
Subjt:  HLVHLVGKKHQTRVSGVLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFEVLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAAAAATGTAGATTGGCATCAGATTTTCGACCAATCAGTCTCACTACATCTTTATTTAAGATCCTTGCAAAGGCACTAGCAAATAGACTGAAACCCCTTCTTCC
AAGCACAATATCAGGTCAACAAATGACGTTTGTTAATGGAAGACAAATCACTGATGCAATTTTAGTTGCAAATGAAGCGGTAGACTACTGGAAGACAAAGAAGACAAGAG
GCTTAATTTTCAAGCTGGATATAGAAAATGCTTTTCACAAGATTAATTGGAACTTCATTGATTTCATCCTGAAAAAGAAGCAGTTCCCTGTCAAATGGAGGATATGGATA
CATTCTTGTATATCATCTGTTCAGTATTCCATCATGATCAATGGCAAACCTAGAGGTAGAATATTTCCAAATAGAGGAATCAGACAAGGAGATCCTTTATCCCCTTTCAT
CTTTGTGCTAGCCATGGATTATCTCAGCAGGATTCTACAACATCTTGAACAAGAGAAGCAAATCAAAGGTATCACAATAAAAGACATAAACCTAACTCATCTTCTCTTTG
CACATGACATTTTGCTCTTTGTTGAAGATAGCGATGAGTACATTAGAAATCTTCATTTCGCTATTCACCTCTTTGTAAAGGCCACTGGTTTAAACATTATCCTCAATAAG
TCCACTATTTTCCCTGTCAATGATGCCTCTTGGGGGCAATCCAAAAACAAAACCGTTTTGGAGCAATATTACAGAGAAGATTCAGAAAAAGCTAAACAATTGGAAATAAT
CCTTCATCTCCAAGGGAGGAATATGGATATGTTTTTTACAATTGCTAAATACAGTAGATCCAATGCTCCATGGAGATCCATCTGCAAATATGTTGATTGGTTCAACTCAA
AGATGAAATGGAAAGTGAATAATGGCAGCTCTTTATCCTTCTGGCATACCAATTGGAGCACGAAGAGCATCCCAAGCAACGATGACTCTTTCTTGGTTAAATCTGTGTTA
CAAACAAATGCTCCAAATGCTAGAAATCCAGATATTGCAGCTAGTCTCAAGAAATTATGGAAATCCCAAGTTCCAAAGAAATGCAAATTCTTCATCTTGACAGCAGCATA
CAATGAAATTTTCACGATGGAAAAGATTCAAAGGAGGTTGAAAAATCTTTGTCTCAACCCAAACTGGTGCGTTCTTTGTAAGAAAAGCAATGAAATAACCGATCACTTGG
ATATGCTCAAACAACATAAAGACATAAAGAGGGGTGGTTATCTTCAATCTGCTGATTGCCATCATTTGGTCCATTTGGTTGGAAAGAAACACCAGACAAGAGTGTCTGGT
GTCTTGACAAATGCTCCATACATATTAAATGTGGATTGTGACATGTTTGCCAATGATCCCCAAGTTGTGTTACATGCAATGTGTGTATTTCTCAACTCCAAATATGATTT
GGAAGATATTGGATATGTTCAAACTCCCCAATGCTTTTATGATGGCCTTGAGGACGACCCCTTTGGAAATCAACTAGTGGTTATATTTGAGGTGCTAAAAGGATACCCAC
CTAGTGGGATGCTTGGACGCATCACTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGTAGATATTCCTTATTGCCATGAAAGAAAAATGTAGATTGGCATCAGATTTTCGACCAATCAGTCTCACTACATCTTTATTTAAGATCCTTGCAAAGGCACTA
GCAAATAGACTGAAACCCCTTCTTCCAAGCACAATATCAGGTCAACAAATGACGTTTGTTAATGGAAGACAAATCACTGATGCAATTTTAGTTGCAAATGAAGCGGTAGA
CTACTGGAAGACAAAGAAGACAAGAGGCTTAATTTTCAAGCTGGATATAGAAAATGCTTTTCACAAGATTAATTGGAACTTCATTGATTTCATCCTGAAAAAGAAGCAGT
TCCCTGTCAAATGGAGGATATGGATACATTCTTGTATATCATCTGTTCAGTATTCCATCATGATCAATGGCAAACCTAGAGGTAGAATATTTCCAAATAGAGGAATCAGA
CAAGGAGATCCTTTATCCCCTTTCATCTTTGTGCTAGCCATGGATTATCTCAGCAGGATTCTACAACATCTTGAACAAGAGAAGCAAATCAAAGGTATCACAATAAAAGA
CATAAACCTAACTCATCTTCTCTTTGCACATGACATTTTGCTCTTTGTTGAAGATAGCGATGAGTACATTAGAAATCTTCATTTCGCTATTCACCTCTTTGTAAAGGCCA
CTGGTTTAAACATTATCCTCAATAAGTCCACTATTTTCCCTGTCAATGATGCCTCTTGGGGGCAATCCAAAAACAAAACCGTTTTGGAGCAATATTACAGAGAAGATTCA
GAAAAAGCTAAACAATTGGAAATAATCCTTCATCTCCAAGGGAGGAATATGGATATGTTTTTTACAATTGCTAAATACAGTAGATCCAATGCTCCATGGAGATCCATCTG
CAAATATGTTGATTGGTTCAACTCAAAGATGAAATGGAAAGTGAATAATGGCAGCTCTTTATCCTTCTGGCATACCAATTGGAGCACGAAGAGCATCCCAAGCAACGATG
ACTCTTTCTTGGTTAAATCTGTGTTACAAACAAATGCTCCAAATGCTAGAAATCCAGATATTGCAGCTAGTCTCAAGAAATTATGGAAATCCCAAGTTCCAAAGAAATGC
AAATTCTTCATCTTGACAGCAGCATACAATGAAATTTTCACGATGGAAAAGATTCAAAGGAGGTTGAAAAATCTTTGTCTCAACCCAAACTGGTGCGTTCTTTGTAAGAA
AAGCAATGAAATAACCGATCACTTGGATATGCTCAAACAACATAAAGACATAAAGAGGGGTGGTTATCTTCAATCTGCTGATTGCCATCATTTGGTCCATTTGGTTGGAA
AGAAACACCAGACAAGAGTGTCTGGTGTCTTGACAAATGCTCCATACATATTAAATGTGGATTGTGACATGTTTGCCAATGATCCCCAAGTTGTGTTACATGCAATGTGT
GTATTTCTCAACTCCAAATATGATTTGGAAGATATTGGATATGTTCAAACTCCCCAATGCTTTTATGATGGCCTTGAGGACGACCCCTTTGGAAATCAACTAGTGGTTAT
ATTTGAGGTGCTAAAAGGATACCCACCTAGTGGGATGCTTGGACGCATCACTGACTAG
Protein sequenceShow/hide protein sequence
MKEKCRLASDFRPISLTTSLFKILAKALANRLKPLLPSTISGQQMTFVNGRQITDAILVANEAVDYWKTKKTRGLIFKLDIENAFHKINWNFIDFILKKKQFPVKWRIWI
HSCISSVQYSIMINGKPRGRIFPNRGIRQGDPLSPFIFVLAMDYLSRILQHLEQEKQIKGITIKDINLTHLLFAHDILLFVEDSDEYIRNLHFAIHLFVKATGLNIILNK
STIFPVNDASWGQSKNKTVLEQYYREDSEKAKQLEIILHLQGRNMDMFFTIAKYSRSNAPWRSICKYVDWFNSKMKWKVNNGSSLSFWHTNWSTKSIPSNDDSFLVKSVL
QTNAPNARNPDIAASLKKLWKSQVPKKCKFFILTAAYNEIFTMEKIQRRLKNLCLNPNWCVLCKKSNEITDHLDMLKQHKDIKRGGYLQSADCHHLVHLVGKKHQTRVSG
VLTNAPYILNVDCDMFANDPQVVLHAMCVFLNSKYDLEDIGYVQTPQCFYDGLEDDPFGNQLVVIFEVLKGYPPSGMLGRITD