; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005392 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005392
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionARM repeat superfamily protein
Genome locationscaffold451:72469..74118
RNA-Seq ExpressionMS005392
SyntenyMS005392
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040032.1 vacuolar protein 8 [Cucumis melo var. makuwa]2.0e-27090.91Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        E  S EEWLLQAQKLVPVAL KA+EVKVFPGRWK IVSKLE++PSRLSDLSSHPCFSKN LC+EQLQAVL SLKET+ELA+LCVREKFEGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
        LSGKLDLNLRDCGLLIKTGVLGEAT  L +SG SSQ E  ++SNIRELLARLQIGHMEAKHRALDSLVEI+KEDDDNVLSVFGRNNVAALVQLLTATSPC
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC

Query:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
        IREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
Subjt:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV

Query:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
        RQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMELLLSLGFLPRLVHVLKS
Subjt:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS

Query:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL
        GS+GAQQAAASAICRVC+TPEMKKL+GE  ECIPLLIKLLE+KSNSVREVAAQAISSL+TLSQNCREVKRDEKSVP+LVQLLDP PQNTAKKYAVACL  
Subjt:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL

Query:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        LSSSRKCKKLMISYGAIGYLKKLS++D  G KKLLEKLERGKLRSLFSRK
Subjt:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

TYK24469.1 vacuolar protein 8 [Cucumis melo var. makuwa]1.2e-27090.91Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        E  S EEWLLQAQKLVPVAL KA+EVKVFPGRWK IVSKLE++PSRLSDLSSHPCFSKN LC+EQLQAVL SLKET+ELA+LCVREKFEGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
        LSGKLDLNLRDCGLLIKTGVLGEAT  L +SG SSQ E  ++SNIRELLARLQIGHMEAKHRALDSLVEI+KEDDDNVLSVFGRNNVAALVQLLTATSPC
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC

Query:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
        IREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
Subjt:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV

Query:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
        RQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMELLLSLGFLPRLVHVLKS
Subjt:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS

Query:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL
        GS+GAQQAAASAICRVC+TPEMKKL+GE  ECIPLLIKLLE+KSNSVREVAAQAISSL+TLSQNCREVKRDEKSVP+LVQLLDP PQNTAKKYAVACL  
Subjt:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL

Query:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        LSSSRKCKKLMISYGAIGYLKKLS++D  G+KKLLEKLERGKLRSLFSRK
Subjt:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

XP_008460169.1 PREDICTED: uncharacterized protein LOC103499059 isoform X1 [Cucumis melo]3.5e-27090.73Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        E  S EEWLLQAQKLVPVAL KA+EVKVFPGRWK IVSKLE++PSRLSDLSSHPCFSKN LC+EQLQAVL SLKET+ELA+LCVREKFEGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
        LSGKLDLNLRDCGLLIKTGVLGEAT  L +SG SSQ E  ++ NIRELLARLQIGHMEAKHRALDSLVEI+KEDDDNVLSVFGRNNVAALVQLLTATSPC
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC

Query:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
        IREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
Subjt:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV

Query:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
        RQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMELLLSLGFLPRLVHVLKS
Subjt:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS

Query:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL
        GS+GAQQAAASAICRVC+TPEMKKL+GE  ECIPLLIKLLE+KSNSVREVAAQAISSL+TLSQNCREVKRDEKSVP+LVQLLDP PQNTAKKYAVACL  
Subjt:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL

Query:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        LSSSRKCKKLMISYGAIGYLKKLS++D  G+KKLLEKLERGKLRSLFSRK
Subjt:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

XP_008460170.1 PREDICTED: uncharacterized protein LOC103499059 isoform X2 [Cucumis melo]3.5e-27090.73Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        E  S EEWLLQAQKLVPVAL KA+EVKVFPGRWK IVSKLE++PSRLSDLSSHPCFSKN LC+EQLQAVL SLKET+ELA+LCVREKFEGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
        LSGKLDLNLRDCGLLIKTGVLGEAT  L +SG SSQ E  ++ NIRELLARLQIGHMEAKHRALDSLVEI+KEDDDNVLSVFGRNNVAALVQLLTATSPC
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC

Query:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
        IREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
Subjt:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV

Query:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
        RQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMELLLSLGFLPRLVHVLKS
Subjt:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS

Query:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL
        GS+GAQQAAASAICRVC+TPEMKKL+GE  ECIPLLIKLLE+KSNSVREVAAQAISSL+TLSQNCREVKRDEKSVP+LVQLLDP PQNTAKKYAVACL  
Subjt:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL

Query:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        LSSSRKCKKLMISYGAIGYLKKLS++D  G+KKLLEKLERGKLRSLFSRK
Subjt:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

XP_022156111.1 protein CELLULOSE SYNTHASE INTERACTIVE 1-like [Momordica charantia]1.6e-29699.64Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
        LSGKLDLNLRDCGLLIKTGVLGEATPAL VSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC

Query:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
        IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
Subjt:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV

Query:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
        RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
Subjt:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS

Query:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL
        GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACL L
Subjt:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL

Query:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
Subjt:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

TrEMBL top hitse value%identityAlignment
A0A1S3CBX6 uncharacterized protein LOC103499059 isoform X11.7e-27090.73Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        E  S EEWLLQAQKLVPVAL KA+EVKVFPGRWK IVSKLE++PSRLSDLSSHPCFSKN LC+EQLQAVL SLKET+ELA+LCVREKFEGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
        LSGKLDLNLRDCGLLIKTGVLGEAT  L +SG SSQ E  ++ NIRELLARLQIGHMEAKHRALDSLVEI+KEDDDNVLSVFGRNNVAALVQLLTATSPC
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC

Query:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
        IREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
Subjt:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV

Query:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
        RQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMELLLSLGFLPRLVHVLKS
Subjt:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS

Query:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL
        GS+GAQQAAASAICRVC+TPEMKKL+GE  ECIPLLIKLLE+KSNSVREVAAQAISSL+TLSQNCREVKRDEKSVP+LVQLLDP PQNTAKKYAVACL  
Subjt:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL

Query:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        LSSSRKCKKLMISYGAIGYLKKLS++D  G+KKLLEKLERGKLRSLFSRK
Subjt:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

A0A1S3CCD8 uncharacterized protein LOC103499059 isoform X21.7e-27090.73Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        E  S EEWLLQAQKLVPVAL KA+EVKVFPGRWK IVSKLE++PSRLSDLSSHPCFSKN LC+EQLQAVL SLKET+ELA+LCVREKFEGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
        LSGKLDLNLRDCGLLIKTGVLGEAT  L +SG SSQ E  ++ NIRELLARLQIGHMEAKHRALDSLVEI+KEDDDNVLSVFGRNNVAALVQLLTATSPC
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC

Query:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
        IREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
Subjt:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV

Query:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
        RQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMELLLSLGFLPRLVHVLKS
Subjt:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS

Query:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL
        GS+GAQQAAASAICRVC+TPEMKKL+GE  ECIPLLIKLLE+KSNSVREVAAQAISSL+TLSQNCREVKRDEKSVP+LVQLLDP PQNTAKKYAVACL  
Subjt:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL

Query:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        LSSSRKCKKLMISYGAIGYLKKLS++D  G+KKLLEKLERGKLRSLFSRK
Subjt:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

A0A5A7T9D4 Vacuolar protein 89.8e-27190.91Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        E  S EEWLLQAQKLVPVAL KA+EVKVFPGRWK IVSKLE++PSRLSDLSSHPCFSKN LC+EQLQAVL SLKET+ELA+LCVREKFEGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
        LSGKLDLNLRDCGLLIKTGVLGEAT  L +SG SSQ E  ++SNIRELLARLQIGHMEAKHRALDSLVEI+KEDDDNVLSVFGRNNVAALVQLLTATSPC
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC

Query:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
        IREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
Subjt:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV

Query:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
        RQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMELLLSLGFLPRLVHVLKS
Subjt:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS

Query:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL
        GS+GAQQAAASAICRVC+TPEMKKL+GE  ECIPLLIKLLE+KSNSVREVAAQAISSL+TLSQNCREVKRDEKSVP+LVQLLDP PQNTAKKYAVACL  
Subjt:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL

Query:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        LSSSRKCKKLMISYGAIGYLKKLS++D  G KKLLEKLERGKLRSLFSRK
Subjt:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

A0A5D3DME2 Vacuolar protein 85.7e-27190.91Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        E  S EEWLLQAQKLVPVAL KA+EVKVFPGRWK IVSKLE++PSRLSDLSSHPCFSKN LC+EQLQAVL SLKET+ELA+LCVREKFEGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
        LSGKLDLNLRDCGLLIKTGVLGEAT  L +SG SSQ E  ++SNIRELLARLQIGHMEAKHRALDSLVEI+KEDDDNVLSVFGRNNVAALVQLLTATSPC
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC

Query:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
        IREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
Subjt:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV

Query:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
        RQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMELLLSLGFLPRLVHVLKS
Subjt:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS

Query:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL
        GS+GAQQAAASAICRVC+TPEMKKL+GE  ECIPLLIKLLE+KSNSVREVAAQAISSL+TLSQNCREVKRDEKSVP+LVQLLDP PQNTAKKYAVACL  
Subjt:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL

Query:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        LSSSRKCKKLMISYGAIGYLKKLS++D  G+KKLLEKLERGKLRSLFSRK
Subjt:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

A0A6J1DTV2 protein CELLULOSE SYNTHASE INTERACTIVE 1-like8.0e-29799.64Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
        LSGKLDLNLRDCGLLIKTGVLGEATPAL VSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPC

Query:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
        IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV
Subjt:  IREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEV

Query:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
        RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS
Subjt:  RQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKS

Query:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL
        GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACL L
Subjt:  GSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLAL

Query:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
Subjt:  LSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

SwissProt top hitse value%identityAlignment
O22161 Protein ARABIDILLO 11.5e-1023.66Show/hide
Query:  LLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGR-------NNVAALVQLLTATSPCIREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSA
        LL  +Q    + + R+   L   +  DD+N     GR         +  L++L  +    ++ +    I  L+ + +    +  EG +  L  L +S + 
Subjt:  LLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGR-------NNVAALVQLLTATSPCIREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSA

Query:  VAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELC----KTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNL
        +  E+A   L  LS+  +   AI   GGV+ L++L        D V + AA   L N++A  +    +A+ G +  ++ L       G +E AA  L NL
Subjt:  VAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELC----KTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNL

Query:  TASNDALRRSVI---SEGGLRCILAYLDGPLP--QESAVGALRNL----VSLVSMELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCTTPEMKKLVG
         A  D+   +       G L  ++     P    ++ A GAL NL     +  S+ +   +  L  L     + S G Q+ AA A+  +  +      +G
Subjt:  TASNDALRRSVI---SEGGLRCILAYLDGPLP--QESAVGALRNL----VSLVSMELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCTTPEMKKLVG

Query:  ETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLALLSSSRKCKKLMISYGAIGYLKKLSDID
             +P LI L  +++  V E AA A+ +L     N   +  +E  VP+LV L   S    A+  A   LA +   R   +  +  G         +I 
Subjt:  ETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLALLSSSRKCKKLMISYGAIGYLKKLSDID

Query:  VSGAKKLLEK
        + GA+ +  K
Subjt:  VSGAKKLLEK

O22193 U-box domain-containing protein 42.1e-1225.91Show/hide
Query:  VAALVQLLTATSPCIREKTIAIICLLAESGSCENWLV--SEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQ
        V  LV+ L ++S   + +  A + LLA+  + +N +V  + G +  L+ L+ S  +  +E AV +L  LS++ +  +AI   G + PLI + + G S ++
Subjt:  VAALVQLLTATSPCIREKTIAIICLLAESGSCENWLV--SEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQ-ESAVGALRNLVSL-VSM
          +A TL ++S + E +  + + G I  +++L+  G   G K+ AA  L NL+   +  +  ++  G +R ++  +D      + AV  L NL ++    
Subjt:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQ-ESAVGALRNLVSL-VSM

Query:  ELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAIS
          +   G +P LV V++ GS   ++ AA+A+ ++ T       +      +P L+ L ++ +   RE A   +S
Subjt:  ELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAIS

Q9FL17 U-box domain-containing protein 402.1e-1225.09Show/hide
Query:  SGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTIAIICLLAESGSCENWLVSEGVLPP
        SG     EP        LL +L+   +     AL S+  I + D+ + +S+     ++AL  L+ +    ++    A++  L+   S +  +V  G++PP
Subjt:  SGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTIAIICLLAESGSCENWLVSEGVLPP

Query:  LIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAA
        LI +++ GS  A+E +   +  L++  +   AI   GG+ PL+ L + G  +++  +A  L ++S V   R  L + G +++++ +V  G ++G      
Subjt:  LIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAA

Query:  ECLQNLTASNDALRRSVISEGGLRCILAYLD-----GPLPQESAVGALRNLV---SLVSMELLLSLGFLPRLVHVLKSGSLGAQQAA
          L N+ AS    R +++  GG+ C++  L          +ES V  L  L     L    L ++   +  LV V +SG   A+Q A
Subjt:  ECLQNLTASNDALRRSVISEGGLRCILAYLD-----GPLPQESAVGALRNLV---SLVSMELLLSLGFLPRLVHVLKSGSLGAQQAA

Q9M224 Protein ARABIDILLO 28.1e-1225.45Show/hide
Query:  LLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGR-------NNVAALVQLLTATSPCIREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSA
        LL+ +Q    + + RA   L   +  DD+N     GR         +  L++L  +    ++ +    I  L+ +      +  EG +  L  L +S + 
Subjt:  LLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGR-------NNVAALVQLLTATSPCIREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSA

Query:  VAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELC----KTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNL
        +  E+A   L  LS+  +   AI   GGV  L++L        D V + AA   L N++A  +    +A  G +  ++ L       G++E AA  L NL
Subjt:  VAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELC----KTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNL

Query:  TASNDALRRSVI---SEGGLRCILAYLDGPLP--QESAVGALRNLV-SLVSMELLLSLGFLPRLVHVLKS---GSLGAQQAAASAICRVCTTPEMKKLVG
         A  D+   +       G L  ++     P    ++ A GAL NL     + E + + G +  LV + KS    S G Q+  A A+  +  +      +G
Subjt:  TASNDALRRSVI---SEGGLRCILAYLDGPLP--QESAVGALRNLV-SLVSMELLLSLGFLPRLVHVLKS---GSLGAQQAAASAICRVCTTPEMKKLVG

Query:  ETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLALLSSSRKCKKLMI
             IP LI L+ +++  V E AA A+ +L     N   +  +E  V +LVQL   S    A+  A   LA +   R  +  MI
Subjt:  ETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLALLSSSRKCKKLMI

Q9SNC6 U-box domain-containing protein 137.5e-1025Show/hide
Query:  SSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTIAIICLLAESGSCENWLVSEGVLPPLIR
        SS S PAE + I +L+ RL  G+ E +  A   +  + K + DN +++     +  LV LL+     I+E ++  +  L+   + +  +VS G +P +++
Subjt:  SSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTIAIICLLAESGSCENWLVSEGVLPPLIR

Query:  LVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNI-------------SAVPEVRQALAEDGIIRVMINLVDCG
        +++ GS  A+E A  +L  LS+  +    I   G + PL+ L   G    +  AA  L N+               +P + + L E G   V   L    
Subjt:  LVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNI-------------SAVPEVRQALAEDGIIRVMINLVDCG

Query:  ILLGSKEYAAECLQNLTASNDALRRSV--ISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELL--LSLGFLPRLVHVLKSGSLGAQQAAASAICRV
        IL    E  A     +  S+DA+   V  I  G           P  +E+A   L +L S     L+    LG +  L+ +  +G+   ++ AA  + R+
Subjt:  ILLGSKEYAAECLQNLTASNDALRRSV--ISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELL--LSLGFLPRLVHVLKSGSLGAQQAAASAICRV

Query:  CTTPEMKK
            E +K
Subjt:  CTTPEMKK

Arabidopsis top hitse value%identityAlignment
AT1G01830.1 ARM repeat superfamily protein6.9e-19265.7Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        ++ S EEWL +   L+P  L KA  VK F GRWK I+SK+E+IP+ LSDLSSHPCFSKN LC EQLQ+V K+L E IELAE C  +K+EGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVL-SVFGRNNVAALVQLLTATSP
        LSGKLDLNLRDCG+LIKTGVLGEAT  L +   SS SE  + S+++ELLARLQIGH+E+KH AL+SL+  ++ED+  VL  + GR NVAALVQLLTATS 
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVL-SVFGRNNVAALVQLLTATSP

Query:  CIREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPE
         IREK + +I +LAESG C+ WL+SEGVLPPL+RL+ESGS   KEKA I++QRLSM+ + AR I GHGG+ PLI+LCKTGDSVSQAA+A  LKN+SAV E
Subjt:  CIREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPE

Query:  VRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLK
        +RQ LAE+GIIRV I+L++ GILLGS+E+ AECLQNLTA++DALR +++SEGG+  +LAYLDGPLPQ+ AV ALRNL+  V+ E+ ++L  LPRL HVLK
Subjt:  VRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLK

Query:  SGSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PSLVQLLDPSPQNTAKKYAVACL
        SGSLGAQQAAASAICR   +PE K+LVGE+  CIP ++KLLE+KSN  RE AAQAI+ L+   +  RE+K+D KSV  +LV LLD +P NTAKKYAVA L
Subjt:  SGSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PSLVQLLDPSPQNTAKKYAVACL

Query:  ALLSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSR
          +S S K KK+M+SYGAIGYLKKLS+++V GA KLLEKLERGKLRS F R
Subjt:  ALLSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSR

AT1G01830.2 ARM repeat superfamily protein6.9e-19265.7Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        ++ S EEWL +   L+P  L KA  VK F GRWK I+SK+E+IP+ LSDLSSHPCFSKN LC EQLQ+V K+L E IELAE C  +K+EGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVL-SVFGRNNVAALVQLLTATSP
        LSGKLDLNLRDCG+LIKTGVLGEAT  L +   SS SE  + S+++ELLARLQIGH+E+KH AL+SL+  ++ED+  VL  + GR NVAALVQLLTATS 
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVL-SVFGRNNVAALVQLLTATSP

Query:  CIREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPE
         IREK + +I +LAESG C+ WL+SEGVLPPL+RL+ESGS   KEKA I++QRLSM+ + AR I GHGG+ PLI+LCKTGDSVSQAA+A  LKN+SAV E
Subjt:  CIREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPE

Query:  VRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLK
        +RQ LAE+GIIRV I+L++ GILLGS+E+ AECLQNLTA++DALR +++SEGG+  +LAYLDGPLPQ+ AV ALRNL+  V+ E+ ++L  LPRL HVLK
Subjt:  VRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLK

Query:  SGSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PSLVQLLDPSPQNTAKKYAVACL
        SGSLGAQQAAASAICR   +PE K+LVGE+  CIP ++KLLE+KSN  RE AAQAI+ L+   +  RE+K+D KSV  +LV LLD +P NTAKKYAVA L
Subjt:  SGSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PSLVQLLDPSPQNTAKKYAVACL

Query:  ALLSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSR
          +S S K KK+M+SYGAIGYLKKLS+++V GA KLLEKLERGKLRS F R
Subjt:  ALLSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSR

AT1G01830.3 ARM repeat superfamily protein6.9e-19265.7Show/hide
Query:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS
        ++ S EEWL +   L+P  L KA  VK F GRWK I+SK+E+IP+ LSDLSSHPCFSKN LC EQLQ+V K+L E IELAE C  +K+EGKLRMQSDLDS
Subjt:  EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDS

Query:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVL-SVFGRNNVAALVQLLTATSP
        LSGKLDLNLRDCG+LIKTGVLGEAT  L +   SS SE  + S+++ELLARLQIGH+E+KH AL+SL+  ++ED+  VL  + GR NVAALVQLLTATS 
Subjt:  LSGKLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVL-SVFGRNNVAALVQLLTATSP

Query:  CIREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPE
         IREK + +I +LAESG C+ WL+SEGVLPPL+RL+ESGS   KEKA I++QRLSM+ + AR I GHGG+ PLI+LCKTGDSVSQAA+A  LKN+SAV E
Subjt:  CIREKTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPE

Query:  VRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLK
        +RQ LAE+GIIRV I+L++ GILLGS+E+ AECLQNLTA++DALR +++SEGG+  +LAYLDGPLPQ+ AV ALRNL+  V+ E+ ++L  LPRL HVLK
Subjt:  VRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLK

Query:  SGSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PSLVQLLDPSPQNTAKKYAVACL
        SGSLGAQQAAASAICR   +PE K+LVGE+  CIP ++KLLE+KSN  RE AAQAI+ L+   +  RE+K+D KSV  +LV LLD +P NTAKKYAVA L
Subjt:  SGSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PSLVQLLDPSPQNTAKKYAVACL

Query:  ALLSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSR
          +S S K KK+M+SYGAIGYLKKLS+++V GA KLLEKLERGKLRS F R
Subjt:  ALLSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSR

AT2G45720.1 ARM repeat superfamily protein6.6e-21169.1Show/hide
Query:  SAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDSLSG
        + E+ LLQAQ+LVP+AL KA  VK F  RW+ I+S+LE+IP+ LSDLSSHPCFSK+ LC+EQLQAVL++LKETIELA +CV EK EGKL+MQSDLDSLS 
Subjt:  SAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDSLSG

Query:  KLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPCIRE
        K+DL+L+DCGLL+KTGVLGE T  L     SS ++  E  ++RELLARLQIGH+E+K +AL+ LVE++KED+  V++  GR NVA+LVQLLTATSP +RE
Subjt:  KLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPCIRE

Query:  KTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQA
          + +IC LAESG CENWL+SE  LP LIRL+ESGS VAKEKAVISLQR+S+SS+T+R+IVGHGGV PLIE+CKTGDSVSQ+A+ACTLKNISAVPEVRQ 
Subjt:  KTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQA

Query:  LAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSGSL
        LAE+GI++VMIN+++CGILLGSKEYAAECLQNLT+SN+ LRRSVISE G++ +LAYLDGPLPQES V A+RNLV  VS+E    +  +P LVHVLKSGS+
Subjt:  LAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSGSL

Query:  GAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLALLSS
        GAQQAAAS ICR+ T+ E K+++GE+  CIPLLI++LE K++  REVAAQAI+SL+T+ +NCREVKRDEKSV SLV LL+PSP N+AKKYAV+ LA L S
Subjt:  GAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLALLSS

Query:  SRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        SRKCKKLM+S+GA+GYLKKLS+++V G+KKLLE++E+GKL+S FSRK
Subjt:  SRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK

AT2G45720.2 ARM repeat superfamily protein6.6e-21169.1Show/hide
Query:  SAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDSLSG
        + E+ LLQAQ+LVP+AL KA  VK F  RW+ I+S+LE+IP+ LSDLSSHPCFSK+ LC+EQLQAVL++LKETIELA +CV EK EGKL+MQSDLDSLS 
Subjt:  SAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDSLSG

Query:  KLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPCIRE
        K+DL+L+DCGLL+KTGVLGE T  L     SS ++  E  ++RELLARLQIGH+E+K +AL+ LVE++KED+  V++  GR NVA+LVQLLTATSP +RE
Subjt:  KLDLNLRDCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPCIRE

Query:  KTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQA
          + +IC LAESG CENWL+SE  LP LIRL+ESGS VAKEKAVISLQR+S+SS+T+R+IVGHGGV PLIE+CKTGDSVSQ+A+ACTLKNISAVPEVRQ 
Subjt:  KTIAIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQA

Query:  LAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSGSL
        LAE+GI++VMIN+++CGILLGSKEYAAECLQNLT+SN+ LRRSVISE G++ +LAYLDGPLPQES V A+RNLV  VS+E    +  +P LVHVLKSGS+
Subjt:  LAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSGSL

Query:  GAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLALLSS
        GAQQAAAS ICR+ T+ E K+++GE+  CIPLLI++LE K++  REVAAQAI+SL+T+ +NCREVKRDEKSV SLV LL+PSP N+AKKYAV+ LA L S
Subjt:  GAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLLETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLALLSS

Query:  SRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK
        SRKCKKLM+S+GA+GYLKKLS+++V G+KKLLE++E+GKL+S FSRK
Subjt:  SRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAGGAAGGATCGGCGGAAGAATGGCTGCTGCAAGCTCAGAAGCTTGTTCCTGTGGCTCTGGGGAAGGCTATGGAGGTGAAGGTGTTCCCGGGGCGATGGAAGGCGATTGT
TTCGAAGCTCGAGCGGATTCCTTCGCGGTTATCGGATTTGTCGAGCCATCCTTGCTTTTCGAAGAATGCTCTCTGCAGGGAGCAACTGCAGGCTGTGTTGAAGAGCTTGA
AGGAGACCATTGAATTGGCGGAGCTTTGTGTTAGGGAGAAATTTGAAGGCAAGCTGCGAATGCAGAGCGATCTCGACTCGCTATCGGGGAAGTTGGATTTGAATTTGCGA
GATTGTGGCCTGCTGATCAAGACGGGGGTGCTGGGTGAGGCTACTCCGGCTCTATCCGTGTCGGGTTCTTCGTCGCAATCGGAGCCTGCTGAGTATAGCAACATAAGGGA
ATTGCTTGCAAGGCTGCAGATTGGCCATATGGAAGCGAAACACCGAGCTCTCGACAGCCTTGTAGAGATCCTGAAAGAGGATGATGATAATGTATTGTCTGTCTTTGGTC
GTAATAATGTTGCCGCGTTAGTCCAACTCCTCACTGCAACCTCCCCGTGCATCCGGGAAAAGACGATCGCCATAATTTGCTTGCTAGCCGAATCGGGAAGTTGCGAGAAT
TGGCTTGTTTCAGAGGGCGTTTTGCCACCTCTAATAAGGCTTGTTGAGTCTGGCAGTGCTGTGGCTAAAGAGAAGGCTGTGATTTCGCTGCAGAGGTTGTCAATGTCAAG
CGATACCGCCCGTGCAATAGTTGGGCATGGCGGGGTTCGACCCCTTATCGAACTGTGTAAGACTGGTGATTCTGTATCACAGGCTGCTGCTGCTTGCACATTGAAGAACA
TCTCGGCGGTGCCCGAGGTTCGACAAGCTTTGGCCGAAGATGGGATCATAAGGGTGATGATCAATCTCGTGGACTGTGGAATTCTGTTGGGATCAAAAGAGTATGCAGCT
GAATGCCTGCAAAATCTCACCGCTAGCAACGACGCGCTAAGGAGATCGGTCATCTCCGAGGGCGGTCTGCGCTGCATATTGGCATATCTCGACGGCCCCCTCCCTCAAGA
ATCTGCAGTTGGGGCATTGAGGAATTTAGTCAGCTTGGTTTCAATGGAACTTCTGTTGTCTCTTGGCTTCCTCCCGCGTCTAGTGCACGTGCTCAAATCGGGATCGCTCG
GGGCACAGCAGGCTGCAGCATCAGCAATCTGCAGGGTATGCACCACACCAGAGATGAAGAAGCTAGTAGGTGAAACCCCCGAGTGCATCCCACTCCTCATCAAACTCCTC
GAGACGAAATCGAACAGCGTTCGGGAAGTTGCAGCACAGGCAATCTCAAGTCTAATGACCCTTTCCCAGAACTGCAGAGAAGTCAAAAGGGATGAGAAGAGTGTCCCAAG
TCTCGTCCAGCTGCTCGATCCCAGCCCCCAAAACACAGCGAAGAAGTACGCCGTGGCGTGCCTCGCCTTGCTCTCGTCAAGCAGGAAATGCAAGAAGCTGATGATTTCAT
ACGGGGCGATCGGGTATCTGAAGAAGCTATCGGACATCGACGTATCGGGTGCTAAGAAACTGCTGGAGAAGCTGGAAAGAGGGAAGTTGAGAAGTCTGTTCAGTAGGAAA
mRNA sequenceShow/hide mRNA sequence
GAGGAAGGATCGGCGGAAGAATGGCTGCTGCAAGCTCAGAAGCTTGTTCCTGTGGCTCTGGGGAAGGCTATGGAGGTGAAGGTGTTCCCGGGGCGATGGAAGGCGATTGT
TTCGAAGCTCGAGCGGATTCCTTCGCGGTTATCGGATTTGTCGAGCCATCCTTGCTTTTCGAAGAATGCTCTCTGCAGGGAGCAACTGCAGGCTGTGTTGAAGAGCTTGA
AGGAGACCATTGAATTGGCGGAGCTTTGTGTTAGGGAGAAATTTGAAGGCAAGCTGCGAATGCAGAGCGATCTCGACTCGCTATCGGGGAAGTTGGATTTGAATTTGCGA
GATTGTGGCCTGCTGATCAAGACGGGGGTGCTGGGTGAGGCTACTCCGGCTCTATCCGTGTCGGGTTCTTCGTCGCAATCGGAGCCTGCTGAGTATAGCAACATAAGGGA
ATTGCTTGCAAGGCTGCAGATTGGCCATATGGAAGCGAAACACCGAGCTCTCGACAGCCTTGTAGAGATCCTGAAAGAGGATGATGATAATGTATTGTCTGTCTTTGGTC
GTAATAATGTTGCCGCGTTAGTCCAACTCCTCACTGCAACCTCCCCGTGCATCCGGGAAAAGACGATCGCCATAATTTGCTTGCTAGCCGAATCGGGAAGTTGCGAGAAT
TGGCTTGTTTCAGAGGGCGTTTTGCCACCTCTAATAAGGCTTGTTGAGTCTGGCAGTGCTGTGGCTAAAGAGAAGGCTGTGATTTCGCTGCAGAGGTTGTCAATGTCAAG
CGATACCGCCCGTGCAATAGTTGGGCATGGCGGGGTTCGACCCCTTATCGAACTGTGTAAGACTGGTGATTCTGTATCACAGGCTGCTGCTGCTTGCACATTGAAGAACA
TCTCGGCGGTGCCCGAGGTTCGACAAGCTTTGGCCGAAGATGGGATCATAAGGGTGATGATCAATCTCGTGGACTGTGGAATTCTGTTGGGATCAAAAGAGTATGCAGCT
GAATGCCTGCAAAATCTCACCGCTAGCAACGACGCGCTAAGGAGATCGGTCATCTCCGAGGGCGGTCTGCGCTGCATATTGGCATATCTCGACGGCCCCCTCCCTCAAGA
ATCTGCAGTTGGGGCATTGAGGAATTTAGTCAGCTTGGTTTCAATGGAACTTCTGTTGTCTCTTGGCTTCCTCCCGCGTCTAGTGCACGTGCTCAAATCGGGATCGCTCG
GGGCACAGCAGGCTGCAGCATCAGCAATCTGCAGGGTATGCACCACACCAGAGATGAAGAAGCTAGTAGGTGAAACCCCCGAGTGCATCCCACTCCTCATCAAACTCCTC
GAGACGAAATCGAACAGCGTTCGGGAAGTTGCAGCACAGGCAATCTCAAGTCTAATGACCCTTTCCCAGAACTGCAGAGAAGTCAAAAGGGATGAGAAGAGTGTCCCAAG
TCTCGTCCAGCTGCTCGATCCCAGCCCCCAAAACACAGCGAAGAAGTACGCCGTGGCGTGCCTCGCCTTGCTCTCGTCAAGCAGGAAATGCAAGAAGCTGATGATTTCAT
ACGGGGCGATCGGGTATCTGAAGAAGCTATCGGACATCGACGTATCGGGTGCTAAGAAACTGCTGGAGAAGCTGGAAAGAGGGAAGTTGAGAAGTCTGTTCAGTAGGAAA
Protein sequenceShow/hide protein sequence
EEGSAEEWLLQAQKLVPVALGKAMEVKVFPGRWKAIVSKLERIPSRLSDLSSHPCFSKNALCREQLQAVLKSLKETIELAELCVREKFEGKLRMQSDLDSLSGKLDLNLR
DCGLLIKTGVLGEATPALSVSGSSSQSEPAEYSNIRELLARLQIGHMEAKHRALDSLVEILKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTIAIICLLAESGSCEN
WLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSSDTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAA
ECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCTTPEMKKLVGETPECIPLLIKLL
ETKSNSVREVAAQAISSLMTLSQNCREVKRDEKSVPSLVQLLDPSPQNTAKKYAVACLALLSSSRKCKKLMISYGAIGYLKKLSDIDVSGAKKLLEKLERGKLRSLFSRK