; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025745 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025745
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionARM repeat superfamily protein
Genome locationtig00152936:2478057..2479745
RNA-Seq ExpressionSgr025745
SyntenySgr025745
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040032.1 vacuolar protein 8 [Cucumis melo var. makuwa]7.8e-27890.8Show/hide
Query:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV
        MVEE    R SGG     ES EEWLLQAQKLVPVALEKA+EVKVFPGRWKMI+SKLEQ+P RLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV

Query:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR
        REK+EGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SG S Q ESTD+SNIRELLARLQIGHMEAKH ALDSL+EI+KEDDDNVLSVFGR
Subjt:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL
        AAAACTLKNISAVPEVRQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMEL
Subjt:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRVCS+PEMKKL+GEAECIPLLIKLLE+K NSVREVAAQAISSL+TLSQNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL  LSSSRKCKKLMISYGAIGYLKKLS+M+ PG KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

TYK24469.1 vacuolar protein 8 [Cucumis melo var. makuwa]4.6e-27890.8Show/hide
Query:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV
        MVEE    R SGG     ES EEWLLQAQKLVPVALEKA+EVKVFPGRWKMI+SKLEQ+P RLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV

Query:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR
        REK+EGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SG S Q ESTD+SNIRELLARLQIGHMEAKH ALDSL+EI+KEDDDNVLSVFGR
Subjt:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL
        AAAACTLKNISAVPEVRQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMEL
Subjt:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRVCS+PEMKKL+GEAECIPLLIKLLE+K NSVREVAAQAISSL+TLSQNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL  LSSSRKCKKLMISYGAIGYLKKLS+M+ PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

XP_008460169.1 PREDICTED: uncharacterized protein LOC103499059 isoform X1 [Cucumis melo]1.3e-27790.62Show/hide
Query:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV
        MVEE    R SGG     ES EEWLLQAQKLVPVALEKA+EVKVFPGRWKMI+SKLEQ+P RLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV

Query:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR
        REK+EGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SG S Q ESTD+ NIRELLARLQIGHMEAKH ALDSL+EI+KEDDDNVLSVFGR
Subjt:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL
        AAAACTLKNISAVPEVRQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMEL
Subjt:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRVCS+PEMKKL+GEAECIPLLIKLLE+K NSVREVAAQAISSL+TLSQNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL  LSSSRKCKKLMISYGAIGYLKKLS+M+ PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

XP_008460170.1 PREDICTED: uncharacterized protein LOC103499059 isoform X2 [Cucumis melo]1.3e-27790.62Show/hide
Query:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV
        MVEE    R SGG     ES EEWLLQAQKLVPVALEKA+EVKVFPGRWKMI+SKLEQ+P RLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV

Query:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR
        REK+EGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SG S Q ESTD+ NIRELLARLQIGHMEAKH ALDSL+EI+KEDDDNVLSVFGR
Subjt:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL
        AAAACTLKNISAVPEVRQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMEL
Subjt:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRVCS+PEMKKL+GEAECIPLLIKLLE+K NSVREVAAQAISSL+TLSQNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL  LSSSRKCKKLMISYGAIGYLKKLS+M+ PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

XP_022156111.1 protein CELLULOSE SYNTHASE INTERACTIVE 1-like [Momordica charantia]1.3e-27792.36Show/hide
Query:  MVEEEGAGRVSGGAVESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREK
        MVEEEG          SAEEWLLQAQKLVPVAL KAMEVKVFPGRWK I+SKLE+IP RLSDLSSHPCFSKNALC+EQLQAVLKSLKE IELAELCVREK
Subjt:  MVEEEGAGRVSGGAVESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREK

Query:  YEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNV
        +EGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEAT  L VSGSS Q E  +YSNIRELLARLQIGHMEAKH ALDSL+EI+KEDDDNVLSVFGRNNV
Subjt:  YEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNV

Query:  AALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAA
        AALVQLLTATSPCIREKTI IICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQAAA
Subjt:  AALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAA

Query:  ACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLS
        ACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLS
Subjt:  ACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLS

Query:  LGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEA-ECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQ
        LGFLPRLVHVLKSGSLGAQQAAASAICRVC++PEMKKLVGE  ECIPLLIKLLE K NSVREVAAQAISSLMTLSQNCREVKRDEKSVP+LVQLLDPSPQ
Subjt:  LGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEA-ECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQ

Query:  NTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        NTAKKYAVACL LLSSSRKCKKLMISYGAIGYLKKLSD+++ GAKKLLEKLERGKLRSLFSRK
Subjt:  NTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

TrEMBL top hitse value%identityAlignment
A0A1S3CBX6 uncharacterized protein LOC103499059 isoform X16.5e-27890.62Show/hide
Query:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV
        MVEE    R SGG     ES EEWLLQAQKLVPVALEKA+EVKVFPGRWKMI+SKLEQ+P RLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV

Query:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR
        REK+EGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SG S Q ESTD+ NIRELLARLQIGHMEAKH ALDSL+EI+KEDDDNVLSVFGR
Subjt:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL
        AAAACTLKNISAVPEVRQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMEL
Subjt:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRVCS+PEMKKL+GEAECIPLLIKLLE+K NSVREVAAQAISSL+TLSQNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL  LSSSRKCKKLMISYGAIGYLKKLS+M+ PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

A0A1S3CCD8 uncharacterized protein LOC103499059 isoform X26.5e-27890.62Show/hide
Query:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV
        MVEE    R SGG     ES EEWLLQAQKLVPVALEKA+EVKVFPGRWKMI+SKLEQ+P RLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV

Query:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR
        REK+EGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SG S Q ESTD+ NIRELLARLQIGHMEAKH ALDSL+EI+KEDDDNVLSVFGR
Subjt:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL
        AAAACTLKNISAVPEVRQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMEL
Subjt:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRVCS+PEMKKL+GEAECIPLLIKLLE+K NSVREVAAQAISSL+TLSQNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL  LSSSRKCKKLMISYGAIGYLKKLS+M+ PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

A0A5A7T9D4 Vacuolar protein 83.8e-27890.8Show/hide
Query:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV
        MVEE    R SGG     ES EEWLLQAQKLVPVALEKA+EVKVFPGRWKMI+SKLEQ+P RLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV

Query:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR
        REK+EGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SG S Q ESTD+SNIRELLARLQIGHMEAKH ALDSL+EI+KEDDDNVLSVFGR
Subjt:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL
        AAAACTLKNISAVPEVRQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMEL
Subjt:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRVCS+PEMKKL+GEAECIPLLIKLLE+K NSVREVAAQAISSL+TLSQNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL  LSSSRKCKKLMISYGAIGYLKKLS+M+ PG KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

A0A5D3DME2 Vacuolar protein 82.2e-27890.8Show/hide
Query:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV
        MVEE    R SGG     ES EEWLLQAQKLVPVALEKA+EVKVFPGRWKMI+SKLEQ+P RLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEEGAGRVSGGA---VESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCV

Query:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR
        REK+EGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SG S Q ESTD+SNIRELLARLQIGHMEAKH ALDSL+EI+KEDDDNVLSVFGR
Subjt:  REKYEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL
        AAAACTLKNISAVPEVRQ LAE+GIIRVMI+LVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVS VSMEL
Subjt:  AAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRVCS+PEMKKL+GEAECIPLLIKLLE+K NSVREVAAQAISSL+TLSQNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL  LSSSRKCKKLMISYGAIGYLKKLS+M+ PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

A0A6J1DTV2 protein CELLULOSE SYNTHASE INTERACTIVE 1-like6.5e-27892.36Show/hide
Query:  MVEEEGAGRVSGGAVESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREK
        MVEEEG          SAEEWLLQAQKLVPVAL KAMEVKVFPGRWK I+SKLE+IP RLSDLSSHPCFSKNALC+EQLQAVLKSLKE IELAELCVREK
Subjt:  MVEEEGAGRVSGGAVESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREK

Query:  YEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNV
        +EGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEAT  L VSGSS Q E  +YSNIRELLARLQIGHMEAKH ALDSL+EI+KEDDDNVLSVFGRNNV
Subjt:  YEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNV

Query:  AALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAA
        AALVQLLTATSPCIREKTI IICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQAAA
Subjt:  AALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAA

Query:  ACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLS
        ACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLS
Subjt:  ACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLS

Query:  LGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEA-ECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQ
        LGFLPRLVHVLKSGSLGAQQAAASAICRVC++PEMKKLVGE  ECIPLLIKLLE K NSVREVAAQAISSLMTLSQNCREVKRDEKSVP+LVQLLDPSPQ
Subjt:  LGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEA-ECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQ

Query:  NTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        NTAKKYAVACL LLSSSRKCKKLMISYGAIGYLKKLSD+++ GAKKLLEKLERGKLRSLFSRK
Subjt:  NTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

SwissProt top hitse value%identityAlignment
O22193 U-box domain-containing protein 41.8e-1425.09Show/hide
Query:  SNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVA
        + +++L+  L+   ++ +  A   L  + K + DN + +     +  LV+LL +T    +E  +T +  L+ + + +  +   G + PLI ++E+GS+ A
Subjt:  SNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVA

Query:  KEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDA
        KE +  +L  LS+  +    I   G + PL++L   G    +  AA  L N+S   E +  + + G +R +I+L+D     G  + A   L NL    + 
Subjt:  KEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDA

Query:  LRRSVISEGGLRCI--LAYLDGPLPQESAVGALRNLVSLVS--MELLLSLGFLPRLVHVLKSGSLGAQQAAASAI
         R ++  EGG+  +  +  L     +E+A  AL  L +       ++L  G +P LV + +SG+  A++ A + +
Subjt:  LRRSVISEGGLRCI--LAYLDGPLPQESAVGALRNLVSLVS--MELLLSLGFLPRLVHVLKSGSLGAQQAAASAI

P39968 Vacuolar protein 83.8e-0922.94Show/hide
Query:  HMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLA--ESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRL
        H+  +  A  +L+  M   ++N   +     V  LV LL++T P ++    T +  +A  E+   +       ++  L+ L++S S+  K +A ++L+ L
Subjt:  HMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLA--ESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRL

Query:  SMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGL
        +        IV  GG+  L++L ++       A+   ++NIS  P     + + G ++ ++ L+D       + +A   L+NL AS++  R+     G +
Subjt:  SMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGL

Query:  -RCILAYLDGPLPQESAVGALRNLVSL--VSMELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIP------LLIKLLEAKPN
         +C    LD P+  +S + A   +++L  VS   LL    L  L+ +  S +      AA+A+  +CS       + EA   P       LI+ L++   
Subjt:  -RCILAYLDGPLPQESAVGALRNLVSL--VSMELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVGEAECIP------LLIKLLEAKPN

Query:  SVREVAAQAISSLMTLSQNCRE--VKRDEKSVPNLVQLLD
        +   +A   I  L+    +  E  VK D+  +  + ++ D
Subjt:  SVREVAAQAISSLMTLSQNCRE--VKRDEKSVPNLVQLLD

Q681N2 U-box domain-containing protein 151.5e-0824.01Show/hide
Query:  LLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAV
        L+  L    +E +  ++  +  + +E+ +N + +     +  LVQLL+     I+E  +T +  L+     +  + +EG +P +I ++E+G+  A+E + 
Subjt:  LLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAV

Query:  ISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSV
         +L  LSM  +    I    G+ PL++L + G    +  A   L N+S     +    + GI++ ++NL+     LG  + A   L  L AS+   R+++
Subjt:  ISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSV

Query:  ISEGGLRCILAYL--DGPLPQESAVGALRNLVSLVSMELL--LSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEM
             +  ++ ++    P  +E A   L  L S  S  +L  L  G    LV +  SG+  AQ+ A + I  +  S ++
Subjt:  ISEGGLRCILAYL--DGPLPQESAVGALRNLVSLVSMELL--LSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEM

Q9FL17 U-box domain-containing protein 405.3e-1125.17Show/hide
Query:  SGSSLQLESTDYSNIRE---LLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGV
        S SS ++ES + +   E   LL +L+   +     AL S+  I + D+ + +S+     ++AL  L+ +    ++     ++  L+   S +  +V  G+
Subjt:  SGSSLQLESTDYSNIRE---LLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGV

Query:  LPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKE
        +PPLI +++ GS  A+E +   +  L++  +   AI   GG+ PL+ L + G  +++  +A  L ++S V   R  L + G +++++ +V  G ++G   
Subjt:  LPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKE

Query:  YAAECLQNLTASNDALRRSVISEGGLRCILAYLD-----GPLPQESAVGALRNLV---SLVSMELLLSLGFLPRLVHVLKSGSLGAQQAA
             L N+ AS    R +++  GG+ C++  L          +ES V  L  L     L    L ++   +  LV V +SG   A+Q A
Subjt:  YAAECLQNLTASNDALRRSVISEGGLRCILAYLD-----GPLPQESAVGALRNLV---SLVSMELLLSLGFLPRLVHVLKSGSLGAQQAA

Q9M224 Protein ARABIDILLO 27.0e-1125.26Show/hide
Query:  LLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR-------NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSA
        LL+ +Q    + +  A   L   +  DD+N     GR         +  L++L  +    ++ +    I  L+ +      +  EG +  L  L +S + 
Subjt:  LLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGR-------NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSA

Query:  VAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELC----KTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNL
        +  E+A   L  LS+  +   AI   GGV  L++L        D V + AA   L N++A  +    +A  G +  ++ L       G++E AA  L NL
Subjt:  VAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELC----KTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLVDCGILLGSKEYAAECLQNL

Query:  TASNDALRRSVI---SEGGLRCILAYLDGPLP--QESAVGALRNLV-SLVSMELLLSLGFLPRLVHVLKS---GSLGAQQAAASAICRVCSSPEMKKLVG
         A  D+   +       G L  ++     P    ++ A GAL NL     + E + + G +  LV + KS    S G Q+  A A+  +  S      +G
Subjt:  TASNDALRRSVI---SEGGLRCILAYLDGPLP--QESAVGALRNLV-SLVSMELLLSLGFLPRLVHVLKS---GSLGAQQAAASAICRVCSSPEMKKLVG

Query:  EAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGLLSSSRKCKKLMI
            IP LI L+ ++   V E AA A+ +L     N   +  +E  V  LVQL   S    A+  A   L  +   R  +  MI
Subjt:  EAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGLLSSSRKCKKLMI

Arabidopsis top hitse value%identityAlignment
AT1G01830.1 ARM repeat superfamily protein1.6e-19667.52Show/hide
Query:  ESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSDLDSLS
        +S EEWL +   L+P  L KA  VK F GRWK IISK+EQIP  LSDLSSHPCFSKN LC EQLQ+V K+L E IELAE C  +KYEGKLRMQSDLDSLS
Subjt:  ESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSDLDSLS

Query:  GKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVL-SVFGRNNVAALVQLLTATSPCI
        GKLDLNLRDCG+LIKTGVLGEATLPL +S SS   E+   S+++ELLARLQIGH+E+KH AL+SL+  M+ED+  VL  + GR NVAALVQLLTATS  I
Subjt:  GKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVL-SVFGRNNVAALVQLLTATSPCI

Query:  REKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVR
        REK + +I +LAESG C+ WL+SEGVLPPL+RL+ESGS   KEKA I++QRLSM+ + AR I GHGG+ PLI+LCKTGDSVSQAA+A  LKN+SAV E+R
Subjt:  REKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVR

Query:  QALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSG
        Q LAE+GIIRV I+L++ GILLGS+E+ AECLQNLTA++DALR +++SEGG+  +LAYLDGPLPQ+ AV ALRNL+  V+ E+ ++L  LPRL HVLKSG
Subjt:  QALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSG

Query:  SLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PNLVQLLDPSPQNTAKKYAVACLGLL
        SLGAQQAAASAICR   SPE K+LVGE+ CIP ++KLLE+K N  RE AAQAI+ L+   +  RE+K+D KSV  NLV LLD +P NTAKKYAVA L  +
Subjt:  SLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PNLVQLLDPSPQNTAKKYAVACLGLL

Query:  SSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSR
        S S K KK+M+SYGAIGYLKKLS+ME+ GA KLLEKLERGKLRS F R
Subjt:  SSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSR

AT1G01830.2 ARM repeat superfamily protein1.6e-19667.52Show/hide
Query:  ESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSDLDSLS
        +S EEWL +   L+P  L KA  VK F GRWK IISK+EQIP  LSDLSSHPCFSKN LC EQLQ+V K+L E IELAE C  +KYEGKLRMQSDLDSLS
Subjt:  ESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSDLDSLS

Query:  GKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVL-SVFGRNNVAALVQLLTATSPCI
        GKLDLNLRDCG+LIKTGVLGEATLPL +S SS   E+   S+++ELLARLQIGH+E+KH AL+SL+  M+ED+  VL  + GR NVAALVQLLTATS  I
Subjt:  GKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVL-SVFGRNNVAALVQLLTATSPCI

Query:  REKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVR
        REK + +I +LAESG C+ WL+SEGVLPPL+RL+ESGS   KEKA I++QRLSM+ + AR I GHGG+ PLI+LCKTGDSVSQAA+A  LKN+SAV E+R
Subjt:  REKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVR

Query:  QALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSG
        Q LAE+GIIRV I+L++ GILLGS+E+ AECLQNLTA++DALR +++SEGG+  +LAYLDGPLPQ+ AV ALRNL+  V+ E+ ++L  LPRL HVLKSG
Subjt:  QALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSG

Query:  SLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PNLVQLLDPSPQNTAKKYAVACLGLL
        SLGAQQAAASAICR   SPE K+LVGE+ CIP ++KLLE+K N  RE AAQAI+ L+   +  RE+K+D KSV  NLV LLD +P NTAKKYAVA L  +
Subjt:  SLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PNLVQLLDPSPQNTAKKYAVACLGLL

Query:  SSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSR
        S S K KK+M+SYGAIGYLKKLS+ME+ GA KLLEKLERGKLRS F R
Subjt:  SSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSR

AT1G01830.3 ARM repeat superfamily protein1.6e-19667.52Show/hide
Query:  ESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSDLDSLS
        +S EEWL +   L+P  L KA  VK F GRWK IISK+EQIP  LSDLSSHPCFSKN LC EQLQ+V K+L E IELAE C  +KYEGKLRMQSDLDSLS
Subjt:  ESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSDLDSLS

Query:  GKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVL-SVFGRNNVAALVQLLTATSPCI
        GKLDLNLRDCG+LIKTGVLGEATLPL +S SS   E+   S+++ELLARLQIGH+E+KH AL+SL+  M+ED+  VL  + GR NVAALVQLLTATS  I
Subjt:  GKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVL-SVFGRNNVAALVQLLTATSPCI

Query:  REKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVR
        REK + +I +LAESG C+ WL+SEGVLPPL+RL+ESGS   KEKA I++QRLSM+ + AR I GHGG+ PLI+LCKTGDSVSQAA+A  LKN+SAV E+R
Subjt:  REKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVR

Query:  QALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSG
        Q LAE+GIIRV I+L++ GILLGS+E+ AECLQNLTA++DALR +++SEGG+  +LAYLDGPLPQ+ AV ALRNL+  V+ E+ ++L  LPRL HVLKSG
Subjt:  QALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSG

Query:  SLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PNLVQLLDPSPQNTAKKYAVACLGLL
        SLGAQQAAASAICR   SPE K+LVGE+ CIP ++KLLE+K N  RE AAQAI+ L+   +  RE+K+D KSV  NLV LLD +P NTAKKYAVA L  +
Subjt:  SLGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSV-PNLVQLLDPSPQNTAKKYAVACLGLL

Query:  SSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSR
        S S K KK+M+SYGAIGYLKKLS+ME+ GA KLLEKLERGKLRS F R
Subjt:  SSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSR

AT2G45720.1 ARM repeat superfamily protein3.2e-21369.65Show/hide
Query:  ESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSDLDSLS
        ++ E+ LLQAQ+LVP+AL KA  VK F  RW++IIS+LE+IP  LSDLSSHPCFSK+ LCKEQLQAVL++LKE IELA +CV EK EGKL+MQSDLDSLS
Subjt:  ESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSDLDSLS

Query:  GKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIR
         K+DL+L+DCGLL+KTGVLGE T PLS S   L+  S     +RELLARLQIGH+E+K  AL+ L+E+MKED+  V++  GR NVA+LVQLLTATSP +R
Subjt:  GKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIR

Query:  EKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQ
        E  +T+IC LAESG CENWL+SE  LP LIRL+ESGS VAKEKAVISLQR+S+S++T+R+IVGHGGV PLIE+CKTGDSVSQ+A+ACTLKNISAVPEVRQ
Subjt:  EKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQ

Query:  ALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSGS
         LAE+GI++VMIN+++CGILLGSKEYAAECLQNLT+SN+ LRRSVISE G++ +LAYLDGPLPQES V A+RNLV  VS+E    +  +P LVHVLKSGS
Subjt:  ALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSGS

Query:  LGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGLLSS
        +GAQQAAAS ICR+ +S E K+++GE+ CIPLLI++LEAK +  REVAAQAI+SL+T+ +NCREVKRDEKSV +LV LL+PSP N+AKKYAV+ L  L S
Subjt:  LGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGLLSS

Query:  SRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        SRKCKKLM+S+GA+GYLKKLS++E+PG+KKLLE++E+GKL+S FSRK
Subjt:  SRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK

AT2G45720.2 ARM repeat superfamily protein3.2e-21369.65Show/hide
Query:  ESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSDLDSLS
        ++ E+ LLQAQ+LVP+AL KA  VK F  RW++IIS+LE+IP  LSDLSSHPCFSK+ LCKEQLQAVL++LKE IELA +CV EK EGKL+MQSDLDSLS
Subjt:  ESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSDLDSLS

Query:  GKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIR
         K+DL+L+DCGLL+KTGVLGE T PLS S   L+  S     +RELLARLQIGH+E+K  AL+ L+E+MKED+  V++  GR NVA+LVQLLTATSP +R
Subjt:  GKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIR

Query:  EKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQ
        E  +T+IC LAESG CENWL+SE  LP LIRL+ESGS VAKEKAVISLQR+S+S++T+R+IVGHGGV PLIE+CKTGDSVSQ+A+ACTLKNISAVPEVRQ
Subjt:  EKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQ

Query:  ALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSGS
         LAE+GI++VMIN+++CGILLGSKEYAAECLQNLT+SN+ LRRSVISE G++ +LAYLDGPLPQES V A+RNLV  VS+E    +  +P LVHVLKSGS
Subjt:  ALAEDGIIRVMINLVDCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSGS

Query:  LGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGLLSS
        +GAQQAAAS ICR+ +S E K+++GE+ CIPLLI++LEAK +  REVAAQAI+SL+T+ +NCREVKRDEKSV +LV LL+PSP N+AKKYAV+ L  L S
Subjt:  LGAQQAAASAICRVCSSPEMKKLVGEAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGLLSS

Query:  SRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK
        SRKCKKLM+S+GA+GYLKKLS++E+PG+KKLLE++E+GKL+S FSRK
Subjt:  SRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKLERGKLRSLFSRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAAGAGGAAGGAGCTGGTCGAGTTTCGGGCGGGGCGGTGGAGTCGGCTGAAGAATGGCTGTTGCAAGCTCAAAAGCTTGTTCCTGTGGCTCTGGAGAAGGCTAT
GGAGGTCAAGGTGTTTCCGGGTAGATGGAAGATGATCATTTCAAAGCTCGAGCAGATTCCTTTGCGGTTATCGGATTTGTCGAGCCATCCTTGCTTTTCGAAGAATGCCC
TCTGCAAGGAGCAATTGCAGGCTGTTTTGAAGAGCTTGAAGGAAGGCATTGAATTGGCGGAGCTTTGTGTGAGGGAGAAATACGAAGGCAAGCTTCGGATGCAGAGCGAT
CTCGACTCGCTCTCTGGGAAATTGGATTTGAATTTGCGCGATTGTGGTCTTCTGATCAAGACGGGCGTGCTCGGTGAGGCTACTTTGCCTCTATCCGTGTCAGGTTCTTC
GTTGCAATTGGAGTCTACTGACTATAGCAACATAAGGGAACTGCTTGCAAGGCTGCAGATTGGCCATATGGAGGCAAAACATCTAGCTCTGGACAGCCTCATCGAGATCA
TGAAAGAAGACGATGATAATGTTTTGTCTGTCTTTGGTCGTAATAATGTCGCAGCTTTAGTCCAATTGCTCACTGCAACCTCTCCATGTATCCGGGAGAAGACGATCACC
ATAATTTGCTTGCTAGCAGAATCGGGAAGTTGCGAGAATTGGCTTGTTTCGGAAGGCGTTTTGCCACCTCTAATAAGGCTTGTTGAGTCTGGCAGTGCTGTGGCTAAAGA
AAAGGCTGTGATTTCATTGCAGAGGTTGTCAATGTCGGCTGATACCGCACGTGCAATAGTTGGACATGGCGGGGTTCGGCCTCTTATTGAACTATGCAAAACCGGTGATT
CTGTATCACAGGCTGCGGCTGCTTGCACGTTGAAGAATATATCAGCCGTGCCCGAGGTTCGACAAGCTTTAGCTGAAGACGGGATCATAAGGGTGATGATCAATCTTGTG
GATTGTGGAATTCTGTTGGGATCAAAAGAATATGCAGCTGAATGCCTGCAAAATCTCACTGCTAGCAACGACGCACTAAGGAGATCGGTTATCTCAGAAGGCGGTCTCCG
CTGCATATTGGCATATCTCGATGGCCCTCTCCCTCAAGAATCCGCAGTCGGGGCATTGAGGAATTTGGTCAGCTTGGTTTCAATGGAACTTCTGTTGTCTCTCGGCTTCC
TCCCTCGTCTAGTACACGTGCTCAAATCGGGATCGCTGGGCGCACAGCAGGCTGCAGCATCAGCAATCTGCCGGGTATGCAGCTCACCAGAGATGAAGAAGCTAGTAGGT
GAAGCCGAGTGCATCCCCCTCCTCATCAAGCTACTCGAGGCGAAACCAAACAGCGTCCGGGAAGTCGCAGCGCAGGCAATCTCAAGCCTGATGACCCTTTCACAGAACTG
CAGAGAAGTCAAAAGGGATGAAAAGAGTGTCCCAAATCTCGTCCAGCTGCTCGACCCAAGCCCCCAAAACACAGCAAAGAAATACGCCGTCGCGTGCCTCGGGTTGCTTT
CGTCAAGCAGGAAATGCAAGAAGCTGATGATTTCGTATGGGGCGATCGGGTATCTGAAGAAGCTTTCGGACATGGAAATCCCAGGAGCAAAGAAATTGCTAGAGAAGCTG
GAAAGAGGGAAGTTGAGAAGCTTGTTCAGTAGGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAAGAGGAAGGAGCTGGTCGAGTTTCGGGCGGGGCGGTGGAGTCGGCTGAAGAATGGCTGTTGCAAGCTCAAAAGCTTGTTCCTGTGGCTCTGGAGAAGGCTAT
GGAGGTCAAGGTGTTTCCGGGTAGATGGAAGATGATCATTTCAAAGCTCGAGCAGATTCCTTTGCGGTTATCGGATTTGTCGAGCCATCCTTGCTTTTCGAAGAATGCCC
TCTGCAAGGAGCAATTGCAGGCTGTTTTGAAGAGCTTGAAGGAAGGCATTGAATTGGCGGAGCTTTGTGTGAGGGAGAAATACGAAGGCAAGCTTCGGATGCAGAGCGAT
CTCGACTCGCTCTCTGGGAAATTGGATTTGAATTTGCGCGATTGTGGTCTTCTGATCAAGACGGGCGTGCTCGGTGAGGCTACTTTGCCTCTATCCGTGTCAGGTTCTTC
GTTGCAATTGGAGTCTACTGACTATAGCAACATAAGGGAACTGCTTGCAAGGCTGCAGATTGGCCATATGGAGGCAAAACATCTAGCTCTGGACAGCCTCATCGAGATCA
TGAAAGAAGACGATGATAATGTTTTGTCTGTCTTTGGTCGTAATAATGTCGCAGCTTTAGTCCAATTGCTCACTGCAACCTCTCCATGTATCCGGGAGAAGACGATCACC
ATAATTTGCTTGCTAGCAGAATCGGGAAGTTGCGAGAATTGGCTTGTTTCGGAAGGCGTTTTGCCACCTCTAATAAGGCTTGTTGAGTCTGGCAGTGCTGTGGCTAAAGA
AAAGGCTGTGATTTCATTGCAGAGGTTGTCAATGTCGGCTGATACCGCACGTGCAATAGTTGGACATGGCGGGGTTCGGCCTCTTATTGAACTATGCAAAACCGGTGATT
CTGTATCACAGGCTGCGGCTGCTTGCACGTTGAAGAATATATCAGCCGTGCCCGAGGTTCGACAAGCTTTAGCTGAAGACGGGATCATAAGGGTGATGATCAATCTTGTG
GATTGTGGAATTCTGTTGGGATCAAAAGAATATGCAGCTGAATGCCTGCAAAATCTCACTGCTAGCAACGACGCACTAAGGAGATCGGTTATCTCAGAAGGCGGTCTCCG
CTGCATATTGGCATATCTCGATGGCCCTCTCCCTCAAGAATCCGCAGTCGGGGCATTGAGGAATTTGGTCAGCTTGGTTTCAATGGAACTTCTGTTGTCTCTCGGCTTCC
TCCCTCGTCTAGTACACGTGCTCAAATCGGGATCGCTGGGCGCACAGCAGGCTGCAGCATCAGCAATCTGCCGGGTATGCAGCTCACCAGAGATGAAGAAGCTAGTAGGT
GAAGCCGAGTGCATCCCCCTCCTCATCAAGCTACTCGAGGCGAAACCAAACAGCGTCCGGGAAGTCGCAGCGCAGGCAATCTCAAGCCTGATGACCCTTTCACAGAACTG
CAGAGAAGTCAAAAGGGATGAAAAGAGTGTCCCAAATCTCGTCCAGCTGCTCGACCCAAGCCCCCAAAACACAGCAAAGAAATACGCCGTCGCGTGCCTCGGGTTGCTTT
CGTCAAGCAGGAAATGCAAGAAGCTGATGATTTCGTATGGGGCGATCGGGTATCTGAAGAAGCTTTCGGACATGGAAATCCCAGGAGCAAAGAAATTGCTAGAGAAGCTG
GAAAGAGGGAAGTTGAGAAGCTTGTTCAGTAGGAAATAG
Protein sequenceShow/hide protein sequence
MVEEEGAGRVSGGAVESAEEWLLQAQKLVPVALEKAMEVKVFPGRWKMIISKLEQIPLRLSDLSSHPCFSKNALCKEQLQAVLKSLKEGIELAELCVREKYEGKLRMQSD
LDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSVSGSSLQLESTDYSNIRELLARLQIGHMEAKHLALDSLIEIMKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTIT
IICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQALAEDGIIRVMINLV
DCGILLGSKEYAAECLQNLTASNDALRRSVISEGGLRCILAYLDGPLPQESAVGALRNLVSLVSMELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVCSSPEMKKLVG
EAECIPLLIKLLEAKPNSVREVAAQAISSLMTLSQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGLLSSSRKCKKLMISYGAIGYLKKLSDMEIPGAKKLLEKL
ERGKLRSLFSRK