; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009041 (gene) of Snake gourd v1 genome

Gene IDTan0009041
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionARM repeat superfamily protein
Genome locationLG06:70205915..70209224
RNA-Seq ExpressionTan0009041
SyntenyTan0009041
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040032.1 vacuolar protein 8 [Cucumis melo var. makuwa]2.1e-28693.45Show/hide
Query:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV
        MVEE  VDRG GG  +EAES EEWLLQAQ+LVPVALEKA+EVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV

Query:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR
         EKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SGCSSQWESTD+SNIRELLARLQIGHMEAKHRALDSLVE++KEDDDNVLSVFGR
Subjt:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL
        AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLR ILAYLDGPLPQESAVGALRNLVSSVSMEL
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRV S+PEMKKLIGEAECIPLLIKLLE+KSNSVREVAAQAISSL+TL+QNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL SLSSSRKCKKLMISYGAIGYLKKL EMD PG KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

TYK24469.1 vacuolar protein 8 [Cucumis melo var. makuwa]1.2e-28693.45Show/hide
Query:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV
        MVEE  VDRG GG  +EAES EEWLLQAQ+LVPVALEKA+EVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV

Query:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR
         EKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SGCSSQWESTD+SNIRELLARLQIGHMEAKHRALDSLVE++KEDDDNVLSVFGR
Subjt:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL
        AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLR ILAYLDGPLPQESAVGALRNLVSSVSMEL
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRV S+PEMKKLIGEAECIPLLIKLLE+KSNSVREVAAQAISSL+TL+QNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL SLSSSRKCKKLMISYGAIGYLKKL EMD PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

XP_008460170.1 PREDICTED: uncharacterized protein LOC103499059 isoform X2 [Cucumis melo]3.5e-28693.27Show/hide
Query:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV
        MVEE  VDRG GG  +EAES EEWLLQAQ+LVPVALEKA+EVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV

Query:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR
         EKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SGCSSQWESTD+ NIRELLARLQIGHMEAKHRALDSLVE++KEDDDNVLSVFGR
Subjt:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL
        AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLR ILAYLDGPLPQESAVGALRNLVSSVSMEL
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRV S+PEMKKLIGEAECIPLLIKLLE+KSNSVREVAAQAISSL+TL+QNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL SLSSSRKCKKLMISYGAIGYLKKL EMD PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

XP_038876426.1 protein CELLULOSE SYNTHASE INTERACTIVE 1 isoform X1 [Benincasa hispida]5.4e-28793.98Show/hide
Query:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV
        MVEEGGVDRGLGG  AEAESAEEWLLQAQ+LVP ALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVL SLKEAIELAELC 
Subjt:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV

Query:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR
         EKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SGCS QWE TDYSNIRELLARLQIGHMEAKHRALDSLVE+LKEDDDNVLSVF R
Subjt:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATS CIREKTIT+ICLLAESGSCE+WLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL
        AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASN++LRRSVIS GGLR ILAYLDGPLPQESAVGALRNLVSSVSMEL
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGSLGAQQAAAS IC+V SS EMKKLIGEAECIP LIKLLEAKSNS+REVAAQAISSL+TL++NCREVKRDEKSVPNLVQLLDPS
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYA+ACLGSLSSSRKCKKLMISYGAIGYLKKL EMD  GAKKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

XP_038876427.1 protein CELLULOSE SYNTHASE INTERACTIVE 1 isoform X2 [Benincasa hispida]5.4e-28793.98Show/hide
Query:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV
        MVEEGGVDRGLGG  AEAESAEEWLLQAQ+LVP ALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVL SLKEAIELAELC 
Subjt:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV

Query:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR
         EKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SGCS QWE TDYSNIRELLARLQIGHMEAKHRALDSLVE+LKEDDDNVLSVF R
Subjt:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATS CIREKTIT+ICLLAESGSCE+WLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMS+DTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL
        AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASN++LRRSVIS GGLR ILAYLDGPLPQESAVGALRNLVSSVSMEL
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGSLGAQQAAAS IC+V SS EMKKLIGEAECIP LIKLLEAKSNS+REVAAQAISSL+TL++NCREVKRDEKSVPNLVQLLDPS
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYA+ACLGSLSSSRKCKKLMISYGAIGYLKKL EMD  GAKKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

TrEMBL top hitse value%identityAlignment
A0A0A0KCS8 Uncharacterized protein2.1e-28492.39Show/hide
Query:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV
        MVEEG V R  GG  +EAESAEEWLLQAQ+LVPVALEKA+EVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV

Query:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR
         EKFEGKLR+QSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SGCSSQWESTDY NIRELLARLQIGHMEAKHRALDSLVE++KEDDDNVLS+FGR
Subjt:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATS CIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESG+AVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL
        AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASN++LRRSVISEGGLR ILAYLDGPLPQESAVGALRN+VSSVSMEL
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRV ++PEMKKLIGEAECIPLLIKLLE+KSNSVREVAAQAISSL+TL+QNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL SLSSSRKCKKLMISYGAIGYLKKL EMD PG+KKLLEKLERGKLRSLF RK
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

A0A1S3CBX6 uncharacterized protein LOC103499059 isoform X11.7e-28693.27Show/hide
Query:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV
        MVEE  VDRG GG  +EAES EEWLLQAQ+LVPVALEKA+EVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV

Query:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR
         EKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SGCSSQWESTD+ NIRELLARLQIGHMEAKHRALDSLVE++KEDDDNVLSVFGR
Subjt:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL
        AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLR ILAYLDGPLPQESAVGALRNLVSSVSMEL
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRV S+PEMKKLIGEAECIPLLIKLLE+KSNSVREVAAQAISSL+TL+QNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL SLSSSRKCKKLMISYGAIGYLKKL EMD PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

A0A1S3CCD8 uncharacterized protein LOC103499059 isoform X21.7e-28693.27Show/hide
Query:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV
        MVEE  VDRG GG  +EAES EEWLLQAQ+LVPVALEKA+EVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV

Query:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR
         EKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SGCSSQWESTD+ NIRELLARLQIGHMEAKHRALDSLVE++KEDDDNVLSVFGR
Subjt:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL
        AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLR ILAYLDGPLPQESAVGALRNLVSSVSMEL
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRV S+PEMKKLIGEAECIPLLIKLLE+KSNSVREVAAQAISSL+TL+QNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL SLSSSRKCKKLMISYGAIGYLKKL EMD PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

A0A5A7T9D4 Vacuolar protein 81.0e-28693.45Show/hide
Query:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV
        MVEE  VDRG GG  +EAES EEWLLQAQ+LVPVALEKA+EVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV

Query:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR
         EKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SGCSSQWESTD+SNIRELLARLQIGHMEAKHRALDSLVE++KEDDDNVLSVFGR
Subjt:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL
        AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLR ILAYLDGPLPQESAVGALRNLVSSVSMEL
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRV S+PEMKKLIGEAECIPLLIKLLE+KSNSVREVAAQAISSL+TL+QNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL SLSSSRKCKKLMISYGAIGYLKKL EMD PG KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

A0A5D3DME2 Vacuolar protein 85.9e-28793.45Show/hide
Query:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV
        MVEE  VDRG GG  +EAES EEWLLQAQ+LVPVALEKA+EVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKN LCKEQLQAVL SLKE +ELA+LCV
Subjt:  MVEEGGVDRGLGG--AEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCV

Query:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR
         EKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPL +SGCSSQWESTD+SNIRELLARLQIGHMEAKHRALDSLVE++KEDDDNVLSVFGR
Subjt:  GEKFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR

Query:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        NNVAALVQLLTATSPCIREKTI +ICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
Subjt:  NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL
        AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTA N++LRRSVISEGGLR ILAYLDGPLPQESAVGALRNLVSSVSMEL
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMEL

Query:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS
        LLSLGFLPRLVHVLKSGS+GAQQAAASAICRV S+PEMKKLIGEAECIPLLIKLLE+KSNSVREVAAQAISSL+TL+QNCREVKRDEKSVPNLVQLLDP 
Subjt:  LLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        PQNTAKKYAVACL SLSSSRKCKKLMISYGAIGYLKKL EMD PG+KKLLEKLERGKLRSLFSRK
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

SwissProt top hitse value%identityAlignment
O22161 Protein ARABIDILLO 14.8e-1225.37Show/hide
Query:  LLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR-------NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSA
        LL  +Q    + + R+   L   +  DD+N     GR         +  L++L  +    ++ +    I  L+ + +    +  EG +  L  L +S + 
Subjt:  LLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGR-------NNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSA

Query:  VAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELC----KTGDSVSQAAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNL
        +  E+A   L  LS+  +   AI   GGV+ L++L        D V + AA   L N++A  +    +A+ G +  ++ L       G +E AA  L NL
Subjt:  VAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELC----KTGDSVSQAAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNL

Query:  TASNDTLRRSVI---SEGGLRSILAYLDGPLP--QESAVGALRNL-VSSVSMELLLSLGFLPRLVHVLKS---GSLGAQQAAASAICRVSSSPEMKKLIG
         A  D+   +       G L +++     P    ++ A GAL NL     + E +   G +  LV + +S    S G Q+ AA A+  +S S      IG
Subjt:  TASNDTLRRSVI---SEGGLRSILAYLDGPLP--QESAVGALRNL-VSSVSMELLLSLGFLPRLVHVLKS---GSLGAQQAAASAICRVSSSPEMKKLIG

Query:  EAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGSLSSSRKCK-KLMISYGAIGYLKKLLEMD
            +P LI L  +++  V E AA A+ +L     N   +  +E  VP LV L   S    A+  A   L  +   R  +  LMI   +     K + +D
Subjt:  EAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGSLSSSRKCK-KLMISYGAIGYLKKLLEMD

Query:  IPGAKKLLEK
          GA+ +  K
Subjt:  IPGAKKLLEK

O22193 U-box domain-containing protein 43.7e-1225.91Show/hide
Query:  VAALVQLLTATSPCIREKTITIICLLAESGSCENWLV--SEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ
        V  LV+ L ++S   + +    + LLA+  + +N +V  + G +  L+ L+ S  +  +E AV +L  LS++ +  +AI   G + PLI + + G S ++
Subjt:  VAALVQLLTATSPCIREKTITIICLLAESGSCENWLV--SEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQ

Query:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQ-ESAVGALRNLVS-SVSM
          +A TL ++S + E +  + + G I  ++ L+  G   G K+ AA  L NL+   +  +  ++  G +R ++  +D      + AV  L NL +     
Subjt:  AAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQ-ESAVGALRNLVS-SVSM

Query:  ELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVS-SSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAIS
          +   G +P LV V++ GS   ++ AA+A+ ++S +S     ++ +   +P L+ L ++ +   RE A   +S
Subjt:  ELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVS-SSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAIS

Q681N2 U-box domain-containing protein 151.6e-1023.65Show/hide
Query:  RMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQ
        + + +LD LS   +  L++  L+++        +P       SQ E  D   +  L+  L    +E + R++  +  + +E+ +N + +     +  LVQ
Subjt:  RMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQ

Query:  LLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLK
        LL+     I+E  +T +  L+     +  + +EG +P +I ++E+G+  A+E +  +L  LSM  +    I    G+ PL++L + G    +  A   L 
Subjt:  LLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLK

Query:  NISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYL--DGPLPQESAVGALRNLVSSVSMELL--LSL
        N+S     +    + GI++ +++L+     LG  + A   L  L AS+   R+++     + +++ ++    P  +E A   L  L S+ S  +L  L  
Subjt:  NISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYL--DGPLPQESAVGALRNLVSSVSMELL--LSL

Query:  GFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEM
        G    LV +  SG+  AQ+ A + I  +S S ++
Subjt:  GFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEM

Q9FL17 U-box domain-containing protein 402.7e-1023.7Show/hide
Query:  LLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAV
        LL +L+   +     AL S+  + + D+ + +S+     ++AL  L+ +    ++     ++  L+   S +  +V  G++PPLI +++ GS  A+E + 
Subjt:  LLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAV

Query:  ISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSV
          +  L++  +   AI   GG+ PL+ L + G  +++  +A  L ++S V   R  L + G +++++ +V  G ++G        L N+ AS    R ++
Subjt:  ISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSV

Query:  ISEGGLRSILAYLD-----GPLPQESAVGALRNLVSSVSME---LLLSLGFLPRLVHVLKSGSLGAQQAA
        +  GG+  ++  L          +ES V  L  L     +    L ++   +  LV V +SG   A+Q A
Subjt:  ISEGGLRSILAYLD-----GPLPQESAVGALRNLVSSVSME---LLLSLGFLPRLVHVLKSGSLGAQQAA

Q9SNC6 U-box domain-containing protein 133.8e-0923.1Show/hide
Query:  PLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEG
        P  +S  SS  E+   + I +L+ RL  G+ E +  A   +  + K + DN +++     +  LV LL+     I+E ++T +  L+   + +  +VS G
Subjt:  PLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEG

Query:  VLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQTLAEEGIIRVMISLV-DCGILLGS
         +P ++++++ GS  A+E A  +L  LS+  +    I   G + PL+ L   G    +  AA  L N+      +      G+I  +  L+ + G  +  
Subjt:  VLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQTLAEEGIIRVMISLV-DCGILLGS

Query:  KEYAAECLQNLTASNDTLRRSVISEGGLRSILAYL--DGPLPQESAVGALRNLVSSVSMELL--LSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPE
        +  A   +  + +S+   +  + S   + S++ ++    P  +E+A   L +L S     L+    LG +  L+ +  +G+   ++ AA  + R+S   E
Subjt:  KEYAAECLQNLTASNDTLRRSVISEGGLRSILAYL--DGPLPQESAVGALRNLVSSVSMELL--LSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPE

Query:  MKK
         +K
Subjt:  MKK

Arabidopsis top hitse value%identityAlignment
AT1G01830.1 ARM repeat superfamily protein9.5e-19765.6Show/hide
Query:  MVEEGGVDRGLGGAEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGE
        M EE  V++ +   + +S EEWL +   L+P  L KA  VK F GRWK I+SK+EQ+P+ LSDLSSHPCFSKN LC EQLQ+V K+L E IELAE C  +
Subjt:  MVEEGGVDRGLGGAEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGE

Query:  KFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVL-SVFGRN
        K+EGKLRMQSDLDSLSGKLDLNLRDCG+LIKTGVLGEATLPL +S  S   E+   S+++ELLARLQIGH+E+KH AL+SL+  ++ED+  VL  + GR 
Subjt:  KFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVL-SVFGRN

Query:  NVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQA
        NVAALVQLLTATS  IREK + +I +LAESG C+ WL+SEGVLPPL+RL+ESGS   KEKA I++QRLSM+ + AR I GHGG+ PLI+LCKTGDSVSQA
Subjt:  NVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQA

Query:  AAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELL
        A+A  LKN+SAV E+RQ LAEEGIIRV I L++ GILLGS+E+ AECLQNLTA++D LR +++SEGG+ S+LAYLDGPLPQ+ AV ALRNL+ SV+ E+ 
Subjt:  AAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELL

Query:  LSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSV-PNLVQLLDPS
        ++L  LPRL HVLKSGSLGAQQAAASAICR + SPE K+L+GE+ CIP ++KLLE+KSN  RE AAQAI+ L+   +  RE+K+D KSV  NLV LLD +
Subjt:  LSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSV-PNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSR
        P NTAKKYAVA L  +S S K KK+M+SYGAIGYLKKL EM++ GA KLLEKLERGKLRS F R
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSR

AT1G01830.2 ARM repeat superfamily protein9.5e-19765.6Show/hide
Query:  MVEEGGVDRGLGGAEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGE
        M EE  V++ +   + +S EEWL +   L+P  L KA  VK F GRWK I+SK+EQ+P+ LSDLSSHPCFSKN LC EQLQ+V K+L E IELAE C  +
Subjt:  MVEEGGVDRGLGGAEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGE

Query:  KFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVL-SVFGRN
        K+EGKLRMQSDLDSLSGKLDLNLRDCG+LIKTGVLGEATLPL +S  S   E+   S+++ELLARLQIGH+E+KH AL+SL+  ++ED+  VL  + GR 
Subjt:  KFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVL-SVFGRN

Query:  NVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQA
        NVAALVQLLTATS  IREK + +I +LAESG C+ WL+SEGVLPPL+RL+ESGS   KEKA I++QRLSM+ + AR I GHGG+ PLI+LCKTGDSVSQA
Subjt:  NVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQA

Query:  AAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELL
        A+A  LKN+SAV E+RQ LAEEGIIRV I L++ GILLGS+E+ AECLQNLTA++D LR +++SEGG+ S+LAYLDGPLPQ+ AV ALRNL+ SV+ E+ 
Subjt:  AAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELL

Query:  LSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSV-PNLVQLLDPS
        ++L  LPRL HVLKSGSLGAQQAAASAICR + SPE K+L+GE+ CIP ++KLLE+KSN  RE AAQAI+ L+   +  RE+K+D KSV  NLV LLD +
Subjt:  LSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSV-PNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSR
        P NTAKKYAVA L  +S S K KK+M+SYGAIGYLKKL EM++ GA KLLEKLERGKLRS F R
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSR

AT1G01830.3 ARM repeat superfamily protein9.5e-19765.6Show/hide
Query:  MVEEGGVDRGLGGAEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGE
        M EE  V++ +   + +S EEWL +   L+P  L KA  VK F GRWK I+SK+EQ+P+ LSDLSSHPCFSKN LC EQLQ+V K+L E IELAE C  +
Subjt:  MVEEGGVDRGLGGAEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGE

Query:  KFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVL-SVFGRN
        K+EGKLRMQSDLDSLSGKLDLNLRDCG+LIKTGVLGEATLPL +S  S   E+   S+++ELLARLQIGH+E+KH AL+SL+  ++ED+  VL  + GR 
Subjt:  KFEGKLRMQSDLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVL-SVFGRN

Query:  NVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQA
        NVAALVQLLTATS  IREK + +I +LAESG C+ WL+SEGVLPPL+RL+ESGS   KEKA I++QRLSM+ + AR I GHGG+ PLI+LCKTGDSVSQA
Subjt:  NVAALVQLLTATSPCIREKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQA

Query:  AAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELL
        A+A  LKN+SAV E+RQ LAEEGIIRV I L++ GILLGS+E+ AECLQNLTA++D LR +++SEGG+ S+LAYLDGPLPQ+ AV ALRNL+ SV+ E+ 
Subjt:  AAACTLKNISAVPEVRQTLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELL

Query:  LSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSV-PNLVQLLDPS
        ++L  LPRL HVLKSGSLGAQQAAASAICR + SPE K+L+GE+ CIP ++KLLE+KSN  RE AAQAI+ L+   +  RE+K+D KSV  NLV LLD +
Subjt:  LSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSV-PNLVQLLDPS

Query:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSR
        P NTAKKYAVA L  +S S K KK+M+SYGAIGYLKKL EM++ GA KLLEKLERGKLRS F R
Subjt:  PQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSR

AT2G45720.1 ARM repeat superfamily protein1.0e-21469.47Show/hide
Query:  ESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGEKFEGKLRMQSDLDSLS
        ++ E+ LLQAQ LVP+AL KA  VK F  RW++I+S+LE++P+ LSDLSSHPCFSK+ LCKEQLQAVL++LKE IELA +CV EK EGKL+MQSDLDSLS
Subjt:  ESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGEKFEGKLRMQSDLDSLS

Query:  GKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQLLTATSPCIR
         K+DL+L+DCGLL+KTGVLGE T PL     SS  +  +  ++RELLARLQIGH+E+K +AL+ LVEV+KED+  V++  GR NVA+LVQLLTATSP +R
Subjt:  GKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQLLTATSPCIR

Query:  EKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQ
        E  +T+IC LAESG CENWL+SE  LP LIRL+ESGS VAKEKAVISLQR+S+S++T+R+IVGHGGV PLIE+CKTGDSVSQ+A+ACTLKNISAVPEVRQ
Subjt:  EKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQ

Query:  TLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELLLSLGFLPRLVHVLKSGS
         LAEEGI++VMI++++CGILLGSKEYAAECLQNLT+SN+TLRRSVISE G++++LAYLDGPLPQES V A+RNLV SVS+E    +  +P LVHVLKSGS
Subjt:  TLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELLLSLGFLPRLVHVLKSGS

Query:  LGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGSLSS
        +GAQQAAAS ICR+++S E K++IGE+ CIPLLI++LEAK++  REVAAQAI+SL+T+ +NCREVKRDEKSV +LV LL+PSP N+AKKYAV+ L +L S
Subjt:  LGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGSLSS

Query:  SRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        SRKCKKLM+S+GA+GYLKKL E+++PG+KKLLE++E+GKL+S FSRK
Subjt:  SRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK

AT2G45720.2 ARM repeat superfamily protein1.0e-21469.47Show/hide
Query:  ESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGEKFEGKLRMQSDLDSLS
        ++ E+ LLQAQ LVP+AL KA  VK F  RW++I+S+LE++P+ LSDLSSHPCFSK+ LCKEQLQAVL++LKE IELA +CV EK EGKL+MQSDLDSLS
Subjt:  ESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGEKFEGKLRMQSDLDSLS

Query:  GKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQLLTATSPCIR
         K+DL+L+DCGLL+KTGVLGE T PL     SS  +  +  ++RELLARLQIGH+E+K +AL+ LVEV+KED+  V++  GR NVA+LVQLLTATSP +R
Subjt:  GKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQLLTATSPCIR

Query:  EKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQ
        E  +T+IC LAESG CENWL+SE  LP LIRL+ESGS VAKEKAVISLQR+S+S++T+R+IVGHGGV PLIE+CKTGDSVSQ+A+ACTLKNISAVPEVRQ
Subjt:  EKTITIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQ

Query:  TLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELLLSLGFLPRLVHVLKSGS
         LAEEGI++VMI++++CGILLGSKEYAAECLQNLT+SN+TLRRSVISE G++++LAYLDGPLPQES V A+RNLV SVS+E    +  +P LVHVLKSGS
Subjt:  TLAEEGIIRVMISLVDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELLLSLGFLPRLVHVLKSGS

Query:  LGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGSLSS
        +GAQQAAAS ICR+++S E K++IGE+ CIPLLI++LEAK++  REVAAQAI+SL+T+ +NCREVKRDEKSV +LV LL+PSP N+AKKYAV+ L +L S
Subjt:  LGAQQAAASAICRVSSSPEMKKLIGEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGSLSS

Query:  SRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK
        SRKCKKLM+S+GA+GYLKKL E+++PG+KKLLE++E+GKL+S FSRK
Subjt:  SRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEKLERGKLRSLFSRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAAGAGGGAGGTGTCGATCGGGGTTTGGGCGGGGCGGAGGCGGAGTCGGCGGAAGAGTGGCTGCTGCAAGCTCAGAGGCTTGTTCCTGTGGCTCTGGAGAAGGC
TATGGAGGTTAAGGTGTTTCCAGGGAGGTGGAAGATGATTGTTTCGAAGCTCGAGCAGCTTCCCTCGCGGTTATCGGATTTGTCGAGCCATCCTTGCTTTTCGAAGAATG
CCCTTTGCAAGGAGCAATTGCAGGCTGTTTTGAAGAGCTTGAAGGAAGCTATTGAATTGGCGGAGCTTTGTGTTGGGGAGAAATTCGAAGGCAAGCTTCGGATGCAGAGC
GATCTCGACTCGCTGTCTGGGAAATTGGATTTGAATTTGCGGGATTGTGGCCTTTTGATTAAGACGGGTGTGCTCGGTGAGGCTACTCTACCTCTATCCATGTCGGGGTG
TTCGTCCCAGTGGGAGTCTACTGACTATAGCAACATAAGGGAACTGCTTGCAAGGCTGCAGATTGGCCATATGGAGGCCAAACATCGAGCACTTGATAGCCTCGTCGAGG
TCCTGAAAGAGGATGATGATAATGTTTTGTCTGTCTTTGGGCGCAATAATGTTGCTGCTTTAGTCCAATTGCTCACTGCAACTTCTCCATGTATTCGGGAGAAGACGATC
ACCATAATTTGTTTGCTAGCTGAATCAGGAAGTTGCGAGAATTGGCTTGTTTCGGAAGGCGTTCTGCCACCTCTAATAAGGCTTGTTGAGTCTGGCAGTGCTGTGGCTAA
AGAAAAGGCTGTGATTTCGCTGCAAAGGTTGTCAATGTCAGCTGATACCGCTCGTGCAATAGTTGGCCATGGCGGGGTTCGACCTCTTATCGAACTATGCAAGACTGGTG
ATTCTGTATCACAGGCTGCTGCTGCTTGCACGTTGAAGAACATATCAGCTGTGCCCGAGGTTCGACAAACTTTGGCGGAAGAAGGGATCATAAGGGTGATGATTAGTCTG
GTAGATTGTGGAATTCTTTTGGGATCAAAAGAATATGCAGCTGAATGCCTGCAAAATCTCACTGCTAGCAATGACACTCTAAGGAGATCGGTCATCTCCGAAGGCGGTCT
CCGCTCCATATTGGCATATCTCGACGGCCCTCTCCCTCAAGAATCCGCAGTGGGGGCATTGAGGAATTTAGTCAGCTCAGTATCAATGGAACTTCTGTTGTCTCTCGGCT
TCCTCCCCCGACTAGTACATGTGCTAAAATCAGGATCTCTCGGCGCACAGCAGGCAGCAGCATCAGCAATCTGCAGGGTAAGCAGCTCACCAGAGATGAAGAAGCTAATA
GGTGAAGCCGAATGCATCCCGCTCCTCATCAAACTCCTCGAGGCCAAATCAAACAGCGTTCGAGAAGTCGCAGCACAGGCAATCTCAAGCCTCATGACCCTTGCGCAGAA
CTGCAGAGAAGTCAAAAGAGACGAAAAGAGTGTCCCAAATCTCGTCCAACTGCTCGACCCAAGCCCCCAAAACACAGCAAAGAAGTACGCCGTCGCGTGCCTCGGGTCTC
TCTCTTCAAGTAGGAAATGCAAGAAGCTGATGATATCGTATGGGGCGATCGGGTATCTGAAGAAGCTATTGGAGATGGACATACCGGGCGCAAAGAAGCTGCTCGAGAAA
CTGGAAAGGGGGAAGTTGAGAAGTCTGTTCAGCAGGAAATAG
mRNA sequenceShow/hide mRNA sequence
CTCATTCCCATTCCCGAAGATCGCCAAGCTCTCTCTCTCTCTAACATATCGCCAAAGTGAAAGAAATCGCTCTCCACAAAGAACCTGAGAAGAAGACATAATTTCAACAT
TTTCTTCCATTAGAAAGAGAGAGAAAATTTTATAGAAGTTTGATTTTTGAACATGTGTTATTTGTTGTACCCAAGTCTCCTTGTTCGTTTGTTTGTTTCTTCTAAGGATT
TTTAAATTTTTGAATCACTTTCGACCGTTTCCTCTTCGAGAAACCAAAAATCGCTTCTGGGTTTCTTTCAACTTCTTAAAGCTGCAACAGAGCCATCTGGGTCATCTTCC
TTTTCTCCAAAAAACAGAGTTACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGTGTTGATTGAAGGTGAAGGTGAAGATATTTGTTTCTGATTTGAAAGTTCTTG
CAAAGATTATTACAGTTTCGCCGAGTCTTTAAGATCAGAATCATGGTGGAAGAGGGAGGTGTCGATCGGGGTTTGGGCGGGGCGGAGGCGGAGTCGGCGGAAGAGTGGCT
GCTGCAAGCTCAGAGGCTTGTTCCTGTGGCTCTGGAGAAGGCTATGGAGGTTAAGGTGTTTCCAGGGAGGTGGAAGATGATTGTTTCGAAGCTCGAGCAGCTTCCCTCGC
GGTTATCGGATTTGTCGAGCCATCCTTGCTTTTCGAAGAATGCCCTTTGCAAGGAGCAATTGCAGGCTGTTTTGAAGAGCTTGAAGGAAGCTATTGAATTGGCGGAGCTT
TGTGTTGGGGAGAAATTCGAAGGCAAGCTTCGGATGCAGAGCGATCTCGACTCGCTGTCTGGGAAATTGGATTTGAATTTGCGGGATTGTGGCCTTTTGATTAAGACGGG
TGTGCTCGGTGAGGCTACTCTACCTCTATCCATGTCGGGGTGTTCGTCCCAGTGGGAGTCTACTGACTATAGCAACATAAGGGAACTGCTTGCAAGGCTGCAGATTGGCC
ATATGGAGGCCAAACATCGAGCACTTGATAGCCTCGTCGAGGTCCTGAAAGAGGATGATGATAATGTTTTGTCTGTCTTTGGGCGCAATAATGTTGCTGCTTTAGTCCAA
TTGCTCACTGCAACTTCTCCATGTATTCGGGAGAAGACGATCACCATAATTTGTTTGCTAGCTGAATCAGGAAGTTGCGAGAATTGGCTTGTTTCGGAAGGCGTTCTGCC
ACCTCTAATAAGGCTTGTTGAGTCTGGCAGTGCTGTGGCTAAAGAAAAGGCTGTGATTTCGCTGCAAAGGTTGTCAATGTCAGCTGATACCGCTCGTGCAATAGTTGGCC
ATGGCGGGGTTCGACCTCTTATCGAACTATGCAAGACTGGTGATTCTGTATCACAGGCTGCTGCTGCTTGCACGTTGAAGAACATATCAGCTGTGCCCGAGGTTCGACAA
ACTTTGGCGGAAGAAGGGATCATAAGGGTGATGATTAGTCTGGTAGATTGTGGAATTCTTTTGGGATCAAAAGAATATGCAGCTGAATGCCTGCAAAATCTCACTGCTAG
CAATGACACTCTAAGGAGATCGGTCATCTCCGAAGGCGGTCTCCGCTCCATATTGGCATATCTCGACGGCCCTCTCCCTCAAGAATCCGCAGTGGGGGCATTGAGGAATT
TAGTCAGCTCAGTATCAATGGAACTTCTGTTGTCTCTCGGCTTCCTCCCCCGACTAGTACATGTGCTAAAATCAGGATCTCTCGGCGCACAGCAGGCAGCAGCATCAGCA
ATCTGCAGGGTAAGCAGCTCACCAGAGATGAAGAAGCTAATAGGTGAAGCCGAATGCATCCCGCTCCTCATCAAACTCCTCGAGGCCAAATCAAACAGCGTTCGAGAAGT
CGCAGCACAGGCAATCTCAAGCCTCATGACCCTTGCGCAGAACTGCAGAGAAGTCAAAAGAGACGAAAAGAGTGTCCCAAATCTCGTCCAACTGCTCGACCCAAGCCCCC
AAAACACAGCAAAGAAGTACGCCGTCGCGTGCCTCGGGTCTCTCTCTTCAAGTAGGAAATGCAAGAAGCTGATGATATCGTATGGGGCGATCGGGTATCTGAAGAAGCTA
TTGGAGATGGACATACCGGGCGCAAAGAAGCTGCTCGAGAAACTGGAAAGGGGGAAGTTGAGAAGTCTGTTCAGCAGGAAATAGGGAGAAAAAATGAGCAGATTGAAGAC
CATTTTGGCTGTTTTATAAAAGAAGTTGTGTTTCATAAACTAAATGAAACAAGAATGTAGTTTGGAGGATAACAAACCGAAGGTTGCCACTCTCTGGTAAACTATTTCCC
ATCTTCTATGAATATACTTCTATGAATATACTTTCCTTGTATGTTTATGGAGTTCGAGTTCTAATCTAATCTTTTAGGGCCTGTTTGACCCACTAGTTATTATAACCCAA
GTTATAATAACCTGTGAAATACTCTGCACTCCAAACACTGAGTATTATAACCCCAGACTATATTTTAAATAATATTATATGTTTCACATACTATTATTATAATCCTATGT
TATTATAA
Protein sequenceShow/hide protein sequence
MVEEGGVDRGLGGAEAESAEEWLLQAQRLVPVALEKAMEVKVFPGRWKMIVSKLEQLPSRLSDLSSHPCFSKNALCKEQLQAVLKSLKEAIELAELCVGEKFEGKLRMQS
DLDSLSGKLDLNLRDCGLLIKTGVLGEATLPLSMSGCSSQWESTDYSNIRELLARLQIGHMEAKHRALDSLVEVLKEDDDNVLSVFGRNNVAALVQLLTATSPCIREKTI
TIICLLAESGSCENWLVSEGVLPPLIRLVESGSAVAKEKAVISLQRLSMSADTARAIVGHGGVRPLIELCKTGDSVSQAAAACTLKNISAVPEVRQTLAEEGIIRVMISL
VDCGILLGSKEYAAECLQNLTASNDTLRRSVISEGGLRSILAYLDGPLPQESAVGALRNLVSSVSMELLLSLGFLPRLVHVLKSGSLGAQQAAASAICRVSSSPEMKKLI
GEAECIPLLIKLLEAKSNSVREVAAQAISSLMTLAQNCREVKRDEKSVPNLVQLLDPSPQNTAKKYAVACLGSLSSSRKCKKLMISYGAIGYLKKLLEMDIPGAKKLLEK
LERGKLRSLFSRK