; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016781 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016781
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF819)
Genome locationscaffold9:44937239..44941969
RNA-Seq ExpressionSpg016781
SyntenySpg016781
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140757.1 uncharacterized protein LOC101211894 isoform X1 [Cucumis sativus]5.6e-20584.85Show/hide
Query:  SQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLF
        SQL +L SKSPE+Q PC + +K SV FSR+I++   PP+Q LSS S   AEI   RFW+FR +S  NV+SRR VAV+S LKLNLPL+SP+DQWGNWTVLF
Subjt:  SQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLF

Query:  SVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMR
        S+GAFGIWSEKTK+GSALSGALVS LVGLAASNFGII+SDAPAF IVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTT+GTVVAYFLVPMR
Subjt:  SVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMR

Query:  SLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALAVS
        SLGQDSW+IAAALMGRHIGGAVNYVAISDAL VSPSVLAAGLAADNVICAVYFATLFALASKVP EPT +++G  VGKDAEVE SNKLPVLQSA+A+AVS
Subjt:  SLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALAVS

Query:  FAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLR
        FAICKVGSYLT Y GIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFFAVVGASGN+WSVI TAPSIF+FA VQI+VHL I IGLGKLLR
Subjt:  FAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLR

Query:  FDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        FDLK LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIA+ATFLGIGFG+ VLKYM
Subjt:  FDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

XP_022922913.1 uncharacterized protein LOC111430750 [Cucurbita moschata]4.3e-20584.7Show/hide
Query:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV
        + SQLA   SKSPEVQL CL+  K+S RFSR+IA+  +PPVQ LSS S +AAE    RFWNF   S  NV  RR VAV+S LKLNLPLISPHDQWGNWTV
Subjt:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV

Query:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP
        LFS+GAFGIWSEKTK+GSALSGALVSALVGLAASNFGII+SDAPAFPIVLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGTVVAYFLVP
Subjt:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP

Query:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA
        M+SLGQDSW+IAAALMGRHIGGAVNYVAISDAL VS SVLAAGLAADNVICA YFATLFALASKVP EPT  ND T+  KDAE+E S+KLPVLQSATALA
Subjt:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA

Query:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL
        VSFAICK GSYLT Y GIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMAMILMQ+FFAVVGASGN+WSVI TAPSIF+F+LVQIAVHLAI +GLGKL
Subjt:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL

Query:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        L FD KLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

XP_022984975.1 uncharacterized protein LOC111483083 [Cucurbita maxima]6.0e-20785.34Show/hide
Query:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV
        + SQLA   SKSPEVQL CL+  KIS R SR+IA+  +PPVQ LSS S +AAE A  RFWNF   S  NV+ RR VAV+S LKLNLPLISPHDQWGNWTV
Subjt:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV

Query:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP
        LFSVGAFGIWSEKTK+GSALSGALVSALVGLAASNFGII+SDAPAFP VLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGTVVAYFLVP
Subjt:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP

Query:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA
        M+SLGQDSW+IAAALMGRHIGGAVNYVAISDAL VS SVLAAGLAADNVICA YFATLFALASKVP EPT  ND T+ GKDAE+E S+KLPVLQSATALA
Subjt:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA

Query:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL
        VSFAICK GSYLT Y GIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMAMILMQ+FFAVVGASGN+WSVI TAPSIF+F+LVQIAVHLAI +GLGKL
Subjt:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL

Query:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        LRFD KLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

XP_023553523.1 uncharacterized protein LOC111810914 [Cucurbita pepo subsp. pepo]2.5e-20584.95Show/hide
Query:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPS-AAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWT
        + SQLA   SKSPEVQL CL+  KIS RFSR+IA+  +PPVQ LSS S S AAE A  RFWNF   S  NV+ RR VAV+S LKLNLPLISPHDQWGNWT
Subjt:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPS-AAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWT

Query:  VLFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLV
        VLFS+GAFGIWSEKTK+GSALSGALVSALVGLAASNFGII+SDAPAFPIVLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGTVVAYFLV
Subjt:  VLFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLV

Query:  PMRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATAL
        PM+SLGQDSW+IAAALMGRHIGGAVNYVAISDAL V+ SVLAAGLAADNVICA YFATLFALASKVP EPT  ND T+ GKDAE+E SNKLPVLQSA AL
Subjt:  PMRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATAL

Query:  AVSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGK
        AVSFAICK GSYLT Y GIQGGSMPAITA+IVVLATIFPKPFAYLAPSG A+AMILMQ+FFAVVGASGN+WSVI TAPSIF+F+LVQIAVHLAI +GLGK
Subjt:  AVSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGK

Query:  LLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        LL FD KLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  LLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

XP_038877446.1 uncharacterized membrane protein YjcL-like [Benincasa hispida]6.6e-20686.36Show/hide
Query:  SQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLF
        SQLA L SKSP  Q PCL+  K SV FSR IA+  QP +Q   S S   AEI G RFWNFR +S  NV+ RR VAVKS LKLN+PLISPHDQWGNWTVLF
Subjt:  SQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLF

Query:  SVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMR
        S+GAFGIWSEKTKIGSALSGALVS LVGLAASNFGII+SDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMR
Subjt:  SVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMR

Query:  SLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALAVS
        SLGQDSW+IAAALMGRHIGGAVNYVAISDAL VSPSVLAAGLAADNVICAVYFATLFALASKVP E TI+++  +VGK AEVE SNKLPVLQSATA+AVS
Subjt:  SLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALAVS

Query:  FAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLR
        FAICKVGSYLT Y GIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFFAVVGASGNIWSVI TAPSIF+F+ VQIAVHLAI +GLGKLLR
Subjt:  FAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLR

Query:  FDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        FDLKLLLIASNANVGGPTTACGMATAKGWSSM+IPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  FDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

TrEMBL top hitse value%identityAlignment
A0A0A0L600 Uncharacterized protein2.7e-20584.85Show/hide
Query:  SQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLF
        SQL +L SKSPE+Q PC + +K SV FSR+I++   PP+Q LSS S   AEI   RFW+FR +S  NV+SRR VAV+S LKLNLPL+SP+DQWGNWTVLF
Subjt:  SQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLF

Query:  SVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMR
        S+GAFGIWSEKTK+GSALSGALVS LVGLAASNFGII+SDAPAF IVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTT+GTVVAYFLVPMR
Subjt:  SVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMR

Query:  SLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALAVS
        SLGQDSW+IAAALMGRHIGGAVNYVAISDAL VSPSVLAAGLAADNVICAVYFATLFALASKVP EPT +++G  VGKDAEVE SNKLPVLQSA+A+AVS
Subjt:  SLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALAVS

Query:  FAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLR
        FAICKVGSYLT Y GIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFFAVVGASGN+WSVI TAPSIF+FA VQI+VHL I IGLGKLLR
Subjt:  FAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLR

Query:  FDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        FDLK LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIA+ATFLGIGFG+ VLKYM
Subjt:  FDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X33.5e-20584.7Show/hide
Query:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV
        MASQ+A+L SKSP++QLPC + +K S RF R+I +  +PPV  +SS SP AAEI   RFWNF S S  N   RR +AVKS LKLNLPLISPHDQW NWTV
Subjt:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV

Query:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP
        LFSVGAFGIWSEKTKIGSALSGALVS LVGLAASN GII+SDAPAFP+VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT IGT VAYFLVP
Subjt:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP

Query:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA
        MRSLGQDSW+IAAALMGRHIGGAVNYVAIS AL VSPSVLAAGLAADNVICAVYFATLFALASKVP EPT  +D  NVGKD E E++NKLPVLQSATALA
Subjt:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA

Query:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL
        VSFAICK GSYLT + GIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA+ILMQVFF VVGASGNIWSVI TAPSIFMF+LVQIAVHLA+TIGLGKL
Subjt:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL

Query:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFGL  LKYM
Subjt:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X13.5e-20584.7Show/hide
Query:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV
        MASQ+A+L SKSP++QLPC + +K S RF R+I +  +PPV  +SS SP AAEI   RFWNF S S  N   RR +AVKS LKLNLPLISPHDQW NWTV
Subjt:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV

Query:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP
        LFSVGAFGIWSEKTKIGSALSGALVS LVGLAASN GII+SDAPAFP+VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT IGT VAYFLVP
Subjt:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP

Query:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA
        MRSLGQDSW+IAAALMGRHIGGAVNYVAIS AL VSPSVLAAGLAADNVICAVYFATLFALASKVP EPT  +D  NVGKD E E++NKLPVLQSATALA
Subjt:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA

Query:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL
        VSFAICK GSYLT + GIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA+ILMQVFF VVGASGNIWSVI TAPSIFMF+LVQIAVHLA+TIGLGKL
Subjt:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL

Query:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFGL  LKYM
Subjt:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

A0A6J1E833 uncharacterized protein LOC1114307502.1e-20584.7Show/hide
Query:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV
        + SQLA   SKSPEVQL CL+  K+S RFSR+IA+  +PPVQ LSS S +AAE    RFWNF   S  NV  RR VAV+S LKLNLPLISPHDQWGNWTV
Subjt:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV

Query:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP
        LFS+GAFGIWSEKTK+GSALSGALVSALVGLAASNFGII+SDAPAFPIVLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGTVVAYFLVP
Subjt:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP

Query:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA
        M+SLGQDSW+IAAALMGRHIGGAVNYVAISDAL VS SVLAAGLAADNVICA YFATLFALASKVP EPT  ND T+  KDAE+E S+KLPVLQSATALA
Subjt:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA

Query:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL
        VSFAICK GSYLT Y GIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMAMILMQ+FFAVVGASGN+WSVI TAPSIF+F+LVQIAVHLAI +GLGKL
Subjt:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL

Query:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        L FD KLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

A0A6J1JA22 uncharacterized protein LOC1114830832.9e-20785.34Show/hide
Query:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV
        + SQLA   SKSPEVQL CL+  KIS R SR+IA+  +PPVQ LSS S +AAE A  RFWNF   S  NV+ RR VAV+S LKLNLPLISPHDQWGNWTV
Subjt:  MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTV

Query:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP
        LFSVGAFGIWSEKTK+GSALSGALVSALVGLAASNFGII+SDAPAFP VLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGTVVAYFLVP
Subjt:  LFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVP

Query:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA
        M+SLGQDSW+IAAALMGRHIGGAVNYVAISDAL VS SVLAAGLAADNVICA YFATLFALASKVP EPT  ND T+ GKDAE+E S+KLPVLQSATALA
Subjt:  MRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALA

Query:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL
        VSFAICK GSYLT Y GIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMAMILMQ+FFAVVGASGN+WSVI TAPSIF+F+LVQIAVHLAI +GLGKL
Subjt:  VSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKL

Query:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        LRFD KLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  LRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL9.1e-4131.03Show/hide
Query:  LISPHDQWGNWTVLFSVGAFGIWSE-KTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSV
        LIS  D W  W  +    A  I  E + K  SA+SGA+++    +  +N G++  ++P +  V  +++PLA+PLLLF+ ++R++ K +  LL  FL+ SV
Subjt:  LISPHDQWGNWTVLFSVGAFGIWSE-KTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSV

Query:  GTTIGTVVAYFLVPMRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALAS--------KVPTEPTIVNDGTNV
        GT +G+++A+FL+       D  +I   +   +IGG VN+ A++   +     ++A + ADN + A+ F  L ++ +         +P E  +  DG N 
Subjt:  GTTIGTVVAYFLVPMRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALAS--------KVPTEPTIVNDGTNV

Query:  GKDAEVEYSNK-LPVLQSATALAVSFAICKVGSYLTNYL----------GIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASG
        G  AE  +  K + +   A     +FA+  V   ++ Y           G  G     +T++ V++  +FP+ F  L  S E +   L+ +FF V+G   
Subjt:  GKDAEVEYSNK-LPVLQSATALAVSFAICKVGSYLTNYL----------GIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASG

Query:  NIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG
        ++  ++  AP I +F  +    +LA+++  GKL R  L+ +L+A NA VGGPTTA  MA AKGW  +V P +L G  G  I  ++G   G
Subjt:  NIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)3.5e-12855.75Show/hide
Query:  SPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLFSVGAFGIWS
        SP +    ++PS+  +  S  I   + PPV    +L+P                  R V SRR   VK   +L  PLISP D W  W  LF+ GAFG+WS
Subjt:  SPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLFSVGAFGIWS

Query:  EKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWRI
        EKTKIGS +SGAL S L+GLAASN  +I  + P++   +EFLLP  +PLLLFRADLRR+I+STG+LLLAFL+GSV T +GTVVA+ LVPMRSLG D+W+I
Subjt:  EKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWRI

Query:  AAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVND-GTNVGKDAEVEYSNKLPVLQSATALAVSFAICKVGS
        AAALMG +IGG++N+VAIS+AL +SPSV+AAG+A DNVICA++F  LFALASK+P E    +    ++ KD ++E  N+  V+ ++ AL+VSF ICK   
Subjt:  AAALMGRHIGGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVND-GTNVGKDAEVEYSNKLPVLQSATALAVSFAICKVGS

Query:  YLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLRFDLKLLLI
         LT    IQG  +PA+TA+ +VLAT FP  F  LAPS E +++ILMQVFF ++GA+G++W+VI TAPSIF+FA +Q+ VHLA+T+ LGKL   D+KLLL+
Subjt:  YLTNYLGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLRFDLKLLLI

Query:  ASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLK
        ASNAN+GGPTTAC MATAKGW+S+V+PGIL+G+FG++IATFLGIG G+ VLK
Subjt:  ASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLK

AT5G52540.1 Protein of unknown function (DUF819)1.6e-15469.88Show/hide
Query:  SSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAP
        S LS S A ++  R    R  S  +    R V V S   L+ PLISP+D+WG WT LF+ GA G+WSEKTK+G+A+SGALVS LVGLAASN GIISS AP
Subjt:  SSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIISSDAP

Query:  AFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGL
        AF +VL FLLPLAVPLLLFRADLRRV++STG LLLAFL+GSV TT+GT +AY+LVPM+SLG DSW+IAAALMGRHIGGAVNYVAIS+AL V+PSVLAAGL
Subjt:  AFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWRIAAALMGRHIGGAVNYVAISDALDVSPSVLAAGL

Query:  AADNVICAVYFATLFALASKVPTE-----PTIVNDGTNVGKDAEVEYSNKLPVLQSATALAVSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPK
        AADNVICAVYF TLFAL SK+P E      TIV+  TN       E  NK+PVL  AT +AVS AICK G+ LT Y GI GGS+PAITAV+V+LAT+FP 
Subjt:  AADNVICAVYFATLFALASKVPTE-----PTIVNDGTNVGKDAEVEYSNKLPVLQSATALAVSFAICKVGSYLTNYLGIQGGSMPAITAVIVVLATIFPK

Query:  PFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGI
         F  LAPSGEAMA+ILMQVFF VVGASGNIWSVI TAPSIF+FALVQI  HLA+ +G+GKLL  +L+LLL+ASNANVGGPTTA GMATAKGW+S+++PGI
Subjt:  PFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGI

Query:  LAGIFGIAIATFLGIGFGLTVLKYM
        LAGIFGIAIATF+GI FG+ VLK+M
Subjt:  LAGIFGIAIATFLGIGFGLTVLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCACAGCTCGCAGTTCTTCCCTCGAAATCGCCTGAAGTTCAGCTTCCATGTTTGGCTCCAAGCAAAATCTCAGTTAGATTCTCCAGGAACATCGCGTTGACACG
TCAGCCACCGGTGCAACAGTTATCGTCGTTATCACCATCAGCTGCAGAAATTGCAGGCTCGAGATTCTGGAACTTTCGCAGCGCTAGTATCAGAAATGTTCGATCAAGAC
GATATGTTGCTGTTAAATCTCTTCTGAAATTGAATCTCCCGCTCATTTCTCCGCACGATCAGTGGGGCAACTGGACTGTTTTATTCTCCGTAGGAGCCTTCGGTATCTGG
TCTGAGAAAACGAAGATTGGTAGTGCATTAAGTGGTGCCTTAGTAAGCGCTTTGGTAGGACTTGCAGCTAGTAATTTTGGGATCATTTCATCTGATGCTCCAGCTTTTCC
AATTGTTTTGGAGTTTTTGCTACCGTTAGCAGTTCCTTTGCTGTTATTTAGAGCAGATTTGCGTCGTGTGATAAAGTCAACTGGGACACTTCTCTTGGCTTTTTTGTTAG
GTTCAGTTGGAACAACAATTGGAACTGTAGTGGCCTACTTTCTTGTACCAATGCGATCTCTTGGTCAAGACAGTTGGAGAATTGCTGCCGCACTAATGGGAAGACACATT
GGTGGAGCTGTCAATTATGTTGCTATATCTGATGCTCTTGATGTATCTCCATCAGTGTTAGCTGCTGGACTTGCTGCCGATAATGTAATTTGTGCAGTGTATTTTGCAAC
ACTGTTTGCATTAGCGTCTAAAGTACCTACTGAACCTACGATAGTGAATGATGGTACGAATGTTGGGAAGGATGCAGAGGTTGAGTACAGCAACAAGCTTCCGGTGTTAC
AATCGGCAACAGCTCTTGCTGTATCATTTGCCATTTGTAAAGTTGGTTCCTATTTGACCAATTATCTTGGAATTCAAGGTGGTAGCATGCCAGCAATTACAGCCGTCATT
GTTGTCTTAGCAACCATTTTTCCTAAGCCGTTTGCTTACCTTGCTCCTTCCGGTGAGGCTATGGCTATGATTCTAATGCAGGTTTTCTTTGCTGTAGTGGGAGCTAGTGG
AAATATATGGAGTGTCATCAAAACTGCACCAAGTATCTTCATGTTTGCTCTTGTTCAGATTGCAGTCCATCTTGCCATAACCATTGGTCTCGGGAAGCTGCTTCGCTTCG
ACCTGAAGTTGTTACTGATAGCATCAAATGCCAATGTTGGAGGGCCCACGACAGCCTGTGGGATGGCCACCGCAAAGGGTTGGAGTTCAATGGTTATCCCTGGAATTCTT
GCTGGAATTTTCGGAATCGCTATCGCAACTTTCCTAGGTATTGGATTTGGATTGACAGTCTTGAAATACATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCACAGCTCGCAGTTCTTCCCTCGAAATCGCCTGAAGTTCAGCTTCCATGTTTGGCTCCAAGCAAAATCTCAGTTAGATTCTCCAGGAACATCGCGTTGACACG
TCAGCCACCGGTGCAACAGTTATCGTCGTTATCACCATCAGCTGCAGAAATTGCAGGCTCGAGATTCTGGAACTTTCGCAGCGCTAGTATCAGAAATGTTCGATCAAGAC
GATATGTTGCTGTTAAATCTCTTCTGAAATTGAATCTCCCGCTCATTTCTCCGCACGATCAGTGGGGCAACTGGACTGTTTTATTCTCCGTAGGAGCCTTCGGTATCTGG
TCTGAGAAAACGAAGATTGGTAGTGCATTAAGTGGTGCCTTAGTAAGCGCTTTGGTAGGACTTGCAGCTAGTAATTTTGGGATCATTTCATCTGATGCTCCAGCTTTTCC
AATTGTTTTGGAGTTTTTGCTACCGTTAGCAGTTCCTTTGCTGTTATTTAGAGCAGATTTGCGTCGTGTGATAAAGTCAACTGGGACACTTCTCTTGGCTTTTTTGTTAG
GTTCAGTTGGAACAACAATTGGAACTGTAGTGGCCTACTTTCTTGTACCAATGCGATCTCTTGGTCAAGACAGTTGGAGAATTGCTGCCGCACTAATGGGAAGACACATT
GGTGGAGCTGTCAATTATGTTGCTATATCTGATGCTCTTGATGTATCTCCATCAGTGTTAGCTGCTGGACTTGCTGCCGATAATGTAATTTGTGCAGTGTATTTTGCAAC
ACTGTTTGCATTAGCGTCTAAAGTACCTACTGAACCTACGATAGTGAATGATGGTACGAATGTTGGGAAGGATGCAGAGGTTGAGTACAGCAACAAGCTTCCGGTGTTAC
AATCGGCAACAGCTCTTGCTGTATCATTTGCCATTTGTAAAGTTGGTTCCTATTTGACCAATTATCTTGGAATTCAAGGTGGTAGCATGCCAGCAATTACAGCCGTCATT
GTTGTCTTAGCAACCATTTTTCCTAAGCCGTTTGCTTACCTTGCTCCTTCCGGTGAGGCTATGGCTATGATTCTAATGCAGGTTTTCTTTGCTGTAGTGGGAGCTAGTGG
AAATATATGGAGTGTCATCAAAACTGCACCAAGTATCTTCATGTTTGCTCTTGTTCAGATTGCAGTCCATCTTGCCATAACCATTGGTCTCGGGAAGCTGCTTCGCTTCG
ACCTGAAGTTGTTACTGATAGCATCAAATGCCAATGTTGGAGGGCCCACGACAGCCTGTGGGATGGCCACCGCAAAGGGTTGGAGTTCAATGGTTATCCCTGGAATTCTT
GCTGGAATTTTCGGAATCGCTATCGCAACTTTCCTAGGTATTGGATTTGGATTGACAGTCTTGAAATACATGTAA
Protein sequenceShow/hide protein sequence
MASQLAVLPSKSPEVQLPCLAPSKISVRFSRNIALTRQPPVQQLSSLSPSAAEIAGSRFWNFRSASIRNVRSRRYVAVKSLLKLNLPLISPHDQWGNWTVLFSVGAFGIW
SEKTKIGSALSGALVSALVGLAASNFGIISSDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWRIAAALMGRHI
GGAVNYVAISDALDVSPSVLAAGLAADNVICAVYFATLFALASKVPTEPTIVNDGTNVGKDAEVEYSNKLPVLQSATALAVSFAICKVGSYLTNYLGIQGGSMPAITAVI
VVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVIKTAPSIFMFALVQIAVHLAITIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVIPGIL
AGIFGIAIATFLGIGFGLTVLKYM