; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020411 (gene) of Snake gourd v1 genome

Gene IDTan0020411
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF819)
Genome locationLG01:1864810..1868992
RNA-Seq ExpressionTan0020411
SyntenyTan0020411
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576704.1 hypothetical protein SDJN03_24278, partial [Cucurbita argyrosperma subsp. sororia]1.8e-21187.91Show/hide
Query:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGI
        SKS EVQL CLSS K+ ARFS SIA+AP+PPVQP+S+SSS+AAE   RRFWNF  TSTGNV LRR VAV+SHL+L+LPLI+PHDQWGNWTVLFSIGAFGI
Subjt:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGI

Query:  WSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSW
        WSEKTK+GSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGT VAYFLVPM+SLGQDSW
Subjt:  WSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSW

Query:  KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVG
        KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICA YFATLFA+ASKVPPEPTT ND TD  KD E E S+KLPVLQSATALAVS AICK G
Subjt:  KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVG

Query:  SYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLL
        SYLTKYFGIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMAMILMQ+FFAVVGASGN+WSVI+TAPSIF+F+LVQIA+HLAI +GLGKLL FD K LL
Subjt:  SYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLL

Query:  IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

XP_022922913.1 uncharacterized protein LOC111430750 [Cucurbita moschata]1.4e-21187.91Show/hide
Query:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGI
        SKS EVQL CLSS K+ ARFS SIA+AP+PPVQP+S+SSS+AAE   RRFWNF  TSTGNV LRR VAV+SHL+L+LPLI+PHDQWGNWTVLFSIGAFGI
Subjt:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGI

Query:  WSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSW
        WSEKTK+GSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGT VAYFLVPM+SLGQDSW
Subjt:  WSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSW

Query:  KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVG
        KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICA YFATLFA+ASKVPPEPTT ND TD  KD E E S+KLPVLQSATALAVS AICK G
Subjt:  KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVG

Query:  SYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLL
        SYLTKYFGIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMAMILMQ+FFAVVGASGN+WSVI+TAPSIF+F+LVQIA+HLAI +GLGKLL FD K LL
Subjt:  SYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLL

Query:  IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

XP_022984975.1 uncharacterized protein LOC111483083 [Cucurbita maxima]1.6e-21287.91Show/hide
Query:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGI
        SKS EVQL CLSS K+ AR S SIA+A +PPVQP+S+SSS+AAE A RRFWNF  TSTGNVQLRR VAV+SHL+L+LPLI+PHDQWGNWTVLFS+GAFGI
Subjt:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGI

Query:  WSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSW
        WSEKTK+GSALSGALVSALVGLAASNFGIIASDAPAFP VLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGT VAYFLVPM+SLGQDSW
Subjt:  WSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSW

Query:  KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVG
        KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICA YFATLFA+ASKVPPEPTT ND TD GKD E E S+KLPVLQSATALAVS AICK G
Subjt:  KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVG

Query:  SYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLL
        SYLTKYFGIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMAMILMQ+FFAVVGASGN+WSVI+TAPSIF+F+LVQIA+HLAI +GLGKLLRFD K LL
Subjt:  SYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLL

Query:  IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

XP_023553523.1 uncharacterized protein LOC111810914 [Cucurbita pepo subsp. pepo]2.8e-21288.16Show/hide
Query:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSS-AAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFG
        SKS EVQL CLSS K+ ARFS SIA+AP+PPVQP+S+SSSS AAE A RRFWNF  TSTGNVQLRR VAV+SHL+L+LPLI+PHDQWGNWTVLFSIGAFG
Subjt:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSS-AAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFG

Query:  IWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDS
        IWSEKTK+GSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGT VAYFLVPM+SLGQDS
Subjt:  IWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDS

Query:  WKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKV
        WKIAAALMGRHIGGAVNYVAISDALGV+SSVLAAGLAADNVICA YFATLFA+ASKVPPEPTT ND TD GKD E E SNKLPVLQSA ALAVS AICK 
Subjt:  WKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKV

Query:  GSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSL
        GSYLTKYFGIQGGSMPAITA+IVVLATIFPKPFAYLAPSG A+AMILMQ+FFAVVGASGN+WSVI+TAPSIF+F+LVQIA+HLAI +GLGKLL FD K L
Subjt:  GSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSL

Query:  LIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        LIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  LIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

XP_038877446.1 uncharacterized membrane protein YjcL-like [Benincasa hispida]6.4e-20987.01Show/hide
Query:  SQFAIFHSKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLF
        SQ A  HSKS   Q PCLSS K    FS  IAMAPQP +QP  +SSS  AE  GRRFWNF  +STGNVQLRR VAVKSHL+L++PLI+PHDQWGNWTVLF
Subjt:  SQFAIFHSKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLF

Query:  SIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMR
        SIGAFGIWSEKTKIGSALSGALVS LVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGT VAYFLVPMR
Subjt:  SIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMR

Query:  SLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVS
        SLGQDSWKIAAALMGRHIGGAVNYVAISDALGVS SVLAAGLAADNVICAVYFATLFA+ASKVPPE T  ++  DVGK  E E SNKLPVLQSATA+AVS
Subjt:  SLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVS

Query:  LAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLR
         AICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFFAVVGASGNIWSVINTAPSIF+F+ VQIA+HLAI +GLGKLLR
Subjt:  LAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLR

Query:  FDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        FDLK LLIASNANVGGPTTACGMATAKGWSSM+IPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  FDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

TrEMBL top hitse value%identityAlignment
A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X36.9e-20985.78Show/hide
Query:  MASQFAIFHSKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTV
        MASQ AI  SKS ++QLPC SS K  ARF  SI MAP+PPV P+S SSS AAE   RRFWNF S S+GN  LRR +AVKSHL+L+LPLI+PHDQW NWTV
Subjt:  MASQFAIFHSKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTV

Query:  LFSIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVP
        LFS+GAFGIWSEKTKIGSALSGALVS LVGLAASN GIIASDAPAFP+VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT IGTAVAYFLVP
Subjt:  LFSIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVP

Query:  MRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALA
        MRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVS SVLAAGLAADNVICAVYFATLFA+ASKVP EPT ++D  +VGKD E E++NKLPVLQSATALA
Subjt:  MRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALA

Query:  VSLAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKL
        VS AICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA+ILMQVFF VVGASGNIWSVINTAPSIFMF+LVQIA+HLA+TIGLGKL
Subjt:  VSLAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKL

Query:  LRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        LRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFGL  LKYM
Subjt:  LRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

A0A6J1CHF3 uncharacterized protein LOC111011457 isoform X46.9e-20985.78Show/hide
Query:  MASQFAIFHSKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTV
        MASQ AI  SKS ++QLPC SS K  ARF  SI MAP+PPV P+S SSS AAE   RRFWNF S S+GN  LRR +AVKSHL+L+LPLI+PHDQW NWTV
Subjt:  MASQFAIFHSKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTV

Query:  LFSIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVP
        LFS+GAFGIWSEKTKIGSALSGALVS LVGLAASN GIIASDAPAFP+VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT IGTAVAYFLVP
Subjt:  LFSIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVP

Query:  MRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALA
        MRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVS SVLAAGLAADNVICAVYFATLFA+ASKVP EPT ++D  +VGKD E E++NKLPVLQSATALA
Subjt:  MRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALA

Query:  VSLAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKL
        VS AICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA+ILMQVFF VVGASGNIWSVINTAPSIFMF+LVQIA+HLA+TIGLGKL
Subjt:  VSLAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKL

Query:  LRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        LRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFGL  LKYM
Subjt:  LRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X16.9e-20985.78Show/hide
Query:  MASQFAIFHSKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTV
        MASQ AI  SKS ++QLPC SS K  ARF  SI MAP+PPV P+S SSS AAE   RRFWNF S S+GN  LRR +AVKSHL+L+LPLI+PHDQW NWTV
Subjt:  MASQFAIFHSKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTV

Query:  LFSIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVP
        LFS+GAFGIWSEKTKIGSALSGALVS LVGLAASN GIIASDAPAFP+VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT IGTAVAYFLVP
Subjt:  LFSIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVP

Query:  MRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALA
        MRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVS SVLAAGLAADNVICAVYFATLFA+ASKVP EPT ++D  +VGKD E E++NKLPVLQSATALA
Subjt:  MRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALA

Query:  VSLAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKL
        VS AICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA+ILMQVFF VVGASGNIWSVINTAPSIFMF+LVQIA+HLA+TIGLGKL
Subjt:  VSLAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKL

Query:  LRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        LRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFGL  LKYM
Subjt:  LRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

A0A6J1E833 uncharacterized protein LOC1114307506.7e-21287.91Show/hide
Query:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGI
        SKS EVQL CLSS K+ ARFS SIA+AP+PPVQP+S+SSS+AAE   RRFWNF  TSTGNV LRR VAV+SHL+L+LPLI+PHDQWGNWTVLFSIGAFGI
Subjt:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGI

Query:  WSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSW
        WSEKTK+GSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGT VAYFLVPM+SLGQDSW
Subjt:  WSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSW

Query:  KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVG
        KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICA YFATLFA+ASKVPPEPTT ND TD  KD E E S+KLPVLQSATALAVS AICK G
Subjt:  KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVG

Query:  SYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLL
        SYLTKYFGIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMAMILMQ+FFAVVGASGN+WSVI+TAPSIF+F+LVQIA+HLAI +GLGKLL FD K LL
Subjt:  SYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLL

Query:  IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

A0A6J1JA22 uncharacterized protein LOC1114830837.9e-21387.91Show/hide
Query:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGI
        SKS EVQL CLSS K+ AR S SIA+A +PPVQP+S+SSS+AAE A RRFWNF  TSTGNVQLRR VAV+SHL+L+LPLI+PHDQWGNWTVLFS+GAFGI
Subjt:  SKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGI

Query:  WSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSW
        WSEKTK+GSALSGALVSALVGLAASNFGIIASDAPAFP VLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TTIGT VAYFLVPM+SLGQDSW
Subjt:  WSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSW

Query:  KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVG
        KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICA YFATLFA+ASKVPPEPTT ND TD GKD E E S+KLPVLQSATALAVS AICK G
Subjt:  KIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVG

Query:  SYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLL
        SYLTKYFGIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMAMILMQ+FFAVVGASGN+WSVI+TAPSIF+F+LVQIA+HLAI +GLGKLLRFD K LL
Subjt:  SYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLL

Query:  IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG+ VLKYM
Subjt:  IASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL1.9e-3830.69Show/hide
Query:  LIAPHDQWGNWTVLFSIGAFGIWSE-KTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSV
        LI+  D W  W  +    A  I  E + K  SA+SGA+++    +  +N G++  ++P +  V  +++PLA+PLLLF+ ++R++ K +  LL  FL+ SV
Subjt:  LIAPHDQWGNWTVLFSIGAFGIWSE-KTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSV

Query:  GTTIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIAS--------------KVPPEPTTT
        GT +G+ +A+FL+       D  KI   +   +IGG VN+ A++         ++A + ADN + A+ F  L +I +              KV  +  + 
Subjt:  GTTIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIAS--------------KVPPEPTTT

Query:  NDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVGSYLTKYF------GIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGAS
        N      K    + S K     +  A A+     KV  Y    F      G  G     +T++ V++  +FP+ F  L  S E +   L+ +FF V+G  
Subjt:  NDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVGSYLTKYF------GIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGAS

Query:  GNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG
         ++  ++  AP I +F  +    +LA+++  GKL R  L+ +L+A NA VGGPTTA  MA AKGW  +V P +L G  G  I  ++G   G
Subjt:  GNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)8.5e-12760.87Show/hide
Query:  RRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIK
        RR V V S LR   PLI+P D W  W  LF+ GAFG+WSEKTKIGS +SGAL S L+GLAASN  +I  + P++   +EFLLP  +PLLLFRADLRR+I+
Subjt:  RRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIK

Query:  STGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPE-PTT
        STG+LLLAFL+GSV T +GT VA+ LVPMRSLG D+WKIAAALMG +IGG++N+VAIS+AL +S SV+AAG+A DNVICA++F  LFA+ASK+PPE  + 
Subjt:  STGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPE-PTT

Query:  TNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWS
        ++   D+ KD + E  N+  V+ ++ AL+VS  ICK    LT  F IQG  +PA+TA+ +VLAT FP  F  LAPS E +++ILMQVFF ++GA+G++W+
Subjt:  TNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWS

Query:  VINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLK
        VINTAPSIF+FA +Q+ +HLA+T+ LGKL   D+K LL+ASNAN+GGPTTAC MATAKGW+S+V+PGIL+G+FG++IATFLGIG G+ VLK
Subjt:  VINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLK

AT5G52540.1 Protein of unknown function (DUF819)4.3e-15573.42Show/hide
Query:  RYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKS
        R V V S   L  PLI+P+D+WG WT LF+ GA G+WSEKTK+G+A+SGALVS LVGLAASN GII+S APAF +VL FLLPLAVPLLLFRADLRRV++S
Subjt:  RYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKS

Query:  TGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVP----PEP
        TG LLLAFL+GSV TT+GTA+AY+LVPM+SLG DSWKIAAALMGRHIGGAVNYVAIS+ALGV+ SVLAAGLAADNVICAVYF TLFA+ SK+P    P P
Subjt:  TGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVP----PEP

Query:  TTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNI
        TT     D   +  +E  NK+PVL  AT +AVSLAICK G+ LTKYFGI GGS+PAITAV+V+LAT+FP  F  LAPSGEAMA+ILMQVFF VVGASGNI
Subjt:  TTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNI

Query:  WSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM
        WSVINTAPSIF+FALVQI  HLA+ +G+GKLL  +L+ LL+ASNANVGGPTTA GMATAKGW+S+++PGILAGIFGIAIATF+GI FG+ VLK+M
Subjt:  WSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGLTVLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCACAGTTCGCAATTTTTCACTCGAAGTCGCTTGAAGTTCAGCTTCCATGTTTGTCTTCAAGAAAAGTTCCAGCCAGATTCTCCAGTAGCATCGCGATGGCACC
TCAGCCACCGGTGCAACCGATATCGGCCTCATCATCATCAGCTGCTGAATATGCAGGCCGTAGATTCTGGAACTTCCACAGCACTAGTACCGGAAATGTTCAATTGAGAC
GATATGTTGCTGTTAAATCTCATCTGAGATTGGATCTCCCGCTCATTGCTCCGCATGACCAGTGGGGCAACTGGACTGTTTTATTCTCCATAGGAGCCTTCGGTATCTGG
TCCGAGAAAACGAAGATCGGTAGTGCACTAAGTGGTGCCTTAGTGAGCGCATTGGTAGGACTTGCAGCCAGTAATTTTGGGATCATTGCATCAGATGCTCCAGCTTTTCC
TATTGTTTTGGAGTTTTTGCTACCGTTGGCAGTTCCTTTGCTGTTATTCAGAGCAGATTTGCGTCGTGTTATAAAGTCGACTGGAACACTTCTCTTGGCCTTTTTGTTAG
GTTCAGTTGGAACAACAATTGGAACTGCAGTGGCCTATTTTCTTGTACCGATGCGATCACTTGGTCAAGACAGTTGGAAAATTGCGGCCGCACTAATGGGAAGACATATT
GGTGGAGCTGTCAATTATGTTGCTATATCCGATGCTCTTGGTGTTTCTTCGTCAGTATTAGCTGCTGGACTTGCTGCAGATAATGTAATTTGTGCAGTGTATTTTGCAAC
ATTGTTCGCAATAGCATCTAAAGTACCTCCTGAACCTACGACAACGAATGATTGTACGGATGTTGGGAAGGATACAGAGACTGAATACAGCAACAAGCTTCCGGTGTTAC
AGTCTGCCACAGCCCTTGCTGTATCACTTGCCATTTGTAAAGTTGGTTCCTACCTTACCAAATATTTTGGAATTCAGGGTGGTAGCATGCCAGCAATTACAGCCGTCATT
GTTGTCTTAGCAACCATTTTTCCTAAGCCGTTTGCTTACCTTGCTCCTTCTGGTGAGGCTATGGCTATGATTCTAATGCAGGTTTTCTTCGCTGTAGTGGGAGCAAGTGG
AAATATATGGAGTGTCATCAACACTGCACCAAGTATCTTCATGTTTGCTCTTGTCCAGATTGCAATCCATCTTGCCATAACCATTGGTCTCGGAAAGCTGCTTCGCTTCG
ACCTGAAATCGTTGCTGATAGCATCGAATGCCAACGTTGGAGGCCCGACGACCGCCTGTGGGATGGCCACAGCAAAGGGTTGGAGTTCAATGGTTATTCCTGGAATTCTT
GCTGGAATTTTTGGAATCGCTATTGCAACTTTCCTAGGTATTGGATTTGGATTGACGGTCTTAAAATACATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCACAGTTCGCAATTTTTCACTCGAAGTCGCTTGAAGTTCAGCTTCCATGTTTGTCTTCAAGAAAAGTTCCAGCCAGATTCTCCAGTAGCATCGCGATGGCACC
TCAGCCACCGGTGCAACCGATATCGGCCTCATCATCATCAGCTGCTGAATATGCAGGCCGTAGATTCTGGAACTTCCACAGCACTAGTACCGGAAATGTTCAATTGAGAC
GATATGTTGCTGTTAAATCTCATCTGAGATTGGATCTCCCGCTCATTGCTCCGCATGACCAGTGGGGCAACTGGACTGTTTTATTCTCCATAGGAGCCTTCGGTATCTGG
TCCGAGAAAACGAAGATCGGTAGTGCACTAAGTGGTGCCTTAGTGAGCGCATTGGTAGGACTTGCAGCCAGTAATTTTGGGATCATTGCATCAGATGCTCCAGCTTTTCC
TATTGTTTTGGAGTTTTTGCTACCGTTGGCAGTTCCTTTGCTGTTATTCAGAGCAGATTTGCGTCGTGTTATAAAGTCGACTGGAACACTTCTCTTGGCCTTTTTGTTAG
GTTCAGTTGGAACAACAATTGGAACTGCAGTGGCCTATTTTCTTGTACCGATGCGATCACTTGGTCAAGACAGTTGGAAAATTGCGGCCGCACTAATGGGAAGACATATT
GGTGGAGCTGTCAATTATGTTGCTATATCCGATGCTCTTGGTGTTTCTTCGTCAGTATTAGCTGCTGGACTTGCTGCAGATAATGTAATTTGTGCAGTGTATTTTGCAAC
ATTGTTCGCAATAGCATCTAAAGTACCTCCTGAACCTACGACAACGAATGATTGTACGGATGTTGGGAAGGATACAGAGACTGAATACAGCAACAAGCTTCCGGTGTTAC
AGTCTGCCACAGCCCTTGCTGTATCACTTGCCATTTGTAAAGTTGGTTCCTACCTTACCAAATATTTTGGAATTCAGGGTGGTAGCATGCCAGCAATTACAGCCGTCATT
GTTGTCTTAGCAACCATTTTTCCTAAGCCGTTTGCTTACCTTGCTCCTTCTGGTGAGGCTATGGCTATGATTCTAATGCAGGTTTTCTTCGCTGTAGTGGGAGCAAGTGG
AAATATATGGAGTGTCATCAACACTGCACCAAGTATCTTCATGTTTGCTCTTGTCCAGATTGCAATCCATCTTGCCATAACCATTGGTCTCGGAAAGCTGCTTCGCTTCG
ACCTGAAATCGTTGCTGATAGCATCGAATGCCAACGTTGGAGGCCCGACGACCGCCTGTGGGATGGCCACAGCAAAGGGTTGGAGTTCAATGGTTATTCCTGGAATTCTT
GCTGGAATTTTTGGAATCGCTATTGCAACTTTCCTAGGTATTGGATTTGGATTGACGGTCTTAAAATACATGTAAAACCATTCGAATTCAGATCATAAAATCGCTTGTTT
CTCAGAAGACATTGGATGAGAGCTTGTTTTCTTAAAGTACTTGCAGGCAAAGCACATTTAGAGCATCTATGTGCACTTCTTTTAAAAGTTCATCTTAGATTTAAATCCTT
CTGTAGTGCTTGTGCTTATGATCTTAGAGCCTTTCTGTTTGGAACCAGATGAACTGAAGGCACACTGTCATTTTTTAAAAAAAATAAAGCATACTGTGTCATGTTGTTTG
TTGTACCAA
Protein sequenceShow/hide protein sequence
MASQFAIFHSKSLEVQLPCLSSRKVPARFSSSIAMAPQPPVQPISASSSSAAEYAGRRFWNFHSTSTGNVQLRRYVAVKSHLRLDLPLIAPHDQWGNWTVLFSIGAFGIW
SEKTKIGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDSWKIAAALMGRHI
GGAVNYVAISDALGVSSSVLAAGLAADNVICAVYFATLFAIASKVPPEPTTTNDCTDVGKDTETEYSNKLPVLQSATALAVSLAICKVGSYLTKYFGIQGGSMPAITAVI
VVLATIFPKPFAYLAPSGEAMAMILMQVFFAVVGASGNIWSVINTAPSIFMFALVQIAIHLAITIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGIL
AGIFGIAIATFLGIGFGLTVLKYM