; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G001540 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G001540
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function, DUF642
Genome locationchr01:1565172..1569220
RNA-Seq ExpressionLsi01G001540
SyntenyLsi01G001540
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006946 - Domain of unknown function DUF642
IPR008979 - Galactose-binding-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581425.1 hypothetical protein SDJN03_21427, partial [Cucurbita argyrosperma subsp. sororia]1.8e-22095.94Show/hide
Query:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKF K  SLILLVFA LASE VAEDGLVANGDFETLPSGGFP DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEP+DETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQ+VNLNFTAKADR+RIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGP +WAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

XP_004134601.1 uncharacterized protein LOC101220961 [Cucumis sativus]1.2e-21995.18Show/hide
Query:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MA +P F KC SLILLVFA  AS+I+A+DGLVANGDFET+PSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPP+SQTIDLQTLYSVQGWDPYTYAFEPE+ETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQ+VNLNFTAKADR+RIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGP FWAFGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

XP_022926053.1 uncharacterized protein LOC111433290 [Cucurbita moschata]1.1e-22096.19Show/hide
Query:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKF K  SLILLVFA LASEIVAEDGLVANGDFETLPSGGFP DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEP+DETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQ+VNLNFTAKADR+RIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGP +WAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

XP_023543516.1 uncharacterized protein LOC111803381 [Cucurbita pepo subsp. pepo]6.8e-22095.94Show/hide
Query:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKF K  SLILLVFA LASEIVAEDGLVANGDFETLPSGGFP DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEP+DETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQ+VNLNFTAKADR+RIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGP  WAFGVGLGLWLL WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

XP_038883634.1 uncharacterized protein LOC120074550 [Benincasa hispida]4.7e-22195.94Show/hide
Query:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKF KC SLILL+ A LASEI AEDGLVANGDFET+PSGGFPNDGAIEGPT IPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDR KDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQ+VNLNFTAKADR+RIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGP +W FGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

TrEMBL top hitse value%identityAlignment
A0A0A0KNV4 Uncharacterized protein5.6e-22095.18Show/hide
Query:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MA +P F KC SLILLVFA  AS+I+A+DGLVANGDFET+PSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPP+SQTIDLQTLYSVQGWDPYTYAFEPE+ETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQ+VNLNFTAKADR+RIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGP FWAFGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

A0A1S3AZX3 uncharacterized protein LOC1034843735.6e-22095.18Show/hide
Query:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MA +PKF KC SLILLVFA LAS+I+AEDGLVANGDFET+PSGGFPNDGAIEGPT IPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPP+SQTIDLQTLYSVQGWD YTYAFEPE+ETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVE+NRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQ+V+LNFTAKADR+RIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGP+FWAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

A0A6J1EGY8 uncharacterized protein LOC1114332905.1e-22196.19Show/hide
Query:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKF K  SLILLVFA LASEIVAEDGLVANGDFETLPSGGFP DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEP+DETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQ+VNLNFTAKADR+RIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGP +WAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

A0A6J1IL07 uncharacterized protein LOC1114784027.6e-21794.67Show/hide
Query:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKF K  SLILLVFA L SEIVAEDGLVANGDFETLPSGGFP +G IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALY+VTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYT AFE +DETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQ+VNLNFTAKADR+RIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSR NGP +WAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

A0A6J1KUY4 uncharacterized protein LOC1114966765.1e-21392.89Show/hide
Query:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MAD PKF    SLILLVFA LASEI AEDGLVANGDFET+P+GG+PNDGAIEGPT IPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDE+TSSLPGW VESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTP+KPYTMTFSLGQAGDKCKQPLA+
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYT PDSNSSFQ+VNLNFTAKADR+RIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGP +   GVGLGLWL++WAL+
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G29980.1 Protein of unknown function, DUF6425.0e-16870.98Show/hide
Query:  PKFHKCQ--SLILLVFALLASEIVA---------EDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGN
        P  ++C+  S+ L + ++  + +VA         EDGLV NGDFET PS GFP+DG  +GP+ IPSW SNGTVEL+ SGQKQGGMILIVP+GRHAVRLGN
Subjt:  PKFHKCQ--SLILLVFALLASEIVA---------EDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGN

Query:  DAEISQELKVEKGALYSVTFSAARTCAQLESLNVSVPP---------ASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIA
        DAEISQ+L VEKG +YSVTFSAARTCAQLES+NVSV           AS+ +DLQTLYSVQGWDPY +AFE ED+ VRLVF+NPGMEDDPTCGPIIDDIA
Subjt:  DAEISQELKVEKGALYSVTFSAARTCAQLESLNVSVPP---------ASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIA

Query:  IKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMT
        IKK+F PD+PKDNAV NGDFE GPWMFRN SLGVL+PTNLDEE SSLPGW VESNRAVR++DS HFSVP+GKRA+ELLSGKEGIISQMVET  +KPY ++
Subjt:  IKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMT

Query:  FSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGL
        FSLG AGDKCK+PLA+MAFAGDQAQNFHY    +NSSF+   LNFTAKADR+R+AFYSVYYNTRTDDMSSLCGPV+DDVRVWFS S+R        G G 
Subjt:  FSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGL

Query:  GLWLLVWALV
        G W+ V  +V
Subjt:  GLWLLVWALV

AT1G29980.2 Protein of unknown function, DUF6422.1e-16675.67Show/hide
Query:  GLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNVSV
        GLV NGDFET PS GFP+DG  +GP+ IPSW SNGTVEL+ SGQKQGGMILIVP+GRHAVRLGNDAEISQ+L VEKG +YSVTFSAARTCAQLES+NVSV
Subjt:  GLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNVSV

Query:  PP---------ASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLI
                   AS+ +DLQTLYSVQGWDPY +AFE ED+ VRLVF+NPGMEDDPTCGPIIDDIAIKK+F PD+PKDNAV NGDFE GPWMFRN SLGVL+
Subjt:  PP---------ASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLI

Query:  PTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNS
        PTNLDEE SSLPGW VESNRAVR++DS HFSVP+GKRA+ELLSGKEGIISQMVET  +KPY ++FSLG AGDKCK+PLA+MAFAGDQAQNFHY    +NS
Subjt:  PTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNS

Query:  SFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV
        SF+   LNFTAKADR+R+AFYSVYYNTRTDDMSSLCGPV+DDVRVWFS S+R        G G G W+ V  +V
Subjt:  SFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV

AT2G34510.1 Protein of unknown function, DUF6426.3e-16378.21Show/hide
Query:  EDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNV
        EDGLV NGDFET PS GFP+D  IE  + IPSW S+GTVEL++SGQKQGGMILIVPEGRHAVRLGNDAEISQEL VEKG++YSVTFSAARTCAQLESLNV
Subjt:  EDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNV

Query:  SV-----PPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPT
        SV     P ASQTIDLQT+YSVQGWDPY +AFE   + VRLVF+NPGMEDDPTCGPIIDDIA+KK+F PD+PK NAV NGDFE GPWMFRN +LGVL+PT
Subjt:  SV-----PPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPT

Query:  NLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSF
        NLDEE SSLPGW VESNRAVR+IDS HFSVP+GKRA+ELLSGKEGIISQMVET    PY M+FSLG AGDKCK+PLAVMAFAGDQAQNFHY    +NSSF
Subjt:  NLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSF

Query:  QTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAF
        +   LNFTAKA+R+RIAFYS+YYNTRTDDM+SLCGPV+DDV+VWFS S R G  F  F
Subjt:  QTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAF

AT4G32460.1 Protein of unknown function, DUF6421.1e-9548.16Show/hide
Query:  LILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT
        ++LL+ +         DGL+ NGDFE  P         +   TAIP+W  +G VE + SG KQG MIL+VP+G  AVRLGN+A I Q++ V+KG+ YS+T
Subjt:  LILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT

Query:  FSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFR
        FSAARTCAQ E LNVSV P    + +QT+YS  GWD Y++AF+ + +   +V  NPG+E+DP CGP+ID +A++ +F P     N + NG FE GPW+  
Subjt:  FSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFR

Query:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH
        N S GVLIP N  ++ S LPGW+VES +AV+YIDS HFSVPQG+RA+EL++GKE  ++Q+V T P K Y ++FS+G A + C   + V AFAG       
Subjt:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH

Query:  YTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV
        Y        F+  +L F A + R+R+ FYS +Y  R DD SSLCGPV+DDV++
Subjt:  YTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV

AT4G32460.2 Protein of unknown function, DUF6421.1e-9548.16Show/hide
Query:  LILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT
        ++LL+ +         DGL+ NGDFE  P         +   TAIP+W  +G VE + SG KQG MIL+VP+G  AVRLGN+A I Q++ V+KG+ YS+T
Subjt:  LILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT

Query:  FSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFR
        FSAARTCAQ E LNVSV P    + +QT+YS  GWD Y++AF+ + +   +V  NPG+E+DP CGP+ID +A++ +F P     N + NG FE GPW+  
Subjt:  FSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFR

Query:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH
        N S GVLIP N  ++ S LPGW+VES +AV+YIDS HFSVPQG+RA+EL++GKE  ++Q+V T P K Y ++FS+G A + C   + V AFAG       
Subjt:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH

Query:  YTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV
        Y        F+  +L F A + R+R+ FYS +Y  R DD SSLCGPV+DDV++
Subjt:  YTGPDSNSSFQTVNLNFTAKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATAGCCCTAAATTTCACAAATGCCAGTCTCTTATTCTTCTTGTTTTTGCTCTTTTGGCTTCTGAAATCGTAGCTGAAGATGGATTGGTGGCCAACGGGGATTT
CGAAACCCTCCCAAGTGGCGGGTTCCCAAACGACGGAGCAATAGAAGGGCCCACAGCAATTCCAAGCTGGACATCAAACGGCACCGTTGAGCTAGTCGAGTCTGGGCAAA
AACAGGGTGGGATGATCCTCATCGTACCCGAAGGTAGACACGCAGTGAGATTAGGGAACGACGCCGAAATAAGCCAAGAACTTAAGGTTGAGAAAGGGGCATTGTATTCA
GTCACCTTCAGTGCCGCTCGCACGTGCGCCCAGCTCGAGTCCCTCAATGTGTCGGTGCCACCTGCATCACAGACCATAGACCTTCAGACTCTATACAGTGTCCAAGGCTG
GGACCCTTACACTTACGCCTTCGAACCCGAGGACGAAACGGTGCGTTTGGTCTTCCGCAATCCCGGCATGGAAGATGACCCCACCTGTGGGCCCATCATTGACGATATTG
CTATCAAGAAAATTTTTATTCCTGATAGACCTAAAGACAATGCGGTGAACAATGGAGATTTTGAATCCGGTCCATGGATGTTCAGAAACGGTTCACTTGGCGTGTTGATC
CCGACCAATTTGGATGAAGAAACGTCGTCGTTACCGGGTTGGATCGTAGAATCAAATCGGGCGGTCCGATACATCGACTCGTACCATTTCAGCGTTCCACAAGGAAAACG
AGCCATCGAATTGCTTTCAGGGAAAGAAGGCATAATCTCTCAAATGGTTGAAACGACGCCGGAAAAGCCGTACACCATGACGTTCTCTTTAGGCCAAGCCGGCGACAAGT
GCAAACAGCCACTTGCCGTAATGGCGTTCGCCGGAGATCAGGCTCAAAACTTTCACTACACCGGCCCCGATTCCAACTCATCCTTTCAAACTGTGAATCTCAATTTCACG
GCCAAGGCCGATAGGTCGAGGATTGCGTTCTACAGTGTTTATTACAATACGAGGACTGACGATATGAGCTCTCTTTGTGGCCCCGTTGTCGATGATGTTAGGGTTTGGTT
TTCGTCCTCTCGCAGAAATGGGCCTAAATTTTGGGCTTTTGGAGTTGGGCTTGGGCTTTGGTTACTTGTTTGGGCCTTGGTTTAG
mRNA sequenceShow/hide mRNA sequence
TTTCAATGTCATTTTGTTGGACGCTATAAGTGGAATCTTTTGGTAAGAGACACACTCACTCTCACTCCCCACTCCTAAACACCCGAGGGAGCTCAGAGCAAACCCACTGT
TACTGAAGGGAGAGAGAGAGAGATTTGGGGAAGAAAATGGCCGATAGCCCTAAATTTCACAAATGCCAGTCTCTTATTCTTCTTGTTTTTGCTCTTTTGGCTTCTGAAAT
CGTAGCTGAAGATGGATTGGTGGCCAACGGGGATTTCGAAACCCTCCCAAGTGGCGGGTTCCCAAACGACGGAGCAATAGAAGGGCCCACAGCAATTCCAAGCTGGACAT
CAAACGGCACCGTTGAGCTAGTCGAGTCTGGGCAAAAACAGGGTGGGATGATCCTCATCGTACCCGAAGGTAGACACGCAGTGAGATTAGGGAACGACGCCGAAATAAGC
CAAGAACTTAAGGTTGAGAAAGGGGCATTGTATTCAGTCACCTTCAGTGCCGCTCGCACGTGCGCCCAGCTCGAGTCCCTCAATGTGTCGGTGCCACCTGCATCACAGAC
CATAGACCTTCAGACTCTATACAGTGTCCAAGGCTGGGACCCTTACACTTACGCCTTCGAACCCGAGGACGAAACGGTGCGTTTGGTCTTCCGCAATCCCGGCATGGAAG
ATGACCCCACCTGTGGGCCCATCATTGACGATATTGCTATCAAGAAAATTTTTATTCCTGATAGACCTAAAGACAATGCGGTGAACAATGGAGATTTTGAATCCGGTCCA
TGGATGTTCAGAAACGGTTCACTTGGCGTGTTGATCCCGACCAATTTGGATGAAGAAACGTCGTCGTTACCGGGTTGGATCGTAGAATCAAATCGGGCGGTCCGATACAT
CGACTCGTACCATTTCAGCGTTCCACAAGGAAAACGAGCCATCGAATTGCTTTCAGGGAAAGAAGGCATAATCTCTCAAATGGTTGAAACGACGCCGGAAAAGCCGTACA
CCATGACGTTCTCTTTAGGCCAAGCCGGCGACAAGTGCAAACAGCCACTTGCCGTAATGGCGTTCGCCGGAGATCAGGCTCAAAACTTTCACTACACCGGCCCCGATTCC
AACTCATCCTTTCAAACTGTGAATCTCAATTTCACGGCCAAGGCCGATAGGTCGAGGATTGCGTTCTACAGTGTTTATTACAATACGAGGACTGACGATATGAGCTCTCT
TTGTGGCCCCGTTGTCGATGATGTTAGGGTTTGGTTTTCGTCCTCTCGCAGAAATGGGCCTAAATTTTGGGCTTTTGGAGTTGGGCTTGGGCTTTGGTTACTTGTTTGGG
CCTTGGTTTAGGGTCTTAATTGTCGGTTTTGCCCTTGTGTATGGGCTTAGAGTTGTGGCATGGCAAGCAAGAATCGCTCTCTACCCAAAAGCTCTAGTTTGTGGTACATG
CTTTGCTTTAAATTTTAATTTTGCTAAGTTATGCTAGTCTTGGTAAATGGCAATATTCTAAACCAATTTAAATACGTTTGATCACAATCATTTCTCTTTTAAACCCAGAG
TCCTCCGTGTGAGGAGAGGATTTGTAGAGATCGAAATTTGCAAATATATATATACAATTTTGGAGCAAGAGATTGAAAATGAAAAT
Protein sequenceShow/hide protein sequence
MADSPKFHKCQSLILLVFALLASEIVAEDGLVANGDFETLPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYS
VTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPEDETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLI
PTNLDEETSSLPGWIVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSFQTVNLNFT
AKADRSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPKFWAFGVGLGLWLLVWALV