; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G36230 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G36230
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function, DUF642
Genome locationChr6:29230668..29234067
RNA-Seq ExpressionCSPI06G36230
SyntenyCSPI06G36230
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006946 - Domain of unknown function DUF642
IPR008979 - Galactose-binding-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581425.1 hypothetical protein SDJN03_21427, partial [Cucurbita argyrosperma subsp. sororia]1.7e-21894.42Show/hide
Query:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MA +P FRK  SLILLVFAH AS+ +A+DGLVANGDFET+PSGGFP DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPP+SQTIDLQTLYSVQGWDPYTYAFEP++ETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGPG+WAFGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

XP_004134601.1 uncharacterized protein LOC101220961 [Cucumis sativus]1.1e-230100Show/hide
Query:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

XP_008439641.1 PREDICTED: uncharacterized protein LOC103484373 [Cucumis melo]2.4e-22597.72Show/hide
Query:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MAHTP FRKCLSLILLVFAH ASQILA+DGLVANGDFETIPSGGFPNDGAIEGPT IPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWD YTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVE+NRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        MAFAGDQAQNFHYTGPDSNSSFQSV+LNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGP FWAFGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

XP_022926053.1 uncharacterized protein LOC111433290 [Cucurbita moschata]2.9e-21894.42Show/hide
Query:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MA +P F K  SLILLVFAH AS+I+A+DGLVANGDFET+PSGGFP DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPP+SQTIDLQTLYSVQGWDPYTYAFEP++ETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGPG+WAFGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

XP_038883634.1 uncharacterized protein LOC120074550 [Benincasa hispida]2.4e-22095.69Show/hide
Query:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MA +P FRKCLSLILL+ AH AS+I A+DGLVANGDFETIPSGGFPNDGAIEGPT IPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPP+SQTIDLQTLYSVQGWDPYTYAFEPE+ETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDR KDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGPG+W FGVGLGLWLLIWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

TrEMBL top hitse value%identityAlignment
A0A0A0KNV4 Uncharacterized protein5.5e-231100Show/hide
Query:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

A0A1S3AZX3 uncharacterized protein LOC1034843731.2e-22597.72Show/hide
Query:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MAHTP FRKCLSLILLVFAH ASQILA+DGLVANGDFETIPSGGFPNDGAIEGPT IPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWD YTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVE+NRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        MAFAGDQAQNFHYTGPDSNSSFQSV+LNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGP FWAFGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

A0A5A7UDL0 DUF642 domain-containing protein8.7e-21399.18Show/hide
Query:  GLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNVSV
        GLVANGDFETIPSGGFPNDGAIEGPT IPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNVSV
Subjt:  GLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNVSV

Query:  PPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPTNLDEETS
        PPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPTNLDEETS
Subjt:  PPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPTNLDEETS

Query:  SLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSFQSVNLNF
        SLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSFQSVNLNF
Subjt:  SLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSFQSVNLNF

Query:  TAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        TAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGP FWAFGVGLGLWLL+WALV
Subjt:  TAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

A0A6J1EGY8 uncharacterized protein LOC1114332901.4e-21894.42Show/hide
Query:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MA +P F K  SLILLVFAH AS+I+A+DGLVANGDFET+PSGGFP DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPP+SQTIDLQTLYSVQGWDPYTYAFEP++ETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGPG+WAFGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

A0A6J1IL07 uncharacterized protein LOC1114784027.2e-21593.15Show/hide
Query:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MA +P FRK  SLILLVFAH  S+I+A+DGLVANGDFET+PSGGFP +G IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN
        LKVEKGALY+VTFSAARTCAQLESLNVSVPP+SQTIDLQTLYSVQGWDPYT AFE ++ETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS  NGPG+WAFGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G29980.1 Protein of unknown function, DUF6422.1e-16671.86Show/hide
Query:  IFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEK
        +F   +S+ +LV          +DGLV NGDFET PS GFP+DG  +GP+ IPSW SNGTVEL+ SGQKQGGMILIVP+GRHAVRLGNDAEISQ+L VEK
Subjt:  IFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEK

Query:  GALYSVTFSAARTCAQLESLNVSVPP---------SSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKD
        G +YSVTFSAARTCAQLES+NVSV           +S+ +DLQTLYSVQGWDPY +AFE E++ VRLVF+NPGMEDDPTCGPIIDDIAIKK+F PD+PKD
Subjt:  GALYSVTFSAARTCAQLESLNVSVPP---------SSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKD

Query:  NAVNNGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQ
        NAV NGDFE GPWMFRN SLGVL+PTNLDEE SSLPGW VESNRAVR++DS HF+VP+GKRA+ELLSGKEGIISQMVET  +KPY ++FSLG AGDKCK+
Subjt:  NAVNNGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQ

Query:  PLAVMAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        PLA+MAFAGDQAQNFHY    +NSSF+   LNFTAKADRTR+AFYSVYYNTRTDDMSSLCGPV+DDVRVWFS S R G GF       G W+ +  +V
Subjt:  PLAVMAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

AT1G29980.2 Protein of unknown function, DUF6421.8e-16575.13Show/hide
Query:  GLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNVSV
        GLV NGDFET PS GFP+DG  +GP+ IPSW SNGTVEL+ SGQKQGGMILIVP+GRHAVRLGNDAEISQ+L VEKG +YSVTFSAARTCAQLES+NVSV
Subjt:  GLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNVSV

Query:  PP---------SSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLI
                   +S+ +DLQTLYSVQGWDPY +AFE E++ VRLVF+NPGMEDDPTCGPIIDDIAIKK+F PD+PKDNAV NGDFE GPWMFRN SLGVL+
Subjt:  PP---------SSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLI

Query:  PTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNS
        PTNLDEE SSLPGW VESNRAVR++DS HF+VP+GKRA+ELLSGKEGIISQMVET  +KPY ++FSLG AGDKCK+PLA+MAFAGDQAQNFHY    +NS
Subjt:  PTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNS

Query:  SFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV
        SF+   LNFTAKADRTR+AFYSVYYNTRTDDMSSLCGPV+DDVRVWFS S R G GF       G W+ +  +V
Subjt:  SFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV

AT2G34510.1 Protein of unknown function, DUF6424.1e-16277.65Show/hide
Query:  DDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNV
        +DGLV NGDFET PS GFP+D  IE  + IPSW S+GTVEL++SGQKQGGMILIVPEGRHAVRLGNDAEISQEL VEKG++YSVTFSAARTCAQLESLNV
Subjt:  DDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNV

Query:  SV-----PPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPT
        SV     P +SQTIDLQT+YSVQGWDPY +AFE   + VRLVF+NPGMEDDPTCGPIIDDIA+KK+F PD+PK NAV NGDFE GPWMFRN +LGVL+PT
Subjt:  SV-----PPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPT

Query:  NLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSF
        NLDEE SSLPGW VESNRAVR+IDS HF+VP+GKRA+ELLSGKEGIISQMVET    PY M+FSLG AGDKCK+PLAVMAFAGDQAQNFHY    +NSSF
Subjt:  NLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSF

Query:  QSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAF
        +   LNFTAKA+RTRIAFYS+YYNTRTDDM+SLCGPV+DDV+VWFS S R G  F  F
Subjt:  QSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAF

AT4G32460.1 Protein of unknown function, DUF6421.3e-9648.44Show/hide
Query:  LILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT
        ++LL+ + F      +DGL+ NGDFE  P         +   TAIP+W  +G VE + SG KQG MIL+VP+G  AVRLGN+A I Q++ V+KG+ YS+T
Subjt:  LILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT

Query:  FSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFR
        FSAARTCAQ E LNVSV P    + +QT+YS  GWD Y++AF+ + +   +V  NPG+E+DP CGP+ID +A++ +F P     N + NG FE GPW+  
Subjt:  FSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFR

Query:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH
        N S GVLIP N  ++ S LPGW+VES +AV+YIDS HF+VPQG+RA+EL++GKE  ++Q+V T P K Y ++FS+G A + C   + V AFAG       
Subjt:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH

Query:  YTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRV
        Y        F+  +L F A + RTR+ FYS +Y  R DD SSLCGPV+DDV++
Subjt:  YTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRV

AT4G32460.2 Protein of unknown function, DUF6421.3e-9648.44Show/hide
Query:  LILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT
        ++LL+ + F      +DGL+ NGDFE  P         +   TAIP+W  +G VE + SG KQG MIL+VP+G  AVRLGN+A I Q++ V+KG+ YS+T
Subjt:  LILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT

Query:  FSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFR
        FSAARTCAQ E LNVSV P    + +QT+YS  GWD Y++AF+ + +   +V  NPG+E+DP CGP+ID +A++ +F P     N + NG FE GPW+  
Subjt:  FSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFR

Query:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH
        N S GVLIP N  ++ S LPGW+VES +AV+YIDS HF+VPQG+RA+EL++GKE  ++Q+V T P K Y ++FS+G A + C   + V AFAG       
Subjt:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH

Query:  YTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRV
        Y        F+  +L F A + RTR+ FYS +Y  R DD SSLCGPV+DDV++
Subjt:  YTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCATACGCCTATATTTCGAAAATGCTTATCTCTTATTCTTCTTGTTTTTGCTCATTTCGCTTCGCAAATCCTAGCTGATGATGGATTGGTGGCGAACGGTGATTT
CGAAACAATCCCAAGTGGCGGGTTCCCAAACGACGGAGCAATAGAAGGACCCACCGCAATTCCAAGCTGGACATCAAACGGCACTGTTGAGCTGGTAGAGTCTGGCCAAA
AACAGGGTGGGATGATCCTCATTGTACCTGAAGGTAGACACGCAGTGAGATTAGGGAACGACGCCGAAATAAGCCAAGAACTTAAGGTGGAGAAAGGGGCGCTTTACTCA
GTCACCTTCAGCGCCGCTCGCACCTGCGCCCAGCTCGAGTCCCTCAACGTGTCGGTGCCACCTTCATCACAAACCATAGACCTTCAAACCTTGTACAGCGTCCAAGGCTG
GGACCCTTACACTTACGCCTTCGAACCCGAGGAGGAAACGGTGCGTTTAGTCTTCCGTAACCCCGGCATGGAAGACGACCCCACCTGTGGCCCCATCATTGACGACATTG
CTATCAAGAAAATTTTTATTCCTGATAGACCTAAAGACAATGCGGTGAACAATGGAGATTTCGAATCCGGTCCATGGATGTTCAGAAACGGTTCGCTTGGCGTGTTGATC
CCAACGAATTTGGACGAAGAAACGTCGTCGTTACCGGGTTGGATCGTGGAATCAAACCGGGCGGTCCGATACATTGATTCGTACCATTTTAATGTCCCACAAGGAAAACG
AGCCATTGAATTGCTTTCAGGGAAGGAAGGCATCATCTCTCAAATGGTTGAAACAACGCCTGAAAAGCCGTACACCATGACATTCTCTTTAGGCCAAGCCGGAGATAAAT
GCAAGCAGCCACTTGCCGTAATGGCGTTCGCCGGAGATCAAGCTCAGAACTTCCACTACACCGGCCCCGATTCCAACTCCTCGTTCCAAAGTGTGAATCTGAATTTCACG
GCGAAGGCCGATAGGACGAGGATTGCTTTCTACAGTGTTTATTACAATACAAGGACTGACGATATGAGTTCTCTGTGTGGCCCTGTCGTGGATGATGTTAGAGTTTGGTT
TTCTTCCTCTTGTAGAAATGGGCCTGGATTTTGGGCTTTCGGTGTTGGGCTTGGGCTTTGGTTACTTATCTGGGCCTTGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ACTCTCACTCTCACTCTCACTCTCACTCTACAAAACATCCGCGAGAGAGAGCTCAGAGCAAAATCACTGTTCTTCTTCTTCAAGGCAGACACTCAGATTTGGGCGAAAAA
ACAATGGCCCATACGCCTATATTTCGAAAATGCTTATCTCTTATTCTTCTTGTTTTTGCTCATTTCGCTTCGCAAATCCTAGCTGATGATGGATTGGTGGCGAACGGTGA
TTTCGAAACAATCCCAAGTGGCGGGTTCCCAAACGACGGAGCAATAGAAGGACCCACCGCAATTCCAAGCTGGACATCAAACGGCACTGTTGAGCTGGTAGAGTCTGGCC
AAAAACAGGGTGGGATGATCCTCATTGTACCTGAAGGTAGACACGCAGTGAGATTAGGGAACGACGCCGAAATAAGCCAAGAACTTAAGGTGGAGAAAGGGGCGCTTTAC
TCAGTCACCTTCAGCGCCGCTCGCACCTGCGCCCAGCTCGAGTCCCTCAACGTGTCGGTGCCACCTTCATCACAAACCATAGACCTTCAAACCTTGTACAGCGTCCAAGG
CTGGGACCCTTACACTTACGCCTTCGAACCCGAGGAGGAAACGGTGCGTTTAGTCTTCCGTAACCCCGGCATGGAAGACGACCCCACCTGTGGCCCCATCATTGACGACA
TTGCTATCAAGAAAATTTTTATTCCTGATAGACCTAAAGACAATGCGGTGAACAATGGAGATTTCGAATCCGGTCCATGGATGTTCAGAAACGGTTCGCTTGGCGTGTTG
ATCCCAACGAATTTGGACGAAGAAACGTCGTCGTTACCGGGTTGGATCGTGGAATCAAACCGGGCGGTCCGATACATTGATTCGTACCATTTTAATGTCCCACAAGGAAA
ACGAGCCATTGAATTGCTTTCAGGGAAGGAAGGCATCATCTCTCAAATGGTTGAAACAACGCCTGAAAAGCCGTACACCATGACATTCTCTTTAGGCCAAGCCGGAGATA
AATGCAAGCAGCCACTTGCCGTAATGGCGTTCGCCGGAGATCAAGCTCAGAACTTCCACTACACCGGCCCCGATTCCAACTCCTCGTTCCAAAGTGTGAATCTGAATTTC
ACGGCGAAGGCCGATAGGACGAGGATTGCTTTCTACAGTGTTTATTACAATACAAGGACTGACGATATGAGTTCTCTGTGTGGCCCTGTCGTGGATGATGTTAGAGTTTG
GTTTTCTTCCTCTTGTAGAAATGGGCCTGGATTTTGGGCTTTCGGTGTTGGGCTTGGGCTTTGGTTACTTATCTGGGCCTTGGTTTAGGGAATTGGTCGGTTTTACCCTT
GTGTATGGGCTTGGAGTTGTGGGACGGGAAGAAGTAAGAAAGAATGAAGCAGTCCCTCTACCCCCACCAAAAGCAAGCTGTAGTTTGCACTTTACACTGGGTGTTTGCTT
TTAAAATTTAATTTTGGTAAGTTATGTTAGCGTTGGTGAATCACAATATTTTAAAGCAATTTAAATACTTTTGATCACCCACCCTCATTTCACTTTCAAACCCACTCTCC
TTTGAAACTGTATAAGATTTAGACAGTTTCTTTGATTACTTAATCCTGTC
Protein sequenceShow/hide protein sequence
MAHTPIFRKCLSLILLVFAHFASQILADDGLVANGDFETIPSGGFPNDGAIEGPTAIPSWTSNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYS
VTFSAARTCAQLESLNVSVPPSSQTIDLQTLYSVQGWDPYTYAFEPEEETVRLVFRNPGMEDDPTCGPIIDDIAIKKIFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLI
PTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSFQSVNLNFT
AKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSCRNGPGFWAFGVGLGLWLLIWALV