; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg19265 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg19265
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of unknown function, DUF642
Genome locationCarg_Chr03:9684296..9686951
RNA-Seq ExpressionCarg19265
SyntenyCarg19265
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006946 - Domain of unknown function DUF642
IPR008979 - Galactose-binding-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581425.1 hypothetical protein SDJN03_21427, partial [Cucurbita argyrosperma subsp. sororia]6.6e-231100Show/hide
Query:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

XP_022926053.1 uncharacterized protein LOC111433290 [Cucurbita moschata]2.1e-22999.49Show/hide
Query:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKF KWASLILLVFAHLASE VAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

XP_022978402.1 uncharacterized protein LOC111478402 [Cucurbita maxima]1.1e-22598.22Show/hide
Query:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKFRKWASLILLVFAHL SE VAEDGLVANGDFETLPSGGFPT+GEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
        LKVEKGALY+VTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYT AFE DDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSR NGPGYWAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

XP_023543516.1 uncharacterized protein LOC111803381 [Cucurbita pepo subsp. pepo]1.1e-22899.24Show/hide
Query:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKFRKWASLILLVFAHLASE VAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPG WAFGVGLGLWLL WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

XP_038883634.1 uncharacterized protein LOC120074550 [Benincasa hispida]8.1e-22195.43Show/hide
Query:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKFRK  SLILL+ AHLASE  AEDGLVANGDFET+PSGGFP DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEP+DETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDR KDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGPGYW FGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

TrEMBL top hitse value%identityAlignment
A0A0A0KNV4 Uncharacterized protein2.1e-21994.42Show/hide
Query:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MA +P FRK  SLILLVFAH AS+ +A+DGLVANGDFET+PSGGFP DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPP+SQTIDLQTLYSVQGWDPYTYAFEP++ETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGPG+WAFGVGLGLWLL+WALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

A0A1S3AZX3 uncharacterized protein LOC1034843731.8e-21894.42Show/hide
Query:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MA +PKFRK  SLILLVFAHLAS+ +AEDGLVANGDFET+PSGGFP DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPP+SQTIDLQTLYSVQGWD YTYAFEP++ETVRLVFRNPGMEDDPTCGPIIDDIAIKK+FIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVE+NRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQSV+LNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGP +WAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

A0A6J1EGY8 uncharacterized protein LOC1114332901.0e-22999.49Show/hide
Query:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKF KWASLILLVFAHLASE VAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

A0A6J1IL07 uncharacterized protein LOC1114784025.3e-22698.22Show/hide
Query:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MADSPKFRKWASLILLVFAHL SE VAEDGLVANGDFETLPSGGFPT+GEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
        LKVEKGALY+VTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYT AFE DDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSR NGPGYWAFGVGLGLWLLVWALV
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

A0A6J1KUY4 uncharacterized protein LOC1114966769.3e-21593.15Show/hide
Query:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
        MAD PKFR W+SLILLVFA LASE  AEDGLVANGDFET+P+GG+P DG IEGPT+IPSWT NGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE
Subjt:  MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQE

Query:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
        LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEP+DETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN
Subjt:  LKVEKGALYSVTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVN

Query:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV
        NGDFESGPWMFRNGSLGVLIPTNLDE+TSSLPGW VESNRAVRYIDSYHF+VPQGKRAIELLSGKEGIISQMVETTP+KPYTMTFSLGQAGDKCKQPLA+
Subjt:  NGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAV

Query:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        MAFAGDQAQNFHYT PDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSS RNGPGY   GVGLGLWL++WAL+
Subjt:  MAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G29980.1 Protein of unknown function, DUF6421.1e-16771.53Show/hide
Query:  KWASLILLVFAHLASETVA---------EDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQ
        KW+S+ L + +   +  VA         EDGLV NGDFET PS GFP DG  +GP+ IPSW  NGTVEL+ SGQKQGGMILIVP+GRHAVRLGNDAEISQ
Subjt:  KWASLILLVFAHLASETVA---------EDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQ

Query:  ELKVEKGALYSVTFSAARTCAQLESLNVSVPP---------ASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFI
        +L VEKG +YSVTFSAARTCAQLES+NVSV           AS+ +DLQTLYSVQGWDPY +AFE +D+ VRLVF+NPGMEDDPTCGPIIDDIAIKK+F 
Subjt:  ELKVEKGALYSVTFSAARTCAQLESLNVSVPP---------ASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFI

Query:  PDRPKDNAVNNGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQA
        PD+PKDNAV NGDFE GPWMFRN SLGVL+PTNLDEE SSLPGW VESNRAVR++DS HF+VP+GKRA+ELLSGKEGIISQMVET  +KPY ++FSLG A
Subjt:  PDRPKDNAVNNGDFESGPWMFRNGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQA

Query:  GDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLV
        GDKCK+PLA+MAFAGDQAQNFHY    +NSSF+   LNFTAKADRTR+AFYSVYYNTRTDDMSSLCGPV+DDVRVWFS S+R        G G G W+ V
Subjt:  GDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLV

Query:  WALV
          +V
Subjt:  WALV

AT1G29980.2 Protein of unknown function, DUF6423.0e-16575.13Show/hide
Query:  GLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNVSV
        GLV NGDFET PS GFP DG  +GP+ IPSW  NGTVEL+ SGQKQGGMILIVP+GRHAVRLGNDAEISQ+L VEKG +YSVTFSAARTCAQLES+NVSV
Subjt:  GLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNVSV

Query:  PP---------ASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLI
                   AS+ +DLQTLYSVQGWDPY +AFE +D+ VRLVF+NPGMEDDPTCGPIIDDIAIKK+F PD+PKDNAV NGDFE GPWMFRN SLGVL+
Subjt:  PP---------ASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLI

Query:  PTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNS
        PTNLDEE SSLPGW VESNRAVR++DS HF+VP+GKRA+ELLSGKEGIISQMVET  +KPY ++FSLG AGDKCK+PLA+MAFAGDQAQNFHY    +NS
Subjt:  PTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNS

Query:  SFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV
        SF+   LNFTAKADRTR+AFYSVYYNTRTDDMSSLCGPV+DDVRVWFS S+R        G G G W+ V  +V
Subjt:  SFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV

AT2G34510.1 Protein of unknown function, DUF6422.0e-16178.69Show/hide
Query:  EDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNV
        EDGLV NGDFET PS GFP D  IE  + IPSW  +GTVEL++SGQKQGGMILIVPEGRHAVRLGNDAEISQEL VEKG++YSVTFSAARTCAQLESLNV
Subjt:  EDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVTFSAARTCAQLESLNV

Query:  SV-----PPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPT
        SV     P ASQTIDLQT+YSVQGWDPY +AFE   + VRLVF+NPGMEDDPTCGPIIDDIA+KK+F PD+PK NAV NGDFE GPWMFRN +LGVL+PT
Subjt:  SV-----PPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLIPT

Query:  NLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSF
        NLDEE SSLPGW VESNRAVR+IDS HF+VP+GKRA+ELLSGKEGIISQMVET    PY M+FSLG AGDKCK+PLAVMAFAGDQAQNFHY    +NSSF
Subjt:  NLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSF

Query:  QSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNG
        +   LNFTAKA+RTRIAFYS+YYNTRTDDM+SLCGPV+DDV+VWFS S R G
Subjt:  QSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNG

AT4G32460.1 Protein of unknown function, DUF6425.1e-9648.16Show/hide
Query:  LILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT
        ++LL+ +         DGL+ NGDFE  P        ++   T+IP+W L+G VE + SG KQG MIL+VP+G  AVRLGN+A I Q++ V+KG+ YS+T
Subjt:  LILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT

Query:  FSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVNNGDFESGPWMFR
        FSAARTCAQ E LNVSV P    + +QT+YS  GWD Y++AF+   +   +V  NPG+E+DP CGP+ID +A++ +F P     N + NG FE GPW+  
Subjt:  FSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVNNGDFESGPWMFR

Query:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH
        N S GVLIP N  ++ S LPGW+VES +AV+YIDS HF+VPQG+RA+EL++GKE  ++Q+V T P K Y ++FS+G A + C   + V AFAG       
Subjt:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH

Query:  YTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRV
        Y        F+  +L F A + RTR+ FYS +Y  R DD SSLCGPV+DDV++
Subjt:  YTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRV

AT4G32460.2 Protein of unknown function, DUF6425.1e-9648.16Show/hide
Query:  LILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT
        ++LL+ +         DGL+ NGDFE  P        ++   T+IP+W L+G VE + SG KQG MIL+VP+G  AVRLGN+A I Q++ V+KG+ YS+T
Subjt:  LILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYSVT

Query:  FSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVNNGDFESGPWMFR
        FSAARTCAQ E LNVSV P    + +QT+YS  GWD Y++AF+   +   +V  NPG+E+DP CGP+ID +A++ +F P     N + NG FE GPW+  
Subjt:  FSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVNNGDFESGPWMFR

Query:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH
        N S GVLIP N  ++ S LPGW+VES +AV+YIDS HF+VPQG+RA+EL++GKE  ++Q+V T P K Y ++FS+G A + C   + V AFAG       
Subjt:  NGSLGVLIPTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFH

Query:  YTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRV
        Y        F+  +L F A + RTR+ FYS +Y  R DD SSLCGPV+DDV++
Subjt:  YTGPDSNSSFQSVNLNFTAKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGATAGCCCTAAATTTCGCAAGTGGGCGTCTCTTATTCTGCTTGTTTTTGCTCATTTGGCTTCTGAAACCGTGGCTGAAGATGGGTTGGTGGCAAACGGGGATTT
CGAAACACTCCCATCTGGCGGGTTCCCAACGGACGGAGAAATAGAAGGGCCCACCTCGATTCCAAGCTGGACACTAAACGGCACCGTTGAGCTAGTTGAGTCTGGGCAAA
AACAAGGTGGGATGATCCTCATCGTACCGGAAGGAAGACACGCAGTGAGATTAGGGAACGACGCCGAAATAAGCCAAGAGCTGAAAGTGGAGAAAGGGGCACTGTATTCA
GTCACCTTCAGTGCTGCTCGCACGTGCGCCCAGCTCGAGTCCCTCAACGTGTCGGTGCCACCTGCATCACAGACCATAGACCTTCAGACTTTGTACAGCGTCCAGGGGTG
GGACCCTTACACTTACGCTTTCGAACCCGATGATGAAACGGTGCGTTTGGTCTTCCGCAACCCTGGCATGGAGGATGACCCCACCTGTGGCCCCATCATTGACGATATTG
CTATCAAGAAAGTTTTTATTCCTGATAGACCTAAAGACAATGCGGTGAACAATGGAGATTTCGAGTCCGGTCCATGGATGTTTAGGAACGGTTCACTGGGGGTCTTGATC
CCGACTAATCTGGACGAAGAAACGTCGTCGTTGCCGGGTTGGATAGTGGAATCTAACCGGGCGGTCCGATATATCGACTCGTACCATTTCAACGTCCCACAAGGCAAACG
AGCCATCGAATTGCTTTCAGGGAAAGAAGGCATAATTTCTCAAATGGTGGAAACGACCCCGGAAAAGCCGTACACCATGACGTTCTCTCTAGGCCAGGCCGGCGACAAGT
GCAAGCAGCCACTTGCCGTGATGGCGTTCGCCGGAGATCAGGCTCAGAACTTCCACTACACCGGCCCCGATTCCAACTCCTCGTTTCAAAGCGTGAACCTCAACTTCACG
GCCAAGGCCGACAGAACCAGGATTGCCTTCTACAGTGTTTATTACAATACGAGGACCGACGATATGAGCTCCCTCTGTGGCCCGGTCGTCGATGATGTCAGGGTTTGGTT
TTCTTCCTCTCGTAGAAATGGGCCTGGATATTGGGCTTTCGGAGTTGGACTTGGGCTTTGGTTACTTGTTTGGGCCTTGGTTTAG
mRNA sequenceShow/hide mRNA sequence
AGAGAGGTCAGAGCAAAGCCACTGTTACTGAGGTGAGAGAGAGAGATTTGGGGAAGAAAATGGCGGATAGCCCTAAATTTCGCAAGTGGGCGTCTCTTATTCTGCTTGTT
TTTGCTCATTTGGCTTCTGAAACCGTGGCTGAAGATGGGTTGGTGGCAAACGGGGATTTCGAAACACTCCCATCTGGCGGGTTCCCAACGGACGGAGAAATAGAAGGGCC
CACCTCGATTCCAAGCTGGACACTAAACGGCACCGTTGAGCTAGTTGAGTCTGGGCAAAAACAAGGTGGGATGATCCTCATCGTACCGGAAGGAAGACACGCAGTGAGAT
TAGGGAACGACGCCGAAATAAGCCAAGAGCTGAAAGTGGAGAAAGGGGCACTGTATTCAGTCACCTTCAGTGCTGCTCGCACGTGCGCCCAGCTCGAGTCCCTCAACGTG
TCGGTGCCACCTGCATCACAGACCATAGACCTTCAGACTTTGTACAGCGTCCAGGGGTGGGACCCTTACACTTACGCTTTCGAACCCGATGATGAAACGGTGCGTTTGGT
CTTCCGCAACCCTGGCATGGAGGATGACCCCACCTGTGGCCCCATCATTGACGATATTGCTATCAAGAAAGTTTTTATTCCTGATAGACCTAAAGACAATGCGGTGAACA
ATGGAGATTTCGAGTCCGGTCCATGGATGTTTAGGAACGGTTCACTGGGGGTCTTGATCCCGACTAATCTGGACGAAGAAACGTCGTCGTTGCCGGGTTGGATAGTGGAA
TCTAACCGGGCGGTCCGATATATCGACTCGTACCATTTCAACGTCCCACAAGGCAAACGAGCCATCGAATTGCTTTCAGGGAAAGAAGGCATAATTTCTCAAATGGTGGA
AACGACCCCGGAAAAGCCGTACACCATGACGTTCTCTCTAGGCCAGGCCGGCGACAAGTGCAAGCAGCCACTTGCCGTGATGGCGTTCGCCGGAGATCAGGCTCAGAACT
TCCACTACACCGGCCCCGATTCCAACTCCTCGTTTCAAAGCGTGAACCTCAACTTCACGGCCAAGGCCGACAGAACCAGGATTGCCTTCTACAGTGTTTATTACAATACG
AGGACCGACGATATGAGCTCCCTCTGTGGCCCGGTCGTCGATGATGTCAGGGTTTGGTTTTCTTCCTCTCGTAGAAATGGGCCTGGATATTGGGCTTTCGGAGTTGGACT
TGGGCTTTGGTTACTTGTTTGGGCCTTGGTTTAGGCCTCATTTTGGGTTCCGCGGAATTGTCGGCTTCCCACGTTTATCGGCTTGGAGTTGTGGCATGGCAAGAACCCCT
CCCTACCCAAAAGCTTTCTCTTTGTATTACGTGCTTCGCTTTAGAATTTAAAAATGTGATCAATTAGGAGGGTGTATGATATAATATTGTCCACTTTCAGCATAAGTTCT
GGTGGTTTTGCTTTGGGCTTCC
Protein sequenceShow/hide protein sequence
MADSPKFRKWASLILLVFAHLASETVAEDGLVANGDFETLPSGGFPTDGEIEGPTSIPSWTLNGTVELVESGQKQGGMILIVPEGRHAVRLGNDAEISQELKVEKGALYS
VTFSAARTCAQLESLNVSVPPASQTIDLQTLYSVQGWDPYTYAFEPDDETVRLVFRNPGMEDDPTCGPIIDDIAIKKVFIPDRPKDNAVNNGDFESGPWMFRNGSLGVLI
PTNLDEETSSLPGWIVESNRAVRYIDSYHFNVPQGKRAIELLSGKEGIISQMVETTPEKPYTMTFSLGQAGDKCKQPLAVMAFAGDQAQNFHYTGPDSNSSFQSVNLNFT
AKADRTRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWFSSSRRNGPGYWAFGVGLGLWLLVWALV