; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026603 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026603
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153033:1722194..1729156
RNA-Seq ExpressionSgr026603
SyntenySgr026603
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136915.2 uncharacterized protein LOC101209598 [Cucumis sativus]1.9e-9280.45Show/hide
Query:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG
        MEEV+  HESKEKIF++FKEFMAS+AKL+ELGTLGS+LLSG++QGLELLRRP+IN TSKLIENVIE +NT+NLRSY E GCINTHDG QST KLHTCRVG
Subjt:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG

Query:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL
        LDDHLKKARSL+DELERL +DVNI LETENP C ST+SDED+E   EEA +P KKPDAN+YA+LMGI+KVM+KK+  MQEKIISGLSLKSSSGELETYCL
Subjt:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL

Query:  MWSLRPYIDDEIMRQAWKLI
        MWSL+PYIDDEIMR AWKL+
Subjt:  MWSLRPYIDDEIMRQAWKLI

XP_008455091.1 PREDICTED: uncharacterized protein LOC103495347 isoform X3 [Cucumis melo]8.0e-9180Show/hide
Query:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG
        MEEVK  HESKEKIF++FKEFMAS+AKL+ELGT GS+LLSG++QGLELLRRP+IN TSKLIENVIET+NT+NLR+Y E GCINTHDG QST KLHTCRVG
Subjt:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG

Query:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL
        LDDHLKKARSL+DELERL +DVNI LETENP CTST+SDED+E   EEA +P KK DAN+YALLMGI+KVM+KK+  MQEKIISGLSLKSSSGELETYCL
Subjt:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL

Query:  MWSLRPYIDDEIMRQAWKLI
        MWSL+PYIDD IM  AWKL+
Subjt:  MWSLRPYIDDEIMRQAWKLI

XP_022150735.1 uncharacterized protein LOC111018794 [Momordica charantia]3.8e-9381.82Show/hide
Query:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG
        MEEV   H SKEKIFK+F+EFMAS+AKLEELGTLGS+ LSG QQGLELLRRPAINR+SKLIE+VIETNNT+NL+SYFE GCINTHDGVQST KLHTCR+G
Subjt:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG

Query:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL
        LDDHLKKARSL+DELE LL++ NIALETEN P  STVSDEDVE   EEA +PD+K DANEYAL MGIIK MVKKD+ MQEKIISGLSLKSSSGELETYC+
Subjt:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL

Query:  MWSLRPYIDDEIMRQAWKLI
        MWSLRPYIDDEI+R+AWKL+
Subjt:  MWSLRPYIDDEIMRQAWKLI

XP_038887975.1 uncharacterized protein LOC120077932 isoform X1 [Benincasa hispida]1.5e-9785Show/hide
Query:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG
        MEEVK   +SKEKIF++FKEFMAS+AKLEELGTLGS+LLSG+QQGLELLRRPAINRTSKLIENVIETNNT++LRSY E GCINTHD VQST KLH+CRVG
Subjt:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG

Query:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVESNE-EAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL
        LDDHLKKARS++D+LERLL+DVNIALETENPPC+STVSDED+E NE EA +P+KKPDANEYALLMGIIKVMVKK+  MQEKIISGLSLKSSSGELETYCL
Subjt:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVESNE-EAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL

Query:  MWSLRPYIDDEIMRQAWKLI
        MWSL+PYIDDEIM QAWKL+
Subjt:  MWSLRPYIDDEIMRQAWKLI

XP_038887977.1 uncharacterized protein LOC120077932 isoform X2 [Benincasa hispida]1.2e-8986.43Show/hide
Query:  MASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVGLDDHLKKARSLVDELERLLDD
        MAS+AKLEELGTLGS+LLSG+QQGLELLRRPAINRTSKLIENVIETNNT++LRSY E GCINTHD VQST KLH+CRVGLDDHLKKARS++D+LERLL+D
Subjt:  MASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVGLDDHLKKARSLVDELERLLDD

Query:  VNIALETENPPCTSTVSDEDVESNE-EAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCLMWSLRPYIDDEIMRQAWKLI
        VNIALETENPPC+STVSDED+E NE EA +P+KKPDANEYALLMGIIKVMVKK+  MQEKIISGLSLKSSSGELETYCLMWSL+PYIDDEIM QAWKL+
Subjt:  VNIALETENPPCTSTVSDEDVESNE-EAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCLMWSLRPYIDDEIMRQAWKLI

TrEMBL top hitse value%identityAlignment
A0A0A0K2B6 Uncharacterized protein9.2e-9380.45Show/hide
Query:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG
        MEEV+  HESKEKIF++FKEFMAS+AKL+ELGTLGS+LLSG++QGLELLRRP+IN TSKLIENVIE +NT+NLRSY E GCINTHDG QST KLHTCRVG
Subjt:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG

Query:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL
        LDDHLKKARSL+DELERL +DVNI LETENP C ST+SDED+E   EEA +P KKPDAN+YA+LMGI+KVM+KK+  MQEKIISGLSLKSSSGELETYCL
Subjt:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL

Query:  MWSLRPYIDDEIMRQAWKLI
        MWSL+PYIDDEIMR AWKL+
Subjt:  MWSLRPYIDDEIMRQAWKLI

A0A1S3C0U3 uncharacterized protein LOC103495347 isoform X33.9e-9180Show/hide
Query:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG
        MEEVK  HESKEKIF++FKEFMAS+AKL+ELGT GS+LLSG++QGLELLRRP+IN TSKLIENVIET+NT+NLR+Y E GCINTHDG QST KLHTCRVG
Subjt:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG

Query:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL
        LDDHLKKARSL+DELERL +DVNI LETENP CTST+SDED+E   EEA +P KK DAN+YALLMGI+KVM+KK+  MQEKIISGLSLKSSSGELETYCL
Subjt:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL

Query:  MWSLRPYIDDEIMRQAWKLI
        MWSL+PYIDD IM  AWKL+
Subjt:  MWSLRPYIDDEIMRQAWKLI

A0A1S4E157 uncharacterized protein LOC103495347 isoform X23.6e-8977.88Show/hide
Query:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGL------ELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKL
        MEEVK  HESKEKIF++FKEFMAS+AKL+ELGT GS+LLSG++QGL      ELLRRP+IN TSKLIENVIET+NT+NLR+Y E GCINTHDG QST KL
Subjt:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGL------ELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKL

Query:  HTCRVGLDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGE
        HTCRVGLDDHLKKARSL+DELERL +DVNI LETENP CTST+SDED+E   EEA +P KK DAN+YALLMGI+KVM+KK+  MQEKIISGLSLKSSSGE
Subjt:  HTCRVGLDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGE

Query:  LETYCLMWSLRPYIDDEIMRQAWKLI
        LETYCLMWSL+PYIDD IM  AWKL+
Subjt:  LETYCLMWSLRPYIDDEIMRQAWKLI

A0A6J1D9B8 uncharacterized protein LOC1110187941.9e-9381.82Show/hide
Query:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG
        MEEV   H SKEKIFK+F+EFMAS+AKLEELGTLGS+ LSG QQGLELLRRPAINR+SKLIE+VIETNNT+NL+SYFE GCINTHDGVQST KLHTCR+G
Subjt:  MEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVG

Query:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL
        LDDHLKKARSL+DELE LL++ NIALETEN P  STVSDEDVE   EEA +PD+K DANEYAL MGIIK MVKKD+ MQEKIISGLSLKSSSGELETYC+
Subjt:  LDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE-SNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCL

Query:  MWSLRPYIDDEIMRQAWKLI
        MWSLRPYIDDEI+R+AWKL+
Subjt:  MWSLRPYIDDEIMRQAWKLI

A0A6J1FDP7 uncharacterized protein LOC1114444456.2e-8973.05Show/hide
Query:  GDRNSLGSNKPIRSWLIGKFDEGNVSILIEMEEVKLVHESKEKIFKMFKE---FMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIET
        GD   LG NK IR  +IG F E       E +++KL + S+EK F  F +      S+ +LEELGTLGS+LLSGVQQGLELLRRPAINRTSKLIENV+ET
Subjt:  GDRNSLGSNKPIRSWLIGKFDEGNVSILIEMEEVKLVHESKEKIFKMFKE---FMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIET

Query:  NNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVGLDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE---SNEEAIIPDKKPDANEYALLM
        NNT++LRSYFE GCI THDG+QS+ KLHTCRVGLDDHLKK RSL +ELERLLDDVNIALE EN  C STVSDED +   ++EEA IPDKKPDANEYALLM
Subjt:  NNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVGLDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVE---SNEEAIIPDKKPDANEYALLM

Query:  GIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCLMWSLRPYIDDEIMRQAWKLIP
        GIIKVMVKKD  MQEKIISGLSLKSSSGELETY LMWSLRPYIDDEI+++AWKLIP
Subjt:  GIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCLMWSLRPYIDDEIMRQAWKLIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49645.1 unknown protein1.5e-5049.77Show/hide
Query:  ESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVGLDDHLKKA
        E K+KI ++FK+FM  I +LEELG   +  L   QQGL  L+RP I  +SKLIEN+I+ N T+ L+SY E GCIN HD  QST  LHT   GL DHL KA
Subjt:  ESKEKIFKMFKEFMASIAKLEELGTLGSKLLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVGLDDHLKKA

Query:  RSLVDELERLLDDVNIALETENPPCTSTVSDEDV----------ESNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYC
        ++L+ ELERL D+  +A+E      ++T  D+D           E NE   +P + P+  EYA L+ +I  M+K+++ MQ+KI+  LSLKSSSGELETY 
Subjt:  RSLVDELERLLDDVNIALETENPPCTSTVSDEDV----------ESNEEAIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYC

Query:  LMWSLRPYIDDEIMRQAWKLI
        LMWSLRP+++DEI+ +AWK I
Subjt:  LMWSLRPYIDDEIMRQAWKLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAACGGAACAACGTAGGATAATTCCAAGAACTAGAAAAGCCGGGAAACCAAGAACGAGAGGCATAAATATTAAAAGGGATGATAGACAAGGGACATTGGCATCTGC
AAGAACTCTAGAAACTACAGGGAAATCCGGGTCGTGTGCACTCAGTGGAAACTTCCGTGGTAGGCAGGCGTGCGCAAGCGAAACCAATGGAGCTCGAGTTTACTCCTTCC
CTGTCTTCTCCATCGGGCTCTCCCTCACCGTCTCTCACTCTCTGTTTACAGCGGCTGTGGGCGGCCGTCAGGATCTTTCTCTCTATCTCTCTCTCCTCTCCGCCGTCGCC
AGTCTCTTTGAGTTCCAGAAGTTTGTTTCTCCTTGCCAGTTTTTTTTTTTTTTTGTTCTCTCTTCGCATAATCCGCAAACGAAACTCTCTTCTCTCTCGCCTTCTCCCTG
GTTCAAGTGGCAAACAGCAGGAGACCGAAACTCTCTTGGATCGAACAAGCCAATTCGAAGTTGGTTGATAGGAAAATTCGACGAAGGTAATGTAAGCATACTAATTGAAA
TGGAAGAAGTTAAGCTTGTTCACGAGTCAAAAGAGAAGATTTTCAAGATGTTTAAAGAATTCATGGCTAGCATTGCGAAGCTCGAGGAATTGGGGACTCTAGGAAGCAAG
TTGCTTTCTGGCGTTCAGCAAGGGCTTGAGCTTCTTCGACGACCTGCGATAAATAGAACATCAAAGTTGATTGAGAATGTCATTGAAACTAACAATACGAAGAATCTTAG
ATCGTACTTTGAAGTTGGATGCATCAACACTCATGATGGTGTGCAAAGTACAAACAAGTTGCATACTTGTCGGGTTGGACTGGATGATCATCTGAAAAAAGCAAGGAGCT
TAGTCGACGAACTCGAACGCCTACTCGATGATGTAAATATTGCGTTGGAAACTGAAAATCCACCGTGCACTTCAACTGTTTCAGATGAAGATGTAGAATCGAATGAAGAA
GCCATCATTCCTGATAAGAAACCTGATGCCAATGAATATGCTTTATTAATGGGGATCATCAAAGTTATGGTCAAGAAAGACTTCACAATGCAGGAGAAGATTATTTCTGG
TCTAAGTCTCAAATCATCCTCCGGAGAACTCGAAACATACTGCCTAATGTGGTCGTTACGACCATATATAGACGATGAAATTATGCGTCAAGCTTGGAAACTCATTCCGT
GA
mRNA sequenceShow/hide mRNA sequence
ATGTTAACGGAACAACGTAGGATAATTCCAAGAACTAGAAAAGCCGGGAAACCAAGAACGAGAGGCATAAATATTAAAAGGGATGATAGACAAGGGACATTGGCATCTGC
AAGAACTCTAGAAACTACAGGGAAATCCGGGTCGTGTGCACTCAGTGGAAACTTCCGTGGTAGGCAGGCGTGCGCAAGCGAAACCAATGGAGCTCGAGTTTACTCCTTCC
CTGTCTTCTCCATCGGGCTCTCCCTCACCGTCTCTCACTCTCTGTTTACAGCGGCTGTGGGCGGCCGTCAGGATCTTTCTCTCTATCTCTCTCTCCTCTCCGCCGTCGCC
AGTCTCTTTGAGTTCCAGAAGTTTGTTTCTCCTTGCCAGTTTTTTTTTTTTTTTGTTCTCTCTTCGCATAATCCGCAAACGAAACTCTCTTCTCTCTCGCCTTCTCCCTG
GTTCAAGTGGCAAACAGCAGGAGACCGAAACTCTCTTGGATCGAACAAGCCAATTCGAAGTTGGTTGATAGGAAAATTCGACGAAGGTAATGTAAGCATACTAATTGAAA
TGGAAGAAGTTAAGCTTGTTCACGAGTCAAAAGAGAAGATTTTCAAGATGTTTAAAGAATTCATGGCTAGCATTGCGAAGCTCGAGGAATTGGGGACTCTAGGAAGCAAG
TTGCTTTCTGGCGTTCAGCAAGGGCTTGAGCTTCTTCGACGACCTGCGATAAATAGAACATCAAAGTTGATTGAGAATGTCATTGAAACTAACAATACGAAGAATCTTAG
ATCGTACTTTGAAGTTGGATGCATCAACACTCATGATGGTGTGCAAAGTACAAACAAGTTGCATACTTGTCGGGTTGGACTGGATGATCATCTGAAAAAAGCAAGGAGCT
TAGTCGACGAACTCGAACGCCTACTCGATGATGTAAATATTGCGTTGGAAACTGAAAATCCACCGTGCACTTCAACTGTTTCAGATGAAGATGTAGAATCGAATGAAGAA
GCCATCATTCCTGATAAGAAACCTGATGCCAATGAATATGCTTTATTAATGGGGATCATCAAAGTTATGGTCAAGAAAGACTTCACAATGCAGGAGAAGATTATTTCTGG
TCTAAGTCTCAAATCATCCTCCGGAGAACTCGAAACATACTGCCTAATGTGGTCGTTACGACCATATATAGACGATGAAATTATGCGTCAAGCTTGGAAACTCATTCCGT
GA
Protein sequenceShow/hide protein sequence
MLTEQRRIIPRTRKAGKPRTRGINIKRDDRQGTLASARTLETTGKSGSCALSGNFRGRQACASETNGARVYSFPVFSIGLSLTVSHSLFTAAVGGRQDLSLYLSLLSAVA
SLFEFQKFVSPCQFFFFFVLSSHNPQTKLSSLSPSPWFKWQTAGDRNSLGSNKPIRSWLIGKFDEGNVSILIEMEEVKLVHESKEKIFKMFKEFMASIAKLEELGTLGSK
LLSGVQQGLELLRRPAINRTSKLIENVIETNNTKNLRSYFEVGCINTHDGVQSTNKLHTCRVGLDDHLKKARSLVDELERLLDDVNIALETENPPCTSTVSDEDVESNEE
AIIPDKKPDANEYALLMGIIKVMVKKDFTMQEKIISGLSLKSSSGELETYCLMWSLRPYIDDEIMRQAWKLIP