; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G03940 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G03940
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUnknown protein
Genome locationClcChr01:3751415..3754773
RNA-Seq ExpressionClc01G03940
SyntenyClc01G03940
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039677.1 uncharacterized protein E6C27_scaffold558G00080 [Cucumis melo var. makuwa]4.9e-11292.53Show/hide
Query:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF
        MAALSFGFTATTIHLPS+SI  RSR  P TITC+GWDPEGLFG+P+TGHIARNEFKRRLEKDAEAREAFERHVREEKERR ALRESR+ PD+VT LIEYF
Subjt:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF

Query:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE
        LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAV KTEAMEDRVIELEALQKALEEGIEAYDKMQ ELVKAREGLTKILTSKDVKATLLDMVE+NE
Subjt:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE

Query:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
        LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKY+TA
Subjt:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA

XP_004147583.2 uncharacterized protein LOC101214026 [Cucumis sativus]4.6e-11091.7Show/hide
Query:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF
        MAALSFGFT TTIHLPS+SI  RSR  P TITCVGWDPEGLFG+P+TGHIARNEFKRRLEKDAEAREAFERHVREEKERR  LRESR+IP +VT LIEYF
Subjt:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF

Query:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE
        LDTEAQDIEFEIARLRPRLNEEFFS VKLELGELRFAV KTEAMEDRVIELEALQKALEEGIEAYDKMQ ELVKAREGLTKILTSKDVKATLLDM+E+NE
Subjt:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE

Query:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
        LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
Subjt:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA

XP_008437135.1 PREDICTED: uncharacterized protein LOC103482648 [Cucumis melo]1.3e-11292.95Show/hide
Query:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF
        MAALSFGFTATTIHLPS+SI  RSR  P TITC+GWDPEGLFG+P+TGHIARNEFKRRLEKDAEAREAFERHVREEKERR ALRESR+IPD+VT LIEYF
Subjt:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF

Query:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE
        LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAV KTEAMEDRVIELEALQKALEEGIEAYDKMQ ELVKAREGLTKILTSKDVKATLLDMVE+NE
Subjt:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE

Query:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
        LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKY+TA
Subjt:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA

XP_022958721.1 uncharacterized protein LOC111459862 [Cucurbita moschata]4.0e-10687.55Show/hide
Query:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF
        MAAL FGFTATTIH+ S+SILTR+R +P TITCVGWDPEG+FG P+TGHIAR EFKRRLE+DAEAREAFER VREEKERR  LR SR++P++VT LIEYF
Subjt:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF

Query:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE
        LDTEAQDIEFEIARLRPRL EEFFS +KLELGELRFAV KTEAMEDRVIELEALQKALEEGIEAYDKMQ ELVKAREGLTKILTSKDVKATLLDMVE+NE
Subjt:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE

Query:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
        LNRSLLALLDENIANAQ GNQK AAAFMEKVRGAVLKYMTA
Subjt:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA

XP_038874696.1 uncharacterized protein LOC120067243 [Benincasa hispida]2.5e-11695.02Show/hide
Query:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF
        MAALSFGFTATTIH PSSSILTRSR SP TITC+GWDPEGLFGRP+TGHIARNEFKRRLEKDAEAREAFERHVREEKERR ALRESR+IPD+VT LIEYF
Subjt:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF

Query:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE
        LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAV KTEAMEDRVIELEALQKALEEGIEAYDKMQ ELVKAREGLTKILTSKDVKATLLDMVE+NE
Subjt:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE

Query:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
        LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
Subjt:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA

TrEMBL top hitse value%identityAlignment
A0A0A0KKI9 Uncharacterized protein2.2e-11091.7Show/hide
Query:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF
        MAALSFGFT TTIHLPS+SI  RSR  P TITCVGWDPEGLFG+P+TGHIARNEFKRRLEKDAEAREAFERHVREEKERR  LRESR+IP +VT LIEYF
Subjt:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF

Query:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE
        LDTEAQDIEFEIARLRPRLNEEFFS VKLELGELRFAV KTEAMEDRVIELEALQKALEEGIEAYDKMQ ELVKAREGLTKILTSKDVKATLLDM+E+NE
Subjt:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE

Query:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
        LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
Subjt:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA

A0A1S3AT96 uncharacterized protein LOC1034826486.3e-11392.95Show/hide
Query:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF
        MAALSFGFTATTIHLPS+SI  RSR  P TITC+GWDPEGLFG+P+TGHIARNEFKRRLEKDAEAREAFERHVREEKERR ALRESR+IPD+VT LIEYF
Subjt:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF

Query:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE
        LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAV KTEAMEDRVIELEALQKALEEGIEAYDKMQ ELVKAREGLTKILTSKDVKATLLDMVE+NE
Subjt:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE

Query:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
        LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKY+TA
Subjt:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA

A0A5A7TDT6 Uncharacterized protein2.4e-11292.53Show/hide
Query:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF
        MAALSFGFTATTIHLPS+SI  RSR  P TITC+GWDPEGLFG+P+TGHIARNEFKRRLEKDAEAREAFERHVREEKERR ALRESR+ PD+VT LIEYF
Subjt:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF

Query:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE
        LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAV KTEAMEDRVIELEALQKALEEGIEAYDKMQ ELVKAREGLTKILTSKDVKATLLDMVE+NE
Subjt:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE

Query:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
        LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKY+TA
Subjt:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA

A0A6J1H2V9 uncharacterized protein LOC1114598622.0e-10687.55Show/hide
Query:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF
        MAAL FGFTATTIH+ S+SILTR+R +P TITCVGWDPEG+FG P+TGHIAR EFKRRLE+DAEAREAFER VREEKERR  LR SR++P++VT LIEYF
Subjt:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF

Query:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE
        LDTEAQDIEFEIARLRPRL EEFFS +KLELGELRFAV KTEAMEDRVIELEALQKALEEGIEAYDKMQ ELVKAREGLTKILTSKDVKATLLDMVE+NE
Subjt:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE

Query:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
        LNRSLLALLDENIANAQ GNQK AAAFMEKVRGAVLKYMTA
Subjt:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA

A0A6J1KA41 uncharacterized protein LOC1114914852.2e-10587.14Show/hide
Query:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF
        MAAL FGFTATTIH+ S+SILTR+R  P TITCVGWDPEG+FG P+TGHIAR EFKRRLE+DAEAREAFE  VREEKERR  LR SR++P++VT LIEYF
Subjt:  MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYF

Query:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE
        LDTEAQDIEFEIARLRPRL EEFFS +KLELGELRFAV KTEAMEDRVIELEALQKALEEGIEAYDKMQ ELVKAREGLTKILTSKDVKATLLDMVE+NE
Subjt:  LDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNE

Query:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA
        LNRSLLALLDENIANAQ GNQK AAAFMEKVRGAVLKYMTA
Subjt:  LNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G62780.1 unknown protein2.0e-7965.7Show/hide
Query:  MAALSFGF--TATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIE
        MA LSFG    ATT+       + R  S    ITC  WDP+G+ G  +TGHIAR EFKRRLE+D+EAREAF++ +REEKERR ALR+SR++PD   +LIE
Subjt:  MAALSFGF--TATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIE

Query:  YFLDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQ
        YFLDTEAQ+IE+EIARLR RLN+EFF+ ++LE+G++RFAVTKT  +EDR+IELE LQKALEEGIEAYDKMQ EL+ A   LTK+LTS D+K TLLDMVE+
Subjt:  YFLDTEAQDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQ

Query:  NELNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMT
        N++NRSLLALLDENIANA  GNQK+A  +MEK+R +VLKY+T
Subjt:  NELNRSLLALLDENIANAQMGNQKQAAAFMEKVRGAVLKYMT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCTTAGCTTCGGCTTCACCGCCACTACCATCCACCTTCCATCAAGTTCGATTCTCACCCGCTCTCGCTCCAGTCCCCACACCATAACTTGCGTCGGA
TGGGACCCAGAAGGCCTCTTCGGACGGCCTGAGACTGGCCACATCGCTCGAAATGAGTTCAAAAGGCGGCTGGAGAAGGACGCCGAAGCTCGTGAGGCTTTCGAG
CGCCACGTCCGGGAAGAGAAAGAACGCCGCACAGCGCTTCGAGAGTCCCGTATGATTCCGGACGATGTGACTGACCTTATCGAGTACTTTCTTGATACTGAAGCT
CAGGATATTGAATTTGAAATCGCCAGGTTGAGACCCAGGCTGAACGAGGAGTTTTTTTCACATGTAAAACTTGAGCTGGGTGAGCTTAGATTTGCTGTTACGAAA
ACTGAGGCCATGGAAGATAGAGTGATTGAGTTGGAAGCATTACAGAAAGCACTTGAAGAAGGAATAGAAGCCTATGACAAAATGCAAGCGGAACTGGTAAAGGCA
AGGGAAGGCTTAACCAAAATTTTAACATCAAAGGATGTAAAAGCTACTTTGCTAGACATGGTTGAGCAAAACGAGCTCAATAGATCATTACTCGCCCTTCTCGAC
GAAAACATAGCCAACGCACAAATGGGTAACCAGAAACAAGCTGCTGCTTTCATGGAGAAGGTTCGAGGGGCAGTTCTCAAGTACATGACAGCTTAG
mRNA sequenceShow/hide mRNA sequence
TTGTAACCCACCTCTACGCTGGTTTGGTTGGTTCAATTTTCCCAAAAAAAAAAATTATTATTATTTCCCTTGTTTTTTATTTTCTTTTTTCTATAAAGCCGAGGG
GCATGGTTGTAATTTAGAAAACAATACACTTTGAAAAAAACGAACGGTGCTCTTCTTTCCACGTGTTCGACGGTTACCATTCCCGCCACCGCGGCCGAACGGATA
ATGCTGCAGAGTACAGACTGAGACGAGGACCAAAAACCAGTGGCTAAGAATTCCGACCTTTTCATTTCTCACAGTGCCCATCGATGGCCGCTCTTAGCTTCGGCT
TCACCGCCACTACCATCCACCTTCCATCAAGTTCGATTCTCACCCGCTCTCGCTCCAGTCCCCACACCATAACTTGCGTCGGATGGGACCCAGAAGGCCTCTTCG
GACGGCCTGAGACTGGCCACATCGCTCGAAATGAGTTCAAAAGGCGGCTGGAGAAGGACGCCGAAGCTCGTGAGGCTTTCGAGCGCCACGTCCGGGAAGAGAAAG
AACGCCGCACAGCGCTTCGAGAGTCCCGTATGATTCCGGACGATGTGACTGACCTTATCGAGTACTTTCTTGATACTGAAGCTCAGGATATTGAATTTGAAATCG
CCAGGTTGAGACCCAGGCTGAACGAGGAGTTTTTTTCACATGTAAAACTTGAGCTGGGTGAGCTTAGATTTGCTGTTACGAAAACTGAGGCCATGGAAGATAGAG
TGATTGAGTTGGAAGCATTACAGAAAGCACTTGAAGAAGGAATAGAAGCCTATGACAAAATGCAAGCGGAACTGGTAAAGGCAAGGGAAGGCTTAACCAAAATTT
TAACATCAAAGGATGTAAAAGCTACTTTGCTAGACATGGTTGAGCAAAACGAGCTCAATAGATCATTACTCGCCCTTCTCGACGAAAACATAGCCAACGCACAAA
TGGGTAACCAGAAACAAGCTGCTGCTTTCATGGAGAAGGTTCGAGGGGCAGTTCTCAAGTACATGACAGCTTAGTATTATGGAGTTTTTACAACAAAGGAGGAGA
GTTTTAGCGGATGTTGAGATCGGTTTTGGTCATTCTTGGCCAAAACCCATGAGATTCCTTTTATTTTTGCATTGTAATCAACATTGTGGCATTCGGATTCAAGAT
TTAATGGCCAACTAAAGGCCCAAATGTCGATTTTGAGGTTTGTTGGGCACTTTTCAATCGATTTTTGAGATTTTGATGAATGTTCATGAGATCTTCTGATTTATT
CTTATTTTCTATTCTTTGCTTAAACTCATATGCAATGTATATTCCATTAATGTAACAAATTAGTCATCTTAAGCA
Protein sequenceShow/hide protein sequence
MAALSFGFTATTIHLPSSSILTRSRSSPHTITCVGWDPEGLFGRPETGHIARNEFKRRLEKDAEAREAFERHVREEKERRTALRESRMIPDDVTDLIEYFLDTEA
QDIEFEIARLRPRLNEEFFSHVKLELGELRFAVTKTEAMEDRVIELEALQKALEEGIEAYDKMQAELVKAREGLTKILTSKDVKATLLDMVEQNELNRSLLALLD
ENIANAQMGNQKQAAAFMEKVRGAVLKYMTA