; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016573 (gene) of Snake gourd v1 genome

Gene IDTan0016573
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMyosin heavy chain, striated muscle
Genome locationLG11:774351..779662
RNA-Seq ExpressionTan0016573
SyntenyTan0016573
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011649890.1 myosin heavy chain, striated muscle [Cucumis sativus]1.9e-11182.05Show/hide
Query:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV
        DVEDQA KIS++EH+H  TIQTMENDL SAKSELKQL ED ER+M+AKGEICSQILEKQRKIASLE+DI  LSQTLELIQQEKVSLGAKIIEKS YYTKV
Subjt:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV

Query:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ
        AEDISLKFQDQQDWVNANMIR EV E +LVKLE+AK A + E  +DTVGGIS T IY +PNNLV E+++LLGKLESA+AKLS+VSK KCAVVLE SKIEQ
Subjt:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ

Query:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL
        SIEE+KN +NDFKPELRAMD VTLEEEYKAL SD+A ETEYSQSLQD+IAKLKGIS VIKC CGKEYKAGVGL
Subjt:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL

XP_022939802.1 uncharacterized protein LOC111445570 isoform X3 [Cucurbita moschata]5.4e-11182.66Show/hide
Query:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV
        DVEDQ SKISVEEH+H  TI+TMENDL +AKSELKQ  ED ER+MRAKGEICSQILE+QRKI SLE DICTLSQTLELIQQEKVSLGAKIIEKSNYY KV
Subjt:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV

Query:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ
        +EDISLKFQDQQDWVNANMIR E  EH LVK ETAK   E E S DTVGGISGT IYC PNNLVEE+K+    LESAK KLSQVSKMKCAVVLENSKI Q
Subjt:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ

Query:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGV
        SIEEVKN +NDFKPELRAMD VTLEEE KAL SDKA ETEYS+SLQDQIAKLK ISRVIKC CGKEYKAG+
Subjt:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGV

XP_022993971.1 uncharacterized protein LOC111489811 isoform X1 [Cucurbita maxima]1.7e-11282.42Show/hide
Query:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV
        DVEDQ SKISVEEH+H  TI+TMENDL +AKSELKQL ED ER+MRAKGEICSQILE+QRKI SLE DICTLSQTLELIQQEKVSLGAKIIEKS YY KV
Subjt:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV

Query:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ
        +EDISLKFQDQQDWVNANMIR E   HELVK ETAK   E E S DTVGGISGT IYC+ N +VEE+K+LLGKLESAKAKLSQVSKMKCAVVLENSKI Q
Subjt:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ

Query:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL
        SIEEVKN +NDFKPELRAMD VTLEEE KAL SDKA ETEYS+SLQDQIAKLK IS VIKC CGKEYK G+ L
Subjt:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL

XP_023550047.1 uncharacterized protein LOC111808353 isoform X1 [Cucurbita pepo subsp. pepo]5.6e-11684.87Show/hide
Query:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV
        DVEDQ SKISVEEH+H  TI+TMENDL +AKSELKQL ED ER+MRAKGEICSQILE+QRKI SLE DICTLSQTLELIQQEKVSLGAKIIEKSNYY KV
Subjt:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV

Query:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ
        +EDISLKFQDQQDWVNANMIR E  EHELV  ETAK   E E S DTVGGISGT IYC PNNLVEE+K+LLGKLESAKAKLSQVSKMKCAVVLEN KI Q
Subjt:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ

Query:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGV
        SIEEVKN +NDFKPELRAMD VTLEEE KAL SDKA ETEYSQSLQDQIAKLK ISRVIKC CGKEYKAG+
Subjt:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGV

XP_038885088.1 myosin heavy chain, striated muscle isoform X1 [Benincasa hispida]5.9e-11885.35Show/hide
Query:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV
        DVEDQA KISVEEH+H  TIQTMENDL SAKSELKQL ED ER+MRAKGEIC QILEKQRKIASLE+DI TLSQTL+LIQQEKVSLGAKIIEKS YY KV
Subjt:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV

Query:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ
        AE+ISLKFQDQQDWVNANMIRREVGEHELVKLETAK A E E  SDTVGGISGT IYC+PNNLVEE+K+LLGKLESA+AKLSQV K KCA+VLE SKIEQ
Subjt:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ

Query:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL
        SIEE+KN +NDFKPELRAMD VTLEEE KAL SD+A ETEYS+SLQDQIAKLKGISRVIKC CGKEY AGVGL
Subjt:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL

TrEMBL top hitse value%identityAlignment
A0A0A0LMF4 Uncharacterized protein2.8e-12169.92Show/hide
Query:  EPNYRLQLLALSTVKDGRVFAVHEDIALTNERYVPPPPSPLDQIQVLTYRNIL-----FAFLWISNTNLDRFCIKLDLDVEDQASKISVEEHIHLITIQT
        E   + Q L L TV +GRVFAVHEDIAL NERY       L  +    ++  L      ++   SNTNLD FCI L LDVEDQA KIS++EH+H  TIQT
Subjt:  EPNYRLQLLALSTVKDGRVFAVHEDIALTNERYVPPPPSPLDQIQVLTYRNIL-----FAFLWISNTNLDRFCIKLDLDVEDQASKISVEEHIHLITIQT

Query:  MENDL------ISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKVAEDISLKFQDQQDWVN
        MENDL       S KSELKQL ED ER+M+AKGEICSQILEKQRKIASLE+DI  LSQTLELIQQEKVSLGAKIIEKS YYTKVAEDISLKFQDQQDWVN
Subjt:  MENDL------ISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKVAEDISLKFQDQQDWVN

Query:  ANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSK------------IEQSIEE
        ANMIR EV E +LVKLE+AK A + E  +DTVGGIS T IY +PNNLV E+++LLGKLESA+AKLS+VSK KCAVVLE SK            IEQSIEE
Subjt:  ANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSK------------IEQSIEE

Query:  VKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL
        +KN +NDFKPELRAMD VTLEEEYKAL SD+A ETEYSQSLQD+IAKLKGIS VIKC CGKEYKAGVGL
Subjt:  VKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL

A0A1S3B2U9 uncharacterized protein LOC103485401 isoform X11.3e-11081.68Show/hide
Query:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV
        DVEDQA KISV+EH+H  TIQTMENDL SAKSELKQL ED ER+M+AKGEICSQILEKQRKIASLE+D+ TLSQTLELIQQEKVSLGAKIIEKS YYTKV
Subjt:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV

Query:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ
        AEDI+LKFQDQQDWVNANMIR EV E +LVKLE AK A E E  SD VGGISGT IY +P NLV E ++LLGKLESA+AKLS+VSK KCAVVLE SKI+Q
Subjt:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ

Query:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL
        SIEE+KN +NDFKPELRAMD VTLEEEYKAL SD+A ETEYS+SLQD+IAKLKGIS VIKC CGKEYKAGVGL
Subjt:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL

A0A6J1DG39 uncharacterized protein LOC1110201955.8e-11174.59Show/hide
Query:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV
        DVEDQASKISVEEH+   TIQTMENDLISAKSELKQL ED E++MRAKGEICSQIL KQRKIASLE+DI TLSQTLELIQQEKVSLGAKIIEKS YYTK 
Subjt:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV

Query:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ
        AE+I+LKFQD QDWVNANMIRREV EHELVKL+TA+ A E E SSDT+ GISGT IYC+P NLVEEKK+LLGKLESAKAKLSQV+KMKCAV+LENSKI Q
Subjt:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ

Query:  SIEEVKNRINDF----------------------------------KPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKE
        SIEEVK+R+N+F                                  +PELR MD+VTL+EEYKAL SDKA ETEYSQSLQDQIAKLKGISRVIKC CG+E
Subjt:  SIEEVKNRINDF----------------------------------KPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKE

Query:  YKAGVGL
        YKAGV L
Subjt:  YKAGVGL

A0A6J1FHU2 uncharacterized protein LOC111445570 isoform X32.6e-11182.66Show/hide
Query:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV
        DVEDQ SKISVEEH+H  TI+TMENDL +AKSELKQ  ED ER+MRAKGEICSQILE+QRKI SLE DICTLSQTLELIQQEKVSLGAKIIEKSNYY KV
Subjt:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV

Query:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ
        +EDISLKFQDQQDWVNANMIR E  EH LVK ETAK   E E S DTVGGISGT IYC PNNLVEE+K+    LESAK KLSQVSKMKCAVVLENSKI Q
Subjt:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ

Query:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGV
        SIEEVKN +NDFKPELRAMD VTLEEE KAL SDKA ETEYS+SLQDQIAKLK ISRVIKC CGKEYKAG+
Subjt:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGV

A0A6J1JXU6 uncharacterized protein LOC111489811 isoform X18.1e-11382.42Show/hide
Query:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV
        DVEDQ SKISVEEH+H  TI+TMENDL +AKSELKQL ED ER+MRAKGEICSQILE+QRKI SLE DICTLSQTLELIQQEKVSLGAKIIEKS YY KV
Subjt:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV

Query:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ
        +EDISLKFQDQQDWVNANMIR E   HELVK ETAK   E E S DTVGGISGT IYC+ N +VEE+K+LLGKLESAKAKLSQVSKMKCAVVLENSKI Q
Subjt:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ

Query:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL
        SIEEVKN +NDFKPELRAMD VTLEEE KAL SDKA ETEYS+SLQDQIAKLK IS VIKC CGKEYK G+ L
Subjt:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G33500.1 unknown protein4.6e-4443.48Show/hide
Query:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV
        DVED A+K+SVEE + + TI T+E DL  A SE K+L E+ ++  R +GEICS ILEKQRKI+S+E+D   ++Q+LELI QE+ SL AK++ K + Y K 
Subjt:  DVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGAKIIEKSNYYTKV

Query:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ
        AE+   K ++Q+ W  ++M             ET +   + E                  NNL+E         +SA+AKL Q   M+  ++ ENSKI+ 
Subjt:  AEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKIEQ

Query:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLK
        SIE VK++IN+FKPEL ++D+  LEEEY AL SD++ E EY  SLQ Q  KLK
Subjt:  SIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTAAGTTTTGCTCGAAAAAGAAAAGCGTCCATCTCGAAGGAGAAATCCAATTTTTTCCCTCCAAATTTGCCTCTCTCAGTTAGTTTAGAATTTATCGTATCGGA
GTGTGGCGCGCGCGCGTTTTCAATCACATTCCCCTCTTCAAGTTTTCTTGGCACTGTACAGAAACTTGAACCGAATTACAGGTTACAGCTGCTTGCTCTGTCAACTGTCA
AGGATGGAAGAGTATTTGCAGTACATGAAGACATTGCGCTTACAAATGAACGGTACGTACCGCCTCCTCCATCTCCATTGGACCAAATCCAAGTTTTAACTTACAGAAAT
ATATTGTTCGCGTTCTTGTGGATTAGTAACACTAATTTGGATCGATTCTGTATCAAACTCGACTTAGACGTGGAGGATCAAGCTTCGAAGATCTCCGTCGAAGAGCATAT
TCACTTGATTACTATTCAAACAATGGAGAACGATCTCATTTCTGCAAAAAGCGAGTTAAAACAACTCACAGAGGATGTTGAGCGAATAATGAGGGCAAAGGGTGAAATAT
GCTCCCAGATATTAGAAAAGCAAAGAAAAATAGCCTCTTTGGAGGCTGACATATGTACACTTTCACAGACACTCGAGCTCATTCAGCAAGAAAAAGTCAGCTTAGGAGCC
AAAATTATTGAGAAGAGTAATTATTATACCAAAGTTGCTGAGGACATCAGTCTCAAATTTCAAGATCAACAGGATTGGGTAAATGCTAACATGATTCGCAGAGAAGTGGG
AGAGCACGAATTGGTTAAGCTTGAAACTGCTAAGGGAGCAAGGGAAATGGAAGTATCTTCTGATACAGTTGGAGGGATCTCTGGCACTCATATTTACTGCGATCCGAACA
ATCTGGTGGAAGAGAAGAAAGAATTATTGGGCAAGTTGGAATCTGCTAAAGCCAAACTCAGTCAAGTTTCAAAGATGAAATGTGCAGTTGTTTTGGAGAATTCCAAGATT
GAACAGTCAATTGAGGAAGTTAAGAACAGAATAAATGATTTCAAGCCAGAACTCAGAGCAATGGATCTTGTTACATTGGAGGAGGAGTACAAGGCTCTCTTCTCAGATAA
AGCTAGAGAAACTGAGTACTCACAATCCCTTCAAGACCAAATTGCAAAACTGAAGGGAATTTCCCGTGTGATTAAATGTGCTTGTGGAAAGGAATACAAGGCTGGAGTAG
GCTTAGGTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTAAGTTTTGCTCGAAAAAGAAAAGCGTCCATCTCGAAGGAGAAATCCAATTTTTTCCCTCCAAATTTGCCTCTCTCAGTTAGTTTAGAATTTATCGTATCGGA
GTGTGGCGCGCGCGCGTTTTCAATCACATTCCCCTCTTCAAGTTTTCTTGGCACTGTACAGAAACTTGAACCGAATTACAGGTTACAGCTGCTTGCTCTGTCAACTGTCA
AGGATGGAAGAGTATTTGCAGTACATGAAGACATTGCGCTTACAAATGAACGGTACGTACCGCCTCCTCCATCTCCATTGGACCAAATCCAAGTTTTAACTTACAGAAAT
ATATTGTTCGCGTTCTTGTGGATTAGTAACACTAATTTGGATCGATTCTGTATCAAACTCGACTTAGACGTGGAGGATCAAGCTTCGAAGATCTCCGTCGAAGAGCATAT
TCACTTGATTACTATTCAAACAATGGAGAACGATCTCATTTCTGCAAAAAGCGAGTTAAAACAACTCACAGAGGATGTTGAGCGAATAATGAGGGCAAAGGGTGAAATAT
GCTCCCAGATATTAGAAAAGCAAAGAAAAATAGCCTCTTTGGAGGCTGACATATGTACACTTTCACAGACACTCGAGCTCATTCAGCAAGAAAAAGTCAGCTTAGGAGCC
AAAATTATTGAGAAGAGTAATTATTATACCAAAGTTGCTGAGGACATCAGTCTCAAATTTCAAGATCAACAGGATTGGGTAAATGCTAACATGATTCGCAGAGAAGTGGG
AGAGCACGAATTGGTTAAGCTTGAAACTGCTAAGGGAGCAAGGGAAATGGAAGTATCTTCTGATACAGTTGGAGGGATCTCTGGCACTCATATTTACTGCGATCCGAACA
ATCTGGTGGAAGAGAAGAAAGAATTATTGGGCAAGTTGGAATCTGCTAAAGCCAAACTCAGTCAAGTTTCAAAGATGAAATGTGCAGTTGTTTTGGAGAATTCCAAGATT
GAACAGTCAATTGAGGAAGTTAAGAACAGAATAAATGATTTCAAGCCAGAACTCAGAGCAATGGATCTTGTTACATTGGAGGAGGAGTACAAGGCTCTCTTCTCAGATAA
AGCTAGAGAAACTGAGTACTCACAATCCCTTCAAGACCAAATTGCAAAACTGAAGGGAATTTCCCGTGTGATTAAATGTGCTTGTGGAAAGGAATACAAGGCTGGAGTAG
GCTTAGGTGCATGA
Protein sequenceShow/hide protein sequence
MQVSFARKRKASISKEKSNFFPPNLPLSVSLEFIVSECGARAFSITFPSSSFLGTVQKLEPNYRLQLLALSTVKDGRVFAVHEDIALTNERYVPPPPSPLDQIQVLTYRN
ILFAFLWISNTNLDRFCIKLDLDVEDQASKISVEEHIHLITIQTMENDLISAKSELKQLTEDVERIMRAKGEICSQILEKQRKIASLEADICTLSQTLELIQQEKVSLGA
KIIEKSNYYTKVAEDISLKFQDQQDWVNANMIRREVGEHELVKLETAKGAREMEVSSDTVGGISGTHIYCDPNNLVEEKKELLGKLESAKAKLSQVSKMKCAVVLENSKI
EQSIEEVKNRINDFKPELRAMDLVTLEEEYKALFSDKARETEYSQSLQDQIAKLKGISRVIKCACGKEYKAGVGLGA