; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g39740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g39740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMyosin heavy chain, striated muscle
Genome locationchr4:29473937..29478303
RNA-Seq ExpressionMoc04g39740
SyntenyMoc04g39740
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152474.1 uncharacterized protein LOC111020195 [Momordica charantia]2.4e-9965.06Show/hide
Query:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL
        MEEYLQYMKTLRLQMN                             AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ   ++    +
Subjt:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL

Query:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY
          GA ++                             SIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY
Subjt:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY

Query:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSK--------------------------------------------------PELREMDVVT
        CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSK                                                  PELREMDVVT
Subjt:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSK--------------------------------------------------PELREMDVVT

Query:  LDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDLCG
        LDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDLCG
Subjt:  LDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDLCG

XP_022993973.1 uncharacterized protein LOC111489811 isoform X3 [Cucurbita maxima]2.7e-8264.11Show/hide
Query:  MEEYLQYMKTLRLQMNAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYLCAGACLVCFPMCRSNSCLATAFVMRDVIN
        MEEYLQYMKTLRLQM+AKSELKQLKEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQ   ++    +  GA ++                      
Subjt:  MEEYLQYMKTLRLQMNAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYLCAGACLVCFPMCRSNSCLATAFVMRDVIN

Query:  SSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTK
               S YY K +E+I+LKFQD QDWVNANMIR E E HELVK +TA+R SETEGS DT+ GISGT+IYCN   +VEE+KDLLGKLESAKAKLSQV+K
Subjt:  SSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTK

Query:  MKCAVILENS----------------KPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
        MKCAV+LENS                KPELR MD VTL+EE KALLSDKAGETEYS+SLQDQIAKLK IS VIKCTCG+EYK G+ L
Subjt:  MKCAVILENS----------------KPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL

XP_023550048.1 uncharacterized protein LOC111808353 isoform X2 [Cucurbita pepo subsp. pepo]9.2e-8362.75Show/hide
Query:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL
        MEEYLQYMKTLRLQM+                             AKSELKQLKEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQ   ++    +
Subjt:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL

Query:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY
          GA ++                             S YY K +E+I+LKFQD QDWVNANMIR E EEHELV  +TA+R SETEGS DT+ GISGT+IY
Subjt:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY

Query:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGV
        C+P NLVEE+KDLLGKLESAKAKLSQV+KMKCAV+LEN KPELR MD VTL+EE KALLSDKAGETEYSQSLQDQIAKLK ISRVIKCTCG+EYKAG+
Subjt:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGV

XP_023550050.1 uncharacterized protein LOC111808353 isoform X3 [Cucurbita pepo subsp. pepo]2.9e-8465.61Show/hide
Query:  MEEYLQYMKTLRLQMNAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYLCAGACLVCFPMCRSNSCLATAFVMRDVIN
        MEEYLQYMKTLRLQM+AKSELKQLKEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQ   ++    +  GA ++                      
Subjt:  MEEYLQYMKTLRLQMNAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYLCAGACLVCFPMCRSNSCLATAFVMRDVIN

Query:  SSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTK
               S YY K +E+I+LKFQD QDWVNANMIR E EEHELV  +TA+R SETEGS DT+ GISGT+IYC+P NLVEE+KDLLGKLESAKAKLSQV+K
Subjt:  SSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTK

Query:  MKCAVILEN----------------SKPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGV
        MKCAV+LEN                 KPELR MD VTL+EE KALLSDKAGETEYSQSLQDQIAKLK ISRVIKCTCG+EYKAG+
Subjt:  MKCAVILEN----------------SKPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGV

XP_038885088.1 myosin heavy chain, striated muscle isoform X1 [Benincasa hispida]5.4e-8360.88Show/hide
Query:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL
        MEEYLQYMKTLR QMN                             AKSELKQL EDAE+MMRAKGEIC QIL KQRKIASLESDI+TLSQ   ++    +
Subjt:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL

Query:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY
          GA ++                             S YY K AEEI+LKFQD QDWVNANMIRREV EHELVKL+TA+RASETEG SDT+ GISGT+IY
Subjt:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY

Query:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENS----------------KPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISR
        CNP NLVEE+KDLLGKLESA+AKLSQV K KCA++LE S                KPELR MD VTL+EE KALLSD+AGETEYS+SLQDQIAKLKGISR
Subjt:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENS----------------KPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISR

Query:  VIKCTCGEEYKAGVDLC
        VIKCTCG+EY AGV LC
Subjt:  VIKCTCGEEYKAGVDLC

TrEMBL top hitse value%identityAlignment
A0A1S3B3I4 uncharacterized protein LOC103485401 isoform X24.2e-8162Show/hide
Query:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL
        MEEYLQYMKTLR QMN                             AKSELKQL EDAE+MM+AKGEICSQIL KQRKIASLESD+STLSQ   ++    +
Subjt:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL

Query:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY
          GA ++                             S YYTK AE+INLKFQD QDWVNANMIR EVEE +LVKL+ A++ASETEG SD + GISGT+IY
Subjt:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY

Query:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
         NPKNLV E +DLLGKLESA+AKLS+V+K KCAV+LE SKPELR MD VTL+EEYKALLSD+AGETEYS+SLQD+IAKLKGIS VIKCTCG+EYKAGV L
Subjt:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL

A0A5D3C8F7 Myosin heavy chain, striated muscle7.1e-8160.44Show/hide
Query:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL
        MEEYLQYMKTLR QMN                             AKSELKQL EDAE+MM+AKGEICSQIL KQRKIASLESD+STLSQ   ++    +
Subjt:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL

Query:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDH--DSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQ
          GA                    + +  S+IFDH  DS YYTK AE+INLKFQD QDWVNANMIR EVEE +LVKL+ A++ASETEG SD + GISGT+
Subjt:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDH--DSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQ

Query:  IYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENS----------------KPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGI
        IY NPKNLV E +DLLGKLESA+AKLS+V+K KCAV+LE S                KPELR MD VTL+EEYKALLSD+AGETEYS+SLQD+IAKLKGI
Subjt:  IYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENS----------------KPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGI

Query:  SRVIKCTCGEEYKAGV
        S VIKCTCG+EYKAGV
Subjt:  SRVIKCTCGEEYKAGV

A0A6J1DG39 uncharacterized protein LOC1110201951.2e-9965.06Show/hide
Query:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL
        MEEYLQYMKTLRLQMN                             AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ   ++    +
Subjt:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL

Query:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY
          GA ++                             SIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY
Subjt:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY

Query:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSK--------------------------------------------------PELREMDVVT
        CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSK                                                  PELREMDVVT
Subjt:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSK--------------------------------------------------PELREMDVVT

Query:  LDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDLCG
        LDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDLCG
Subjt:  LDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDLCG

A0A6J1JUE5 uncharacterized protein LOC111489811 isoform X24.2e-8161.33Show/hide
Query:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL
        MEEYLQYMKTLRLQM+                             AKSELKQLKEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQ   ++    +
Subjt:  MEEYLQYMKTLRLQMN-----------------------------AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYL

Query:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY
          GA ++                             S YY K +E+I+LKFQD QDWVNANMIR E E HELVK +TA+R SETEGS DT+ GISGT+IY
Subjt:  CAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIY

Query:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
        CN   +VEE+KDLLGKLESAKAKLSQV+KMKCAV+LENSKPELR MD VTL+EE KALLSDKAGETEYS+SLQDQIAKLK IS VIKCTCG+EYK G+ L
Subjt:  CNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL

A0A6J1K3U6 uncharacterized protein LOC111489811 isoform X31.3e-8264.11Show/hide
Query:  MEEYLQYMKTLRLQMNAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYLCAGACLVCFPMCRSNSCLATAFVMRDVIN
        MEEYLQYMKTLRLQM+AKSELKQLKEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQ   ++    +  GA ++                      
Subjt:  MEEYLQYMKTLRLQMNAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQ---VVSSPYLCAGACLVCFPMCRSNSCLATAFVMRDVIN

Query:  SSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTK
               S YY K +E+I+LKFQD QDWVNANMIR E E HELVK +TA+R SETEGS DT+ GISGT+IYCN   +VEE+KDLLGKLESAKAKLSQV+K
Subjt:  SSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTK

Query:  MKCAVILENS----------------KPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
        MKCAV+LENS                KPELR MD VTL+EE KALLSDKAGETEYS+SLQDQIAKLK IS VIKCTCG+EYK G+ L
Subjt:  MKCAVILENS----------------KPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G33500.1 unknown protein1.4e-2032.49Show/hide
Query:  MEEYLQYMKTLRLQM-----------------------------NAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQVVSSPYLCAG
        MEEYLQYMKTLR QM                             +A SE K+LKE+ +Q  R +GEICS IL KQRKI+S+ESD   ++Q          
Subjt:  MEEYLQYMKTLRLQM-----------------------------NAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQVVSSPYLCAG

Query:  ACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIYCNP
                      L      RD +++ +    S  Y K AEE   K ++ + W  ++M     ++    K +T     E    SD+ R     Q     
Subjt:  ACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIYCNP

Query:  KNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLK
         NL++E   +   +E+ K K+++             KPEL  +D+  L+EEY ALLSD++GE EY  SLQ Q  KLK
Subjt:  KNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKPELREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGTACTTGCAGTACATGAAGACATTGCGCCTGCAAATGAACGCAAAAAGTGAGTTAAAACAACTCAAAGAGGATGCTGAGCAAATGATGCGGGCAAAGGGCGA
AATATGCTCCCAGATATTAGCAAAACAAAGAAAAATAGCCTCTTTGGAGTCTGACATATCTACACTTTCACAGGTGGTCTCTTCTCCGTACTTGTGTGCTGGTGCCTGCC
TTGTATGCTTTCCCATGTGTAGGTCTAACAGCTGTTTAGCAACTGCTTTTGTGATGCGTGATGTTATTAACAGTAGTATATTTGACCATGATAGTATTTATTATACCAAA
GCTGCTGAGGAGATCAATCTGAAATTTCAAGATCTACAGGACTGGGTAAATGCTAACATGATTCGCAGAGAAGTGGAAGAGCACGAATTGGTTAAACTTGACACCGCTGA
GCGAGCAAGTGAAACTGAAGGATCTTCTGATACAATTCGAGGGATCTCTGGCACTCAGATTTACTGCAATCCAAAAAATCTGGTGGAAGAGAAGAAAGATCTATTGGGCA
AGTTGGAATCTGCTAAAGCCAAACTTAGTCAAGTTACGAAAATGAAATGTGCAGTTATTTTGGAGAATTCCAAGCCAGAACTCAGGGAAATGGATGTCGTTACATTGGAC
GAAGAGTACAAGGCTCTCTTATCAGATAAAGCTGGAGAAACGGAGTACTCACAGTCCCTTCAAGACCAAATTGCAAAACTGAAGGGAATTTCCCGTGTGATTAAATGCAC
TTGTGGAGAGGAATACAAGGCTGGAGTAGACCTATGTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGTACTTGCAGTACATGAAGACATTGCGCCTGCAAATGAACGCAAAAAGTGAGTTAAAACAACTCAAAGAGGATGCTGAGCAAATGATGCGGGCAAAGGGCGA
AATATGCTCCCAGATATTAGCAAAACAAAGAAAAATAGCCTCTTTGGAGTCTGACATATCTACACTTTCACAGGTGGTCTCTTCTCCGTACTTGTGTGCTGGTGCCTGCC
TTGTATGCTTTCCCATGTGTAGGTCTAACAGCTGTTTAGCAACTGCTTTTGTGATGCGTGATGTTATTAACAGTAGTATATTTGACCATGATAGTATTTATTATACCAAA
GCTGCTGAGGAGATCAATCTGAAATTTCAAGATCTACAGGACTGGGTAAATGCTAACATGATTCGCAGAGAAGTGGAAGAGCACGAATTGGTTAAACTTGACACCGCTGA
GCGAGCAAGTGAAACTGAAGGATCTTCTGATACAATTCGAGGGATCTCTGGCACTCAGATTTACTGCAATCCAAAAAATCTGGTGGAAGAGAAGAAAGATCTATTGGGCA
AGTTGGAATCTGCTAAAGCCAAACTTAGTCAAGTTACGAAAATGAAATGTGCAGTTATTTTGGAGAATTCCAAGCCAGAACTCAGGGAAATGGATGTCGTTACATTGGAC
GAAGAGTACAAGGCTCTCTTATCAGATAAAGCTGGAGAAACGGAGTACTCACAGTCCCTTCAAGACCAAATTGCAAAACTGAAGGGAATTTCCCGTGTGATTAAATGCAC
TTGTGGAGAGGAATACAAGGCTGGAGTAGACCTATGTGGATGA
Protein sequenceShow/hide protein sequence
MEEYLQYMKTLRLQMNAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQVVSSPYLCAGACLVCFPMCRSNSCLATAFVMRDVINSSIFDHDSIYYTK
AAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKPELREMDVVTLD
EEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDLCG