; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g03370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g03370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr7:2954716..2961939
RNA-Seq ExpressionMoc07g03370
SyntenyMoc07g03370
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.4e-10673.88Show/hide
Query:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLR PLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL+V+QLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV------------PMVCGFTS
        KWF+AS EWLAKDESGR F+DVP RFGNLV I+P+PELTQASFDTLK+YK+ FPRGRK+GTLVTD+LLLE GLLDYNP V             MVC F S
Subjt:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV------------PMVCGFTS

Query:  SVRRKSKGRAHALKIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALD
         V+RKSKGRAHAL+  QSS P TP V         GP+SE P PVIEL+S+G  SREKR R +  A+D
Subjt:  SVRRKSKGRAHALKIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALD

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.5e-8786.11Show/hide
Query:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLR PLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL+V+QLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV
        KWF+AS EWLAKDESGR F+DVP RFGNLV I+P+PELTQASFDTLK+YK+ FPRGRK+GTLVTD+LLLE GLLDYNP V
Subjt:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]3.9e-8886.67Show/hide
Query:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLR PLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL+V+QLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV
        KWF+AS EWLAKDESGR F+DVP RFGNLV I+P+PELTQASFDTLK+YK+HFPRGRK+GTLVTDKLLLE GLLDYNP V
Subjt:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]8.3e-14774.37Show/hide
Query:  SDSGEDLALRLESKLEEIENFRFSDDGEDSDTSTSGQGLQYPFKMPEHYLRPLSRGFKIPNNILLRIPEEGERADNPPAGWVTLYLKMFEYGLRFPLHPF
        S+   DLA RLESKLEEIEN R SDDGEDSD STSGQGL+YP ++PEHYL  L RGF IP NILLR+PEEGERADNPP GWVTLY KMFEYGLR PLHPF
Subjt:  SDSGEDLALRLESKLEEIENFRFSDDGEDSDTSTSGQGLQYPFKMPEHYLRPLSRGFKIPNNILLRIPEEGERADNPPAGWVTLYLKMFEYGLRFPLHPF

Query:  AQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASCEWLAKD
         QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAEL +V+QLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+AS EWLAKD
Subjt:  AQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASCEWLAKD

Query:  ESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV------------PMVCGFTSSVRRKSKGRAHAL
        ESGR F+DVP RFGNLV I+P+PELTQASFDTLK+YK+ FPRGRK+GTLVTD+LLLE GLLDYNP V             MVCGF S V+RKSKGRAHAL
Subjt:  ESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV------------PMVCGFTSSVRRKSKGRAHAL

Query:  KIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALD
        +  QSS P TP V         GP+SE P  VIEL+S+G  SREKR R +  A+D
Subjt:  KIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.6e-15077.48Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFAS EWLAKDESGR F+DVP RFGNLV IK IPEL QA+FDTLK YKDHFPR RKI TLVTDKLLLE GLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNP

Query:  LV------------PMVCGFTSSVRRKSKGRAHALKIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALDVSPLCEVRED
        LV             MVCGFT SV+RKSKGRAHALK V  ++PVTP V +   QG +GPSS VPTPVIELD +G RS EKRSR E  ALDVSPL EVR +
Subjt:  LV------------PMVCGFTSSVRRKSKGRAHALKIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALDVSPLCEVRED

Query:  SPLKKRKKKKKATSSSEVGPRGQLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSEVKDQVSRISATCLDRCLRRASKFVSDPGSVLQRTIDHATEAFI
        SPL++R+KKKK +SSSE G RG LPTSHADLVDDPEARM GTS+V+MRF MEPSSS VKDQVSRISATCLDR LRRASKFVSDPGSVLQRTID+  EAFI
Subjt:  SPLKKRKKKKKATSSSEVGPRGQLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSEVKDQVSRISATCLDRCLRRASKFVSDPGSVLQRTIDHATEAFI

Query:  ASIHSAVMIKAELDGREVLAAMEKKNSSAALEAATTLKGELLKAWSEVDILRAEVEAKAELLKREDERHKAHL
        ASIH AVM+KAELDGRE LAA E++NS AALEAATTLKGELLKA  EVDILRAEV+AK +LLK+E E+HKAHL
Subjt:  ASIHSAVMIKAELDGREVLAAMEKKNSSAALEAATTLKGELLKAWSEVDILRAEVEAKAELLKREDERHKAHL

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138266.9e-10773.88Show/hide
Query:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLR PLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL+V+QLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV------------PMVCGFTS
        KWF+AS EWLAKDESGR F+DVP RFGNLV I+P+PELTQASFDTLK+YK+ FPRGRK+GTLVTD+LLLE GLLDYNP V             MVC F S
Subjt:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV------------PMVCGFTS

Query:  SVRRKSKGRAHALKIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALD
         V+RKSKGRAHAL+  QSS P TP V         GP+SE P PVIEL+S+G  SREKR R +  A+D
Subjt:  SVRRKSKGRAHALKIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALD

A0A6J1DWD2 uncharacterized protein LOC1110246807.2e-8886.11Show/hide
Query:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLR PLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL+V+QLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV
        KWF+AS EWLAKDESGR F+DVP RFGNLV I+P+PELTQASFDTLK+YK+ FPRGRK+GTLVTD+LLLE GLLDYNP V
Subjt:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV

A0A6J1DWF1 uncharacterized protein LOC1110251081.9e-8886.67Show/hide
Query:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLR PLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAELL+V+QLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV
        KWF+AS EWLAKDESGR F+DVP RFGNLV I+P+PELTQASFDTLK+YK+HFPRGRK+GTLVTDKLLLE GLLDYNP V
Subjt:  KWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV

A0A6J1DXS5 uncharacterized protein LOC1110255024.0e-14774.37Show/hide
Query:  SDSGEDLALRLESKLEEIENFRFSDDGEDSDTSTSGQGLQYPFKMPEHYLRPLSRGFKIPNNILLRIPEEGERADNPPAGWVTLYLKMFEYGLRFPLHPF
        S+   DLA RLESKLEEIEN R SDDGEDSD STSGQGL+YP ++PEHYL  L RGF IP NILLR+PEEGERADNPP GWVTLY KMFEYGLR PLHPF
Subjt:  SDSGEDLALRLESKLEEIENFRFSDDGEDSDTSTSGQGLQYPFKMPEHYLRPLSRGFKIPNNILLRIPEEGERADNPPAGWVTLYLKMFEYGLRFPLHPF

Query:  AQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASCEWLAKD
         QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAEL +V+QLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+AS EWLAKD
Subjt:  AQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASCEWLAKD

Query:  ESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV------------PMVCGFTSSVRRKSKGRAHAL
        ESGR F+DVP RFGNLV I+P+PELTQASFDTLK+YK+ FPRGRK+GTLVTD+LLLE GLLDYNP V             MVCGF S V+RKSKGRAHAL
Subjt:  ESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLV------------PMVCGFTSSVRRKSKGRAHAL

Query:  KIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALD
        +  QSS P TP V         GP+SE P  VIEL+S+G  SREKR R +  A+D
Subjt:  KIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALD

A0A6J1DZB3 uncharacterized protein LOC1110256651.7e-15077.48Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFAS EWLAKDESGR F+DVP RFGNLV IK IPEL QA+FDTLK YKDHFPR RKI TLVTDKLLLE GLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNP

Query:  LV------------PMVCGFTSSVRRKSKGRAHALKIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALDVSPLCEVRED
        LV             MVCGFT SV+RKSKGRAHALK V  ++PVTP V +   QG +GPSS VPTPVIELD +G RS EKRSR E  ALDVSPL EVR +
Subjt:  LV------------PMVCGFTSSVRRKSKGRAHALKIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALDVSPLCEVRED

Query:  SPLKKRKKKKKATSSSEVGPRGQLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSEVKDQVSRISATCLDRCLRRASKFVSDPGSVLQRTIDHATEAFI
        SPL++R+KKKK +SSSE G RG LPTSHADLVDDPEARM GTS+V+MRF MEPSSS VKDQVSRISATCLDR LRRASKFVSDPGSVLQRTID+  EAFI
Subjt:  SPLKKRKKKKKATSSSEVGPRGQLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSEVKDQVSRISATCLDRCLRRASKFVSDPGSVLQRTIDHATEAFI

Query:  ASIHSAVMIKAELDGREVLAAMEKKNSSAALEAATTLKGELLKAWSEVDILRAEVEAKAELLKREDERHKAHL
        ASIH AVM+KAELDGRE LAA E++NS AALEAATTLKGELLKA  EVDILRAEV+AK +LLK+E E+HKAHL
Subjt:  ASIHSAVMIKAELDGREVLAAMEKKNSSAALEAATTLKGELLKAWSEVDILRAEVEAKAELLKREDERHKAHL

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic4.2e-0827.37Show/hide
Query:  ILLRIPEEGERADNPPAGWVTLYLKMFEYG--LRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAK-
        + LR+P   ERAD+PPAG+ TLY + F YG  L  P+     E++    +A +Q+       + +L  L  +  R  +    + +  L    E +R+ K 
Subjt:  ILLRIPEEGERADNPPAGWVTLYLKMFEYG--LRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIAK-

Query:  KPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASCE-WLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYK----DHFPRGR
        +  RYY+   KG   I   P+  + +   +FF + E  + +D  G       +    L  ++PIP+   ++F  L   K     HF R R
Subjt:  KPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASCE-WLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYK----DHFPRGR

Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related8.0e-0729.14Show/hide
Query:  PNNILLRIPEEGERADNPPAGWVTLYLKMF-EYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIA
        P  I L  P+  +R   PP G++ LY   F   GL FPL  F  E+  R  +A +Q+          LAIL      ++D       +         R+ 
Subjt:  PNNILLRIPEEGERADNPPAGWVTLYLKMF-EYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDMDEAELLNVEQLLGCFEAKRIA

Query:  KKPGRYYMCARKGAGGIVKGPTS-IKGWVGKWFFASCEWLAKDESGRPFYD
        + PG YY  A K    IV G  S I GW  ++FF      + +     F D
Subjt:  KKPGRYYMCARKGAGGIVKGPTS-IKGWVGKWFFASCEWLAKDESGRPFYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAGGAGATTGCGTTTGCACCACCAGGGAAGCTTTATGCACAAATTCGTGGGTCTGTGCTGTGGCCGTCGGAGAACGTTGCTACAGGGCTGGAGGAACGAACGTC
GCCGCTGATCGTGTTACTTTGCCGCCGTCGTCGGGGAAGACGCCAGAGGGGGAGGCTGGACCATTTGGGAGTGGAAATTAATCAAAGGAAGCAAAAAGTCGGAAATACCT
ACATAGGAGGCGCCAGGCGCCTGGGAAGGCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCTAACCGATGCA
TACACTTTAGAGTTTGGAAATTATTTAGAAGTCCTTGAGGATCGATACTTGGATACTCCATCCATTTTATATTACTCTGACAGTCGGTGCAATAGTTCGGGGCCTTCGGT
AAATGGTCAAGGGTTGAACGCCAAGCATCGTAGAGAAGTGTCGAGGCTTTGGGTAGATGGTCAGGGGTCGATACGACTCGAAGGAGTAAGACTTGGAGGAATAGGGAAGA
GAACTAGTAATGACTGCTTACTGAGTACTAGTTTTCGTACTTACCCCTCCCTTTTACCCCAAATTTTGCAGGTATTCTTCTTGCAGCATTATAGATCTCTTGAATATTCC
TCATCTTCCTTGGTGGAGTTGATTCGTCATTTGTGGAAAGTGGAGATGAAGATAGAGCTGCTGGTTTTGTCTCTTCAAGCTCCAAGTCTTGAGCATCTCATTTGTCATTT
ATGTCAACAAGTGGTGGATCCGACGGTGCTCACGACCGGCGGTTATATGTCTTTTCTCATGTCGGACCTGTCGGGTTCCGAGCAGATCGAACCCCAACCAGGTCGAACCT
TGGCGTCTATACTTAGCCTTTTTCAATTGGCGAACCCAGTCACCTCGGTGAGGCCGAGGTTCGATCTCGACCTGGCAGAGAAGTTCATCCGATTCCATTTTGGACGCGTG
GCGACTTTTAATTCGAAGAGGGAATATGACCGTTGCAGAAGATGTTTCGACCTGCCAGGTTGTCGGAGCACTCAAGTATTCCGTCGTTACGGATCTCGAGACGATCCTAG
CCGCTCGTTGATTACACGTGTCCGGTGTAGAGGTCAGTCATTCTCTTATTACTCTTCTTTTTCAAATATGGTAGTTTTCTTGACTTCCCCTTCCAGTAGTGATAGCCTAG
GTAGCGCAGGTCGGACTATAAGTAGTTCGCCCCCCAAACCAAGTGATTCTGGGGAGGACTTAGCTCTTAGGTTAGAGTCCAAGCTGGAGGAGATAGAGAACTTCAGATTT
TCGGACGATGGGGAGGATAGTGACACTTCCACCTCGGGCCAGGGTTTGCAATACCCTTTTAAAATGCCCGAGCACTATCTCAGACCCCTCAGTAGGGGGTTCAAAATTCC
AAACAACATCCTTCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCCTCCAGCGGGATGGGTCACTCTTTACTTAAAAATGTTTGAGTACGGCCTCAGATTTCCTC
TTCATCCTTTTGCCCAAGAGTTCTTAAACCGAACTGGATTGGCTCCTGCTCAAGTGGCCCCAAACGGATGGGGCGTCATTTTTGCGTTGGCCATCCTTTTCTGGTTGCGA
GCTCGGGACATGGACGAGGCCGAGCTGCTGAACGTTGAGCAGCTTCTTGGATGCTTCGAAGCCAAAAGGATAGCTAAGAAGCCTGGTCGGTACTATATGTGCGCAAGGAA
AGGCGCAGGTGGTATAGTCAAGGGGCCGACCTCCATCAAAGGATGGGTAGGGAAGTGGTTCTTTGCCTCATGTGAATGGCTGGCAAAGGACGAATCAGGTCGTCCCTTCT
ATGACGTGCCTGTTAGGTTTGGGAACCTAGTGTTGATCAAACCGATTCCCGAGCTCACACAAGCCTCATTTGACACCCTCAAGTTTTACAAAGACCATTTCCCAAGGGGT
CGGAAGATCGGAACCTTGGTGACTGACAAGCTGCTGCTAGAGTTGGGGCTGTTGGACTACAATCCTTTAGTTCCCATGGTGTGTGGATTCACGAGTAGCGTGAGACGCAA
GTCTAAGGGTCGTGCTCACGCCCTTAAGATTGTTCAAAGCTCTGATCCAGTGACTCCTGTCGTGGATCAACCTGCGGTTCAGGGCCAGGCTGGGCCATCCTCTGAAGTTC
CAACTCCAGTGATTGAGCTGGATTCTACTGGGGAGCGGTCCAGGGAGAAGCGCTCGAGGAGCGAATGCGGGGCGCTAGACGTGTCGCCTCTTTGCGAGGTGAGGGAGGAC
TCTCCTCTGAAGAAGAGAAAGAAAAAGAAGAAAGCCACTTCCTCCTCGGAGGTTGGACCTCGTGGCCAACTACCCACGAGCCATGCTGACCTGGTGGACGACCCTGAAGC
TCGGATGGGGGGGACGTCCGACGTGAAGATGCGGTTCCGAATGGAACCGTCAAGCTCTGAGGTGAAGGACCAAGTGTCTCGCATCTCGGCTACATGCTTGGATCGCTGTC
TCAGGAGAGCATCCAAGTTTGTGAGCGATCCAGGGTCCGTGCTGCAACGGACAATTGACCACGCTACCGAGGCGTTTATTGCCTCCATTCATTCGGCGGTCATGATCAAG
GCCGAGCTGGATGGAAGGGAGGTCTTGGCAGCGATGGAGAAGAAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCACGCTGAAGGGCGAGCTGCTGAAGGCTTGGAGCGA
GGTGGATATATTGAGGGCCGAGGTGGAAGCCAAGGCCGAGCTGCTAAAGAGGGAGGATGAGAGGCATAAGGCCCACCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAAGGAGATTGCGTTTGCACCACCAGGGAAGCTTTATGCACAAATTCGTGGGTCTGTGCTGTGGCCGTCGGAGAACGTTGCTACAGGGCTGGAGGAACGAACGTC
GCCGCTGATCGTGTTACTTTGCCGCCGTCGTCGGGGAAGACGCCAGAGGGGGAGGCTGGACCATTTGGGAGTGGAAATTAATCAAAGGAAGCAAAAAGTCGGAAATACCT
ACATAGGAGGCGCCAGGCGCCTGGGAAGGCTGCAGAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCTAACCGATGCA
TACACTTTAGAGTTTGGAAATTATTTAGAAGTCCTTGAGGATCGATACTTGGATACTCCATCCATTTTATATTACTCTGACAGTCGGTGCAATAGTTCGGGGCCTTCGGT
AAATGGTCAAGGGTTGAACGCCAAGCATCGTAGAGAAGTGTCGAGGCTTTGGGTAGATGGTCAGGGGTCGATACGACTCGAAGGAGTAAGACTTGGAGGAATAGGGAAGA
GAACTAGTAATGACTGCTTACTGAGTACTAGTTTTCGTACTTACCCCTCCCTTTTACCCCAAATTTTGCAGGTATTCTTCTTGCAGCATTATAGATCTCTTGAATATTCC
TCATCTTCCTTGGTGGAGTTGATTCGTCATTTGTGGAAAGTGGAGATGAAGATAGAGCTGCTGGTTTTGTCTCTTCAAGCTCCAAGTCTTGAGCATCTCATTTGTCATTT
ATGTCAACAAGTGGTGGATCCGACGGTGCTCACGACCGGCGGTTATATGTCTTTTCTCATGTCGGACCTGTCGGGTTCCGAGCAGATCGAACCCCAACCAGGTCGAACCT
TGGCGTCTATACTTAGCCTTTTTCAATTGGCGAACCCAGTCACCTCGGTGAGGCCGAGGTTCGATCTCGACCTGGCAGAGAAGTTCATCCGATTCCATTTTGGACGCGTG
GCGACTTTTAATTCGAAGAGGGAATATGACCGTTGCAGAAGATGTTTCGACCTGCCAGGTTGTCGGAGCACTCAAGTATTCCGTCGTTACGGATCTCGAGACGATCCTAG
CCGCTCGTTGATTACACGTGTCCGGTGTAGAGGTCAGTCATTCTCTTATTACTCTTCTTTTTCAAATATGGTAGTTTTCTTGACTTCCCCTTCCAGTAGTGATAGCCTAG
GTAGCGCAGGTCGGACTATAAGTAGTTCGCCCCCCAAACCAAGTGATTCTGGGGAGGACTTAGCTCTTAGGTTAGAGTCCAAGCTGGAGGAGATAGAGAACTTCAGATTT
TCGGACGATGGGGAGGATAGTGACACTTCCACCTCGGGCCAGGGTTTGCAATACCCTTTTAAAATGCCCGAGCACTATCTCAGACCCCTCAGTAGGGGGTTCAAAATTCC
AAACAACATCCTTCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCCTCCAGCGGGATGGGTCACTCTTTACTTAAAAATGTTTGAGTACGGCCTCAGATTTCCTC
TTCATCCTTTTGCCCAAGAGTTCTTAAACCGAACTGGATTGGCTCCTGCTCAAGTGGCCCCAAACGGATGGGGCGTCATTTTTGCGTTGGCCATCCTTTTCTGGTTGCGA
GCTCGGGACATGGACGAGGCCGAGCTGCTGAACGTTGAGCAGCTTCTTGGATGCTTCGAAGCCAAAAGGATAGCTAAGAAGCCTGGTCGGTACTATATGTGCGCAAGGAA
AGGCGCAGGTGGTATAGTCAAGGGGCCGACCTCCATCAAAGGATGGGTAGGGAAGTGGTTCTTTGCCTCATGTGAATGGCTGGCAAAGGACGAATCAGGTCGTCCCTTCT
ATGACGTGCCTGTTAGGTTTGGGAACCTAGTGTTGATCAAACCGATTCCCGAGCTCACACAAGCCTCATTTGACACCCTCAAGTTTTACAAAGACCATTTCCCAAGGGGT
CGGAAGATCGGAACCTTGGTGACTGACAAGCTGCTGCTAGAGTTGGGGCTGTTGGACTACAATCCTTTAGTTCCCATGGTGTGTGGATTCACGAGTAGCGTGAGACGCAA
GTCTAAGGGTCGTGCTCACGCCCTTAAGATTGTTCAAAGCTCTGATCCAGTGACTCCTGTCGTGGATCAACCTGCGGTTCAGGGCCAGGCTGGGCCATCCTCTGAAGTTC
CAACTCCAGTGATTGAGCTGGATTCTACTGGGGAGCGGTCCAGGGAGAAGCGCTCGAGGAGCGAATGCGGGGCGCTAGACGTGTCGCCTCTTTGCGAGGTGAGGGAGGAC
TCTCCTCTGAAGAAGAGAAAGAAAAAGAAGAAAGCCACTTCCTCCTCGGAGGTTGGACCTCGTGGCCAACTACCCACGAGCCATGCTGACCTGGTGGACGACCCTGAAGC
TCGGATGGGGGGGACGTCCGACGTGAAGATGCGGTTCCGAATGGAACCGTCAAGCTCTGAGGTGAAGGACCAAGTGTCTCGCATCTCGGCTACATGCTTGGATCGCTGTC
TCAGGAGAGCATCCAAGTTTGTGAGCGATCCAGGGTCCGTGCTGCAACGGACAATTGACCACGCTACCGAGGCGTTTATTGCCTCCATTCATTCGGCGGTCATGATCAAG
GCCGAGCTGGATGGAAGGGAGGTCTTGGCAGCGATGGAGAAGAAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCACGCTGAAGGGCGAGCTGCTGAAGGCTTGGAGCGA
GGTGGATATATTGAGGGCCGAGGTGGAAGCCAAGGCCGAGCTGCTAAAGAGGGAGGATGAGAGGCATAAGGCCCACCTCTGA
Protein sequenceShow/hide protein sequence
MPKEIAFAPPGKLYAQIRGSVLWPSENVATGLEERTSPLIVLLCRRRRGRRQRGRLDHLGVEINQRKQKVGNTYIGGARRLGRLQKTVFLPTLPLMKRVFQCVLVVLTDA
YTLEFGNYLEVLEDRYLDTPSILYYSDSRCNSSGPSVNGQGLNAKHRREVSRLWVDGQGSIRLEGVRLGGIGKRTSNDCLLSTSFRTYPSLLPQILQVFFLQHYRSLEYS
SSSLVELIRHLWKVEMKIELLVLSLQAPSLEHLICHLCQQVVDPTVLTTGGYMSFLMSDLSGSEQIEPQPGRTLASILSLFQLANPVTSVRPRFDLDLAEKFIRFHFGRV
ATFNSKREYDRCRRCFDLPGCRSTQVFRRYGSRDDPSRSLITRVRCRGQSFSYYSSFSNMVVFLTSPSSSDSLGSAGRTISSSPPKPSDSGEDLALRLESKLEEIENFRF
SDDGEDSDTSTSGQGLQYPFKMPEHYLRPLSRGFKIPNNILLRIPEEGERADNPPAGWVTLYLKMFEYGLRFPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLR
ARDMDEAELLNVEQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASCEWLAKDESGRPFYDVPVRFGNLVLIKPIPELTQASFDTLKFYKDHFPRG
RKIGTLVTDKLLLELGLLDYNPLVPMVCGFTSSVRRKSKGRAHALKIVQSSDPVTPVVDQPAVQGQAGPSSEVPTPVIELDSTGERSREKRSRSECGALDVSPLCEVRED
SPLKKRKKKKKATSSSEVGPRGQLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSEVKDQVSRISATCLDRCLRRASKFVSDPGSVLQRTIDHATEAFIASIHSAVMIK
AELDGREVLAAMEKKNSSAALEAATTLKGELLKAWSEVDILRAEVEAKAELLKREDERHKAHL