; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g20180 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g20180
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr10:14893791..14896466
RNA-Seq ExpressionMoc10g20180
SyntenyMoc10g20180
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.4e-9472.8Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVKKWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWV+KWF+ASGEWLAKDE+              V IRPVP+LTQ SFDTLKYYKEHFP+GRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVKKWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP

Query:  LVRPIEVSRSNSELAMVCGFTSSVRRKSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPL-NEVRDE
         VRPIE SR NSELAMVCGF S+V+RKSKG+AHAL+A  SS+PVTPAV GPASEDPA       PVIEL+S    S+EKRPR ++EA+DVSPL  EVR+E
Subjt:  LVRPIEVSRSNSELAMVCGFTSSVRRKSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPL-NEVRDE

Query:  SPLKRRRKKKKTTSFSKVGARGALPTRFADLVDDLEAKMGGMSDVPTQFRIEPSSFGVKDQ
         PLKRRRKKKKTTS  +VGARG LP  FAD VDD EA+MGG  DV T+FR+EPSS GV+DQ
Subjt:  SPLKRRRKKKKTTSFSKVGARGALPTRFADLVDDLEAKMGGMSDVPTQFRIEPSSFGVKDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.7e-11778.57Show/hide
Query:  MFEYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVK
        MFEYGLRLPLHPF+QEFL RTGLAPAQVAPNGWG IFALAILFWLRARD+E AELLDV QLLACFEAKRIAKKPG +YMCARKGAGGIVKGPTSIKGWV+
Subjt:  MFEYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVK

Query:  KWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAMVCGFTS
        KWF+ASGEWLAKDE+G +FFDV TRFGNLV IRPVP+LTQ SFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNP VRPIE SR NS LAMVC F S
Subjt:  KWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAMVCGFTS

Query:  SVRRKSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPLNEVRDESPL
         V+RKSKGRAHAL+A  SS+P TPAV GPASEDPA       PVIEL+S    S+EKRPR ++EA+D     E  D  PL
Subjt:  SVRRKSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPLNEVRDESPL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]4.0e-9588.14Show/hide
Query:  MFEYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVK
        MFEYGLRLPLHPF+QEFL RTGLAPAQVAPNGWG IFALAILFWLRARD+E AELLDV QLLACFEAKRIAKKPG +YMCARKGA GIVKGPTSIKGWV+
Subjt:  MFEYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVK

Query:  KWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAM
        KWF+ASGEWLAKDE+G +FFDV TRFGNLV IRPVP+LTQ SFDTLKYYKEHFP+GRKVGTLVTDKLLLESGLLDYNP VRPIE SR NSEL M
Subjt:  KWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.2e-16181.54Show/hide
Query:  MSSSFSSDSLGSDESLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRHLVRGFAIPDNILLRIPEEGEKADNPPEGWVTLYLKMFEY
        MSSS SS +L SD  LARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYL  L RGFAIP+NILLR+PEEGE+ADNPPEGWVTLY KMFEY
Subjt:  MSSSFSSDSLGSDESLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRHLVRGFAIPDNILLRIPEEGEKADNPPEGWVTLYLKMFEY

Query:  GLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVKKWFF
        GLRLPLHPF+QEFL RTGLAPAQVAPNGWG IFALAILFWLRARD+E AEL DV QLLACFEAKRIAKKPG +YMCARKGAGGIVKGPTSIKGWV+KWF+
Subjt:  GLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVKKWFF

Query:  ASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAMVCGFTSSVRR
        ASGEWLAKDE+G +FFDV TRFGNLV IRPVP+LTQ SFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNP VRPIE SR NSELAMVCGF S V+R
Subjt:  ASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAMVCGFTSSVRR

Query:  KSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALD
        KSKGRAHAL+A  SS+P TPAV GPASEDPA        VIEL+S    S+EKRPR ++EA+D
Subjt:  KSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.8e-13572.51Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVKKWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWFFASGEWLAKDE+G  FFDV TRFGNLV I+ +P+L Q +FDTLK+YK+HFP+ RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVKKWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP

Query:  LVRPIEVSRSNSELAMVCGFTSSVRRKSKGRAHALKAVNSSEPVTPAV-AGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPLNEVRDE
        LVR IE SR NSELAMVCGFT SV+RKSKGRAHALK V  +EPVTP V    A  +  P+S  PTPVIELD     S EKR R ESEALDVSPLNEVR E
Subjt:  LVRPIEVSRSNSELAMVCGFTSSVRRKSKGRAHALKAVNSSEPVTPAV-AGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPLNEVRDE

Query:  SPLKRRRKKKKTTSFSKVGARGALPTRFADLVDDLEAKMGGMSDVPTQFRIEPSSFGVKDQVSHISAASLDRCLRMASKFVSDLGSVLQRTIDHAAEAFV
        SPL+RRRKKKKT+S S+ GARG LPT  ADLVDD EA+M G S+V  +F +EPSS GVKDQVS ISA  LDR LR ASKFVSD GSVLQRTID+ AEAF+
Subjt:  SPLKRRRKKKKTTSFSKVGARGALPTRFADLVDDLEAKMGGMSDVPTQFRIEPSSFGVKDQVSHISAASLDRCLRMASKFVSDLGSVLQRTIDHAAEAFV

Query:  ASILSAIAIKAELDGREALAAREKEDLSTALEAATTMKGELLKARFEVDILKAEVEAKTELLRREDERRKA
        ASI  A+ +KAELDGREALAA+E+E+   ALEAATT+KGELLKA+ EVDIL+AEV+AK +LL++E E+ KA
Subjt:  ASILSAIAIKAELDGREALAAREKEDLSTALEAATTMKGELLKARFEVDILKAEVEAKTELLRREDERRKA

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.2e-9472.8Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVKKWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWV+KWF+ASGEWLAKDE+              V IRPVP+LTQ SFDTLKYYKEHFP+GRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVKKWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP

Query:  LVRPIEVSRSNSELAMVCGFTSSVRRKSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPL-NEVRDE
         VRPIE SR NSELAMVCGF S+V+RKSKG+AHAL+A  SS+PVTPAV GPASEDPA       PVIEL+S    S+EKRPR ++EA+DVSPL  EVR+E
Subjt:  LVRPIEVSRSNSELAMVCGFTSSVRRKSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPL-NEVRDE

Query:  SPLKRRRKKKKTTSFSKVGARGALPTRFADLVDDLEAKMGGMSDVPTQFRIEPSSFGVKDQ
         PLKRRRKKKKTTS  +VGARG LP  FAD VDD EA+MGG  DV T+FR+EPSS GV+DQ
Subjt:  SPLKRRRKKKKTTSFSKVGARGALPTRFADLVDDLEAKMGGMSDVPTQFRIEPSSFGVKDQ

A0A6J1CR42 uncharacterized protein LOC1110138268.1e-11878.57Show/hide
Query:  MFEYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVK
        MFEYGLRLPLHPF+QEFL RTGLAPAQVAPNGWG IFALAILFWLRARD+E AELLDV QLLACFEAKRIAKKPG +YMCARKGAGGIVKGPTSIKGWV+
Subjt:  MFEYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVK

Query:  KWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAMVCGFTS
        KWF+ASGEWLAKDE+G +FFDV TRFGNLV IRPVP+LTQ SFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNP VRPIE SR NS LAMVC F S
Subjt:  KWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAMVCGFTS

Query:  SVRRKSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPLNEVRDESPL
         V+RKSKGRAHAL+A  SS+P TPAV GPASEDPA       PVIEL+S    S+EKRPR ++EA+D     E  D  PL
Subjt:  SVRRKSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPLNEVRDESPL

A0A6J1DWF1 uncharacterized protein LOC1110251081.9e-9588.14Show/hide
Query:  MFEYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVK
        MFEYGLRLPLHPF+QEFL RTGLAPAQVAPNGWG IFALAILFWLRARD+E AELLDV QLLACFEAKRIAKKPG +YMCARKGA GIVKGPTSIKGWV+
Subjt:  MFEYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVK

Query:  KWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAM
        KWF+ASGEWLAKDE+G +FFDV TRFGNLV IRPVP+LTQ SFDTLKYYKEHFP+GRKVGTLVTDKLLLESGLLDYNP VRPIE SR NSEL M
Subjt:  KWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255021.6e-16181.54Show/hide
Query:  MSSSFSSDSLGSDESLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRHLVRGFAIPDNILLRIPEEGEKADNPPEGWVTLYLKMFEY
        MSSS SS +L SD  LARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYL  L RGFAIP+NILLR+PEEGE+ADNPPEGWVTLY KMFEY
Subjt:  MSSSFSSDSLGSDESLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRHLVRGFAIPDNILLRIPEEGEKADNPPEGWVTLYLKMFEY

Query:  GLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVKKWFF
        GLRLPLHPF+QEFL RTGLAPAQVAPNGWG IFALAILFWLRARD+E AEL DV QLLACFEAKRIAKKPG +YMCARKGAGGIVKGPTSIKGWV+KWF+
Subjt:  GLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVKKWFF

Query:  ASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAMVCGFTSSVRR
        ASGEWLAKDE+G +FFDV TRFGNLV IRPVP+LTQ SFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNP VRPIE SR NSELAMVCGF S V+R
Subjt:  ASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAMVCGFTSSVRR

Query:  KSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALD
        KSKGRAHAL+A  SS+P TPAV GPASEDPA        VIEL+S    S+EKRPR ++EA+D
Subjt:  KSKGRAHALKAVNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256653.3e-13572.51Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVKKWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWFFASGEWLAKDE+G  FFDV TRFGNLV I+ +P+L Q +FDTLK+YK+HFP+ RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVKKWFFASGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP

Query:  LVRPIEVSRSNSELAMVCGFTSSVRRKSKGRAHALKAVNSSEPVTPAV-AGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPLNEVRDE
        LVR IE SR NSELAMVCGFT SV+RKSKGRAHALK V  +EPVTP V    A  +  P+S  PTPVIELD     S EKR R ESEALDVSPLNEVR E
Subjt:  LVRPIEVSRSNSELAMVCGFTSSVRRKSKGRAHALKAVNSSEPVTPAV-AGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPLNEVRDE

Query:  SPLKRRRKKKKTTSFSKVGARGALPTRFADLVDDLEAKMGGMSDVPTQFRIEPSSFGVKDQVSHISAASLDRCLRMASKFVSDLGSVLQRTIDHAAEAFV
        SPL+RRRKKKKT+S S+ GARG LPT  ADLVDD EA+M G S+V  +F +EPSS GVKDQVS ISA  LDR LR ASKFVSD GSVLQRTID+ AEAF+
Subjt:  SPLKRRRKKKKTTSFSKVGARGALPTRFADLVDDLEAKMGGMSDVPTQFRIEPSSFGVKDQVSHISAASLDRCLRMASKFVSDLGSVLQRTIDHAAEAFV

Query:  ASILSAIAIKAELDGREALAAREKEDLSTALEAATTMKGELLKARFEVDILKAEVEAKTELLRREDERRKA
        ASI  A+ +KAELDGREALAA+E+E+   ALEAATT+KGELLKA+ EVDIL+AEV+AK +LL++E E+ KA
Subjt:  ASILSAIAIKAELDGREALAAREKEDLSTALEAATTMKGELLKARFEVDILKAEVEAKTELLRREDERRKA

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic2.1e-0626.94Show/hide
Query:  ILLRIPEEGEKADNPPEGWVTLYLKMFEYG--LRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAK-
        + LR+P   E+AD+PP G+ TLY + F YG  L LP+   + E++    +A +Q+       + +L  L  +  R  E    + +  L    E +R+ K 
Subjt:  ILLRIPEEGEKADNPPEGWVTLYLKMFEYG--LRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAK-

Query:  KPGWYYMCARKGAGGIVKGPTSIKGWVKKWFFASGEWLAKDETGHTFFDVSTRFG----NLVLIRPVPKLTQTSFDTLKYYK----EHFPKGR
        +   YY+   KG   I   P+  + +   +FF + E    ++   T   V TR+G     L  + P+P    ++F  L   K    +HF + R
Subjt:  KPGWYYMCARKGAGGIVKGPTSIKGWVKKWFFASGEWLAKDETGHTFFDVSTRFG----NLVLIRPVPKLTQTSFDTLKYYK----EHFPKGR

Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related2.0e-0729.14Show/hide
Query:  PDNILLRIPEEGEKADNPPEGWVTLYLKMF-EYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIA
        P  I L  P+  ++   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL        E    +D           R+ 
Subjt:  PDNILLRIPEEGEKADNPPEGWVTLYLKMF-EYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIA

Query:  KKPGWYYMCARKGAGGIVKGPTS-IKGWVKKWFFASGEWLAKDETGHTFFD
        + PG YY  A K    IV G  S I GW +++FF      + +     F D
Subjt:  KKPGWYYMCARKGAGGIVKGPTS-IKGWVKKWFFASGEWLAKDETGHTFFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAGATCTGCACAACAGTATTGTACGAACTTAGCTCAAACCCGGTCTCCGGTCCGACCTGAACACAAGAGTGGACATGCACAAGAGAGTAAACACTCTGACGCTCA
ATTCAGTATAGGTGGTGGATCCGACAGTACACACGACCGGCGGTTACTTGTCTTTTCTTACATCGGACCTGTCGAGTTTCACGGTAGATCGGACCCCGAGCAGGTCGTAC
CTCGGCTTTTACACTTAGCCTTTTCAATGGTTGTGACCGGTCTTCTCGTCGGTTCGAGGTCATACCTTACGTTTCCTGAATTTTCGGAGTTCGATCTGAAAGCAGCTCAA
ACCCTCGGTAGGTCAGTCTCTTCCCCTTTTTCACTTTCTTTTTCAAGTGTAGTCCCCATGTCTTCCTCCTTTAGCAGTGATAGCTTAGGATCTGATGAGAGTTTAGCTCG
TAGGTTAGAGTCCGAGCTCGAAGAGATAGAAAACTTTAGGTTTTCCGATGACGGGGAGGACAGCGATGCTTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATAC
CCGAGCATTACCTCAGACATCTTGTTAGAGGGTTCGCTATCCCAGATAACATCCTCCTTAGGATTCCGGAGGAGGGGGAAAAAGCTGACAATCCTCCAGAGGGATGGGTC
ACTCTTTATCTCAAAATGTTTGAGTACGGCCTCAGACTTCCTCTGCATCCTTTCATTCAAGAGTTCTTGAACCGAACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGG
ATGGGGTTTCATTTTTGCCTTGGCCATCCTTTTTTGGTTACGAGCTCGGGACAACGAAGTGGCTGAGCTATTGGATGTTGGGCAGCTCCTCGCATGCTTCGAAGCGAAGA
GGATAGCCAAGAAGCCTGGTTGGTACTATATGTGCGCAAGGAAGGGCGCAGGAGGTATAGTGAAGGGGCCGACCTCCATCAAGGGATGGGTGAAGAAGTGGTTTTTTGCC
TCTGGGGAATGGCTGGCAAAGGATGAGACTGGTCATACCTTCTTTGATGTTTCCACTAGGTTTGGGAACTTAGTGTTGATTAGGCCGGTTCCCAAACTCACTCAAACTTC
CTTTGATACGCTGAAGTACTACAAGGAGCACTTTCCAAAGGGCAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTCGAATCTGGGTTGTTAGACTACAACCCCT
TAGTTCGTCCCATCGAAGTGTCAAGGTCGAACTCCGAACTAGCCATGGTGTGCGGATTTACAAGTAGCGTGAGGCGCAAGTCTAAAGGTCGCGCTCATGCTCTCAAGGCT
GTCAATAGCTCGGAGCCTGTAACTCCTGCTGTGGCAGGGCCGGCCTCAGAAGATCCAGCCCCAACCTCAGAAAATCCAACTCCGGTGATCGAGCTGGACTCGGTTGAGGA
GCACTCCAAGGAGAAGCGCCCAAGAGGCGAGTCTGAGGCATTGGATGTATCTCCCCTCAACGAGGTGAGAGATGAGTCTCCTCTGAAGAGGAGAAGGAAGAAGAAGAAGA
CTACCTCCTTCTCAAAGGTCGGAGCTCGTGGGGCCTTGCCCACGAGATTCGCAGATCTGGTGGACGACCTTGAAGCCAAGATGGGTGGGATGTCCGACGTGCCGACGCAG
TTCCGAATAGAACCGTCCAGCTTCGGGGTGAAGGATCAAGTGTCCCACATCTCGGCCGCGAGCTTGGATCGCTGCCTCAGAATGGCGTCAAAGTTTGTAAGTGACCTAGG
GTCCGTTCTCCAGAGGACCATCGACCACGCCGCTGAGGCGTTCGTTGCTTCCATTCTCTCGGCTATAGCAATAAAGGCCGAGCTGGATGGAAGGGAAGCTTTAGCAGCAA
GGGAGAAGGAGGACTTATCTACTGCCTTGGAGGCTGCTACCACCATGAAGGGTGAGCTGCTGAAGGCTCGCTTTGAGGTGGACATCTTAAAGGCCGAGGTGGAGGCCAAG
ACCGAGTTGCTGAGGAGGGAAGATGAGAGGCGCAAGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAAGATCTGCACAACAGTATTGTACGAACTTAGCTCAAACCCGGTCTCCGGTCCGACCTGAACACAAGAGTGGACATGCACAAGAGAGTAAACACTCTGACGCTCA
ATTCAGTATAGGTGGTGGATCCGACAGTACACACGACCGGCGGTTACTTGTCTTTTCTTACATCGGACCTGTCGAGTTTCACGGTAGATCGGACCCCGAGCAGGTCGTAC
CTCGGCTTTTACACTTAGCCTTTTCAATGGTTGTGACCGGTCTTCTCGTCGGTTCGAGGTCATACCTTACGTTTCCTGAATTTTCGGAGTTCGATCTGAAAGCAGCTCAA
ACCCTCGGTAGGTCAGTCTCTTCCCCTTTTTCACTTTCTTTTTCAAGTGTAGTCCCCATGTCTTCCTCCTTTAGCAGTGATAGCTTAGGATCTGATGAGAGTTTAGCTCG
TAGGTTAGAGTCCGAGCTCGAAGAGATAGAAAACTTTAGGTTTTCCGATGACGGGGAGGACAGCGATGCTTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATAC
CCGAGCATTACCTCAGACATCTTGTTAGAGGGTTCGCTATCCCAGATAACATCCTCCTTAGGATTCCGGAGGAGGGGGAAAAAGCTGACAATCCTCCAGAGGGATGGGTC
ACTCTTTATCTCAAAATGTTTGAGTACGGCCTCAGACTTCCTCTGCATCCTTTCATTCAAGAGTTCTTGAACCGAACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGG
ATGGGGTTTCATTTTTGCCTTGGCCATCCTTTTTTGGTTACGAGCTCGGGACAACGAAGTGGCTGAGCTATTGGATGTTGGGCAGCTCCTCGCATGCTTCGAAGCGAAGA
GGATAGCCAAGAAGCCTGGTTGGTACTATATGTGCGCAAGGAAGGGCGCAGGAGGTATAGTGAAGGGGCCGACCTCCATCAAGGGATGGGTGAAGAAGTGGTTTTTTGCC
TCTGGGGAATGGCTGGCAAAGGATGAGACTGGTCATACCTTCTTTGATGTTTCCACTAGGTTTGGGAACTTAGTGTTGATTAGGCCGGTTCCCAAACTCACTCAAACTTC
CTTTGATACGCTGAAGTACTACAAGGAGCACTTTCCAAAGGGCAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTCGAATCTGGGTTGTTAGACTACAACCCCT
TAGTTCGTCCCATCGAAGTGTCAAGGTCGAACTCCGAACTAGCCATGGTGTGCGGATTTACAAGTAGCGTGAGGCGCAAGTCTAAAGGTCGCGCTCATGCTCTCAAGGCT
GTCAATAGCTCGGAGCCTGTAACTCCTGCTGTGGCAGGGCCGGCCTCAGAAGATCCAGCCCCAACCTCAGAAAATCCAACTCCGGTGATCGAGCTGGACTCGGTTGAGGA
GCACTCCAAGGAGAAGCGCCCAAGAGGCGAGTCTGAGGCATTGGATGTATCTCCCCTCAACGAGGTGAGAGATGAGTCTCCTCTGAAGAGGAGAAGGAAGAAGAAGAAGA
CTACCTCCTTCTCAAAGGTCGGAGCTCGTGGGGCCTTGCCCACGAGATTCGCAGATCTGGTGGACGACCTTGAAGCCAAGATGGGTGGGATGTCCGACGTGCCGACGCAG
TTCCGAATAGAACCGTCCAGCTTCGGGGTGAAGGATCAAGTGTCCCACATCTCGGCCGCGAGCTTGGATCGCTGCCTCAGAATGGCGTCAAAGTTTGTAAGTGACCTAGG
GTCCGTTCTCCAGAGGACCATCGACCACGCCGCTGAGGCGTTCGTTGCTTCCATTCTCTCGGCTATAGCAATAAAGGCCGAGCTGGATGGAAGGGAAGCTTTAGCAGCAA
GGGAGAAGGAGGACTTATCTACTGCCTTGGAGGCTGCTACCACCATGAAGGGTGAGCTGCTGAAGGCTCGCTTTGAGGTGGACATCTTAAAGGCCGAGGTGGAGGCCAAG
ACCGAGTTGCTGAGGAGGGAAGATGAGAGGCGCAAGGCTTAG
Protein sequenceShow/hide protein sequence
MQRSAQQYCTNLAQTRSPVRPEHKSGHAQESKHSDAQFSIGGGSDSTHDRRLLVFSYIGPVEFHGRSDPEQVVPRLLHLAFSMVVTGLLVGSRSYLTFPEFSEFDLKAAQ
TLGRSVSSPFSLSFSSVVPMSSSFSSDSLGSDESLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRHLVRGFAIPDNILLRIPEEGEKADNPPEGWV
TLYLKMFEYGLRLPLHPFIQEFLNRTGLAPAQVAPNGWGFIFALAILFWLRARDNEVAELLDVGQLLACFEAKRIAKKPGWYYMCARKGAGGIVKGPTSIKGWVKKWFFA
SGEWLAKDETGHTFFDVSTRFGNLVLIRPVPKLTQTSFDTLKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPLVRPIEVSRSNSELAMVCGFTSSVRRKSKGRAHALKA
VNSSEPVTPAVAGPASEDPAPTSENPTPVIELDSVEEHSKEKRPRGESEALDVSPLNEVRDESPLKRRRKKKKTTSFSKVGARGALPTRFADLVDDLEAKMGGMSDVPTQ
FRIEPSSFGVKDQVSHISAASLDRCLRMASKFVSDLGSVLQRTIDHAAEAFVASILSAIAIKAELDGREALAAREKEDLSTALEAATTMKGELLKARFEVDILKAEVEAK
TELLRREDERRKA