; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g28580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g28580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr11:20866345..20868856
RNA-Seq ExpressionMoc11g28580
SyntenyMoc11g28580
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]5.5e-11185.43Show/hide
Query:  MCARKGADGIVKGPTSIKGWVRKWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYAS EWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGADGIVKGPTSIKGWVRKWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSMKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR
        AVRPIESSRPNSELAMVCGFAS++KRKSKG+AHALEAAQSSKP TP V GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREE PLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASSMKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR

Query:  KKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRVEPSSSGVRDQ
        KKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.2e-13993.04Show/hide
Query:  IFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVR
        +FEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  IFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVR

Query:  KWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SMKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE
         +KRKSKGRAHALEAAQSSKP TP V GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE
Subjt:  SMKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]4.5e-10596.91Show/hide
Query:  IFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVR
        +FEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMCARKGADGIVKGPTSIKGWVR
Subjt:  IFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVR

Query:  KWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]5.0e-18995.47Show/hide
Query:  MSSSFSSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRRFAIPENILLRLPEEGERADNPPEGWVTLYFKIFEYGLR
        MSSS SSNL SDLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRR FAIPENILLRLPEEGERADNPPEGWVTLYFK+FEYGLR
Subjt:  MSSSFSSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRRFAIPENILLRLPEEGERADNPPEGWVTLYFKIFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYASW
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAK+IAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYAS 
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYASW

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSMKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS +KRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSMKRKSK

Query:  GRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKPATP V GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.9e-16870.6Show/hide
Query:  MCARKGADGIVKGPTSIKGWVRKWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV KWF+AS EWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGADGIVKGPTSIKGWVRKWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSMKRKSKGRAHALEAAQSSKPATPVV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE
         VR IE+SRPNSELAMVCGF  S+KRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASSMKRKSKGRAHALEAAQSSKPATPVV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE

Query:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVNDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LR ASKFV+DPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVNDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTTKDELLKAHSEVEILKVEVESQAELLKKEEDRRKAQLQAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T K ELLKA  EV+IL+ EV+++ +LLKKE ++ KA L+AAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTTKDELLKAHSEVEILKVEVESQAELLKKEEDRRKAQLQAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKKLEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP L
Subjt:  QALEAKDKKLEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.7e-11185.43Show/hide
Query:  MCARKGADGIVKGPTSIKGWVRKWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYAS EWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGADGIVKGPTSIKGWVRKWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSMKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR
        AVRPIESSRPNSELAMVCGFAS++KRKSKG+AHALEAAQSSKP TP V GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREE PLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASSMKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR

Query:  KKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRVEPSSSGVRDQ
        KKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138263.0e-13993.04Show/hide
Query:  IFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVR
        +FEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  IFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVR

Query:  KWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SMKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE
         +KRKSKGRAHALEAAQSSKP TP V GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE
Subjt:  SMKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE

A0A6J1DWF1 uncharacterized protein LOC1110251082.2e-10596.91Show/hide
Query:  IFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVR
        +FEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMCARKGADGIVKGPTSIKGWVR
Subjt:  IFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVR

Query:  KWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255022.4e-18995.47Show/hide
Query:  MSSSFSSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRRFAIPENILLRLPEEGERADNPPEGWVTLYFKIFEYGLR
        MSSS SSNL SDLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRR FAIPENILLRLPEEGERADNPPEGWVTLYFK+FEYGLR
Subjt:  MSSSFSSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRRFAIPENILLRLPEEGERADNPPEGWVTLYFKIFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYASW
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAK+IAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYAS 
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYASW

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSMKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS +KRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSMKRKSK

Query:  GRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKPATP V GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.4e-16870.6Show/hide
Query:  MCARKGADGIVKGPTSIKGWVRKWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV KWF+AS EWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGADGIVKGPTSIKGWVRKWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSMKRKSKGRAHALEAAQSSKPATPVV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE
         VR IE+SRPNSELAMVCGF  S+KRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASSMKRKSKGRAHALEAAQSSKPATPVV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE

Query:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVNDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LR ASKFV+DPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVNDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTTKDELLKAHSEVEILKVEVESQAELLKKEEDRRKAQLQAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T K ELLKA  EV+IL+ EV+++ +LLKKE ++ KA L+AAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTTKDELLKAHSEVEILKVEVESQAELLKKEEDRRKAQLQAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKKLEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP L
Subjt:  QALEAKDKKLEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42060.1 myosin heavy chain-related1.9e-0526.9Show/hide
Query:  SRIPEHYLGSLRRRFAIPENILLRLPEEGERADNPPEGWVTLYFKIF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE
        SR    + G    +   PE +   +PE  +R  + PEG++ L+   F E GL  PL  F+  +  R  +A +Q++         L IL       +EE  
Subjt:  SRIPEHYLGSLRRRFAIPENILLRLPEEGERADNPPEGWVTLYFKIF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE

Query:  LLDVDQLLACFEAKKIAKKPGRFYMCA--RKGADGIVKGPTS-IKGWVRKWFYASWEWLAKDESGRSFFDV
        ++D+D L     +  I  K  R  +CA  R+G   I  G TS ++ W + +F+A    ++ D++  S  ++
Subjt:  LLDVDQLLACFEAKKIAKKPGRFYMCA--RKGADGIVKGPTS-IKGWVRKWFYASWEWLAKDESGRSFFDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAATCTTGGAGCACCAATAGGGGTCCTCCACGCGTCCAGGGTATTCCCTTCTCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTT
CATTCGACTTGCTTTGGACACGTGGTCATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTT
CGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGAC
GGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGAGGTTCGCTATCCCTGAGAACAT
CCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATATTTGAGTACGGCCTCAGACTTCCCCTTCACCCTT
TTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGAT
AGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAAGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGA
CGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCTGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCC
CCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCTGAGCTTACGCAAGCCTCCTTTGACACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTC
GGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGACTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTG
CGGATTTGCAAGCAGCATGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGTCGTGGCAGGGCCTGCCTCGGAAG
ATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCCGCCTTTGGGCGAGGAGGTGAGG
GAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCC
TACGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCGAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACC
GCTGCCTAAGGATGGCGTCCAAATTTGTGAACGACCCTGGGTCCGTTCTGCAGAGGACCATCGATTACGCCGCCGAGGCGTTCGTTGCTTCCATCCAATCGGCTCTGGCT
GTAAAGGCCGAGTTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAAGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCACGAAGGATGAGCTGCTGAAGGC
TCACTCTGAGGTGGAGATTTTGAAGGTCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCAAGCTGCCCACGCTATCACCA
GGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGAAGCTGGAGCATGCGACTGCCGAGCTGGAGACG
GCGAAGGAGCGGCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCAT
GAAGGGCATTGCTTCCGACATGCCCGACCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAATCTTGGAGCACCAATAGGGGTCCTCCACGCGTCCAGGGTATTCCCTTCTCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTT
CATTCGACTTGCTTTGGACACGTGGTCATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTT
CGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGAC
GGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGAGGTTCGCTATCCCTGAGAACAT
CCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATATTTGAGTACGGCCTCAGACTTCCCCTTCACCCTT
TTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGAT
AGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAAGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGA
CGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCTGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCC
CCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCTGAGCTTACGCAAGCCTCCTTTGACACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTC
GGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGACTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTG
CGGATTTGCAAGCAGCATGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGTCGTGGCAGGGCCTGCCTCGGAAG
ATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCCGCCTTTGGGCGAGGAGGTGAGG
GAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCC
TACGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCGAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACC
GCTGCCTAAGGATGGCGTCCAAATTTGTGAACGACCCTGGGTCCGTTCTGCAGAGGACCATCGATTACGCCGCCGAGGCGTTCGTTGCTTCCATCCAATCGGCTCTGGCT
GTAAAGGCCGAGTTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAAGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCACGAAGGATGAGCTGCTGAAGGC
TCACTCTGAGGTGGAGATTTTGAAGGTCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCAAGCTGCCCACGCTATCACCA
GGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGAAGCTGGAGCATGCGACTGCCGAGCTGGAGACG
GCGAAGGAGCGGCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCAT
GAAGGGCATTGCTTCCGACATGCCCGACCTTTAG
Protein sequenceShow/hide protein sequence
MSNLGAPIGVLHASRVFPSPNIGPLSVWSDLDLAEKFIRLALDTWSYLTFPEFLEFDLKAARTLGRSVSSLSLSNVVAMSSSFSSNLGSDLARRLESELEEIENFRFSDD
GEDSDASTSGQGLEYPSRIPEHYLGSLRRRFAIPENILLRLPEEGERADNPPEGWVTLYFKIFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARD
SEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYASWEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKV
GTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSMKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVR
EEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVNDPGSVLQRTIDYAAEAFVASIQSALA
VKAELDGREVLAAREKEEFSAALEAASSTTKDELLKAHSEVEILKVEVESQAELLKKEEDRRKAQLQAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKKLEHATAELET
AKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL