; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g00960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g00960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr2:783758..786468
RNA-Seq ExpressionMoc02g00960
SyntenyMoc02g00960
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.4e-13590.22Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMC RKGA GIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSI PVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKHPRDQTEGGGRPDRGGGRPAFGRGSE
        GVKRKSKGRAHALEAAQSSKP TP VVGPASEDPAPVIELESSGGPSREK PRDQTE           P  G G++
Subjt:  GVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKHPRDQTEGGGRPDRGGGRPAFGRGSE

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]6.5e-10497.4Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMC RKGA GIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSI PVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]2.9e-10496.39Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMC RKGADGIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSI PVPELTQASFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.7e-18996.29Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMC RKGA GIVKG TSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSI PVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKHPRDQTE
        GRAHALEAAQSSKPATP VVGPASEDPA VIELESSGGPSREK PRDQTE
Subjt:  GRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKHPRDQTE

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]7.8e-13456.14Show/hide
Query:  MCTRKGADGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP
        MC RKG  GIVKG TSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI  +PEL QA+FDTLK+YK+ FP+ RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCTRKGADGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSGGPSREKHPRDQTEG--GGRPDRGGGR
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EK  R+++E       +   G 
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSGGPSREKHPRDQTEG--GGRPDRGGGR

Query:  PAFGRGSEGGSPSEAKKEEKEGDLPLGGRSL--------RGLACKFRRSGGRSCGQDGRD-VSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFV
            R  +    S + +    G LP     L        RG +    R G        +D VSRISA  LDR LRRASKFVS PGSVLQRTID  AEAF+
Subjt:  PAFGRGSEGGSPSEAKKEEKEGDLPLGGRSL--------RGLACKFRRSGGRSCGQDGRD-VSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFV

Query:  ASIQSALAVKAELDAREVLAAREKEEFSAALEAASSTMKDKLLKAHSEVETLKAE---------------------------------------------
        ASI  A+ VKAELD RE LAA+E+E   AALEAA +T+K +LLKA  EV+ L+AE                                             
Subjt:  ASIQSALAVKAELDAREVLAAREKEEFSAALEAASSTMKDKLLKAHSEVETLKAE---------------------------------------------

Query:  ALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRD
         LE KD  +   T EL+  KERL+NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK +Y+EKWASGP GTP PQ+LVD+YVR+
Subjt:  ALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRD

Query:  LDSDYSDPEED--------QVGSTQEGAP
        LDSDYSD EE+        +VG+TQE  P
Subjt:  LDSDYSDPEED--------QVGSTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138266.9e-13690.22Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMC RKGA GIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSI PVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKHPRDQTEGGGRPDRGGGRPAFGRGSE
        GVKRKSKGRAHALEAAQSSKP TP VVGPASEDPAPVIELESSGGPSREK PRDQTE           P  G G++
Subjt:  GVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKHPRDQTEGGGRPDRGGGRPAFGRGSE

A0A6J1DWD2 uncharacterized protein LOC1110246803.1e-10497.4Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMC RKGA GIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSI PVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251081.4e-10496.39Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMC RKGADGIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSI PVPELTQASFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255028.3e-19096.29Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMC RKGA GIVKG TSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSI PVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKHPRDQTE
        GRAHALEAAQSSKPATP VVGPASEDPA VIELESSGGPSREK PRDQTE
Subjt:  GRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKHPRDQTE

A0A6J1DZB3 uncharacterized protein LOC1110256653.8e-13456.14Show/hide
Query:  MCTRKGADGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP
        MC RKG  GIVKG TSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI  +PEL QA+FDTLK+YK+ FP+ RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCTRKGADGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSGGPSREKHPRDQTEG--GGRPDRGGGR
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EK  R+++E       +   G 
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSGGPSREKHPRDQTEG--GGRPDRGGGR

Query:  PAFGRGSEGGSPSEAKKEEKEGDLPLGGRSL--------RGLACKFRRSGGRSCGQDGRD-VSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFV
            R  +    S + +    G LP     L        RG +    R G        +D VSRISA  LDR LRRASKFVS PGSVLQRTID  AEAF+
Subjt:  PAFGRGSEGGSPSEAKKEEKEGDLPLGGRSL--------RGLACKFRRSGGRSCGQDGRD-VSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFV

Query:  ASIQSALAVKAELDAREVLAAREKEEFSAALEAASSTMKDKLLKAHSEVETLKAE---------------------------------------------
        ASI  A+ VKAELD RE LAA+E+E   AALEAA +T+K +LLKA  EV+ L+AE                                             
Subjt:  ASIQSALAVKAELDAREVLAAREKEEFSAALEAASSTMKDKLLKAHSEVETLKAE---------------------------------------------

Query:  ALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRD
         LE KD  +   T EL+  KERL+NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK +Y+EKWASGP GTP PQ+LVD+YVR+
Subjt:  ALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRD

Query:  LDSDYSDPEED--------QVGSTQEGAP
        LDSDYSD EE+        +VG+TQE  P
Subjt:  LDSDYSDPEED--------QVGSTQEGAP

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic6.1e-0430.3Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA
        S   E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+       + +L  L  +  R  E  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA

Query:  ELLDVDQLLACFEAKRIAK-KPGRFYMCTRKG
          + +  L    E +R+ K +  R+Y+   KG
Subjt:  ELLDVDQLLACFEAKRIAK-KPGRFYMCTRKG

Arabidopsis top hitse value%identityAlignment
AT3G42060.1 myosin heavy chain-related1.3e-0422.02Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE
        SR    + G        PE +   +PE  +R  + PEG++ L+   F E GL  PL  F+  +  R  +A +Q++         L IL       +EE  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE

Query:  LLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDV
        ++D+D            K        +R+G       ++ ++ W + +F+A    ++ D++  S  ++
Subjt:  LLDVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCATCTTGGAGCACCAATGGGGGTCCTCCACGTGTCCAGGGTATTCCCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTT
CATCCGACTTGCTTTGGACACGTGGCGACTTCCTATTTATCTGAAGGCAGCTCGAACCCCTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAATTGCCATGTCGT
CCTCTATTAGCAGCAACCTAGGGTCCGATCTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCC
ACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTCCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGA
GGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAGTTTCTCTTTC
GGACTGGGTTGGCTCCGGCTCAAGTGGCCCCTAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTG
GACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAAAAGCCTGGTCGGTTCTATATGTGCACAAGGAAAGGCGCAGACGGTATAGTTAAGGGGTCGAC
CTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCTACTAGGTTTGGGAACCTAG
TTTCAATCTGGCCAGTCCCCGAGCTTACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAAGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAA
CTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCGGCGTGAA
GCGCAAGTCTAAAGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACACCTGTCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGC
TGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCACCCCAGGGATCAGACCGAGGGGGGTGGACGCCCAGACCGAGGCGGCGGACGCCCCGCCTTTGGGCGAGGAAGCGAG
GGAGGAAGCCCCTCCGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCGGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATC
CTGCGGCCAGGATGGGCGGGACGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGA
CCATCGACTACGCCGCCGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATGCGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTC
TCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATAAGCTGCTGAAGGCTCATTCTGAGGTGGAGACTTTGAAGGCCGAGGCGCTTGAAGCGAAGGATAAGGAGCT
GGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACT
TTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAATGAGGTATGCCGAGAAGTGGGCGTCT
GGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGAGGG
CGCTCCCCCAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCCATCTTGGAGCACCAATGGGGGTCCTCCACGTGTCCAGGGTATTCCCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTT
CATCCGACTTGCTTTGGACACGTGGCGACTTCCTATTTATCTGAAGGCAGCTCGAACCCCTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAATTGCCATGTCGT
CCTCTATTAGCAGCAACCTAGGGTCCGATCTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCC
ACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTCCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGA
GGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAGTTTCTCTTTC
GGACTGGGTTGGCTCCGGCTCAAGTGGCCCCTAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTG
GACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAAAAGCCTGGTCGGTTCTATATGTGCACAAGGAAAGGCGCAGACGGTATAGTTAAGGGGTCGAC
CTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCTACTAGGTTTGGGAACCTAG
TTTCAATCTGGCCAGTCCCCGAGCTTACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAAGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAA
CTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCGGCGTGAA
GCGCAAGTCTAAAGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACACCTGTCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGC
TGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCACCCCAGGGATCAGACCGAGGGGGGTGGACGCCCAGACCGAGGCGGCGGACGCCCCGCCTTTGGGCGAGGAAGCGAG
GGAGGAAGCCCCTCCGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCGGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATC
CTGCGGCCAGGATGGGCGGGACGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGA
CCATCGACTACGCCGCCGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATGCGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTC
TCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATAAGCTGCTGAAGGCTCATTCTGAGGTGGAGACTTTGAAGGCCGAGGCGCTTGAAGCGAAGGATAAGGAGCT
GGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACT
TTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAATGAGGTATGCCGAGAAGTGGGCGTCT
GGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGAGGG
CGCTCCCCCAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSHLGAPMGVLHVSRVFPSPNIGPLSVWSDLDLAEKFIRLALDTWRLPIYLKAARTPGRSVSSLSLSNVIAMSSSISSNLGSDLARRLESELEEIENFRFSDDGEDSDAS
TSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL
DVDQLLACFEAKRIAKKPGRFYMCTRKGADGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIWPVPELTQASFDTLKYYKERFPKGRKVGTLVTDE
LLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKHPRDQTEGGGRPDRGGGRPAFGRGSE
GGSPSEAKKEEKEGDLPLGGRSLRGLACKFRRSGGRSCGQDGRDVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDAREVLAAREKEEF
SAALEAASSTMKDKLLKAHSEVETLKAEALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWAS
GPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPPAGS