; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g10830 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g10830
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr1:6654796..6658943
RNA-Seq ExpressionMoc01g10830
SyntenyMoc01g10830
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.2e-7767.9Show/hide
Query:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR
        MFEYGLRLPLHPFVQ+                                  EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPT IKGWVR
Subjt:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-------------------------------------AMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLV IRPVPELTQASFDTLKYYKERFPR                                     AMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-------------------------------------AMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVVGPTSEDPAPVIELESS
         VKRKSKGRAHALEAAQSSKP TPAVVGP SEDPAPVIELESS
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVVGPTSEDPAPVIELESS

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.1e-5774.38Show/hide
Query:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR
        MFEYGLRLPLHPFVQ+                                  EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPT IKGWVR
Subjt:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPRAMVCG
        KWFYASGEWLAKDESGRSFFDVPTRFGNLV IRPVPELTQASFDTLKYYKERFPR    G
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPRAMVCG

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]2.8e-5673.12Show/hide
Query:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR
        MFEYGLRLPLHPFVQ+                                  EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPT IKGWVR
Subjt:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPRAMVCG
        KWFYASGEWLAKDESGRSFFDVPTRFGNLV IRPVPELTQASFDTLKYYKE FPR    G
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPRAMVCG

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.5e-12674.26Show/hide
Query:  MSSSFSSNLGSDEELARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVALYFKMFEYG
        MSSS SSNL SD  LARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWV LYFKMFEYG
Subjt:  MSSSFSSNLGSDEELARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVALYFKMFEYG

Query:  LRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVRKWFYA
        LRLPLHPFVQ+                                  EEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPT IKGWVRKWFYA
Subjt:  LRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-------------------------------------AMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLV IRPVPELTQASFDTLKYYKERFPR                                     AMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-------------------------------------AMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPTSEDPAPVIELESS
        SKGRAHALEAAQSSKPATPAVVGP SEDPA VIELESS
Subjt:  SKGRAHALEAAQSSKPATPAVVGPTSEDPAPVIELESS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.9e-7747.36Show/hide
Query:  MCARKGAGGIVKGPTFIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-----------------------
        MCARKG GGIVKGPT IKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLV I+ +PEL QA+FDTLK+YK+ FPR                       
Subjt:  MCARKGAGGIVKGPTFIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-----------------------

Query:  --------------AMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPTSEDPAPVIELESS---------------------RVLRGR
                      AMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ S                       +RG 
Subjt:  --------------AMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPTSEDPAPVIELESS---------------------RVLRGR

Query:  SA--------------------------------PGIRPRRWTPCPWARRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV
        S                                 P  R R  +       +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+
Subjt:  SA--------------------------------PGIRPRRWTPCPWARRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV

Query:  ASIQSALAVKAELDGRE-----------------------------------AEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQA
        ASI  A+ VKAELDGRE                                   AEV+ K +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q 
Subjt:  ASIQSALAVKAELDGRE-----------------------------------AEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQA

Query:  LEANEEELKHATAELE
        LE  +  +   T EL+
Subjt:  LEANEEELKHATAELE

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.1e-7767.9Show/hide
Query:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR
        MFEYGLRLPLHPFVQ+                                  EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPT IKGWVR
Subjt:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-------------------------------------AMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLV IRPVPELTQASFDTLKYYKERFPR                                     AMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-------------------------------------AMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVVGPTSEDPAPVIELESS
         VKRKSKGRAHALEAAQSSKP TPAVVGP SEDPAPVIELESS
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVVGPTSEDPAPVIELESS

A0A6J1DWD2 uncharacterized protein LOC1110246805.4e-5874.38Show/hide
Query:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR
        MFEYGLRLPLHPFVQ+                                  EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPT IKGWVR
Subjt:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPRAMVCG
        KWFYASGEWLAKDESGRSFFDVPTRFGNLV IRPVPELTQASFDTLKYYKERFPR    G
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPRAMVCG

A0A6J1DWF1 uncharacterized protein LOC1110251081.3e-5673.12Show/hide
Query:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR
        MFEYGLRLPLHPFVQ+                                  EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPT IKGWVR
Subjt:  MFEYGLRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPRAMVCG
        KWFYASGEWLAKDESGRSFFDVPTRFGNLV IRPVPELTQASFDTLKYYKE FPR    G
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPRAMVCG

A0A6J1DXS5 uncharacterized protein LOC1110255027.2e-12774.26Show/hide
Query:  MSSSFSSNLGSDEELARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVALYFKMFEYG
        MSSS SSNL SD  LARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWV LYFKMFEYG
Subjt:  MSSSFSSNLGSDEELARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVALYFKMFEYG

Query:  LRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVRKWFYA
        LRLPLHPFVQ+                                  EEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPT IKGWVRKWFYA
Subjt:  LRLPLHPFVQD----------------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-------------------------------------AMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLV IRPVPELTQASFDTLKYYKERFPR                                     AMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-------------------------------------AMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPTSEDPAPVIELESS
        SKGRAHALEAAQSSKPATPAVVGP SEDPA VIELESS
Subjt:  SKGRAHALEAAQSSKPATPAVVGPTSEDPAPVIELESS

A0A6J1DZB3 uncharacterized protein LOC1110256652.3e-7747.36Show/hide
Query:  MCARKGAGGIVKGPTFIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-----------------------
        MCARKG GGIVKGPT IKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLV I+ +PEL QA+FDTLK+YK+ FPR                       
Subjt:  MCARKGAGGIVKGPTFIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYYKERFPR-----------------------

Query:  --------------AMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPTSEDPAPVIELESS---------------------RVLRGR
                      AMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ S                       +RG 
Subjt:  --------------AMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPTSEDPAPVIELESS---------------------RVLRGR

Query:  SA--------------------------------PGIRPRRWTPCPWARRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV
        S                                 P  R R  +       +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+
Subjt:  SA--------------------------------PGIRPRRWTPCPWARRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV

Query:  ASIQSALAVKAELDGRE-----------------------------------AEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQA
        ASI  A+ VKAELDGRE                                   AEV+ K +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q 
Subjt:  ASIQSALAVKAELDGRE-----------------------------------AEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQA

Query:  LEANEEELKHATAELE
        LE  +  +   T EL+
Subjt:  LEANEEELKHATAELE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.0e-0321.2Show/hide
Query:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVALYFKMF-EYGLRLPLHPFV-
        R+ ++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+ 
Subjt:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVALYFKMF-EYGLRLPLHPFV-

Query:  ------QDEEAEL---------------------LDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVRKWFYA
              Q   ++L                     L V+ +       ++  K G+ Y+ + +G   +  GP+  + W+  +FYA
Subjt:  ------QDEEAEL---------------------LDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGACGAGGAGTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGG
GAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTAGCTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTCGTCCAAGATGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATG
TGCGCAAGGAAAGGCGCAGGTGGTATAGTTAAGGGGCCGACCTTCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAG
TCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTTAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTAC
AAGGAGCGTTTTCCGAGGGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCT
GCCACCCCTGCTGTGGTAGGGCCAACCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTAGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCG
AGGCGGTGGACGCCTTGCCCTTGGGCGAGGAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTA
AGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTG
AAGGCCGAGCTGGATGGGAGGGAAGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATC
ACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAATGAGGAGGAGCTGAAGCATGCGACTGCCGAG
CTGGAGACGGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCATCGATCTCGGTGGTCTGAAGAAGAGGTACCTGAGCAGTGGGCGTCTGGGCCTAGCG
GCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGGCGCTC
CTCAAGCAGGCTCTTAGGCGACCACCCTTCAAGAGGCTTCGCTGCTTTCTTTTCTTCTTCTTTTTTTGTTTTGTTTGTAAGTGTCAGGGCAGAGCTGCAAGCCAT
TGTCACCTCGCACCTCTTACTTTTGAGGTTCAGAGGCTTGATCTTTTCACATCGTCCCTTTACCTTGAAGGTTTGAATTTTAAGTTCATCAGTGGTTTTGGCATC
GCACCTCGTACCCTTAGATCCATTGAAAATCCTTTTGGCATTTCAAGGATAATAACGCTTCAGGTGCTCCGCGTTCCACGGGTGCGCGAGGACGTCTCCTTTCAG
ATCGGCCAATATGTACGTCCCAGGTCGGACTATGCCCTTGACCTCAAACGGGCCCTCCCAGGCCCTCTTCAGGGGTTAGGCATCTCAATAGAGGTACGGAAAAGC
CGCGTCGGTACAATGCTCCACCTCGGACCACGAACCGAGCTGCTCGCCTTGCCAACTTTCTGCGCTCCTTGGGGTCTTTTGGTGAGTTGCCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGACGAGGAGTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGG
GAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTAGCTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTCGTCCAAGATGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATG
TGCGCAAGGAAAGGCGCAGGTGGTATAGTTAAGGGGCCGACCTTCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAG
TCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTTAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTAC
AAGGAGCGTTTTCCGAGGGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCT
GCCACCCCTGCTGTGGTAGGGCCAACCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTAGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCG
AGGCGGTGGACGCCTTGCCCTTGGGCGAGGAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTA
AGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTG
AAGGCCGAGCTGGATGGGAGGGAAGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATC
ACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAATGAGGAGGAGCTGAAGCATGCGACTGCCGAG
CTGGAGACGGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCATCGATCTCGGTGGTCTGAAGAAGAGGTACCTGAGCAGTGGGCGTCTGGGCCTAGCG
GCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGGCGCTC
CTCAAGCAGGCTCTTAGGCGACCACCCTTCAAGAGGCTTCGCTGCTTTCTTTTCTTCTTCTTTTTTTGTTTTGTTTGTAAGTGTCAGGGCAGAGCTGCAAGCCAT
TGTCACCTCGCACCTCTTACTTTTGAGGTTCAGAGGCTTGATCTTTTCACATCGTCCCTTTACCTTGAAGGTTTGAATTTTAAGTTCATCAGTGGTTTTGGCATC
GCACCTCGTACCCTTAGATCCATTGAAAATCCTTTTGGCATTTCAAGGATAATAACGCTTCAGGTGCTCCGCGTTCCACGGGTGCGCGAGGACGTCTCCTTTCAG
ATCGGCCAATATGTACGTCCCAGGTCGGACTATGCCCTTGACCTCAAACGGGCCCTCCCAGGCCCTCTTCAGGGGTTAGGCATCTCAATAGAGGTACGGAAAAGC
CGCGTCGGTACAATGCTCCACCTCGGACCACGAACCGAGCTGCTCGCCTTGCCAACTTTCTGCGCTCCTTGGGGTCTTTTGGTGAGTTGCCTCTAA
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEELARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVALYFKMFEYGLRLPL
HPFVQDEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTFIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVLIRPVPELTQASFDTLKYY
KERFPRAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPTSEDPAPVIELESSRVLRGRSAPGIRPRRWTPCPWARRVEPSSSGVRDQVSRISAASLDRCL
RRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEANEEELKHATAE
LETEGASQQWSPIGGIIDLGGLKKRYLSSGRLGLAAPLAPKRWWISTSEIWTLTTPTSKRIRSAPLKRALLKQALRRPPFKRLRCFLFFFFFCFVCKCQGRAASH
CHLAPLTFEVQRLDLFTSSLYLEGLNFKFISGFGIAPRTLRSIENPFGISRIITLQVLRVPRVREDVSFQIGQYVRPRSDYALDLKRALPGPLQGLGISIEVRKS
RVGTMLHLGPRTELLALPTFCAPWGLLVSCL