; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g01970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g01970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr4:1254569..1256821
RNA-Seq ExpressionMoc04g01970
SyntenyMoc04g01970
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.0e-6963.93Show/hide
Query:  MFEYGLRLPLHPFIQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR
        MFEYGLRLPLHPF+QEFLFRTGLAPAQVAPNG                  EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSI+GWVR
Subjt:  MFEYGLRLPLHPFIQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK-------------------PPSTP-----------MVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTR             + + F  +++  ++ P  RK                   P   P           MVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK-------------------PPSTP-----------MVCGFAS

Query:  IVKRKSKGRAHALEAAQSSESATPAVVGPASENPAPVIELESSG
         VKRKSKGRAHALEAAQSS+  TPAVVGPASE+PAPVIELESSG
Subjt:  IVKRKSKGRAHALEAAQSSESATPAVVGPASENPAPVIELESSG

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.1e-8168.66Show/hide
Query:  VRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE--
        VRDQVSRISAASLDRCLRRASKF                 AFVASIQSAL VKAELDGRE LAAR+KEEFSAALE ASSTMKDELLKAHSEVE LKAE  
Subjt:  VRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE--

Query:  -------------------------------------------ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKG
                                                   ALEAK++EL+HATAELET KERLSNG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKG
Subjt:  -------------------------------------------ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKG

Query:  IASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS
        IASDMPDLQIDL GLK+RYAE+WAS P GTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA   GS
Subjt:  IASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]7.2e-6054.78Show/hide
Query:  VRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE--
        V+DQVSRISA  LDRCL+RASKF                 AFVASI SA++VKAELDGREALAA+++E  SAALEAA +T+K ELLKA  EV IL+AE  
Subjt:  VRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE--

Query:  -------------------------------------------ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKG
                                                    LE K+  +   TAEL+ +KERL+NG+LLEESFRQH DFDGFAKDFSDAGFKFLMKG
Subjt:  -------------------------------------------ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKG

Query:  IASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP
        IA+DMP LQIDL  LKK+Y+E+WAS P+GTPGPQ+LV KYVR+LDSDYSD+EE+        ++GTTQE  P
Subjt:  IASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.4e-11870.5Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLDYPSRIPEHYLGSLRRGFAIPENILIRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGL+YPSRIPEHYLGSLRRGFAIPENIL+R+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLDYPSRIPEHYLGSLRRGFAIPENILIRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFIQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA
        LRLPLHPF+QEFLFRTGLAPAQVAPNG                  EEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSI+GWVRKWFYA
Subjt:  LRLPLHPFIQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK-------------------PPSTP-----------MVCGFASIVKRK
        SGEWLAKDESGRSFFDVPTR             + + F  +++  ++ P  RK                   P   P           MVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK-------------------PPSTP-----------MVCGFASIVKRK

Query:  SKGRAHALEAAQSSESATPAVVGPASENPAPVIELESSG
        SKGRAHALEAAQSS+ ATPAVVGPASE+PA VIELESSG
Subjt:  SKGRAHALEAAQSSESATPAVVGPASENPAPVIELESSG

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.1e-8945.42Show/hide
Query:  MCARKGAGGIVKGPTSIEGWVRKWFYASGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK--------------------
        MCARKG GGIVKGPTSI+GWV KWF+ASGEWLAKDESGR+FFDVPTR             + + F  ++   D  P  RK                    
Subjt:  MCARKGAGGIVKGPTSIEGWVRKWFYASGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK--------------------

Query:  -----PPSTP-----MVCGFASIVKRKSKGRAHALEAAQSSESATPAV--------VGPASENPAPVIELESSGVLRG-------------------RSA
               S P     MVCGF   VKRKSKGRAHAL+    +E  TP V         GP+S  P PVIEL+ SG   G                   R  
Subjt:  -----PPSTP-----MVCGFASIVKRKSKGRAHALEAAQSSESATPAV--------VGPASENPAPVIELESSGVLRG-------------------RSA

Query:  PGIRPRR------WTSRPWARSTV------------------QNRAVKF-------WVRDQVSRISAASLDRCLRRASKF-----------------AFV
          +R RR       +S   AR T+                   N  ++F        V+DQVSRISA  LDR LRRASKF                 AF+
Subjt:  PGIRPRR------WTSRPWARSTV------------------QNRAVKF-------WVRDQVSRISAASLDRCLRRASKF-----------------AFV

Query:  ASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE---------------------------------------------
        ASI  A++VKAELDGREALAA+++E   AALEAA +T+K ELLKA  EV+IL+AE                                             
Subjt:  ASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE---------------------------------------------

Query:  ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRD
         LE K+  +   T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WAS P+GTP PQ+LVDKYVR+
Subjt:  ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRD

Query:  LDSDYSDLEED--------QVGTTQEGAP--QAGS
        LDSDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  LDSDYSDLEED--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138264.8e-7063.93Show/hide
Query:  MFEYGLRLPLHPFIQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR
        MFEYGLRLPLHPF+QEFLFRTGLAPAQVAPNG                  EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSI+GWVR
Subjt:  MFEYGLRLPLHPFIQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK-------------------PPSTP-----------MVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTR             + + F  +++  ++ P  RK                   P   P           MVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK-------------------PPSTP-----------MVCGFAS

Query:  IVKRKSKGRAHALEAAQSSESATPAVVGPASENPAPVIELESSG
         VKRKSKGRAHALEAAQSS+  TPAVVGPASE+PAPVIELESSG
Subjt:  IVKRKSKGRAHALEAAQSSESATPAVVGPASENPAPVIELESSG

A0A6J1D971 uncharacterized protein LOC1110185385.5e-8268.66Show/hide
Query:  VRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE--
        VRDQVSRISAASLDRCLRRASKF                 AFVASIQSAL VKAELDGRE LAAR+KEEFSAALE ASSTMKDELLKAHSEVE LKAE  
Subjt:  VRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE--

Query:  -------------------------------------------ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKG
                                                   ALEAK++EL+HATAELET KERLSNG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKG
Subjt:  -------------------------------------------ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKG

Query:  IASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS
        IASDMPDLQIDL GLK+RYAE+WAS P GTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA   GS
Subjt:  IASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS

A0A6J1DF31 uncharacterized protein LOC1110199093.5e-6054.78Show/hide
Query:  VRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE--
        V+DQVSRISA  LDRCL+RASKF                 AFVASI SA++VKAELDGREALAA+++E  SAALEAA +T+K ELLKA  EV IL+AE  
Subjt:  VRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE--

Query:  -------------------------------------------ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKG
                                                    LE K+  +   TAEL+ +KERL+NG+LLEESFRQH DFDGFAKDFSDAGFKFLMKG
Subjt:  -------------------------------------------ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKG

Query:  IASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP
        IA+DMP LQIDL  LKK+Y+E+WAS P+GTPGPQ+LV KYVR+LDSDYSD+EE+        ++GTTQE  P
Subjt:  IASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP

A0A6J1DXS5 uncharacterized protein LOC1110255021.6e-11870.5Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLDYPSRIPEHYLGSLRRGFAIPENILIRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGL+YPSRIPEHYLGSLRRGFAIPENIL+R+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLDYPSRIPEHYLGSLRRGFAIPENILIRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFIQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA
        LRLPLHPF+QEFLFRTGLAPAQVAPNG                  EEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSI+GWVRKWFYA
Subjt:  LRLPLHPFIQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK-------------------PPSTP-----------MVCGFASIVKRK
        SGEWLAKDESGRSFFDVPTR             + + F  +++  ++ P  RK                   P   P           MVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK-------------------PPSTP-----------MVCGFASIVKRK

Query:  SKGRAHALEAAQSSESATPAVVGPASENPAPVIELESSG
        SKGRAHALEAAQSS+ ATPAVVGPASE+PA VIELESSG
Subjt:  SKGRAHALEAAQSSESATPAVVGPASENPAPVIELESSG

A0A6J1DZB3 uncharacterized protein LOC1110256655.5e-9045.42Show/hide
Query:  MCARKGAGGIVKGPTSIEGWVRKWFYASGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK--------------------
        MCARKG GGIVKGPTSI+GWV KWF+ASGEWLAKDESGR+FFDVPTR             + + F  ++   D  P  RK                    
Subjt:  MCARKGAGGIVKGPTSIEGWVRKWFYASGEWLAKDESGRSFFDVPTR-------------SNSDFVFVQFQSDQSPSLRK--------------------

Query:  -----PPSTP-----MVCGFASIVKRKSKGRAHALEAAQSSESATPAV--------VGPASENPAPVIELESSGVLRG-------------------RSA
               S P     MVCGF   VKRKSKGRAHAL+    +E  TP V         GP+S  P PVIEL+ SG   G                   R  
Subjt:  -----PPSTP-----MVCGFASIVKRKSKGRAHALEAAQSSESATPAV--------VGPASENPAPVIELESSGVLRG-------------------RSA

Query:  PGIRPRR------WTSRPWARSTV------------------QNRAVKF-------WVRDQVSRISAASLDRCLRRASKF-----------------AFV
          +R RR       +S   AR T+                   N  ++F        V+DQVSRISA  LDR LRRASKF                 AF+
Subjt:  PGIRPRR------WTSRPWARSTV------------------QNRAVKF-------WVRDQVSRISAASLDRCLRRASKF-----------------AFV

Query:  ASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE---------------------------------------------
        ASI  A++VKAELDGREALAA+++E   AALEAA +T+K ELLKA  EV+IL+AE                                             
Subjt:  ASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAE---------------------------------------------

Query:  ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRD
         LE K+  +   T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WAS P+GTP PQ+LVDKYVR+
Subjt:  ALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRD

Query:  LDSDYSDLEED--------QVGTTQEGAP--QAGS
        LDSDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  LDSDYSDLEED--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related9.8e-0725Show/hide
Query:  RLESELEEIENFRFSDDGEDSDASTSGQG-----LDY-PSRIPEHYLGSLRRGFAIPENILIRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFIQ
        R+ ++ +   N    D+ E +D + SG+      +D  P+      +G       +P  + IRIP + +R  + PEG++ L+   F E GLR P+  F+ 
Subjt:  RLESELEEIENFRFSDDGEDSDASTSGQG-----LDY-PSRIPEHYLGSLRRGFAIPENILIRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFIQ

Query:  EFLFRTGLAPAQ--VAPNGEEAEL----------LDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA
         F     +A +Q  VA     A L          L V+ +       ++  K G+ Y+ + +G   +  GP+    W+  +FYA
Subjt:  EFLFRTGLAPAQ--VAPNGEEAEL----------LDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA

AT2G15420.1 myosin heavy chain-related7.5e-0731.25Show/hide
Query:  PENILIRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFIQEFLFRTGLAPAQ-----------VAPNGEEAEL-LDVDQLLACFEAKRIAKKPGRF
        P  I +  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q           +A  G E  + +D D         R+ + PG +
Subjt:  PENILIRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFIQEFLFRTGLAPAQ-----------VAPNGEEAEL-LDVDQLLACFEAKRIAKKPGRF

Query:  YMCARKGAGGIVKGPTS-IEGWVRKWFY
        Y  A K    IV G  S I GW R++F+
Subjt:  YMCARKGAGGIVKGPTS-IEGWVRKWFY

AT5G38190.1 INVOLVED IN: biological_process unknown8.3e-0626.01Show/hide
Query:  RFSDD-GEDSDASTSGQG-----LDY-PSRIPEHYLGSLRRGFAIPENILIRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFIQEFLFRTGLAPA
        R++DD  E +D + SG+      +D  P+      +G       +P  + IRIP + +R  + PEG++ L+   F E GLR P+  F+  F     +A +
Subjt:  RFSDD-GEDSDASTSGQG-----LDY-PSRIPEHYLGSLRRGFAIPENILIRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFIQEFLFRTGLAPA

Query:  Q--VAPNGEEAEL----------LDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA
        Q  VA     A L          L V+ +       ++  K G+ Y+ + +G   +   P+    W+  +FYA
Subjt:  Q--VAPNGEEAEL----------LDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGGTCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGACTACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCATTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCATCCAA
GAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGTGAAGAGGCGGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGAT
AGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCGAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTG
GGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTCTAACTCGGATTTTGTCTTCGTGCAGTTTCAATCCGACCAGTCCCCGAGCTTA
CGCAAGCCTCCTTCGACACCCATGGTTTGCGGGTTTGCGAGTATCGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGGAATCTGCCAC
TCCTGCTGTGGTAGGGCCAGCCTCGGAAAATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGA
CGTCTCGCCCTTGGGCGAGGAGCACGGTTCAGAATCGAGCCGTCAAGTTCTGGGTGAGGGACCAGGTGTCCCGCATCTCTGCTGCAAGTTTGGACCGCTGCCTCAGAAGA
GCGTCCAAATTTGCGTTTGTTGCTTCCATTCAATCGGCTCTAGTCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGAAGAAAGAGGAGTTCTCTGCTGC
CTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATG
CGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGAC
GCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCATCTAGGCCTAG
CGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGGCGCTCCTC
AAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGGTCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGACTACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCATTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCATCCAA
GAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGTGAAGAGGCGGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGAT
AGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCGAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTG
GGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTCTAACTCGGATTTTGTCTTCGTGCAGTTTCAATCCGACCAGTCCCCGAGCTTA
CGCAAGCCTCCTTCGACACCCATGGTTTGCGGGTTTGCGAGTATCGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGGAATCTGCCAC
TCCTGCTGTGGTAGGGCCAGCCTCGGAAAATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGA
CGTCTCGCCCTTGGGCGAGGAGCACGGTTCAGAATCGAGCCGTCAAGTTCTGGGTGAGGGACCAGGTGTCCCGCATCTCTGCTGCAAGTTTGGACCGCTGCCTCAGAAGA
GCGTCCAAATTTGCGTTTGTTGCTTCCATTCAATCGGCTCTAGTCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGAAGAAAGAGGAGTTCTCTGCTGC
CTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATG
CGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGAC
GCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCATCTAGGCCTAG
CGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGGCGCTCCTC
AAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLDYPSRIPEHYLGSLRRGFAIPENILIRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFIQ
EFLFRTGLAPAQVAPNGEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYASGEWLAKDESGRSFFDVPTRSNSDFVFVQFQSDQSPSL
RKPPSTPMVCGFASIVKRKSKGRAHALEAAQSSESATPAVVGPASENPAPVIELESSGVLRGRSAPGIRPRRWTSRPWARSTVQNRAVKFWVRDQVSRISAASLDRCLRR
ASKFAFVASIQSALVVKAELDGREALAARKKEEFSAALEAASSTMKDELLKAHSEVEILKAEALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSD
AGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASRPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS