; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017759 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017759
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold373:2268915..2270337
RNA-Seq ExpressionMS017759
SyntenyMS017759
Gene Ontology termsGO:0006464 - cellular protein modification process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
GO:0140096 - catalytic activity, acting on a protein (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145897.1 uncharacterized protein LOC101214743 [Cucumis sativus]6.7e-10478.09Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGP C DISLP EQE++HKE  D K G    G   R+ AAAFLS RQLNALAVV+IFSASGMVCAEDL FV+FS+ YMYF+SRVAFP  GG  D  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE
        VF P +++VLRLYV+ AA +GLFLP+ YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA ND FSTPIRVF+PVFYNSRRIFT+ EWLR+EFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        +SGSVRRL+VGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

XP_008437515.1 PREDICTED: uncharacterized protein LOC103482907 [Cucumis melo]5.0e-10781.27Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPTCSDISLP EQE++HKE  D K      GG  R+ AAAFLS RQLNALAVV+IFSASGMVCAEDLAFV+FS+ YMYFISRVAFP  GG  D  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE
        VF P +++VLRLYV+ AA IGLFLPI YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVF+PVFYNSRRIFT+ EWLRDEFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        +SGSVRRL+VGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

XP_022156297.1 uncharacterized protein LOC111023219 [Momordica charantia]6.0e-13799.6Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE
        VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVF+PVFYNSRRIFTIAEWLRDEFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

XP_023551347.1 uncharacterized protein LOC111809194 [Cucurbita pepo subsp. pepo]1.1e-9571.92Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRK------SGAVVHGGARRK---PAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFP
        MSGGVGPTCSDISLP+EQE LHKE CD K       G    GG  +K    AAAFLSF+QLNALAVVVIFSASGMVCAEDLAFV+FS+ YMYFISRVAFP
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRK------SGAVVHGGARRK---PAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFP

Query:  ASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLR
           G  +  VFS  ++++LRLY      +G FLPI YILEGFFE+DKEGIKAASPHVFLLASQ FMEGVA NDRFSTPIRVF+PV YN+RR+FT+ EWLR
Subjt:  ASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLR

Query:  DEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        DEFAKEDKE+SGSVRR ++GR LAV NMA+WSFNLFG LLP+Y+P+AL RYY S+ KSKD
Subjt:  DEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

XP_038906981.1 uncharacterized protein LOC120092829 [Benincasa hispida]9.4e-10680Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSG----AVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGG
        MSGGVGPTCSDISLP EQE+LHKE  D K+      +  GG  R+ AAAFLSFRQLNALAVV+IFSASGMVCAEDLAFV+FS+ YMYFISRVAFP  G  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSG----AVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGG

Query:  EDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAK
         +  VFSP ++K+LRLYV  AA +GLFLPI YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA  DRFSTPIRVF+PVFYNSRRIFT+AEWLRDEF K
Subjt:  EDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAK

Query:  EDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        EDKE+SGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  EDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

TrEMBL top hitse value%identityAlignment
A0A0A0KK26 Uncharacterized protein3.3e-10478.09Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGP C DISLP EQE++HKE  D K G    G   R+ AAAFLS RQLNALAVV+IFSASGMVCAEDL FV+FS+ YMYF+SRVAFP  GG  D  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE
        VF P +++VLRLYV+ AA +GLFLP+ YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA ND FSTPIRVF+PVFYNSRRIFT+ EWLR+EFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        +SGSVRRL+VGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

A0A1S3AUS6 uncharacterized protein LOC1034829072.4e-10781.27Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPTCSDISLP EQE++HKE  D K      GG  R+ AAAFLS RQLNALAVV+IFSASGMVCAEDLAFV+FS+ YMYFISRVAFP  GG  D  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE
        VF P +++VLRLYV+ AA IGLFLPI YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVF+PVFYNSRRIFT+ EWLRDEFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        +SGSVRRL+VGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

A0A5D3C472 Uncharacterized protein2.4e-10781.27Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPTCSDISLP EQE++HKE  D K      GG  R+ AAAFLS RQLNALAVV+IFSASGMVCAEDLAFV+FS+ YMYFISRVAFP  GG  D  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE
        VF P +++VLRLYV+ AA IGLFLPI YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVF+PVFYNSRRIFT+ EWLRDEFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        +SGSVRRL+VGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

A0A6J1DRN6 uncharacterized protein LOC1110232192.9e-13799.6Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE
        VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVF+PVFYNSRRIFTIAEWLRDEFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

A0A6J1I7I0 uncharacterized protein LOC1114715247.3e-9671.65Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRK-------SGAVVHGGARRK---PAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAF
        MSGGVGPTCSDISLP+EQE LHKE CD K        G    GG  RK    AAAFLSF+QLNALAVVVIFSASGMVCAEDLAFV+FS+ YMYFISRVAF
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRK-------SGAVVHGGARRK---PAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAF

Query:  PASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWL
        P   G  +  VFS  ++++LRLY      +G FLPI YILEGFFE+DKEGIKAASPHVFLLASQ FMEGVA NDRFSTPIRVF+PV YN+RR+FT+ EWL
Subjt:  PASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWL

Query:  RDEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        RDEFAKEDKE+SGSVRR ++GR LAV NMA+WSFNLFG LLP+Y+P+A  RYY S+ KSKD
Subjt:  RDEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27990.1 unknown protein1.6e-3137.56Show/hide
Query:  QLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFL
        +L  +A +++FSASG+V   D+ F  F+  Y+  +SR+AFP+ G          G SK+ RLYV+    IGLFLP+ Y+L GF   D   +++A+PH+FL
Subjt:  QLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFL

Query:  LASQVFMEGVANN-DRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKEFSGS---VRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYY
        L+ Q+  E V +    FS P+R  +P+ Y   RIF I  W +D +  +    + +   V     GR LA+AN+  +  NL  FL+P +LPRA  +Y+
Subjt:  LASQVFMEGVANN-DRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKEFSGS---VRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYY

AT5G23920.1 unknown protein4.4e-3739.5Show/hide
Query:  RQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMY-FISRVAFPASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHV
        RQL  L+ +++ +A G+V   ++AFV+    Y+Y F+SR AFP     +   + +P ++K+ + Y +  A IGL  P+ YI +G +  D  G  AA+PH+
Subjt:  RQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMY-FISRVAFPASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHV

Query:  FLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHK
        FLL+ Q F E +  +D++S PI +  PVFYN+RRIF + +W++ EF+  D +  G   RL  GR +A  N  +W +NLFG LLPV+LPR+   Y++  +K
Subjt:  FLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHK

AT5G52420.1 unknown protein1.1e-6448.16Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPT +DI+LP E+E  H+    + +  V   G   KP A F SFRQLN LA++++ SASG+V  +D  F + ++ Y +F+S++ FP         
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE
          +   +K+ R+YV  A  +GL +PI YI EG  EDDK G+ AA+PHVFLLASQ+FMEG+A    FS P R+ +P+ YN+RR+ T+ EW+  EF++ED  
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNS
         + S RR+  G+ LA AN+ +WSFNLFG L+PVYLPRA  RYY S
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGGCGGGGTGGGCCCTACCTGCAGCGACATCAGCCTCCCGAGCGAACAGGAGACCCTCCACAAGGAGATCTGCGACCGAAAGTCCGGCGCCGTCGTCCATGGCGG
CGCGCGGAGGAAGCCGGCGGCGGCGTTCCTGTCTTTCCGGCAGCTGAACGCGCTGGCGGTGGTGGTGATATTCTCGGCGAGCGGGATGGTGTGCGCGGAGGACTTGGCCT
TCGTCCTCTTCTCCGTTGCTTACATGTACTTCATCTCGAGAGTGGCGTTCCCGGCGAGTGGCGGAGGGGAGGACGGAGGCGTGTTCAGCCCCGGCCAGAGCAAGGTGCTC
CGGCTGTACGTGATGCTGGCGGCGGCGATCGGGCTGTTCCTTCCGATCGGGTACATTCTAGAAGGATTCTTCGAGGACGATAAAGAAGGGATTAAAGCCGCTTCTCCCCA
TGTTTTTCTTCTCGCCAGCCAGGTTTTCATGGAAGGCGTGGCAAACAACGACAGATTCTCGACGCCGATCCGCGTGTTCATACCGGTGTTCTACAACTCGAGGAGGATCT
TCACCATAGCGGAGTGGCTGCGGGACGAGTTCGCGAAGGAAGACAAAGAGTTCAGCGGGTCTGTGCGGCGGCTGCTGGTCGGAAGAGCGCTCGCGGTGGCGAACATGGCG
CTTTGGAGCTTCAATCTCTTCGGGTTCTTGTTGCCTGTGTACCTCCCAAGAGCTCTCAACAGGTACTATAATTCTCTGCACAAGTCCAAAGAT
mRNA sequenceShow/hide mRNA sequence
ATGTCCGGCGGGGTGGGCCCTACCTGCAGCGACATCAGCCTCCCGAGCGAACAGGAGACCCTCCACAAGGAGATCTGCGACCGAAAGTCCGGCGCCGTCGTCCATGGCGG
CGCGCGGAGGAAGCCGGCGGCGGCGTTCCTGTCTTTCCGGCAGCTGAACGCGCTGGCGGTGGTGGTGATATTCTCGGCGAGCGGGATGGTGTGCGCGGAGGACTTGGCCT
TCGTCCTCTTCTCCGTTGCTTACATGTACTTCATCTCGAGAGTGGCGTTCCCGGCGAGTGGCGGAGGGGAGGACGGAGGCGTGTTCAGCCCCGGCCAGAGCAAGGTGCTC
CGGCTGTACGTGATGCTGGCGGCGGCGATCGGGCTGTTCCTTCCGATCGGGTACATTCTAGAAGGATTCTTCGAGGACGATAAAGAAGGGATTAAAGCCGCTTCTCCCCA
TGTTTTTCTTCTCGCCAGCCAGGTTTTCATGGAAGGCGTGGCAAACAACGACAGATTCTCGACGCCGATCCGCGTGTTCATACCGGTGTTCTACAACTCGAGGAGGATCT
TCACCATAGCGGAGTGGCTGCGGGACGAGTTCGCGAAGGAAGACAAAGAGTTCAGCGGGTCTGTGCGGCGGCTGCTGGTCGGAAGAGCGCTCGCGGTGGCGAACATGGCG
CTTTGGAGCTTCAATCTCTTCGGGTTCTTGTTGCCTGTGTACCTCCCAAGAGCTCTCAACAGGTACTATAATTCTCTGCACAAGTCCAAAGAT
Protein sequenceShow/hide protein sequence
MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGGVFSPGQSKVL
RLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFIPVFYNSRRIFTIAEWLRDEFAKEDKEFSGSVRRLLVGRALAVANMA
LWSFNLFGFLLPVYLPRALNRYYNSLHKSKD