; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1892 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1892
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC06:26340024..26341988
RNA-Seq ExpressionMC06g1892
SyntenyMC06g1892
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145897.1 uncharacterized protein LOC101214743 [Cucumis sativus]2.86e-13478.49Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGP C DISLP EQE++HKE  D K G    G   R+ AAAFLS RQLNALAVV+IFSASGMVCAEDL FV+FS+ YMYF+SRVAFP  GG  D  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE
        VF P +++VLRLYV+ AA +GLFLP+ YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA ND FSTPIRVFVPVFYNSRRIFT+ EWLR+EFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        +SGSVRRL+VGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

XP_008437515.1 PREDICTED: uncharacterized protein LOC103482907 [Cucumis melo]2.06e-13881.67Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPTCSDISLP EQE++HKE  D K G     G  R+ AAAFLS RQLNALAVV+IFSASGMVCAEDLAFV+FS+ YMYFISRVAFP  GG  D  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE
        VF P +++VLRLYV+ AA IGLFLPI YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIFT+ EWLRDEFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        +SGSVRRL+VGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

XP_022156297.1 uncharacterized protein LOC111023219 [Momordica charantia]1.61e-178100Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE
        VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKDH
        FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKDH
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKDH

XP_023551347.1 uncharacterized protein LOC111809194 [Cucurbita pepo subsp. pepo]2.81e-12372.31Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKS------GAVVHGGARRK---PAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFP
        MSGGVGPTCSDISLP+EQE LHKE CD K       G    GG  +K    AAAFLSF+QLNALAVVVIFSASGMVCAEDLAFV+FS+ YMYFISRVAFP
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKS------GAVVHGGARRK---PAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFP

Query:  ASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLR
           G  +  VFS  ++++LRLY      +G FLPI YILEGFFE+DKEGIKAASPHVFLLASQ FMEGVA NDRFSTPIRVFVPV YN+RR+FT+ EWLR
Subjt:  ASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLR

Query:  DEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        DEFAKEDKE+SGSVRR ++GR LAV NMA+WSFNLFG LLP+Y+P+AL RYY S+ KSKD
Subjt:  DEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

XP_038906981.1 uncharacterized protein LOC120092829 [Benincasa hispida]1.34e-13680.39Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGA----VVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGG
        MSGGVGPTCSDISLP EQE+LHKE  D K+      +  GG  R+ AAAFLSFRQLNALAVV+IFSASGMVCAEDLAFV+FS+ YMYFISRVAFP  G  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGA----VVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGG

Query:  EDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAK
         +  VFSP ++K+LRLYV  AA +GLFLPI YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA  DRFSTPIRVFVPVFYNSRRIFT+AEWLRDEF K
Subjt:  EDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAK

Query:  EDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        EDKE+SGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  EDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

TrEMBL top hitse value%identityAlignment
A0A0A0KK26 Uncharacterized protein1.38e-13478.49Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGP C DISLP EQE++HKE  D K G    G   R+ AAAFLS RQLNALAVV+IFSASGMVCAEDL FV+FS+ YMYF+SRVAFP  GG  D  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE
        VF P +++VLRLYV+ AA +GLFLP+ YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA ND FSTPIRVFVPVFYNSRRIFT+ EWLR+EFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        +SGSVRRL+VGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

A0A1S3AUS6 uncharacterized protein LOC1034829079.96e-13981.67Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPTCSDISLP EQE++HKE  D K G     G  R+ AAAFLS RQLNALAVV+IFSASGMVCAEDLAFV+FS+ YMYFISRVAFP  GG  D  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE
        VF P +++VLRLYV+ AA IGLFLPI YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIFT+ EWLRDEFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        +SGSVRRL+VGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

A0A5D3C472 Uncharacterized protein9.96e-13981.67Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPTCSDISLP EQE++HKE  D K G     G  R+ AAAFLS RQLNALAVV+IFSASGMVCAEDLAFV+FS+ YMYFISRVAFP  GG  D  
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE
        VF P +++VLRLYV+ AA IGLFLPI YILEGFFE+DKEGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIFT+ EWLRDEFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        +SGSVRRL+VGRALAVANMALWSFNLFGFLLPVYLPRA  RYY SL+KSKD
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

A0A6J1DRN6 uncharacterized protein LOC1110232197.78e-179100Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE
        VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKDH
        FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKDH
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKDH

A0A6J1I7I0 uncharacterized protein LOC1114715242.00e-12372.03Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKS-------GAVVHGGARRK---PAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAF
        MSGGVGPTCSDISLP+EQE LHKE CD K        G    GG  RK    AAAFLSF+QLNALAVVVIFSASGMVCAEDLAFV+FS+ YMYFISRVAF
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKS-------GAVVHGGARRK---PAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAF

Query:  PASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWL
        P   G  +  VFS  ++++LRLY      +G FLPI YILEGFFE+DKEGIKAASPHVFLLASQ FMEGVA NDRFSTPIRVFVPV YN+RR+FT+ EWL
Subjt:  PASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWL

Query:  RDEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD
        RDEFAKEDKE+SGSVRR ++GR LAV NMA+WSFNLFG LLP+Y+P+A  RYY S+ KSKD
Subjt:  RDEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHKSKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27990.1 unknown protein1.3e-3138.07Show/hide
Query:  QLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFL
        +L  +A +++FSASG+V   D+ F  F+  Y+  +SR+AFP+ G          G SK+ RLYV+    IGLFLP+ Y+L GF   D   +++A+PH+FL
Subjt:  QLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFL

Query:  LASQVFMEGVANN-DRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKEFSGS---VRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYY
        L+ Q+  E V +    FS P+R  VP+ Y   RIF I  W +D +  +    + +   V     GR LA+AN+  +  NL  FL+P +LPRA  +Y+
Subjt:  LASQVFMEGVANN-DRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKEFSGS---VRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYY

AT5G23920.1 unknown protein3.4e-3739.5Show/hide
Query:  RQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMY-FISRVAFPASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHV
        RQL  L+ +++ +A G+V   ++AFV+    Y+Y F+SR AFP     +   + +P ++K+ + Y +  A IGL  P+ YI +G +  D  G  AA+PH+
Subjt:  RQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMY-FISRVAFPASGGGEDGGVFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHV

Query:  FLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHK
        FLL+ Q F E +  +D++S PI +  PVFYN+RRIF + +W++ EF+  D +  G   RL  GR +A  N  +W +NLFG LLPV+LPR+   Y++  +K
Subjt:  FLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKEFSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNSLHK

AT5G52420.1 unknown protein8.6e-6548.57Show/hide
Query:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG
        MSGGVGPT +DI+LP E+E  H+    + +  V   G   KP A F SFRQLN LA++++ SASG+V  +D  F + ++ Y +F+S++ FP         
Subjt:  MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGG

Query:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE
          +   +K+ R+YV  A  +GL +PI YI EG  EDDK G+ AA+PHVFLLASQ+FMEG+A    FS P R+ VP+ YN+RR+ T+ EW+  EF++ED  
Subjt:  VFSPGQSKVLRLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKE

Query:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNS
         + S RR+  G+ LA AN+ +WSFNLFG L+PVYLPRA  RYY S
Subjt:  FSGSVRRLLVGRALAVANMALWSFNLFGFLLPVYLPRALNRYYNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGGCGGGGTGGGCCCTACCTGCAGCGACATCAGCCTCCCGAGCGAACAGGAGACCCTCCACAAGGAGATCTGCGACCGAAAGTCCGGCGCCGTCGTCCATGGCGG
CGCGCGGAGGAAGCCGGCGGCGGCGTTCCTGTCTTTCCGGCAGCTGAACGCGCTGGCGGTGGTGGTGATATTCTCGGCGAGCGGGATGGTGTGCGCGGAAGACTTGGCCT
TCGTCCTCTTCTCCGTTGCTTACATGTACTTCATCTCGAGAGTGGCGTTCCCGGCGAGTGGCGGAGGGGAGGACGGAGGCGTGTTCAGCCCCGGCCAGAGCAAGGTGCTC
CGGCTGTACGTGATGCTGGCGGCGGCGATCGGGCTGTTCCTTCCGATCGGGTACATTCTAGAAGGATTCTTCGAGGACGATAAAGAAGGGATTAAAGCCGCTTCTCCCCA
TGTTTTTCTTCTCGCCAGCCAGGTTTTCATGGAAGGCGTGGCAAACAACGACAGATTCTCGACGCCGATCCGCGTGTTCGTACCGGTGTTCTACAACTCGAGGAGGATCT
TCACCATAGCGGAGTGGCTGCGGGACGAGTTCGCGAAGGAAGACAAAGAGTTCAGCGGGTCTGTGCGGCGGCTGCTGGTCGGAAGAGCGCTCGCGGTGGCGAACATGGCG
CTTTGGAGCTTCAATCTCTTCGGGTTCTTGTTGCCTGTGTATCTCCCAAGAGCTCTCAACAGGTACTATAATTCTCTGCACAAGTCCAAAGATCATTGA
mRNA sequenceShow/hide mRNA sequence
GTTTTGGTCTGCCCAGAAAAAAGGATCTCATCTCATTTTTCTCCTGACCAACGACAGTACCCATTCTCACAAATCTCAACCACGTGTTCGTTTCCACCAACCTGACGCCA
CGTCCCCACCGTCACCCTCGATCCCTCACAGCCGGCCACTTGGCTATTTCCCCCACCACACTCTCTTTAAACTCCCCTCCTTCAACCTCCTCCTTCTGAACCCCTCTCTC
TCTCTGACTCTGTAATTTCTCTCTCTCTAGAAAATAAAAATGTCCGGCGGGGTGGGCCCTACCTGCAGCGACATCAGCCTCCCGAGCGAACAGGAGACCCTCCACAAGGA
GATCTGCGACCGAAAGTCCGGCGCCGTCGTCCATGGCGGCGCGCGGAGGAAGCCGGCGGCGGCGTTCCTGTCTTTCCGGCAGCTGAACGCGCTGGCGGTGGTGGTGATAT
TCTCGGCGAGCGGGATGGTGTGCGCGGAAGACTTGGCCTTCGTCCTCTTCTCCGTTGCTTACATGTACTTCATCTCGAGAGTGGCGTTCCCGGCGAGTGGCGGAGGGGAG
GACGGAGGCGTGTTCAGCCCCGGCCAGAGCAAGGTGCTCCGGCTGTACGTGATGCTGGCGGCGGCGATCGGGCTGTTCCTTCCGATCGGGTACATTCTAGAAGGATTCTT
CGAGGACGATAAAGAAGGGATTAAAGCCGCTTCTCCCCATGTTTTTCTTCTCGCCAGCCAGGTTTTCATGGAAGGCGTGGCAAACAACGACAGATTCTCGACGCCGATCC
GCGTGTTCGTACCGGTGTTCTACAACTCGAGGAGGATCTTCACCATAGCGGAGTGGCTGCGGGACGAGTTCGCGAAGGAAGACAAAGAGTTCAGCGGGTCTGTGCGGCGG
CTGCTGGTCGGAAGAGCGCTCGCGGTGGCGAACATGGCGCTTTGGAGCTTCAATCTCTTCGGGTTCTTGTTGCCTGTGTATCTCCCAAGAGCTCTCAACAGGTACTATAA
TTCTCTGCACAAGTCCAAAGATCATTGAGGGATCAGGACAAGAAGACCATGTTTGGTTCTTATTCACAAAATACTTATTAACCCATAGCAATATGTTTGAATAGTTTGGA
GTAATTTTATTTGTAATGGGAGTTTTACTTGCAAGTTTCTAGCTTTACTGCTTAGTTAATTAGATCATGTGTCCATGTAATATGGATAATTATCGATGGAAAAGAAAAGT
TCCTCGTTCAAATGTAGGGTTTTAATTTCCAACCTCTATGAAAGAGTAAATATGTCATTTATATATCACTAAGCTATTTTCATATTTCGTC
Protein sequenceShow/hide protein sequence
MSGGVGPTCSDISLPSEQETLHKEICDRKSGAVVHGGARRKPAAAFLSFRQLNALAVVVIFSASGMVCAEDLAFVLFSVAYMYFISRVAFPASGGGEDGGVFSPGQSKVL
RLYVMLAAAIGLFLPIGYILEGFFEDDKEGIKAASPHVFLLASQVFMEGVANNDRFSTPIRVFVPVFYNSRRIFTIAEWLRDEFAKEDKEFSGSVRRLLVGRALAVANMA
LWSFNLFGFLLPVYLPRALNRYYNSLHKSKDH