; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g05800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g05800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:4866641..4867705
RNA-Seq ExpressionMoc07g05800
SyntenyMoc07g05800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.3e-6844.76Show/hide
Query:  DEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKA-NGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR
        +E +KV  C+D+ AM YF TGL+D  LT++     P +  ++L +A++ IDG E  +   G      G+ R  K       ++ D+GS        + GR
Subjt:  DEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKA-NGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR

Query:  RDERALSD--RRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERA
         + R   +   R   +++FTP    I++I    +++ +E L    EKL     +R K  YCRFH++H H+TS  + L  Q+E+LI+ GY KK+VG    +
Subjt:  RDERALSD--RRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERA

Query:  EPEGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLV
          E   ++E+R+R+  P R+ DRPAVIN + GGPSGGQSG+KRK LA  A  EVC    + P   I FD  D E VHLPHNDALVIA LIDHV V RVLV
Subjt:  EPEGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLV

Query:  DGGASANILSFSTYTTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE
        DGG SANILS  TY  LGW R  LK++PTPLVGF+GE+V  EG I LPV +G+
Subjt:  DGGASANILSFSTYTTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]2.3e-9778.57Show/hide
Query:  MDEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR
        MDEHVKVVSCTD+IAMMYFTTGL+DRNLTIEF SRPP SLNKMLARARQYIDGLE WKA GA+RSSRGKDRD++SSPPKK  +DDQ SSR+A D+++RG+
Subjt:  MDEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR

Query:  RDERALSDRRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEP
         DER  SDR GPKFDKFTPLNAS+A+IYA  ++TD++ALF A +KL RPSGKRDKRLYCRFHKDH H++S CFHL EQV+DLIRRGYLKKYVGSRERA+P
Subjt:  RDERALSDRRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEP

Query:  EGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSG
        EGS REEKRER+  P RKEDRPAVIN +HGGPSG +SG
Subjt:  EGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSG

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.2e-6645.4Show/hide
Query:  MMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKAN-GAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGRRDERALSDRRGPKF
        M YF TGL+D  LT++     P +  ++L +A++ IDG E  +   G  RS +  +     S  K + ++ +   RRA++   R R             +
Subjt:  MMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKAN-GAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGRRDERALSDRRGPKF

Query:  DKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEPEGSAREEKRERTPL
        ++FTP    I++I    +++ +E L    EKL     +R K  YCRFH++H H+TS  + L  Q+EDLI+ GY KK+VG    +  E   ++E+R+R+  
Subjt:  DKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEPEGSAREEKRERTPL

Query:  PRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDGGASANILSFSTYTT
        P R+ DRPAVIN + GGPSGGQSG KRK LA  A  EVC    + P   I FD  D   VHLPHNDALVIA LIDHV V RVLVDGGASANILS  TY  
Subjt:  PRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDGGASANILSFSTYTT

Query:  LGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE
        LGW R  LK++PTPLVGF+GE+V  EGCI LPV +G+
Subjt:  LGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE

XP_022154797.1 uncharacterized protein LOC111021964 [Momordica charantia]2.9e-6846.61Show/hide
Query:  MMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWK-ANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGRRDERALSDR--RGP
        M YF TGL+D  LT++     PT+  ++L +A++ IDG E  +   G      G+ R  K       ++ D+GS        + GR + R + +R  R  
Subjt:  MMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWK-ANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGRRDERALSDR--RGP

Query:  KFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEPEGSAREEKRERT
         +++FTP    I++I    +++ +E     LEKL     +R K  YCRFH++H H+TS C+ L  Q+EDLI+ GY KK+VG    +  E   ++E R+R+
Subjt:  KFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEPEGSAREEKRERT

Query:  PLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDGGASANILSFSTY
          P R  DRPAVIN + GGPSGGQSG KRK LA  A  EVC    + P   I FD  D E VHLPHNDALVIA LIDHV V RVLVDGGASANILS  TY
Subjt:  PLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDGGASANILSFSTY

Query:  TTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE
          LGW R  LK++PTPLVGF+GE+V  EGCI LPV  G+
Subjt:  TTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]5.0e-15379.89Show/hide
Query:  MDEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR
        MDEHVKVVSCTD+IAMMYFTTGL+DRNLTIEF SRPP SLN+M ARARQYIDGLE WKANGA+RSSRG+DRD KS P KK   DD+ SSRRADD+K+R R
Subjt:  MDEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR

Query:  RDERALSDRRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEP
        RDER  S+RRGPKFDKFTPLNASIA+IYA  +DTD+E LFA+ EKL RPSGKR+KRLYCRFHKDH HDTS CFHL EQVEDLIR GYLKKYVGSRE+AE 
Subjt:  RDERALSDRRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEP

Query:  EGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDG
        EGSAREEKRER+  PR KEDRPAVIN +HGGPSG +SGQKRKALA E AHEVCTSYPK PV+ ILFDEQD E VH+PHNDALVIA LIDHVKV RV VDG
Subjt:  EGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDG

Query:  GASANILSFSTYTTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGEGD
        GASANI SFSTYT LGWER+HLK   T LVGFA E+VSTEGCISLPV + EG+
Subjt:  GASANILSFSTYTTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGEGD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.3e-6944.76Show/hide
Query:  DEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKA-NGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR
        +E +KV  C+D+ AM YF TGL+D  LT++     P +  ++L +A++ IDG E  +   G      G+ R  K       ++ D+GS        + GR
Subjt:  DEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKA-NGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR

Query:  RDERALSD--RRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERA
         + R   +   R   +++FTP    I++I    +++ +E L    EKL     +R K  YCRFH++H H+TS  + L  Q+E+LI+ GY KK+VG    +
Subjt:  RDERALSD--RRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERA

Query:  EPEGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLV
          E   ++E+R+R+  P R+ DRPAVIN + GGPSGGQSG+KRK LA  A  EVC    + P   I FD  D E VHLPHNDALVIA LIDHV V RVLV
Subjt:  EPEGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLV

Query:  DGGASANILSFSTYTTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE
        DGG SANILS  TY  LGW R  LK++PTPLVGF+GE+V  EG I LPV +G+
Subjt:  DGGASANILSFSTYTTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE

A0A6J1D5T3 uncharacterized protein LOC1110175481.1e-9778.57Show/hide
Query:  MDEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR
        MDEHVKVVSCTD+IAMMYFTTGL+DRNLTIEF SRPP SLNKMLARARQYIDGLE WKA GA+RSSRGKDRD++SSPPKK  +DDQ SSR+A D+++RG+
Subjt:  MDEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR

Query:  RDERALSDRRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEP
         DER  SDR GPKFDKFTPLNAS+A+IYA  ++TD++ALF A +KL RPSGKRDKRLYCRFHKDH H++S CFHL EQV+DLIRRGYLKKYVGSRERA+P
Subjt:  RDERALSDRRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEP

Query:  EGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSG
        EGS REEKRER+  P RKEDRPAVIN +HGGPSG +SG
Subjt:  EGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSG

A0A6J1DD03 uncharacterized protein LOC1110198995.9e-6745.4Show/hide
Query:  MMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKAN-GAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGRRDERALSDRRGPKF
        M YF TGL+D  LT++     P +  ++L +A++ IDG E  +   G  RS +  +     S  K + ++ +   RRA++   R R             +
Subjt:  MMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKAN-GAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGRRDERALSDRRGPKF

Query:  DKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEPEGSAREEKRERTPL
        ++FTP    I++I    +++ +E L    EKL     +R K  YCRFH++H H+TS  + L  Q+EDLI+ GY KK+VG    +  E   ++E+R+R+  
Subjt:  DKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEPEGSAREEKRERTPL

Query:  PRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDGGASANILSFSTYTT
        P R+ DRPAVIN + GGPSGGQSG KRK LA  A  EVC    + P   I FD  D   VHLPHNDALVIA LIDHV V RVLVDGGASANILS  TY  
Subjt:  PRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDGGASANILSFSTYTT

Query:  LGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE
        LGW R  LK++PTPLVGF+GE+V  EGCI LPV +G+
Subjt:  LGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE

A0A6J1DMN7 uncharacterized protein LOC1110219641.4e-6846.61Show/hide
Query:  MMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWK-ANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGRRDERALSDR--RGP
        M YF TGL+D  LT++     PT+  ++L +A++ IDG E  +   G      G+ R  K       ++ D+GS        + GR + R + +R  R  
Subjt:  MMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWK-ANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGRRDERALSDR--RGP

Query:  KFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEPEGSAREEKRERT
         +++FTP    I++I    +++ +E     LEKL     +R K  YCRFH++H H+TS C+ L  Q+EDLI+ GY KK+VG    +  E   ++E R+R+
Subjt:  KFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEPEGSAREEKRERT

Query:  PLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDGGASANILSFSTY
          P R  DRPAVIN + GGPSGGQSG KRK LA  A  EVC    + P   I FD  D E VHLPHNDALVIA LIDHV V RVLVDGGASANILS  TY
Subjt:  PLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDGGASANILSFSTY

Query:  TTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE
          LGW R  LK++PTPLVGF+GE+V  EGCI LPV  G+
Subjt:  TTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGE

A0A6J1E0L8 uncharacterized protein LOC1110253102.4e-15379.89Show/hide
Query:  MDEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR
        MDEHVKVVSCTD+IAMMYFTTGL+DRNLTIEF SRPP SLN+M ARARQYIDGLE WKANGA+RSSRG+DRD KS P KK   DD+ SSRRADD+K+R R
Subjt:  MDEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGR

Query:  RDERALSDRRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEP
        RDER  S+RRGPKFDKFTPLNASIA+IYA  +DTD+E LFA+ EKL RPSGKR+KRLYCRFHKDH HDTS CFHL EQVEDLIR GYLKKYVGSRE+AE 
Subjt:  RDERALSDRRGPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEP

Query:  EGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDG
        EGSAREEKRER+  PR KEDRPAVIN +HGGPSG +SGQKRKALA E AHEVCTSYPK PV+ ILFDEQD E VH+PHNDALVIA LIDHVKV RV VDG
Subjt:  EGSAREEKRERTPLPRRKEDRPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDG

Query:  GASANILSFSTYTTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGEGD
        GASANI SFSTYT LGWER+HLK   T LVGFA E+VSTEGCISLPV + EG+
Subjt:  GASANILSFSTYTTLGWERKHLKRTPTPLVGFAGETVSTEGCISLPVIVGEGD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAGCATGTCAAGGTGGTAAGTTGCACCGACAACATCGCCATGATGTACTTCACAACGGGGTTAAGTGACAGAAACCTGACGATCGAGTTCGAAAGTCGTCCGCC
GACCTCACTGAACAAAATGCTCGCCCGAGCTCGACAGTACATTGATGGCCTGGAGAGGTGGAAGGCGAACGGAGCCAAGCGGAGCAGCCGCGGTAAAGATCGGGACCGAA
AGTCCTCTCCTCCCAAGAAGAATCGTGCTGATGATCAGGGCTCGTCTCGACGGGCCGACGACAACAAGAATAGAGGCCGTCGCGACGAGAGAGCGCTTTCAGACCGTCGA
GGGCCGAAGTTTGACAAGTTCACTCCACTGAACGCTTCGATCGCGAAAATCTACGCGGCAGCAAAAGATACCGACCTGGAAGCGCTGTTCGCAGCCCTAGAGAAACTCTG
CCGACCTTCAGGGAAGCGAGACAAGCGACTCTACTGCCGATTCCACAAGGATCACGACCACGACACCTCTTGTTGCTTTCATTTGAATGAGCAAGTCGAGGATTTAATCC
GAAGAGGTTATTTGAAAAAGTACGTCGGCAGTCGAGAACGGGCTGAGCCAGAAGGTTCAGCTCGGGAAGAGAAGCGAGAGAGAACACCGCTCCCCAGGCGGAAGGAAGAT
CGTCCTGCAGTGATAAATATCGTCCATGGGGGCCCGAGTGGGGGACAATCAGGGCAGAAGAGAAAAGCTCTGGCTCATGAGGCAGCACACGAGGTTTGTACCTCGTACCC
CAAGGAGCCTGTGGTGCTGATCTTGTTTGACGAGCAGGATAGCGAAGGAGTGCACCTGCCTCATAACGACGCTCTGGTGATCGCCTCACTGATAGACCACGTGAAGGTCG
GAAGAGTTCTTGTTGATGGCGGAGCGTCAGCTAATATACTGTCCTTCTCGACCTACACGACCCTAGGGTGGGAGAGGAAACATTTGAAGCGCACCCCAACTCCTTTGGTC
GGCTTTGCCGGGGAGACAGTTAGCACAGAAGGATGCATCTCACTCCCTGTCATTGTCGGCGAAGGAGATCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGAGCATGTCAAGGTGGTAAGTTGCACCGACAACATCGCCATGATGTACTTCACAACGGGGTTAAGTGACAGAAACCTGACGATCGAGTTCGAAAGTCGTCCGCC
GACCTCACTGAACAAAATGCTCGCCCGAGCTCGACAGTACATTGATGGCCTGGAGAGGTGGAAGGCGAACGGAGCCAAGCGGAGCAGCCGCGGTAAAGATCGGGACCGAA
AGTCCTCTCCTCCCAAGAAGAATCGTGCTGATGATCAGGGCTCGTCTCGACGGGCCGACGACAACAAGAATAGAGGCCGTCGCGACGAGAGAGCGCTTTCAGACCGTCGA
GGGCCGAAGTTTGACAAGTTCACTCCACTGAACGCTTCGATCGCGAAAATCTACGCGGCAGCAAAAGATACCGACCTGGAAGCGCTGTTCGCAGCCCTAGAGAAACTCTG
CCGACCTTCAGGGAAGCGAGACAAGCGACTCTACTGCCGATTCCACAAGGATCACGACCACGACACCTCTTGTTGCTTTCATTTGAATGAGCAAGTCGAGGATTTAATCC
GAAGAGGTTATTTGAAAAAGTACGTCGGCAGTCGAGAACGGGCTGAGCCAGAAGGTTCAGCTCGGGAAGAGAAGCGAGAGAGAACACCGCTCCCCAGGCGGAAGGAAGAT
CGTCCTGCAGTGATAAATATCGTCCATGGGGGCCCGAGTGGGGGACAATCAGGGCAGAAGAGAAAAGCTCTGGCTCATGAGGCAGCACACGAGGTTTGTACCTCGTACCC
CAAGGAGCCTGTGGTGCTGATCTTGTTTGACGAGCAGGATAGCGAAGGAGTGCACCTGCCTCATAACGACGCTCTGGTGATCGCCTCACTGATAGACCACGTGAAGGTCG
GAAGAGTTCTTGTTGATGGCGGAGCGTCAGCTAATATACTGTCCTTCTCGACCTACACGACCCTAGGGTGGGAGAGGAAACATTTGAAGCGCACCCCAACTCCTTTGGTC
GGCTTTGCCGGGGAGACAGTTAGCACAGAAGGATGCATCTCACTCCCTGTCATTGTCGGCGAAGGAGATCAGTAG
Protein sequenceShow/hide protein sequence
MDEHVKVVSCTDNIAMMYFTTGLSDRNLTIEFESRPPTSLNKMLARARQYIDGLERWKANGAKRSSRGKDRDRKSSPPKKNRADDQGSSRRADDNKNRGRRDERALSDRR
GPKFDKFTPLNASIAKIYAAAKDTDLEALFAALEKLCRPSGKRDKRLYCRFHKDHDHDTSCCFHLNEQVEDLIRRGYLKKYVGSRERAEPEGSAREEKRERTPLPRRKED
RPAVINIVHGGPSGGQSGQKRKALAHEAAHEVCTSYPKEPVVLILFDEQDSEGVHLPHNDALVIASLIDHVKVGRVLVDGGASANILSFSTYTTLGWERKHLKRTPTPLV
GFAGETVSTEGCISLPVIVGEGDQ