; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g14930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g14930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr10:11350641..11356118
RNA-Seq ExpressionMoc10g14930
SyntenyMoc10g14930
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.5e-4932.21Show/hide
Query:  RYEVDLLRDQFQKEIENIKRQCRPVD-PYRVAEQQEPPFSQAIVDAPIAPRFKAPT---------------------------------------TG---
        R E D LR Q   ++E +K +C   + P    +  E PF+  +++API P+FKAPT                                       TG   
Subjt:  RYEVDLLRDQFQKEIENIKRQCRPVD-PYRVAEQQEPPFSQAIVDAPIAPRFKAPT---------------------------------------TG---

Query:  -----LEIRSLT------------WRSRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV-------------------------------------
             L   S++            + SR   K   +HL T++Q++ E+L EY+ RF +E +KV                                     
Subjt:  -----LEIRSLT------------WRSRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV-------------------------------------

Query:  -----YIDGLELWKANGAR------RSNRGKDRDQKSPLPKKQR--VDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYL
              IDG EL +    R      R   GKD +   P  K +      R+  RRA +  +R R             ++RFTP    I+EI    E++ +
Subjt:  -----YIDGLELWKANGAR------RSNRGKDRDQKSPLPKKQR--VDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYL

Query:  EALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGH
        E L   PEKLR    +R K  YC+FH++HGH+T+  + LK Q+E+LI+ GY KK+VG    +  E   +EE++    PPR +   PAVINTI GGPSGG 
Subjt:  EALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGH

Query:  EG-----------------------------------MHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEM
         G                                   +HLPHNDALVIAPLIDHV V RVLVDG  SA+ILS  TY  LGW    LK++  PLVGF+GE 
Subjt:  EG-----------------------------------MHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEM

Query:  VSVEGY
        V  EG+
Subjt:  VSVEGY

XP_022148920.1 uncharacterized protein LOC111017470 [Momordica charantia]3.0e-4742.4Show/hide
Query:  EHVKVYIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPE
        +  K  IDG EL +    R     K  DQK    +K++ D +   + ++ + SR  +        +   ++R+TP    I+EI    E++ +E L   PE
Subjt:  EHVKVYIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPE

Query:  KLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGG---------
        KL+    KR+K  YC+FH+DH H+TT C+ LK Q+E LI+ GY KK+VG   +       ++EKR+ +  P  ++  PAVINTI GGPSGG         
Subjt:  KLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGG---------

Query:  -----HEGMHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGL
              EG+HLPHNDALVIAPLIDHV V+RVLVDG ASA+ILS  TY  LGW    LK++  PL GF+ E VS+EG   C+ L
Subjt:  -----HEGMHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGL

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]3.1e-8464.93Show/hide
Query:  SRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV------------------------------------------YIDGLELWKANGARRSNRGKD
        +RQLLKLPPSHL TVKQRDNESLTEYIAR MDEHVKV                                          YIDGLELWKA GARRS+RGKD
Subjt:  SRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV------------------------------------------YIDGLELWKANGARRSNRGKD

Query:  RDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTT
        RDQ+S  PKK+  DD+SSSR+A D++SR +  E+  SDR GPKFD+FTPLNAS+AEIYA  E+T ++ALF AP+KL RPSGKRDKRLYC+FHKDHGH+++
Subjt:  RDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTT

Query:  RCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGHEG
        RCFHLKEQV+DLIRRGYLKKYVGSRERA+PEG+TREEKRE + PP  KE  PAVINTIHGGPSG   G
Subjt:  RCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGHEG

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.8e-4531.39Show/hide
Query:  RYEVDLLRDQFQKEIENIKRQC-RPVDPYRVAEQQEPPFSQAIVDAPIAPRFKAPT---------------------------------------TG---
        R E D L+ +F  ++E +K +C +    +   +  E  FS  I++A I P+FK PT                                       TG   
Subjt:  RYEVDLLRDQFQKEIENIKRQC-RPVDPYRVAEQQEPPFSQAIVDAPIAPRFKAPT---------------------------------------TG---

Query:  LEIRSLTWR-----------------SRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV-------------------------------------
        L  R L  R                 SR   +  P+HL T++Q++ E+L EY+ RF +E +KV                                     
Subjt:  LEIRSLTWR-----------------SRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV-------------------------------------

Query:  -----YIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPE
              IDG EL +    R     K+ DQ      K + D +S  +  + + SR  +     S  Q   ++ +TP    I EI    E+T +E L   PE
Subjt:  -----YIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPE

Query:  KLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIH---------------
        KLR    KR+   YC+FH+DHGH+T+  + LK Q+EDLI+ GY KK+VG + R+       E KR  TPP R  +  PAVIN                  
Subjt:  KLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIH---------------

Query:  ----GGPSGGH---EGMHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGLVAIMRDNQG
               +  H   EG+HLPHNDALVIAPLID V VRR+LVDG ASA+ILS STY  LGW    LK++  PLVGF+GE +S+EG   C+ L   +R +  
Subjt:  ----GGPSGGH---EGMHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGLVAIMRDNQG

Query:  CLVVGSSKSLAFGRDVFTA-EALALLHSLHVV
         +   +   +  GR  + A     ++HS   V
Subjt:  CLVVGSSKSLAFGRDVFTA-EALALLHSLHVV

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]7.4e-10260.89Show/hide
Query:  TTGLEIRSLT--WRSRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKVYIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSR
        TTGL  R+LT  +RSR     PP+           SL E  AR      + YIDGLELWKANGARRS+RG+DRD KSP  KK+  DDRSSSRRA+D+KSR
Subjt:  TTGLEIRSLT--WRSRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKVYIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSR

Query:  ERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERA
         R  E+  S+R+GPKFD+FTPLNASIAEIYA  EDT +E LFA+PEKLRRPSGKR+KRLYC+FHKDHGHDT+RCFHLKEQVEDLIR GYLKKYVGSRE+A
Subjt:  ERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERA

Query:  EPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGHEG-----------------------------------MHLPHNDALVIAPLIDHVKVRRVLV
        E EG+ REEKRE + PPR KE  PAVINTIHGGPSG   G                                   +H+PHNDALVIAPLIDHVKVRRV V
Subjt:  EPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGHEG-----------------------------------MHLPHNDALVIAPLIDHVKVRRVLV

Query:  DGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGLVAIMRDNQ
        DG ASA+I SFSTYT LGWE RHLK     LVGFA E VS EG   C+ L   + + +
Subjt:  DGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGLVAIMRDNQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.2e-4932.21Show/hide
Query:  RYEVDLLRDQFQKEIENIKRQCRPVD-PYRVAEQQEPPFSQAIVDAPIAPRFKAPT---------------------------------------TG---
        R E D LR Q   ++E +K +C   + P    +  E PF+  +++API P+FKAPT                                       TG   
Subjt:  RYEVDLLRDQFQKEIENIKRQCRPVD-PYRVAEQQEPPFSQAIVDAPIAPRFKAPT---------------------------------------TG---

Query:  -----LEIRSLT------------WRSRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV-------------------------------------
             L   S++            + SR   K   +HL T++Q++ E+L EY+ RF +E +KV                                     
Subjt:  -----LEIRSLT------------WRSRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV-------------------------------------

Query:  -----YIDGLELWKANGAR------RSNRGKDRDQKSPLPKKQR--VDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYL
              IDG EL +    R      R   GKD +   P  K +      R+  RRA +  +R R             ++RFTP    I+EI    E++ +
Subjt:  -----YIDGLELWKANGAR------RSNRGKDRDQKSPLPKKQR--VDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYL

Query:  EALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGH
        E L   PEKLR    +R K  YC+FH++HGH+T+  + LK Q+E+LI+ GY KK+VG    +  E   +EE++    PPR +   PAVINTI GGPSGG 
Subjt:  EALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGH

Query:  EG-----------------------------------MHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEM
         G                                   +HLPHNDALVIAPLIDHV V RVLVDG  SA+ILS  TY  LGW    LK++  PLVGF+GE 
Subjt:  EG-----------------------------------MHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEM

Query:  VSVEGY
        V  EG+
Subjt:  VSVEGY

A0A6J1D4A4 uncharacterized protein LOC1110174701.5e-4742.4Show/hide
Query:  EHVKVYIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPE
        +  K  IDG EL +    R     K  DQK    +K++ D +   + ++ + SR  +        +   ++R+TP    I+EI    E++ +E L   PE
Subjt:  EHVKVYIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPE

Query:  KLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGG---------
        KL+    KR+K  YC+FH+DH H+TT C+ LK Q+E LI+ GY KK+VG   +       ++EKR+ +  P  ++  PAVINTI GGPSGG         
Subjt:  KLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGG---------

Query:  -----HEGMHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGL
              EG+HLPHNDALVIAPLIDHV V+RVLVDG ASA+ILS  TY  LGW    LK++  PL GF+ E VS+EG   C+ L
Subjt:  -----HEGMHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGL

A0A6J1D5T3 uncharacterized protein LOC1110175481.5e-8464.93Show/hide
Query:  SRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV------------------------------------------YIDGLELWKANGARRSNRGKD
        +RQLLKLPPSHL TVKQRDNESLTEYIAR MDEHVKV                                          YIDGLELWKA GARRS+RGKD
Subjt:  SRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV------------------------------------------YIDGLELWKANGARRSNRGKD

Query:  RDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTT
        RDQ+S  PKK+  DD+SSSR+A D++SR +  E+  SDR GPKFD+FTPLNAS+AEIYA  E+T ++ALF AP+KL RPSGKRDKRLYC+FHKDHGH+++
Subjt:  RDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTT

Query:  RCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGHEG
        RCFHLKEQV+DLIRRGYLKKYVGSRERA+PEG+TREEKRE + PP  KE  PAVINTIHGGPSG   G
Subjt:  RCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGHEG

A0A6J1DHB3 uncharacterized protein LOC1110204791.4e-4531.39Show/hide
Query:  RYEVDLLRDQFQKEIENIKRQC-RPVDPYRVAEQQEPPFSQAIVDAPIAPRFKAPT---------------------------------------TG---
        R E D L+ +F  ++E +K +C +    +   +  E  FS  I++A I P+FK PT                                       TG   
Subjt:  RYEVDLLRDQFQKEIENIKRQC-RPVDPYRVAEQQEPPFSQAIVDAPIAPRFKAPT---------------------------------------TG---

Query:  LEIRSLTWR-----------------SRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV-------------------------------------
        L  R L  R                 SR   +  P+HL T++Q++ E+L EY+ RF +E +KV                                     
Subjt:  LEIRSLTWR-----------------SRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKV-------------------------------------

Query:  -----YIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPE
              IDG EL +    R     K+ DQ      K + D +S  +  + + SR  +     S  Q   ++ +TP    I EI    E+T +E L   PE
Subjt:  -----YIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPE

Query:  KLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIH---------------
        KLR    KR+   YC+FH+DHGH+T+  + LK Q+EDLI+ GY KK+VG + R+       E KR  TPP R  +  PAVIN                  
Subjt:  KLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIH---------------

Query:  ----GGPSGGH---EGMHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGLVAIMRDNQG
               +  H   EG+HLPHNDALVIAPLID V VRR+LVDG ASA+ILS STY  LGW    LK++  PLVGF+GE +S+EG   C+ L   +R +  
Subjt:  ----GGPSGGH---EGMHLPHNDALVIAPLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGLVAIMRDNQG

Query:  CLVVGSSKSLAFGRDVFTA-EALALLHSLHVV
         +   +   +  GR  + A     ++HS   V
Subjt:  CLVVGSSKSLAFGRDVFTA-EALALLHSLHVV

A0A6J1E0L8 uncharacterized protein LOC1110253103.6e-10260.89Show/hide
Query:  TTGLEIRSLT--WRSRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKVYIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSR
        TTGL  R+LT  +RSR     PP+           SL E  AR      + YIDGLELWKANGARRS+RG+DRD KSP  KK+  DDRSSSRRA+D+KSR
Subjt:  TTGLEIRSLT--WRSRQLLKLPPSHLGTVKQRDNESLTEYIARFMDEHVKVYIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSR

Query:  ERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERA
         R  E+  S+R+GPKFD+FTPLNASIAEIYA  EDT +E LFA+PEKLRRPSGKR+KRLYC+FHKDHGHDT+RCFHLKEQVEDLIR GYLKKYVGSRE+A
Subjt:  ERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALFAAPEKLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERA

Query:  EPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGHEG-----------------------------------MHLPHNDALVIAPLIDHVKVRRVLV
        E EG+ REEKRE + PPR KE  PAVINTIHGGPSG   G                                   +H+PHNDALVIAPLIDHVKVRRV V
Subjt:  EPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGHEG-----------------------------------MHLPHNDALVIAPLIDHVKVRRVLV

Query:  DGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGLVAIMRDNQ
        DG ASA+I SFSTYT LGWE RHLK     LVGFA E VS EG   C+ L   + + +
Subjt:  DGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGLVAIMRDNQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAAACCCGCTCCCAACAATCACGATCCCAAGTTTCACCAATCCGCTCTTCGACAAGGGATGCTGCCACTTCCCGTGCCCATCATCGCCTCACGAACTCTCAGGT
GGCCGGGACTCCTGTCGTCGAACGACGACCCCAGGCGGGGGCGGTCAAGAAAAATGGAGGTCAGCCGGCCACATTCGATCCCGTCGCAGTTCGGGACTTCCATCTCACCT
CAGATCAGTTTCCGCCACTACAGCTTCAGAGGAACGGGTTGCTGCCCCCCGCACCTCGTCTCCGCGGCTGGGGGAACACAGGTGCACGCTCCGGAGTGAGTGCTGACGCA
GGTGTGGACCCCGTCATAGTAGCTGACGTGATCGCCGAGCTTACGGAAGTCAAGGCGAGGCTCGAAGCGGTCGAAAGAGGCAACGAGATGTCCGACTCCTCCGTCTCTAG
GGATCCCATTCGAGAGAAAGAGCCGATGCATCCAACTCAAAGAACGGAATATCAGTTCCGATCTCGCAGGGAGGCCCGAGCTGAGGACAATCAGGTGGAGGATCGCCGCC
CGAGGGTCCGACCAATTCGGACTCCCCTGGCATCGTTTGATAGCTACAATGCCCAACAGGGCCGAGGTGAAGGGCCACCAAGGCGACGAGGGGTGGCGCCCAATGATCGG
GAGTATTGGACCGACCACAAGGAGGGAAGCCTAGAGGTCGACGATCGGGAGAGGTCATTCCAGGGTGATCATTCGTTTCGGTACGAAGTGGACCTCCTCCGAGACCAATT
TCAGAAGGAGATAGAAAATATCAAGCGGCAGTGCAGGCCTGTAGATCCCTATCGTGTGGCCGAGCAACAGGAGCCGCCTTTCTCCCAAGCAATCGTGGACGCACCTATCG
CACCAAGGTTCAAAGCTCCTACGACGGGTCTAGAGATCCGATCTCTTACGTGGAGGTCTCGGCAGTTGCTGAAGTTGCCGCCCTCTCACCTCGGAACAGTGAAGCAACGA
GACAATGAGTCCCTGACGGAGTACATCGCTCGGTTCATGGATGAGCATGTCAAGGTGTACATTGATGGCCTGGAGCTGTGGAAGGCCAACGGAGCCAGGCGGAGCAACCG
CGGTAAAGATCGGGACCAAAAGTCTCCTCTTCCCAAGAAGCAACGTGTTGATGATAGGAGCTCGTCTCGGCGGGCCAACGACAACAAGAGCCGAGAACGCCACGGTGAGA
AAGCCCCTTCAGACCGTCAGGGGCCGAAGTTTGACAGGTTCACTCCGCTGAACGCCTCAATCGCGGAGATCTACGCAGCAGCTGAAGATACCTACCTGGAGGCGCTGTTC
GCAGCCCCAGAAAAGCTCCGCCGACCTTCGGGGAAGCGGGACAAGCGGCTCTACTGCAAATTCCACAAGGATCACGGCCATGACACCACCCGTTGCTTTCACTTAAAGGA
GCAAGTTGAGGATCTGATCCGAAGAGGATATTTGAAGAAGTACGTTGGCAGCAGAGAAAGAGCCGAGCCAGAGGGAACAACTCGGGAGGAGAAGCGAGAGGGGACCCCGC
CGCCCAGATGGAAGGAAGGTCCTCCCGCAGTAATAAATACCATTCATGGGGGCCCAAGTGGGGGGCATGAGGGAATGCACCTGCCTCATAACGACGCCCTGGTGATCGCC
CCACTAATAGACCACGTGAAGGTTAGAAGAGTGCTTGTTGATGGCAGAGCGTCGGCTGATATATTGTCCTTCTCGACCTACACGACCCTAGGATGGGAGATGAGACATTT
GAAGCGCAACTTGATGCCTTTGGTCGGCTTTGCCGGGGAGATGGTTAGCGTGGAAGGATATGGAAACTGTGTGGGTCTGGTGGCGATAATGAGAGATAATCAAGGTTGTT
TGGTGGTTGGTTCTTCAAAGTCGTTGGCCTTTGGGAGAGATGTGTTTACTGCAGAAGCCTTAGCTCTTCTCCACAGTTTGCATGTAGTAGTTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCAAACCCGCTCCCAACAATCACGATCCCAAGTTTCACCAATCCGCTCTTCGACAAGGGATGCTGCCACTTCCCGTGCCCATCATCGCCTCACGAACTCTCAGGT
GGCCGGGACTCCTGTCGTCGAACGACGACCCCAGGCGGGGGCGGTCAAGAAAAATGGAGGTCAGCCGGCCACATTCGATCCCGTCGCAGTTCGGGACTTCCATCTCACCT
CAGATCAGTTTCCGCCACTACAGCTTCAGAGGAACGGGTTGCTGCCCCCCGCACCTCGTCTCCGCGGCTGGGGGAACACAGGTGCACGCTCCGGAGTGAGTGCTGACGCA
GGTGTGGACCCCGTCATAGTAGCTGACGTGATCGCCGAGCTTACGGAAGTCAAGGCGAGGCTCGAAGCGGTCGAAAGAGGCAACGAGATGTCCGACTCCTCCGTCTCTAG
GGATCCCATTCGAGAGAAAGAGCCGATGCATCCAACTCAAAGAACGGAATATCAGTTCCGATCTCGCAGGGAGGCCCGAGCTGAGGACAATCAGGTGGAGGATCGCCGCC
CGAGGGTCCGACCAATTCGGACTCCCCTGGCATCGTTTGATAGCTACAATGCCCAACAGGGCCGAGGTGAAGGGCCACCAAGGCGACGAGGGGTGGCGCCCAATGATCGG
GAGTATTGGACCGACCACAAGGAGGGAAGCCTAGAGGTCGACGATCGGGAGAGGTCATTCCAGGGTGATCATTCGTTTCGGTACGAAGTGGACCTCCTCCGAGACCAATT
TCAGAAGGAGATAGAAAATATCAAGCGGCAGTGCAGGCCTGTAGATCCCTATCGTGTGGCCGAGCAACAGGAGCCGCCTTTCTCCCAAGCAATCGTGGACGCACCTATCG
CACCAAGGTTCAAAGCTCCTACGACGGGTCTAGAGATCCGATCTCTTACGTGGAGGTCTCGGCAGTTGCTGAAGTTGCCGCCCTCTCACCTCGGAACAGTGAAGCAACGA
GACAATGAGTCCCTGACGGAGTACATCGCTCGGTTCATGGATGAGCATGTCAAGGTGTACATTGATGGCCTGGAGCTGTGGAAGGCCAACGGAGCCAGGCGGAGCAACCG
CGGTAAAGATCGGGACCAAAAGTCTCCTCTTCCCAAGAAGCAACGTGTTGATGATAGGAGCTCGTCTCGGCGGGCCAACGACAACAAGAGCCGAGAACGCCACGGTGAGA
AAGCCCCTTCAGACCGTCAGGGGCCGAAGTTTGACAGGTTCACTCCGCTGAACGCCTCAATCGCGGAGATCTACGCAGCAGCTGAAGATACCTACCTGGAGGCGCTGTTC
GCAGCCCCAGAAAAGCTCCGCCGACCTTCGGGGAAGCGGGACAAGCGGCTCTACTGCAAATTCCACAAGGATCACGGCCATGACACCACCCGTTGCTTTCACTTAAAGGA
GCAAGTTGAGGATCTGATCCGAAGAGGATATTTGAAGAAGTACGTTGGCAGCAGAGAAAGAGCCGAGCCAGAGGGAACAACTCGGGAGGAGAAGCGAGAGGGGACCCCGC
CGCCCAGATGGAAGGAAGGTCCTCCCGCAGTAATAAATACCATTCATGGGGGCCCAAGTGGGGGGCATGAGGGAATGCACCTGCCTCATAACGACGCCCTGGTGATCGCC
CCACTAATAGACCACGTGAAGGTTAGAAGAGTGCTTGTTGATGGCAGAGCGTCGGCTGATATATTGTCCTTCTCGACCTACACGACCCTAGGATGGGAGATGAGACATTT
GAAGCGCAACTTGATGCCTTTGGTCGGCTTTGCCGGGGAGATGGTTAGCGTGGAAGGATATGGAAACTGTGTGGGTCTGGTGGCGATAATGAGAGATAATCAAGGTTGTT
TGGTGGTTGGTTCTTCAAAGTCGTTGGCCTTTGGGAGAGATGTGTTTACTGCAGAAGCCTTAGCTCTTCTCCACAGTTTGCATGTAGTAGTTGAGTAG
Protein sequenceShow/hide protein sequence
MAQTRSQQSRSQVSPIRSSTRDAATSRAHHRLTNSQVAGTPVVERRPQAGAVKKNGGQPATFDPVAVRDFHLTSDQFPPLQLQRNGLLPPAPRLRGWGNTGARSGVSADA
GVDPVIVADVIAELTEVKARLEAVERGNEMSDSSVSRDPIREKEPMHPTQRTEYQFRSRREARAEDNQVEDRRPRVRPIRTPLASFDSYNAQQGRGEGPPRRRGVAPNDR
EYWTDHKEGSLEVDDRERSFQGDHSFRYEVDLLRDQFQKEIENIKRQCRPVDPYRVAEQQEPPFSQAIVDAPIAPRFKAPTTGLEIRSLTWRSRQLLKLPPSHLGTVKQR
DNESLTEYIARFMDEHVKVYIDGLELWKANGARRSNRGKDRDQKSPLPKKQRVDDRSSSRRANDNKSRERHGEKAPSDRQGPKFDRFTPLNASIAEIYAAAEDTYLEALF
AAPEKLRRPSGKRDKRLYCKFHKDHGHDTTRCFHLKEQVEDLIRRGYLKKYVGSRERAEPEGTTREEKREGTPPPRWKEGPPAVINTIHGGPSGGHEGMHLPHNDALVIA
PLIDHVKVRRVLVDGRASADILSFSTYTTLGWEMRHLKRNLMPLVGFAGEMVSVEGYGNCVGLVAIMRDNQGCLVVGSSKSLAFGRDVFTAEALALLHSLHVVVE