; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g11100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g11100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr5:8683393..8685569
RNA-Seq ExpressionMoc05g11100
SyntenyMoc05g11100
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.3e-8945.23Show/hide
Query:  ALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMGGTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQ
        A P    + E  P  +RRKKKK  +  EVGA  VLPA FADRVDDP ARMGGTSDV ARFR++PSSSGVRDQVSRISAASLDRCLRRASKFV+ PGSVL 
Subjt:  ALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMGGTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQ

Query:  RTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVET-------------------------------
        R IDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEVE+                               
Subjt:  RTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVET-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------KAELLRKEEGKHKAHLRAAHAITKGLEKEK-----------------
                                                             KAELL++E+ +HKAHLRAAHAITKGLEKEK                 
Subjt:  -----------------------------------------------------KAELLRKEEGKHKAHLRAAHAITKGLEKEK-----------------

Query:  -----------------------------QHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDY
                                     QHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DLG LKKRYAE+WASGP+GT GP +LVDKYVRDLDSDY
Subjt:  -----------------------------QHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDY

Query:  SDLEED--------QVGTTQEGAP
        SDL+ED        +VGTTQEG P
Subjt:  SDLEED--------QVGTTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.3e-9872.28Show/hide
Query:  GTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   ++A+ R++PSSSGVRDQVSRISAASLDRCLRRASKFV+ PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLRKEEGKHKAHLRAAHAITKGLEKEK----------------------------------------------QHPDFD
        ELLKAHSEVE LKAEVE++AELL+KEE + +A LRAAHAIT+GLE+EK                                              QHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLRKEEGKHKAHLRAAHAITKGLEKEK----------------------------------------------QHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]6.4e-12672.39Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRISKEGERADNPPEGWVALYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+ +EGERADNPPEGWV LYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRISKEGERADNPPEGWVALYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNVWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA--------------------
        LRLPLHPFVQEFLFRTGLAPAQVAPN WGVIFALAILFWLRARD+EEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA                    
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNVWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA--------------------

Query:  -----------------------CVSIRPVPELTPASFDTLKYYKERFSRGRKVGTLVTDKLLLESGLLDYNPAP----------------WFA------
                                VSIRPVPELT ASFDTLKYYKERF RGRKVGTLVTD+LLLESGLLDYNPA                  FA      
Subjt:  -----------------------CVSIRPVPELTPASFDTLKYYKERFSRGRKVGTLVTDKLLLESGLLDYNPAP----------------WFA------

Query:  ---------AAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKFPRDQTETVD
                 AAQSSKPATPAVVGPASEDPA VIELESSGGPSREK PRDQTE VD
Subjt:  ---------AAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKFPRDQTETVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]1.9e-8585.91Show/hide
Query:  AAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKFPRDQTETVDALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMGGTS
        AAQSSKPATPAV GPASEDPAPVIELESSGGPSREK PRDQTE VDALPLGEEVREEVPLKRRRKKKKT +PLEVGA GVLPASFADRVDDPEARMGGTS
Subjt:  AAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKFPRDQTETVDALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMGGTS

Query:  DVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELL
        DV ARFRV+PSS+GVRDQVSRISAASLDRCLRRASKFV+DPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFS ALEA     KD+  
Subjt:  DVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELL

Query:  KAHSEVEILKAEVETKAELL
            E+E   AE+ET  E L
Subjt:  KAHSEVEILKAEVETKAELL

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.4e-12556.36Show/hide
Query:  FYMCARKGACVSIRPVPELTPASFDTLKYYKERFSRGRKVGTLVTDKLLLESGLLDYNP-------------------------------APWFAAAQSS
        F +  R G  VSI+ +PEL  A+FDTLK+YK+ F R RK+ TLVTDKLLLESGLLDYNP                               A        +
Subjt:  FYMCARKGACVSIRPVPELTPASFDTLKYYKERFSRGRKVGTLVTDKLLLESGLLDYNP-------------------------------APWFAAAQSS

Query:  KPATPAV--------VGPASEDPAPVIELESSGGPSREKFPRDQTETVDALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMG
        +P TP V         GP+S  P PVIEL+ SGG S EK  R+++E +D  PL  EVR E PL+RRRKKKKT++  E GA G LP S AD VDDPEARM 
Subjt:  KPATPAV--------VGPASEDPAPVIELESSGGPSREKFPRDQTETVDALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMG

Query:  GTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        GTS+V  RF ++PSSSGV+DQVSRISA  LDR LRRASKFV+DPGSVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K 
Subjt:  GTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLRKEEGKHKAHLRAAHAITKGLEKEK----------------------------------------------QHPDFD
        ELLKA  EV+IL+AEV+ K +LL+KE  KHKAHLRAAHAITKGLEKEK                                              QHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLRKEEGKHKAHLRAAHAITKGLEKEK----------------------------------------------QHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP--QAGS
        GFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+LDSDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124676.1e-9045.23Show/hide
Query:  ALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMGGTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQ
        A P    + E  P  +RRKKKK  +  EVGA  VLPA FADRVDDP ARMGGTSDV ARFR++PSSSGVRDQVSRISAASLDRCLRRASKFV+ PGSVL 
Subjt:  ALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMGGTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQ

Query:  RTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVET-------------------------------
        R IDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEVE+                               
Subjt:  RTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVET-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------KAELLRKEEGKHKAHLRAAHAITKGLEKEK-----------------
                                                             KAELL++E+ +HKAHLRAAHAITKGLEKEK                 
Subjt:  -----------------------------------------------------KAELLRKEEGKHKAHLRAAHAITKGLEKEK-----------------

Query:  -----------------------------QHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDY
                                     QHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DLG LKKRYAE+WASGP+GT GP +LVDKYVRDLDSDY
Subjt:  -----------------------------QHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDY

Query:  SDLEED--------QVGTTQEGAP
        SDL+ED        +VGTTQEG P
Subjt:  SDLEED--------QVGTTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185381.6e-9872.28Show/hide
Query:  GTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   ++A+ R++PSSSGVRDQVSRISAASLDRCLRRASKFV+ PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLRKEEGKHKAHLRAAHAITKGLEKEK----------------------------------------------QHPDFD
        ELLKAHSEVE LKAEVE++AELL+KEE + +A LRAAHAIT+GLE+EK                                              QHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLRKEEGKHKAHLRAAHAITKGLEKEK----------------------------------------------QHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255023.1e-12672.39Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRISKEGERADNPPEGWVALYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+ +EGERADNPPEGWV LYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRISKEGERADNPPEGWVALYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNVWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA--------------------
        LRLPLHPFVQEFLFRTGLAPAQVAPN WGVIFALAILFWLRARD+EEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA                    
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNVWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA--------------------

Query:  -----------------------CVSIRPVPELTPASFDTLKYYKERFSRGRKVGTLVTDKLLLESGLLDYNPAP----------------WFA------
                                VSIRPVPELT ASFDTLKYYKERF RGRKVGTLVTD+LLLESGLLDYNPA                  FA      
Subjt:  -----------------------CVSIRPVPELTPASFDTLKYYKERFSRGRKVGTLVTDKLLLESGLLDYNPAP----------------WFA------

Query:  ---------AAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKFPRDQTETVD
                 AAQSSKPATPAVVGPASEDPA VIELESSGGPSREK PRDQTE VD
Subjt:  ---------AAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKFPRDQTETVD

A0A6J1DXZ1 uncharacterized protein LOC1110256069.1e-8685.91Show/hide
Query:  AAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKFPRDQTETVDALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMGGTS
        AAQSSKPATPAV GPASEDPAPVIELESSGGPSREK PRDQTE VDALPLGEEVREEVPLKRRRKKKKT +PLEVGA GVLPASFADRVDDPEARMGGTS
Subjt:  AAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKFPRDQTETVDALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMGGTS

Query:  DVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELL
        DV ARFRV+PSS+GVRDQVSRISAASLDRCLRRASKFV+DPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFS ALEA     KD+  
Subjt:  DVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELL

Query:  KAHSEVEILKAEVETKAELL
            E+E   AE+ET  E L
Subjt:  KAHSEVEILKAEVETKAELL

A0A6J1DZB3 uncharacterized protein LOC1110256656.9e-12656.36Show/hide
Query:  FYMCARKGACVSIRPVPELTPASFDTLKYYKERFSRGRKVGTLVTDKLLLESGLLDYNP-------------------------------APWFAAAQSS
        F +  R G  VSI+ +PEL  A+FDTLK+YK+ F R RK+ TLVTDKLLLESGLLDYNP                               A        +
Subjt:  FYMCARKGACVSIRPVPELTPASFDTLKYYKERFSRGRKVGTLVTDKLLLESGLLDYNP-------------------------------APWFAAAQSS

Query:  KPATPAV--------VGPASEDPAPVIELESSGGPSREKFPRDQTETVDALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMG
        +P TP V         GP+S  P PVIEL+ SGG S EK  R+++E +D  PL  EVR E PL+RRRKKKKT++  E GA G LP S AD VDDPEARM 
Subjt:  KPATPAV--------VGPASEDPAPVIELESSGGPSREKFPRDQTETVDALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEARMG

Query:  GTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        GTS+V  RF ++PSSSGV+DQVSRISA  LDR LRRASKFV+DPGSVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K 
Subjt:  GTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLRKEEGKHKAHLRAAHAITKGLEKEK----------------------------------------------QHPDFD
        ELLKA  EV+IL+AEV+ K +LL+KE  KHKAHLRAAHAITKGLEKEK                                              QHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLRKEEGKHKAHLRAAHAITKGLEKEK----------------------------------------------QHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP--QAGS
        GFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+LDSDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGCGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGATTCGCTATCCCTGAGAACATCCTCCTTA
GGATTTCGAAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCGCTCTTTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGTGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAATGAAGA
GGCCGAGCTGTTAGATGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGCCGGTTCTACATGTGCGCAAGGAAAGGCGCATGCGTTTCAA
TCCGACCAGTCCCCGAACTTACGCCAGCCTCCTTCGATACGCTGAAATATTACAAGGAGCGCTTTTCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTG
CTTGAGTCCGGGCTGCTAGATTACAACCCCGCACCATGGTTTGCGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCTGTGGTAGGGCCAGCCTCGGAGGATCCAGCCCC
AGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGTTCCCCAGGGATCAGACCGAGACGGTGGACGCCTTGCCCCTGGGCGAGGAGGTGAGGGAGGAAGTCC
CTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCACCCCCTTGGAGGTCGGAGCTCATGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGG
ATGGGCGGGACGTCTGATGTGATGGCACGGTTCAGAGTTAAGCCGTCTAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTAGACCGCTGCCTAAG
GAGGGCGTCCAAATTTGTAACTGACCCAGGGTCTGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCG
AGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAG
GTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGTTGCTGAGGAAGGAAGAAGGCAAACACAAGGCCCACCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGA
GAAGGAGAAGCAACATCCTGACTTCGACGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCG
ATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGAC
TACTCCGACCTCGAAGAGGATCAGGTCGGCACCACCCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGCGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGATTCGCTATCCCTGAGAACATCCTCCTTA
GGATTTCGAAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCGCTCTTTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGTGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAATGAAGA
GGCCGAGCTGTTAGATGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGCCGGTTCTACATGTGCGCAAGGAAAGGCGCATGCGTTTCAA
TCCGACCAGTCCCCGAACTTACGCCAGCCTCCTTCGATACGCTGAAATATTACAAGGAGCGCTTTTCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTG
CTTGAGTCCGGGCTGCTAGATTACAACCCCGCACCATGGTTTGCGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCTGTGGTAGGGCCAGCCTCGGAGGATCCAGCCCC
AGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGTTCCCCAGGGATCAGACCGAGACGGTGGACGCCTTGCCCCTGGGCGAGGAGGTGAGGGAGGAAGTCC
CTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCACCCCCTTGGAGGTCGGAGCTCATGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGG
ATGGGCGGGACGTCTGATGTGATGGCACGGTTCAGAGTTAAGCCGTCTAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTAGACCGCTGCCTAAG
GAGGGCGTCCAAATTTGTAACTGACCCAGGGTCTGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCG
AGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAG
GTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGTTGCTGAGGAAGGAAGAAGGCAAACACAAGGCCCACCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGA
GAAGGAGAAGCAACATCCTGACTTCGACGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCG
ATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGAC
TACTCCGACCTCGAAGAGGATCAGGTCGGCACCACCCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRISKEGERADNPPEGWVALYFKMFEYGLRLPLHPFVQ
EFLFRTGLAPAQVAPNVWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACVSIRPVPELTPASFDTLKYYKERFSRGRKVGTLVTDKLL
LESGLLDYNPAPWFAAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKFPRDQTETVDALPLGEEVREEVPLKRRRKKKKTTTPLEVGAHGVLPASFADRVDDPEAR
MGGTSDVMARFRVKPSSSGVRDQVSRISAASLDRCLRRASKFVTDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSE
VEILKAEVETKAELLRKEEGKHKAHLRAAHAITKGLEKEKQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSD
YSDLEEDQVGTTQEGAPQAGS