; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moctig00076g060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoctig00076g060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationtig00000076_pilon:31215..35658
RNA-Seq ExpressionMoctig00076g060
SyntenyMoctig00076g060
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001995 - Peptidase A2A, retrovirus, catalytic
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152029.1 uncharacterized protein LOC111019838 [Momordica charantia]2.9e-6650.86Show/hide
Query:  PKKQRSD-GKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK
        P+K R D  KR+K  YCRFH+DHGH+TS  + LK Q+EDLI+ GY KK++G   +   E   + E+R++S+ P R+EDRPA+INTI GGPSGGQS  KR 
Subjt:  PKKQRSD-GKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK

Query:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC
         LAR A  +                                         VRRVL+D GASANILS  TY  LGW R  LK S TPLVGF+GES++ EGC
Subjt:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC

Query:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAA
        + L VT G+   QVTK+AEFVVID  SAYNAI GRP+IH FRA+PST HQVLKY T  G+  + GEQ  SRECY +A+KG++ CA    A+
Subjt:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAA

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]6.5e-7450Show/hide
Query:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK
        P K + +  +R K  YCRFH++HGH+TS Y+ LK Q+EDLI+ GY KK++G   T+  E   ++E+R++S+ P R+ DRPA+INTI GGPSGGQSG KRK
Subjt:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK

Query:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC
         LAR A  E                                         VRRVL+D GASANILS  TY ALGW R  LK SPTPLVGF+GESV  EGC
Subjt:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC

Query:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAAEPCAAE---
        + L VT+G+   +VT++AEFVV+D  SAYNAI GRP+IH FRAIPST HQVLKY T  G+ T+ GEQ  SRECY + +KGT+ CA  T  +     E   
Subjt:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAAEPCAAE---

Query:  --PEPSRGTPAEGLKLVPLLGPEKQVSV
          P      P E L+LVPLL  EKQV +
Subjt:  --PEPSRGTPAEGLKLVPLLGPEKQVSV

XP_022155866.1 uncharacterized protein LOC111022880 [Momordica charantia]3.2e-7349.85Show/hide
Query:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK
        P K + +  +R K  YCRFH++HGH+TS  + LK Q+EDLI+ GY KK++G   T+  E   ++E+R++S+ P R+ DRPA+INTI GGPSGGQSG KRK
Subjt:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK

Query:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC
         LAR A  E                                         VRRVL++ GASANILS  TY ALGW R  L+ SPTPLVGF+GESV  EGC
Subjt:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC

Query:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVT---DAAEPCAAE
        + L VT+G+   ++T++AEFVV+D  S YNAI GRP+IH FRAIPST HQVLKYPT  G+ T+ GEQ  SRECY AA+KG + CA  T      E  A  
Subjt:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVT---DAAEPCAAE

Query:  PEPSRGTPAEGLKLVPLLGPEKQVS
        P      P E L+LVPLL PEKQ++
Subjt:  PEPSRGTPAEGLKLVPLLGPEKQVS

XP_022156175.1 uncharacterized protein LOC111023128 [Momordica charantia]4.8e-6946.65Show/hide
Query:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAR-----EEKREKSQPPRRKE---DRPAIINTINGGPSG
        P K + S  KR+K  Y RFH+DHGHDTS+ F L++Q+E+LIR G+LKKY+G +++   +G  +      + ++KS   ++ E    RP +INTI GGPSG
Subjt:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAR-----EEKREKSQPPRRKE---DRPAIINTINGGPSG

Query:  GQSGQKRKALAREAAHE----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGE
        GQSG KRKAL RE +HE                                        V+ VLID  AS NILS STY ALGWE+  LK  PTPLVGF+GE
Subjt:  GQSGQKRKALAREAAHE----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGE

Query:  SVSAEGCVSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAV--TDAA
         ++AEGC  L VTIGE D +V KV EFV++D +SAYNAI+GRP IH+ + +PSTYHQV+KYPT  G+  I GEQK SRECY  A+KGT T A +  T ++
Subjt:  SVSAEGCVSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAV--TDAA

Query:  EPCAAEP---EPSRGTPAEGLKLVPLLGPEKQVSVASKLGAEV
        E    +    E   GT  + LK + L   EK VS+ S L A++
Subjt:  EPCAAEP---EPSRGTPAEGLKLVPLLGPEKQVSVASKLGAEV

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]4.7e-7251.59Show/hide
Query:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK
        P K +R  GKR+KRLYCRFHKDHGHDTSR FHLKEQVEDLIR GYLKKY+G RE AE EGSAREEKRE+SQPPR KEDRPA+INTI+GGPSG +SGQKRK
Subjt:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK

Query:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC
        ALARE AHE                                         VRRV +D GASANI SFSTYTALGWER+HLK   T LVGFA ESVS EGC
Subjt:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC

Query:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAAEPCAAEPEP
        +SL VTI EG+ QVT+VAEFVVIDRSSAY   +  P                                 S+ C T   +G A     +  AE        
Subjt:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAAEPCAAEPEP

Query:  SRGTPAEGLKLVPLLGPEKQVSVASKLGA----EVPRSENSNADALA
                  LVPLLGP++QVS+ S+L A    E+ R   SN+D  A
Subjt:  SRGTPAEGLKLVPLLGPEKQVSVASKLGA----EVPRSENSNADALA

TrEMBL top hitse value%identityAlignment
A0A6J1DD03 uncharacterized protein LOC1110198993.2e-7450Show/hide
Query:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK
        P K + +  +R K  YCRFH++HGH+TS Y+ LK Q+EDLI+ GY KK++G   T+  E   ++E+R++S+ P R+ DRPA+INTI GGPSGGQSG KRK
Subjt:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK

Query:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC
         LAR A  E                                         VRRVL+D GASANILS  TY ALGW R  LK SPTPLVGF+GESV  EGC
Subjt:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC

Query:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAAEPCAAE---
        + L VT+G+   +VT++AEFVV+D  SAYNAI GRP+IH FRAIPST HQVLKY T  G+ T+ GEQ  SRECY + +KGT+ CA  T  +     E   
Subjt:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAAEPCAAE---

Query:  --PEPSRGTPAEGLKLVPLLGPEKQVSV
          P      P E L+LVPLL  EKQV +
Subjt:  --PEPSRGTPAEGLKLVPLLGPEKQVSV

A0A6J1DET8 uncharacterized protein LOC1110198381.4e-6650.86Show/hide
Query:  PKKQRSD-GKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK
        P+K R D  KR+K  YCRFH+DHGH+TS  + LK Q+EDLI+ GY KK++G   +   E   + E+R++S+ P R+EDRPA+INTI GGPSGGQS  KR 
Subjt:  PKKQRSD-GKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK

Query:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC
         LAR A  +                                         VRRVL+D GASANILS  TY  LGW R  LK S TPLVGF+GES++ EGC
Subjt:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC

Query:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAA
        + L VT G+   QVTK+AEFVVID  SAYNAI GRP+IH FRA+PST HQVLKY T  G+  + GEQ  SRECY +A+KG++ CA    A+
Subjt:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAA

A0A6J1DPJ9 uncharacterized protein LOC1110231282.3e-6946.65Show/hide
Query:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAR-----EEKREKSQPPRRKE---DRPAIINTINGGPSG
        P K + S  KR+K  Y RFH+DHGHDTS+ F L++Q+E+LIR G+LKKY+G +++   +G  +      + ++KS   ++ E    RP +INTI GGPSG
Subjt:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAR-----EEKREKSQPPRRKE---DRPAIINTINGGPSG

Query:  GQSGQKRKALAREAAHE----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGE
        GQSG KRKAL RE +HE                                        V+ VLID  AS NILS STY ALGWE+  LK  PTPLVGF+GE
Subjt:  GQSGQKRKALAREAAHE----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGE

Query:  SVSAEGCVSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAV--TDAA
         ++AEGC  L VTIGE D +V KV EFV++D +SAYNAI+GRP IH+ + +PSTYHQV+KYPT  G+  I GEQK SRECY  A+KGT T A +  T ++
Subjt:  SVSAEGCVSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAV--TDAA

Query:  EPCAAEP---EPSRGTPAEGLKLVPLLGPEKQVSVASKLGAEV
        E    +    E   GT  + LK + L   EK VS+ S L A++
Subjt:  EPCAAEP---EPSRGTPAEGLKLVPLLGPEKQVSVASKLGAEV

A0A6J1DT04 uncharacterized protein LOC1110228801.6e-7349.85Show/hide
Query:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK
        P K + +  +R K  YCRFH++HGH+TS  + LK Q+EDLI+ GY KK++G   T+  E   ++E+R++S+ P R+ DRPA+INTI GGPSGGQSG KRK
Subjt:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK

Query:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC
         LAR A  E                                         VRRVL++ GASANILS  TY ALGW R  L+ SPTPLVGF+GESV  EGC
Subjt:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC

Query:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVT---DAAEPCAAE
        + L VT+G+   ++T++AEFVV+D  S YNAI GRP+IH FRAIPST HQVLKYPT  G+ T+ GEQ  SRECY AA+KG + CA  T      E  A  
Subjt:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVT---DAAEPCAAE

Query:  PEPSRGTPAEGLKLVPLLGPEKQVS
        P      P E L+LVPLL PEKQ++
Subjt:  PEPSRGTPAEGLKLVPLLGPEKQVS

A0A6J1E0L8 uncharacterized protein LOC1110253102.3e-7251.59Show/hide
Query:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK
        P K +R  GKR+KRLYCRFHKDHGHDTSR FHLKEQVEDLIR GYLKKY+G RE AE EGSAREEKRE+SQPPR KEDRPA+INTI+GGPSG +SGQKRK
Subjt:  PPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETAEPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRK

Query:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC
        ALARE AHE                                         VRRV +D GASANI SFSTYTALGWER+HLK   T LVGFA ESVS EGC
Subjt:  ALAREAAHE-----------------------------------------VRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGC

Query:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAAEPCAAEPEP
        +SL VTI EG+ QVT+VAEFVVIDRSSAY   +  P                                 S+ C T   +G A     +  AE        
Subjt:  VSLSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAAEPCAAEPEP

Query:  SRGTPAEGLKLVPLLGPEKQVSVASKLGA----EVPRSENSNADALA
                  LVPLLGP++QVS+ S+L A    E+ R   SN+D  A
Subjt:  SRGTPAEGLKLVPLLGPEKQVSVASKLGA----EVPRSENSNADALA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAGGCGACGAGTAGTGGCACCCGGAGATCGGGAGTATCTGGTTGACGATAAGGAGGAAAGGCCAGAGGTCGACGATCGAGAGAGGTCCTCCCACGCTGAC
CATTTGTTTCGGTCTAAAGTGGACCTCCTCCGAAAGTTCCCTCCTCCCAAGAAGCAACGAAGCGATGGGAAACGAGACAAGCGACTTTACTGCCGATTCCACAAG
GATCACGGCCACGACACTTCACGCTATTTCCACCTGAAGGAGCAGGTTGAGGATCTGATCCGGAGGGGTTATCTCAAAAAATACATCGGCTGGCGTGAAACGGCA
GAACCAGAGGGGTCGGCTCGGGAGGAGAAGCGAGAGAAGTCACAACCACCGAGACGGAAGGAAGATCGTCCCGCCATTATAAATACCATCAATGGGGGCCCGAGT
GGGGGACAGTCGGGGCAGAAGAGAAAAGCTCTGGCTCGGGAGGCAGCACACGAGGTGAGAAGAGTTCTTATCGACAGTGGAGCGTCGGCTAATATCTTATCGTTC
TCGACCTACACGGCCCTGGGGTGGGAGAGGAAGCATTTGAAGCTCAGCCCGACGCCTTTGGTCGGTTTTGCAGGGGAGTCAGTCAGCGCGGAAGGATGTGTCTCG
CTCTCTGTCACCATTGGCGAGGGAGATCAACAAGTAACTAAGGTTGCAGAATTTGTTGTGATAGATCGGAGCTCTGCGTACAACGCCATAATTGGTCGGCCTTTG
ATTCACGATTTCCGTGCAATTCCATCCACTTATCACCAGGTCTTGAAGTACCCCACCTCGACCGGAATTGCGACAATCCTGGGTGAGCAAAAGACGTCCAGAGAA
TGCTACACAGCCGCGATGAAGGGAACAGCCACTTGTGCAGCGGTCACGGACGCGGCAGAGCCATGTGCCGCCGAACCAGAGCCGAGCCGCGGTACCCCAGCCGAA
GGGCTAAAGCTTGTCCCCCTGTTGGGGCCAGAAAAGCAGGTCAGCGTTGCCAGCAAACTAGGGGCCGAGGTGCCGAGGTCTGAAAACTCCAATGCCGACGCACTG
GCTCGCCTAGCCTCGGCATACGAGACCGACCTACCGAGAACAGTTCCAGTTGAAATATTCGCTGAGTCGTCCATCGACCAGCCTGAGGTAATGGAGATCCAGTCA
GCTCAGCCTACACGGATGGACCCGATTAAGGACTTCCTGGTCAGTGGCTCAGTCCCTGTCGATCCGAGCCAGGCCAAAAAGCTCCGACGTCAAGCTGCTCACTAC
TTGATGCAAGAAGGCAAGATCTTCAAGAGAGGATATTCCCTACCATTACTGCAAGTAAAAGTCAGAACGTTCAAGCCTGGCGACCTCGTCCGCAAGAAGGTAATG
CAGCATGTCGGAGCACTCGAGCCGAACTGGGAAGGTCCGTACAAAGTGTTGAAAACACTCCGCCCTGGAGCGTATCTGCTGTCCGACCTCAATGGGAGACAACTC
CCTCACCCATGGAATGCAGAGCATTTGCGAGTTTATTATCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAGGCGACGAGTAGTGGCACCCGGAGATCGGGAGTATCTGGTTGACGATAAGGAGGAAAGGCCAGAGGTCGACGATCGAGAGAGGTCCTCCCACGCTGAC
CATTTGTTTCGGTCTAAAGTGGACCTCCTCCGAAAGTTCCCTCCTCCCAAGAAGCAACGAAGCGATGGGAAACGAGACAAGCGACTTTACTGCCGATTCCACAAG
GATCACGGCCACGACACTTCACGCTATTTCCACCTGAAGGAGCAGGTTGAGGATCTGATCCGGAGGGGTTATCTCAAAAAATACATCGGCTGGCGTGAAACGGCA
GAACCAGAGGGGTCGGCTCGGGAGGAGAAGCGAGAGAAGTCACAACCACCGAGACGGAAGGAAGATCGTCCCGCCATTATAAATACCATCAATGGGGGCCCGAGT
GGGGGACAGTCGGGGCAGAAGAGAAAAGCTCTGGCTCGGGAGGCAGCACACGAGGTGAGAAGAGTTCTTATCGACAGTGGAGCGTCGGCTAATATCTTATCGTTC
TCGACCTACACGGCCCTGGGGTGGGAGAGGAAGCATTTGAAGCTCAGCCCGACGCCTTTGGTCGGTTTTGCAGGGGAGTCAGTCAGCGCGGAAGGATGTGTCTCG
CTCTCTGTCACCATTGGCGAGGGAGATCAACAAGTAACTAAGGTTGCAGAATTTGTTGTGATAGATCGGAGCTCTGCGTACAACGCCATAATTGGTCGGCCTTTG
ATTCACGATTTCCGTGCAATTCCATCCACTTATCACCAGGTCTTGAAGTACCCCACCTCGACCGGAATTGCGACAATCCTGGGTGAGCAAAAGACGTCCAGAGAA
TGCTACACAGCCGCGATGAAGGGAACAGCCACTTGTGCAGCGGTCACGGACGCGGCAGAGCCATGTGCCGCCGAACCAGAGCCGAGCCGCGGTACCCCAGCCGAA
GGGCTAAAGCTTGTCCCCCTGTTGGGGCCAGAAAAGCAGGTCAGCGTTGCCAGCAAACTAGGGGCCGAGGTGCCGAGGTCTGAAAACTCCAATGCCGACGCACTG
GCTCGCCTAGCCTCGGCATACGAGACCGACCTACCGAGAACAGTTCCAGTTGAAATATTCGCTGAGTCGTCCATCGACCAGCCTGAGGTAATGGAGATCCAGTCA
GCTCAGCCTACACGGATGGACCCGATTAAGGACTTCCTGGTCAGTGGCTCAGTCCCTGTCGATCCGAGCCAGGCCAAAAAGCTCCGACGTCAAGCTGCTCACTAC
TTGATGCAAGAAGGCAAGATCTTCAAGAGAGGATATTCCCTACCATTACTGCAAGTAAAAGTCAGAACGTTCAAGCCTGGCGACCTCGTCCGCAAGAAGGTAATG
CAGCATGTCGGAGCACTCGAGCCGAACTGGGAAGGTCCGTACAAAGTGTTGAAAACACTCCGCCCTGGAGCGTATCTGCTGTCCGACCTCAATGGGAGACAACTC
CCTCACCCATGGAATGCAGAGCATTTGCGAGTTTATTATCAATGA
Protein sequenceShow/hide protein sequence
MPRRRVVAPGDREYLVDDKEERPEVDDRERSSHADHLFRSKVDLLRKFPPPKKQRSDGKRDKRLYCRFHKDHGHDTSRYFHLKEQVEDLIRRGYLKKYIGWRETA
EPEGSAREEKREKSQPPRRKEDRPAIINTINGGPSGGQSGQKRKALAREAAHEVRRVLIDSGASANILSFSTYTALGWERKHLKLSPTPLVGFAGESVSAEGCVS
LSVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDFRAIPSTYHQVLKYPTSTGIATILGEQKTSRECYTAAMKGTATCAAVTDAAEPCAAEPEPSRGTPAE
GLKLVPLLGPEKQVSVASKLGAEVPRSENSNADALARLASAYETDLPRTVPVEIFAESSIDQPEVMEIQSAQPTRMDPIKDFLVSGSVPVDPSQAKKLRRQAAHY
LMQEGKIFKRGYSLPLLQVKVRTFKPGDLVRKKVMQHVGALEPNWEGPYKVLKTLRPGAYLLSDLNGRQLPHPWNAEHLRVYYQ