; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g17800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g17800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111020110
Genome locationchr4:13081402..13082580
RNA-Seq ExpressionMoc04g17800
SyntenyMoc04g17800
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]1.7e-11380.07Show/hide
Query:  PPMINPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHG
        P  +  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVP P QLPFLERGPQAPQLNPNI                           R EHG
Subjt:  PPMINPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHG

Query:  GEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQE
        GE EKSH  RLEP VSIGQKRKGKEVM DPEI  DGSSRRLTPKDSSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQE
Subjt:  GEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQE

Query:  PSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST
        PSPLDTP EGAFCSHQ LPTGATGSTPLAT+EY T MATLPGVRDAH IPSN VNPL C TGR KVGENST
Subjt:  PSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]3.8e-17483.42Show/hide
Query:  HTSLPTSSTQAPNATSIPFSPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMIN----
        HTSLPTSSTQAPNATSIPF PLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQ+SEFRAENPQQLPPMIN    
Subjt:  HTSLPTSSTQAPNATSIPFSPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMIN----

Query:  --------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNISR---------------
                            PASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGMISPRVPPP QL FLERGPQAPQLNPNI                 
Subjt:  --------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNISR---------------

Query:  -----------DEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE
                   DEHGGEQEKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIIT EDGNDDFLLVSRGHCSNMPE
Subjt:  -----------DEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE

Query:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST
        TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGS PLATDEYVT MATLPGVRDAHTIPSNTVNPLGCDTGRS+VGENST
Subjt:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST

XP_022156936.1 uncharacterized protein LOC111023763 [Momordica charantia]5.0e-13477.61Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M+PHGYVNFQQLPT NIPQNSEFRAENPQQLPPMINP                        ASVHPYGMPNPSTLFPS PPYYGMGHWVPPHNYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        VPPP QLPFLERGPQ PQLNPN                           +  EHGGEQEKSH  RLEP +SIGQKRKGKEVM DPEI EDGSSRRL PKD
Subjt:  VPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD
        SSME RDEEQFYSSPLIITP DGNDDFLLVSRG+CSNM E EDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPL TDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD

Query:  AHTIPSNTVNPLGCDTGRSKVGENST
        AHTIPSN VNPLGCDTGRSKVGENST
Subjt:  AHTIPSNTVNPLGCDTGRSKVGENST

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]1.1e-17883.16Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFSPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPF PLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPT NIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFSPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM

Query:  IN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNI-----------
        IN                        PASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGMISP+VPPP QLPFLERGPQAPQL+PNI           
Subjt:  IN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNI-----------

Query:  ---------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH
                       + DEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IITPEDGNDDFLLVSRGH
Subjt:  ---------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST
        CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST
Subjt:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST

XP_022158852.1 uncharacterized protein LOC111025316 [Momordica charantia]7.9e-14080.67Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M+PHGYVNFQ LPT NIPQNSEFRAE+PQQLPPMIN                        PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        VPPP QLPFLERGPQAPQLNPNI                          + DEHGGEQEKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKD
Subjt:  VPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD
        SSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQEPS LDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD

Query:  AHTIPSNTVNPLGCDTGRSKVGENST
        AHTIPSN VNPLGCDTGRSKVGENST
Subjt:  AHTIPSNTVNPLGCDTGRSKVGENST

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195154.7e-11480.44Show/hide
Query:  PPMINPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHG
        P  +  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVP P QLPFLERGPQAPQLNPNI                           R EHG
Subjt:  PPMINPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHG

Query:  GEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQE
        GE EKSH  RLEP VSIGQKRKGKEVM DPEI  DGSSRRLTPKDSSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQE
Subjt:  GEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQE

Query:  PSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST
        PSPLDTP EGAFCSHQ LPTGATGSTPLATDEY T MATLPGVRDAH IPSN VNPL C TGR KVGENST
Subjt:  PSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201101.8e-17483.42Show/hide
Query:  HTSLPTSSTQAPNATSIPFSPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMIN----
        HTSLPTSSTQAPNATSIPF PLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQ+SEFRAENPQQLPPMIN    
Subjt:  HTSLPTSSTQAPNATSIPFSPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMIN----

Query:  --------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNISR---------------
                            PASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGMISPRVPPP QL FLERGPQAPQLNPNI                 
Subjt:  --------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNISR---------------

Query:  -----------DEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE
                   DEHGGEQEKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIIT EDGNDDFLLVSRGHCSNMPE
Subjt:  -----------DEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE

Query:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST
        TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGS PLATDEYVT MATLPGVRDAHTIPSNTVNPLGCDTGRS+VGENST
Subjt:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST

A0A6J1DWG3 uncharacterized protein LOC1110237632.4e-13477.61Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M+PHGYVNFQQLPT NIPQNSEFRAENPQQLPPMINP                        ASVHPYGMPNPSTLFPS PPYYGMGHWVPPHNYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        VPPP QLPFLERGPQ PQLNPN                           +  EHGGEQEKSH  RLEP +SIGQKRKGKEVM DPEI EDGSSRRL PKD
Subjt:  VPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD
        SSME RDEEQFYSSPLIITP DGNDDFLLVSRG+CSNM E EDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPL TDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD

Query:  AHTIPSNTVNPLGCDTGRSKVGENST
        AHTIPSN VNPLGCDTGRSKVGENST
Subjt:  AHTIPSNTVNPLGCDTGRSKVGENST

A0A6J1DX11 uncharacterized protein LOC1110248605.5e-17983.16Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFSPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPF PLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPT NIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFSPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM

Query:  IN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNI-----------
        IN                        PASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGMISP+VPPP QLPFLERGPQAPQL+PNI           
Subjt:  IN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNI-----------

Query:  ---------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH
                       + DEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IITPEDGNDDFLLVSRGH
Subjt:  ---------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST
        CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST
Subjt:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENST

A0A6J1DY94 uncharacterized protein LOC1110253163.8e-14080.67Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M+PHGYVNFQ LPT NIPQNSEFRAE+PQQLPPMIN                        PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        VPPP QLPFLERGPQAPQLNPNI                          + DEHGGEQEKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKD
Subjt:  VPPPAQLPFLERGPQAPQLNPNI--------------------------SRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD
        SSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQEPS LDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD

Query:  AHTIPSNTVNPLGCDTGRSKVGENST
        AHTIPSN VNPLGCDTGRSKVGENST
Subjt:  AHTIPSNTVNPLGCDTGRSKVGENST

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCTCAACATACTTCACTTCCTACCTCTTCCACTCAAGCTCCAAATGCCACAAGCATTCCTTTTTCACCATTGGAGAACTTCCAACACCACATGGGTTC
AGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCACTTCAAACATTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAATGAGCCCACATGGTTATGTTA
ATTTTCAGCAACTACCCACCTTTAATATACCTCAAAACAGTGAGTTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGCTAGTGTTCATCCTTATGGT
ATGCCAAATCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACATAATTATGGGATGATTTCACCTAGAGTTCCCCCACCAGC
TCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATCGAGAGATGAGCATGGAGGAGAGCAAGAGAAGAGCCACAGCCATAGGCTAGAGC
CCGGGGTTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACGCCTAAGGATTCATCTATGGAG
AATAGGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAATGATGATTTTCTACTGGTTTCACGAGGACATTGTTCAAATATGCCGGAAAC
AGAGGATACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGCATTTTGTAGTCACCAAGAGTTGCCTACTGGGGCAA
CTGGATCTACTCCATTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTACCAGGGGTAAGGGATGCTCACACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGT
GATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCTCAACATACTTCACTTCCTACCTCTTCCACTCAAGCTCCAAATGCCACAAGCATTCCTTTTTCACCATTGGAGAACTTCCAACACCACATGGGTTC
AGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCACTTCAAACATTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAATGAGCCCACATGGTTATGTTA
ATTTTCAGCAACTACCCACCTTTAATATACCTCAAAACAGTGAGTTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGCTAGTGTTCATCCTTATGGT
ATGCCAAATCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACATAATTATGGGATGATTTCACCTAGAGTTCCCCCACCAGC
TCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATCGAGAGATGAGCATGGAGGAGAGCAAGAGAAGAGCCACAGCCATAGGCTAGAGC
CCGGGGTTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACGCCTAAGGATTCATCTATGGAG
AATAGGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAATGATGATTTTCTACTGGTTTCACGAGGACATTGTTCAAATATGCCGGAAAC
AGAGGATACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGCATTTTGTAGTCACCAAGAGTTGCCTACTGGGGCAA
CTGGATCTACTCCATTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTACCAGGGGTAAGGGATGCTCACACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGT
GATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTTAA
Protein sequenceShow/hide protein sequence
MASSSQHTSLPTSSTQAPNATSIPFSPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPASVHPYG
MPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVPPPAQLPFLERGPQAPQLNPNISRDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSME
NRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGC
DTGRSKVGENST