; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g27870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g27870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111020110
Genome locationchr6:20941270..20942331
RNA-Seq ExpressionMoc06g27870
SyntenyMoc06g27870
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]1.1e-18994.81Show/hide
Query:  HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMY
        HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDS+LAAVRGGNPLQTFECPPSQAPTQH IMSPHGYVNFQQLPTFNIPQ SEFRAENPQQLPPMINPGMY
Subjt:  HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMY

Query:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRM
        QPFMFN VPSYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGMISPRVPPPPQLSFLERGPQAPQLN NILSTFNMG LKPLEPPRM
Subjt:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRM

Query:  QIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE
          PTNMPMD GDEHGGE EKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRR TPKDSSMEN+DEEQFYSSPLIIT EDGNDDFLLVSRGHCSNMPE
Subjt:  QIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE

Query:  TEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE
        TEDTEN+VVRTDT EPSPLDTPTEGAFCSHQELPTGATGSIPLATDE
Subjt:  TEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE

XP_022156936.1 uncharacterized protein LOC111023763 [Momordica charantia]7.2e-14488.85Show/hide
Query:  MSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPR
        M+PHGYVNFQQLPT NIPQ SEFRAENPQQLPPMINPGMYQPFMFN VPSYHFPLSQMQI ASVHPYGMPNPSTLFPS PPYYGMGHWVPP NYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPR

Query:  VPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKD
        VPPPPQL FLERGPQ PQLN N LSTFNMGHLKPLEPPRM IPTNMPMDAG EHGGE EKSH  RLEP +SIGQKRKGKEVM DPEI EDGSSRR  PKD
Subjt:  VPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKD

Query:  SSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE
        SSME +DEEQFYSSPLIITP DGNDDFLLVSRG+CSNM E EDTEN+VVRTDT EPSPLDTPTEGAFCSHQELPTGATGS PL TDE
Subjt:  SSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE

XP_022157800.1 protein PAF1 homolog [Momordica charantia]2.1e-12786.31Show/hide
Query:  MVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYG
        MV D +LAAVRGGNPLQTFE PPSQAPTQH +MSPHGYVNFQQLP  NIPQ SEFRAE+PQQ PPMINPGMYQPF+FNPVP YHFPLSQMQIPASVHPYG
Subjt:  MVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYG

Query:  MPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEP
        MPNPSTLFPSLPPYY MG+WVPPQNYGMISPRVPPPPQL FLERGPQAPQLN NILSTFNMG LKPLEPP M IP N+PMD G EHGGE EKSH+ RLEP
Subjt:  MPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEP

Query:  GVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVS
         VSIGQKRKGKEV+ DPE E+DGSSRR TPKDS+MEN+DEEQFYSSPLIITP DGNDDFLLVS
Subjt:  GVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVS

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]8.6e-19092.63Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPFPPLENFQHHM SDS+LAAVRGGNPLQTFECPPSQAPTQH IMSPHGYVNFQQLPT NIPQ SEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PPQNYGMISP+VPPPPQL FLERGPQAPQL+ NILST+NMG LKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKP

Query:  LEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGH
        LEPPRM  PTNMPMDAGDEHGGE EK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRR TPKDS+MEN+DEEQFYSS  IITPEDGNDDFLLVSRGH
Subjt:  LEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE
        CSNMPETEDTEN+VVRTDT EPSPLDTPTEGAFCSHQELPTGATGS PLATDE
Subjt:  CSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE

XP_022158852.1 uncharacterized protein LOC111025316 [Momordica charantia]1.8e-15092.33Show/hide
Query:  MSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPR
        M+PHGYVNFQ LPT NIPQ SEFRAE+PQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPP NYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPR

Query:  VPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKD
        VPPPPQL FLERGPQAPQLN NILSTFNMGHLKPLEP RM IPTNMPMD GDEHGGE EKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRR TPKD
Subjt:  VPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKD

Query:  SSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE
        SSMEN+DEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTEN+VVRTDT EPS LDTPTEGAFCSHQELPTGATGS PLATDE
Subjt:  SSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE

TrEMBL top hitse value%identityAlignment
A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201105.5e-19094.81Show/hide
Query:  HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMY
        HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDS+LAAVRGGNPLQTFECPPSQAPTQH IMSPHGYVNFQQLPTFNIPQ SEFRAENPQQLPPMINPGMY
Subjt:  HTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMY

Query:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRM
        QPFMFN VPSYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGMISPRVPPPPQLSFLERGPQAPQLN NILSTFNMG LKPLEPPRM
Subjt:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRM

Query:  QIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE
          PTNMPMD GDEHGGE EKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRR TPKDSSMEN+DEEQFYSSPLIIT EDGNDDFLLVSRGHCSNMPE
Subjt:  QIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE

Query:  TEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE
        TEDTEN+VVRTDT EPSPLDTPTEGAFCSHQELPTGATGSIPLATDE
Subjt:  TEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE

A0A6J1DU32 protein PAF1 homolog1.0e-12786.31Show/hide
Query:  MVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYG
        MV D +LAAVRGGNPLQTFE PPSQAPTQH +MSPHGYVNFQQLP  NIPQ SEFRAE+PQQ PPMINPGMYQPF+FNPVP YHFPLSQMQIPASVHPYG
Subjt:  MVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYG

Query:  MPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEP
        MPNPSTLFPSLPPYY MG+WVPPQNYGMISPRVPPPPQL FLERGPQAPQLN NILSTFNMG LKPLEPP M IP N+PMD G EHGGE EKSH+ RLEP
Subjt:  MPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEP

Query:  GVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVS
         VSIGQKRKGKEV+ DPE E+DGSSRR TPKDS+MEN+DEEQFYSSPLIITP DGNDDFLLVS
Subjt:  GVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVS

A0A6J1DWG3 uncharacterized protein LOC1110237633.5e-14488.85Show/hide
Query:  MSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPR
        M+PHGYVNFQQLPT NIPQ SEFRAENPQQLPPMINPGMYQPFMFN VPSYHFPLSQMQI ASVHPYGMPNPSTLFPS PPYYGMGHWVPP NYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPR

Query:  VPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKD
        VPPPPQL FLERGPQ PQLN N LSTFNMGHLKPLEPPRM IPTNMPMDAG EHGGE EKSH  RLEP +SIGQKRKGKEVM DPEI EDGSSRR  PKD
Subjt:  VPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKD

Query:  SSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE
        SSME +DEEQFYSSPLIITP DGNDDFLLVSRG+CSNM E EDTEN+VVRTDT EPSPLDTPTEGAFCSHQELPTGATGS PL TDE
Subjt:  SSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE

A0A6J1DX11 uncharacterized protein LOC1110248604.2e-19092.63Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPFPPLENFQHHM SDS+LAAVRGGNPLQTFECPPSQAPTQH IMSPHGYVNFQQLPT NIPQ SEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PPQNYGMISP+VPPPPQL FLERGPQAPQL+ NILST+NMG LKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKP

Query:  LEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGH
        LEPPRM  PTNMPMDAGDEHGGE EK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRR TPKDS+MEN+DEEQFYSS  IITPEDGNDDFLLVSRGH
Subjt:  LEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE
        CSNMPETEDTEN+VVRTDT EPSPLDTPTEGAFCSHQELPTGATGS PLATDE
Subjt:  CSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE

A0A6J1DY94 uncharacterized protein LOC1110253168.5e-15192.33Show/hide
Query:  MSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPR
        M+PHGYVNFQ LPT NIPQ SEFRAE+PQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPP NYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPR

Query:  VPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKD
        VPPPPQL FLERGPQAPQLN NILSTFNMGHLKPLEP RM IPTNMPMD GDEHGGE EKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRR TPKD
Subjt:  VPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEHGGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKD

Query:  SSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE
        SSMEN+DEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTEN+VVRTDT EPS LDTPTEGAFCSHQELPTGATGS PLATDE
Subjt:  SSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENDVVRTDTHEPSPLDTPTEGAFCSHQELPTGATGSIPLATDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCTCAACATACTTCACTTCCTACCTCTTCTACTCAAGCTCCAAACGCCACAAGCATTCCTTTTCCACCATTGGAGAACTTCCAACACCACATGGTTTC
AGATTCTAAGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACATTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCTTATAATGAGCCCACATGGTTATGTTA
ATTTTCAGCAACTACCCACCTTTAATATACCTCAAATCAGTGAGTTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTTTATG
TTTAACCCGGTTCCTTCCTATCATTTTCCCTTGTCCCAAATGCAAATTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACC
TTATTATGGAATGGGTCATTGGGTACCTCCACAAAATTATGGGATGATTTCACCTAGGGTTCCCCCACCACCTCAACTTTCATTCCTAGAAAGAGGACCTCAAGCACCAC
AATTGAACTCTAACATATTGAGTACCTTCAACATGGGACACCTAAAACCCTTAGAGCCTCCTAGGATGCAAATCCCAACGAATATGCCAATGGATGCAGGAGATGAGCAT
GGAGGAGAGCCAGAGAAGAGCCATAGCCATAGGCTAGAGCCCGGGGTGTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAAGATGG
GAGTAGTAGGCGTCAGACGCCTAAGGATTCATCTATGGAGAACAAGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAATGATGATTTTC
TACTGGTTTCACGAGGCCATTGTTCAAATATGCCAGAAACAGAGGATACCGAGAACGACGTGGTGAGAACAGATACTCATGAACCTTCCCCATTGGATACACCTACAGAA
GGAGCGTTTTGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGGATCTATTCCGTTGGCAACGGATGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCTCAACATACTTCACTTCCTACCTCTTCTACTCAAGCTCCAAACGCCACAAGCATTCCTTTTCCACCATTGGAGAACTTCCAACACCACATGGTTTC
AGATTCTAAGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACATTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCTTATAATGAGCCCACATGGTTATGTTA
ATTTTCAGCAACTACCCACCTTTAATATACCTCAAATCAGTGAGTTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTTTATG
TTTAACCCGGTTCCTTCCTATCATTTTCCCTTGTCCCAAATGCAAATTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACC
TTATTATGGAATGGGTCATTGGGTACCTCCACAAAATTATGGGATGATTTCACCTAGGGTTCCCCCACCACCTCAACTTTCATTCCTAGAAAGAGGACCTCAAGCACCAC
AATTGAACTCTAACATATTGAGTACCTTCAACATGGGACACCTAAAACCCTTAGAGCCTCCTAGGATGCAAATCCCAACGAATATGCCAATGGATGCAGGAGATGAGCAT
GGAGGAGAGCCAGAGAAGAGCCATAGCCATAGGCTAGAGCCCGGGGTGTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAAGATGG
GAGTAGTAGGCGTCAGACGCCTAAGGATTCATCTATGGAGAACAAGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAATGATGATTTTC
TACTGGTTTCACGAGGCCATTGTTCAAATATGCCAGAAACAGAGGATACCGAGAACGACGTGGTGAGAACAGATACTCATGAACCTTCCCCATTGGATACACCTACAGAA
GGAGCGTTTTGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGGATCTATTCCGTTGGCAACGGATGAGTAG
Protein sequenceShow/hide protein sequence
MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMVSDSKLAAVRGGNPLQTFECPPSQAPTQHLIMSPHGYVNFQQLPTFNIPQISEFRAENPQQLPPMINPGMYQPFM
FNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPQNYGMISPRVPPPPQLSFLERGPQAPQLNSNILSTFNMGHLKPLEPPRMQIPTNMPMDAGDEH
GGEPEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRQTPKDSSMENKDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENDVVRTDTHEPSPLDTPTE
GAFCSHQELPTGATGSIPLATDE