; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g16440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g16440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:13304484..13306736
RNA-Seq ExpressionMoc09g16440
SyntenyMoc09g16440
Gene Ontology termsGO:0009536 - plastid (cellular component)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5336140.1 unnamed protein product [Arabidopsis thaliana]6.7e-11475.91Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S                + RL     S PFEILRRVALWRAQYD
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD

Query:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG
        ESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQTRKG
Subjt:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG

Query:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI
        LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV                    D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Subjt:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI

Query:  KEV
        KE+
Subjt:  KEV

CAD5336141.1 unnamed protein product [Arabidopsis thaliana]6.7e-11475.91Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S                + RL     S PFEILRRVALWRAQYD
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD

Query:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG
        ESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQTRKG
Subjt:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG

Query:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI
        LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV                    D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Subjt:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI

Query:  KEV
        KE+
Subjt:  KEV

CAD5336145.1 unnamed protein product [Arabidopsis thaliana]6.7e-11475.91Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S                + RL     S PFEILRRVALWRAQYD
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD

Query:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG
        ESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQTRKG
Subjt:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG

Query:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI
        LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV                    D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Subjt:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI

Query:  KEV
        KE+
Subjt:  KEV

KAG7528872.1 hypothetical protein ISN44_Un153g000040 [Arabidopsis suecica]1.4e-10880.75Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S                + RL             RVALWRAQYD
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD

Query:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG
        ESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQTRKG
Subjt:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG

Query:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH
        LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVEC TLDGESPVAESITSL SDPSSMGH
Subjt:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH

OVA05688.1 hypothetical protein BVC80_4285g1 [Macleaya cordata]9.0e-11970.61Show/hide
Query:  KRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLH
        KRIEEASDSFM APLGSGGYSSVGRAPLLQL              VVGVGGS RVPSSGIPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLH
Subjt:  KRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLH

Query:  PVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESA
        PV              R  H  L    ++   +  LEWDSNS PFEILRRVALWRAQ                      S GGLRGGGLPCGGCQRFESA
Subjt:  PVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESA

Query:  YLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG------
        YLQLVNLADTK+YDSTQFFRFGSSIYD SFMDVDKIL FSSTLGWHSLKV GEVQTRKGLRWIPRHPETRKGV SDEMLRGVENK RSGDSRIG      
Subjt:  YLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG------

Query:  -----------------EAVECCTLDGESPVAESITSLRSDPSSMGH
                         EAVEC TLDGESPVAESITSLRSDPSSMGH
Subjt:  -----------------EAVECCTLDGESPVAESITSLRSDPSSMGH

TrEMBL top hitse value%identityAlignment
A0A200Q5G5 Uncharacterized protein ycf684.4e-11970.61Show/hide
Query:  KRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLH
        KRIEEASDSFM APLGSGGYSSVGRAPLLQL              VVGVGGS RVPSSGIPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLH
Subjt:  KRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLH

Query:  PVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESA
        PV              R  H  L    ++   +  LEWDSNS PFEILRRVALWRAQ                      S GGLRGGGLPCGGCQRFESA
Subjt:  PVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESA

Query:  YLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG------
        YLQLVNLADTK+YDSTQFFRFGSSIYD SFMDVDKIL FSSTLGWHSLKV GEVQTRKGLRWIPRHPETRKGV SDEMLRGVENK RSGDSRIG      
Subjt:  YLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG------

Query:  -----------------EAVECCTLDGESPVAESITSLRSDPSSMGH
                         EAVEC TLDGESPVAESITSLRSDPSSMGH
Subjt:  -----------------EAVECCTLDGESPVAESITSLRSDPSSMGH

A0A2N9I678 Uncharacterized protein ycf689.4e-10673.5Show/hide
Query:  EILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGW
        +++  VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYD SFMDVDKILPFSSTLGW
Subjt:  EILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGW

Query:  HSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH---------------------
        HSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH                     
Subjt:  HSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH---------------------

Query:  ----------------------------------------------GSRSASETMGDKLHRREGNSPDHLLRPLNDRSVIKEV
                                                      GSRSASETMGDKLHRREGNSPDH LRPLNDRSVIKE+
Subjt:  ----------------------------------------------GSRSASETMGDKLHRREGNSPDHLLRPLNDRSVIKEV

A0A7G2FJL3 Uncharacterized protein ycf683.2e-11475.91Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S                + RL     S PFEILRRVALWRAQYD
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD

Query:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG
        ESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQTRKG
Subjt:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG

Query:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI
        LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV                    D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Subjt:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI

Query:  KEV
        KE+
Subjt:  KEV

A0A7G2FKR6 Uncharacterized protein ycf683.2e-11475.91Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S                + RL     S PFEILRRVALWRAQYD
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD

Query:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG
        ESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQTRKG
Subjt:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG

Query:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI
        LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV                    D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Subjt:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI

Query:  KEV
        KE+
Subjt:  KEV

A0A7G2FMH4 Uncharacterized protein ycf683.2e-11475.91Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHRLHPV         G   S                + RL     S PFEILRRVALWRAQYD
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYD

Query:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG
        ESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQTRKG
Subjt:  ESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKG

Query:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI
        LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG AV                    D +S G    RSASET+GDKLHRREGNSPDH LRPLNDRSVI
Subjt:  LRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHLLRPLNDRSVI

Query:  KEV
        KE+
Subjt:  KEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07706.1 unknown protein2.0e-0754.84Show/hide
Query:  SETMGDKLHRR----------EGNSPDHLLRPLNDRSVIKEVGVQRQPGGL------PRSSH
        SET G ++ R           EGNSPDH LRP N RSVIKEVGVQRQP  L      PRS H
Subjt:  SETMGDKLHRR----------EGNSPDHLLRPLNDRSVIKEVGVQRQPGGL------PRSSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTACTTCACGGGCGAGGTCCCTGGTTCAAGTCCAGGATGGCCCAGCTACGCCAAGAAAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCC
ACTTGGCTCGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGATCTAGTATGGATCGTACATGGACGGTAGTTGGAGTCGGCG
GCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACGAGATCCCTTCAACCCTTTGAGCGAAATGCGGCAA
AAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGTACGAGATCACCCCAAGGACGCCTTCGGTATCCAGGGGTCGCGGACCGACCATAGAACCCTGTT
CAATAAGTGGAACGCATTAGTTGTCCGCTCTCGGTTAGAATGGGATTCCAACTCAGTACCTTTTGAGATTTTGAGAAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAA
GTTGTAAGCTGTGTTCGGGGGGGAGTTATTGTCTATCGTTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTC
GAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCAATTTTTCCGATTCGGCAGTTCGATCTATGATTTCTCATTCATGGACGTTGA
TAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAGGAAAGGCTTACGGTGGATACCTAGACACCCAGAGACGAGGA
AGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGGCGAAGCGGTGGAGTGCTGCACCCTAGATGGCGAGAGTCCA
GTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGGGCACGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTTCATCGTCGAGAGGGAAACAG
CCCGGATCACCTGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAGCCACCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTACTTCACGGGCGAGGTCCCTGGTTCAAGTCCAGGATGGCCCAGCTACGCCAAGAAAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCC
ACTTGGCTCGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGATCTAGTATGGATCGTACATGGACGGTAGTTGGAGTCGGCG
GCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACGAGATCCCTTCAACCCTTTGAGCGAAATGCGGCAA
AAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGTACGAGATCACCCCAAGGACGCCTTCGGTATCCAGGGGTCGCGGACCGACCATAGAACCCTGTT
CAATAAGTGGAACGCATTAGTTGTCCGCTCTCGGTTAGAATGGGATTCCAACTCAGTACCTTTTGAGATTTTGAGAAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAA
GTTGTAAGCTGTGTTCGGGGGGGAGTTATTGTCTATCGTTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTC
GAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCAATTTTTCCGATTCGGCAGTTCGATCTATGATTTCTCATTCATGGACGTTGA
TAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAGGAAAGGCTTACGGTGGATACCTAGACACCCAGAGACGAGGA
AGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGGCGAAGCGGTGGAGTGCTGCACCCTAGATGGCGAGAGTCCA
GTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGGGCACGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTTCATCGTCGAGAGGGAAACAG
CCCGGATCACCTGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAGCCACCCTTGA
Protein sequenceShow/hide protein sequence
MIYFTGEVPGSSPGWPSYAKKKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQ
KEGKSMDRPHRLHPVVRDHPKDAFGIQGSRTDHRTLFNKWNALVVRSRLEWDSNSVPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRF
ESAYLQLVNLADTKLYDSTQFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTRKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGEAVECCTLDGESP
VAESITSLRSDPSSMGHGSRSASETMGDKLHRREGNSPDHLLRPLNDRSVIKEVGVQRQPGGLPRSSHP