; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0984 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0984
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1639)
Genome locationMC05:11151007..11153817
RNA-Seq ExpressionMC05g0984
SyntenyMC05g0984
Gene Ontology termsNA
InterPro domainsIPR012438 - Protein of unknown function DUF1639


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155976.1 uncharacterized protein LOC111022959 [Momordica charantia]1.49e-153100Show/hide
Query:  METEVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPSPEK
        METEVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPSPEK
Subjt:  METEVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPSPEK

Query:  EDRYYATRGSAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSI
        EDRYYATRGSAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSI
Subjt:  EDRYYATRGSAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSI

Query:  KKRPRGLKAMGGSMETDSE
        KKRPRGLKAMGGSMETDSE
Subjt:  KKRPRGLKAMGGSMETDSE

XP_022932617.1 uncharacterized protein LOC111439121 isoform X1 [Cucurbita moschata]1.61e-12586.78Show/hide
Query:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS
        METEVRNQ  RGCKPLEPEVFLQW KR+RLRCARIKDPEISERLCGGLRKKIASRADRCVV+ SE+ERTPLQPNRLTR+SEGVT LRNGAGT+     PS
Subjt:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS

Query:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
        PEKEDRYYATRGSA A  VDENG+ISTN N    +KAE+RGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
Subjt:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY

Query:  EVREKKSIKKRPRGLKAMGGSMETDSE
        EVREKKS KKRP GLKAMG SME+DSE
Subjt:  EVREKKSIKKRPRGLKAMGGSMETDSE

XP_022932618.1 uncharacterized protein LOC111439121 isoform X2 [Cucurbita moschata]8.83e-12686.78Show/hide
Query:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS
        METEVRNQ  RGCKPLEPEVFLQW KR+RLRCARIKDPEISERLCGGLRKKIASRADRCVV+ SE+ERTPLQPNRLTR+SEGVT LRNGAGT+     PS
Subjt:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS

Query:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
        PEKEDRYYATRGSA A  VDENG+ISTN N    +KAE+RGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
Subjt:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY

Query:  EVREKKSIKKRPRGLKAMGGSMETDSE
        EVREKKS KKRP GLKAMG SME+DSE
Subjt:  EVREKKSIKKRPRGLKAMGGSMETDSE

XP_022972122.1 uncharacterized protein LOC111470755 isoform X1 [Cucurbita maxima]1.61e-12586.78Show/hide
Query:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS
        METEVRNQ  RGCKPLEPEVFLQW K++RLRCARIKDPEISERLCGGLRKKIASRADRCVV+ SE+ERTPLQPNRLTR+SEGVT LRNGAGT+     PS
Subjt:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS

Query:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
        PEKEDRYYATRGSA A  VDENG+ISTN N     KAE+RGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
Subjt:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY

Query:  EVREKKSIKKRPRGLKAMGGSMETDSE
        EVREKKS KKRP GLKAMG SMETDSE
Subjt:  EVREKKSIKKRPRGLKAMGGSMETDSE

XP_022972123.1 uncharacterized protein LOC111470755 isoform X2 [Cucurbita maxima]8.83e-12686.78Show/hide
Query:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS
        METEVRNQ  RGCKPLEPEVFLQW K++RLRCARIKDPEISERLCGGLRKKIASRADRCVV+ SE+ERTPLQPNRLTR+SEGVT LRNGAGT+     PS
Subjt:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS

Query:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
        PEKEDRYYATRGSA A  VDENG+ISTN N     KAE+RGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
Subjt:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY

Query:  EVREKKSIKKRPRGLKAMGGSMETDSE
        EVREKKS KKRP GLKAMG SMETDSE
Subjt:  EVREKKSIKKRPRGLKAMGGSMETDSE

TrEMBL top hitse value%identityAlignment
A0A6J1DQU0 uncharacterized protein LOC1110229597.22e-154100Show/hide
Query:  METEVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPSPEK
        METEVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPSPEK
Subjt:  METEVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPSPEK

Query:  EDRYYATRGSAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSI
        EDRYYATRGSAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSI
Subjt:  EDRYYATRGSAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSI

Query:  KKRPRGLKAMGGSMETDSE
        KKRPRGLKAMGGSMETDSE
Subjt:  KKRPRGLKAMGGSMETDSE

A0A6J1EX97 uncharacterized protein LOC111439121 isoform X17.79e-12686.78Show/hide
Query:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS
        METEVRNQ  RGCKPLEPEVFLQW KR+RLRCARIKDPEISERLCGGLRKKIASRADRCVV+ SE+ERTPLQPNRLTR+SEGVT LRNGAGT+     PS
Subjt:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS

Query:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
        PEKEDRYYATRGSA A  VDENG+ISTN N    +KAE+RGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
Subjt:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY

Query:  EVREKKSIKKRPRGLKAMGGSMETDSE
        EVREKKS KKRP GLKAMG SME+DSE
Subjt:  EVREKKSIKKRPRGLKAMGGSMETDSE

A0A6J1F285 uncharacterized protein LOC111439121 isoform X24.27e-12686.78Show/hide
Query:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS
        METEVRNQ  RGCKPLEPEVFLQW KR+RLRCARIKDPEISERLCGGLRKKIASRADRCVV+ SE+ERTPLQPNRLTR+SEGVT LRNGAGT+     PS
Subjt:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS

Query:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
        PEKEDRYYATRGSA A  VDENG+ISTN N    +KAE+RGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
Subjt:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY

Query:  EVREKKSIKKRPRGLKAMGGSMETDSE
        EVREKKS KKRP GLKAMG SME+DSE
Subjt:  EVREKKSIKKRPRGLKAMGGSMETDSE

A0A6J1I3X6 uncharacterized protein LOC111470755 isoform X17.79e-12686.78Show/hide
Query:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS
        METEVRNQ  RGCKPLEPEVFLQW K++RLRCARIKDPEISERLCGGLRKKIASRADRCVV+ SE+ERTPLQPNRLTR+SEGVT LRNGAGT+     PS
Subjt:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS

Query:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
        PEKEDRYYATRGSA A  VDENG+ISTN N     KAE+RGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
Subjt:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY

Query:  EVREKKSIKKRPRGLKAMGGSMETDSE
        EVREKKS KKRP GLKAMG SMETDSE
Subjt:  EVREKKSIKKRPRGLKAMGGSMETDSE

A0A6J1I8Z1 uncharacterized protein LOC111470755 isoform X24.27e-12686.78Show/hide
Query:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS
        METEVRNQ  RGCKPLEPEVFLQW K++RLRCARIKDPEISERLCGGLRKKIASRADRCVV+ SE+ERTPLQPNRLTR+SEGVT LRNGAGT+     PS
Subjt:  METEVRNQ--RGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVT-LRNGAGTAEIRKPPS

Query:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
        PEKEDRYYATRGSA A  VDENG+ISTN N     KAE+RGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY
Subjt:  PEKEDRYYATRGSAVA--VDENGKISTNGN---TTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERY

Query:  EVREKKSIKKRPRGLKAMGGSMETDSE
        EVREKKS KKRP GLKAMG SMETDSE
Subjt:  EVREKKSIKKRPRGLKAMGGSMETDSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55340.1 Protein of unknown function (DUF1639)8.9e-5053.33Show/hide
Query:  EVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGG------LRKKIASRADRCVVTASER---ERTPLQPNRLTRSSEGVTLRNGAGTAEIRK
        EV+ QRG    E +  LQWG+R+R+RC ++K     + L  G       ++K+ SRA      +SER    R   +PN++T S   V     A       
Subjt:  EVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGG------LRKKIASRADRCVVTASER---ERTPLQPNRLTRSSEGVTLRNGAGTAEIRK

Query:  PPSPEKEDRYYATRGSAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEV
          SPEKEDRYY TRGS + +DE+GKI         E +  VWPKL+IALS+KEKEEDF+AMKGCKLPQRPKKRAK++Q++LLLVSPGAWLS++ +ERYEV
Subjt:  PPSPEKEDRYYATRGSAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEV

Query:  REKKSIKKRPRGLKAMGGSMETDSE
        REKK+ KKRPRGLKAM GSME+DSE
Subjt:  REKKSIKKRPRGLKAMGGSMETDSE

AT1G55340.2 Protein of unknown function (DUF1639)9.8e-4952.25Show/hide
Query:  EVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGG------LRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPS
        EV+ QRG    E +  LQWG+R+R+RC ++K     + L  G       ++K+ SRA      +SER       NR  +S                   S
Subjt:  EVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGG------LRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPS

Query:  PEKEDRYYATRGSAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK
        PEKEDRYY TRGS + +DE+GKI         E +  VWPKL+IALS+KEKEEDF+AMKGCKLPQRPKKRAK++Q++LLLVSPGAWLS++ +ERYEVREK
Subjt:  PEKEDRYYATRGSAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREK

Query:  KSIKKRPRGLKAMGGSMETDSE
        K+ KKRPRGLKAM GSME+DSE
Subjt:  KSIKKRPRGLKAMGGSMETDSE

AT3G03880.1 Protein of unknown function (DUF1639)1.0e-4553.81Show/hide
Query:  LEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPSPEKEDRYYATRGSAVA
        LE E+FLQWG ++RLRC R K  +IS       R K +SR          ++   LQ +R +R SEG  LR+G      R+ PSPEKE+RYY TRG    
Subjt:  LEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPSPEKEDRYYATRGSAVA

Query:  VDENGKISTNGNTTKAEDRG---FVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSIKK-RPRGLKA
        VD  GK   +GN    +       +WPKLFI LS+KEKEEDFMAMKGCK   RPKKRAK+IQRSLLLVSPG WL+++  +RY+VR KKS KK R RGLKA
Subjt:  VDENGKISTNGNTTKAEDRG---FVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSIKK-RPRGLKA

Query:  MGGSMETDSE
        M G+METDS+
Subjt:  MGGSMETDSE

AT4G20300.1 Protein of unknown function (DUF1639)2.5e-2038.82Show/hide
Query:  RSSEGVTLRNGAGTAEI-------RKPPSPEKEDRYYATRG------------------SAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEE
        RS+ G   RN      I       R PPSP++ ++  + R                   S      + ++  NG   KA      WP+++IALS KEKEE
Subjt:  RSSEGVTLRNGAGTAEI-------RKPPSPEKEDRYYATRG------------------SAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEE

Query:  DFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSIKK
        DF+ MKG KLP RP+KRAK I ++L    PG WLS++++ RYEVREKK++KK
Subjt:  DFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSIKK

AT4G20300.2 Protein of unknown function (DUF1639)4.1e-2340.35Show/hide
Query:  RSSEGVTLRNGAGTAEI-------RKPPSPEKEDRYYATRG------------------SAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEE
        RS+ G   RN      I       R PPSP++ ++  + R                   S      + ++  NG   KA      WP+++IALS KEKEE
Subjt:  RSSEGVTLRNGAGTAEI-------RKPPSPEKEDRYYATRG------------------SAVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEE

Query:  DFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSIKK--RPRGLKAMGGSMETDSE
        DF+ MKG KLP RP+KRAK I ++L    PG WLS++++ RYEVREKK++KK  + RGLK M  +M+TDSE
Subjt:  DFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSIKK--RPRGLKAMGGSMETDSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACAGAGGTGAGGAATCAGAGAGGGTGTAAACCTTTGGAACCGGAGGTTTTCTTACAGTGGGGAAAGAGGAGGCGACTGAGATGTGCTAGAATCAAAGACCCGGA
GATCTCTGAGCGTTTGTGCGGCGGCCTACGGAAGAAAATCGCATCTCGAGCCGATCGCTGTGTGGTTACAGCTTCCGAGAGAGAAAGGACCCCACTCCAACCAAATCGTC
TCACTAGGAGTTCTGAGGGCGTTACGCTGCGTAACGGTGCAGGCACAGCCGAGATCCGGAAACCGCCTTCGCCGGAGAAGGAGGACCGTTACTACGCCACCCGAGGATCC
GCCGTGGCGGTGGATGAGAACGGGAAGATCTCAACCAACGGCAACACCACCAAGGCGGAAGACAGAGGCTTTGTTTGGCCAAAGCTGTTCATCGCTCTGTCAAGCAAAGA
AAAGGAGGAAGACTTCATGGCCATGAAGGGCTGTAAGCTCCCACAAAGGCCCAAAAAGAGGGCCAAGATGATCCAGAGAAGCTTACTCCTGGTGAGCCCTGGGGCATGGC
TGAGCGAAATGAGCCAAGAGAGATATGAAGTGAGGGAAAAGAAGAGTATAAAGAAGAGGCCAAGAGGATTGAAAGCCATGGGAGGAAGCATGGAGACTGATTCAGAATGA
mRNA sequenceShow/hide mRNA sequence
CTTAGATCTTTTTGGCTATTGATCTAGTAATAGGACATTTGAGGTCGAGTCGGTGTACCCATCAATTCTAATCTCGGTATATTAATAAATAACCTATATATTAATTACCT
ATTATTTACTACATCACTATTTAACTTTTAGATTGATTAACGTTGATATACATGTATTTTTATTTGGATACATCATTATTATCTTTTAGGTGATATTTTTGTAATATTTT
AGGCAAATTACAATGGAATTGAGGTAAGAAAAGGACGAAGGCCAGATAATTACAATGGGCCTAAGCTTGGCTCTCACCGAGGCGGCCCAAAACAAGAACAATGTGGGCCC
ATCGAGCCCATGAAGGCACTTCTAATGGGACAGAATGGGAATTTAATTTTTGTTTTGCTTACCTTTTTATCTTCACTGGCGAGGCCGAGGCGATGAAACGAGATTGAAGA
CGAAGGAAGACATTGTAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAACTCGAAGAACACGACGACCGCCATAGCTCAGACTTCTAT
AAGCTCTGCTCTGGAGCTGGAAACTTACTCAACCACAGCCTTTCTCTTGTAATGCGGAAAATTCCAGGCCTGAACTTCGATTATCCGAACAATTTCAGGTCGTAGAGGCT
CGAAATTGATTGAGAGGATCGGAGCCGCAATTTCCATTGGAGATCTGGTTTTGTTTCATCGGACTGGCCGTCTTGTATTCTTGACGAGAATCTTGCTGTTCTCATTATAT
GGAGACAGAGGTGAGGAATCAGAGAGGGTGTAAACCTTTGGAACCGGAGGTTTTCTTACAGTGGGGAAAGAGGAGGCGACTGAGATGTGCTAGAATCAAAGACCCGGAGA
TCTCTGAGCGTTTGTGCGGCGGCCTACGGAAGAAAATCGCATCTCGAGCCGATCGCTGTGTGGTTACAGCTTCCGAGAGAGAAAGGACCCCACTCCAACCAAATCGTCTC
ACTAGGAGTTCTGAGGGCGTTACGCTGCGTAACGGTGCAGGCACAGCCGAGATCCGGAAACCGCCTTCGCCGGAGAAGGAGGACCGTTACTACGCCACCCGAGGATCCGC
CGTGGCGGTGGATGAGAACGGGAAGATCTCAACCAACGGCAACACCACCAAGGCGGAAGACAGAGGCTTTGTTTGGCCAAAGCTGTTCATCGCTCTGTCAAGCAAAGAAA
AGGAGGAAGACTTCATGGCCATGAAGGGCTGTAAGCTCCCACAAAGGCCCAAAAAGAGGGCCAAGATGATCCAGAGAAGCTTACTCCTGGTGAGCCCTGGGGCATGGCTG
AGCGAAATGAGCCAAGAGAGATATGAAGTGAGGGAAAAGAAGAGTATAAAGAAGAGGCCAAGAGGATTGAAAGCCATGGGAGGAAGCATGGAGACTGATTCAGAATGAGG
AGGAAACTGAACTGACCCTTTAATTCTTTTTTTTTTTTTTTTTTTTTGGGGAATGCTGAATTGAATGAATTGGAAAGAAGAGAAGAGAAGAGAAGAGAAGAGGAAGAGGA
AGAGGAAGAGGGCATCTCAATCTCATCCTGCCTCTGCTTGAATTCTGAGGTTAGTGGGGTTTTGGTTTTGGATTTGGTGCTTAAAGAAGTCCACTTTTGAGGGGTTGGTT
AGGCAGAGCTGCAAAAGTGATGAGTTCTGTTTACAAATTTTAAAAGTTGCTCTCCTTTTTGTGTATAAAATGTAATGTAAAAAAAATTGTTATTTATAAATGGAAGGAAG
GTCCCATTCTAAATGGGATTTTGTATTAGGGGGATGACATGATAACCCAACAAAAATTGAAATATGATCCAAAAAAGTTATAATGAAAAAAGAGAAAAAAAAAAGAGAGT
CTTTGGCTTTTGATCTTCAGATTGTGAAAGATAAATTTGTCTTGTTTACTTGTGTGCATGTACATAGAGATAATCTTTT
Protein sequenceShow/hide protein sequence
METEVRNQRGCKPLEPEVFLQWGKRRRLRCARIKDPEISERLCGGLRKKIASRADRCVVTASERERTPLQPNRLTRSSEGVTLRNGAGTAEIRKPPSPEKEDRYYATRGS
AVAVDENGKISTNGNTTKAEDRGFVWPKLFIALSSKEKEEDFMAMKGCKLPQRPKKRAKMIQRSLLLVSPGAWLSEMSQERYEVREKKSIKKRPRGLKAMGGSMETDSE