; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g05210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g05210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022160
Genome locationchr7:4395485..4401389
RNA-Seq ExpressionMoc07g05210
SyntenyMoc07g05210
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]1.4e-8778.03Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLA PIQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLIC                                        WRHHPNFSWGGQGGS+GFNQGQSQQNKQPYVPPTQQYIPP QQQYNQRT+TP V
Subjt:  VNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  HNNNSNLENMMKEYMARTDAVIQ
         NNNSNLENMMKEYMARTD VIQ
Subjt:  HNNNSNLENMMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]4.4e-8961.79Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   + P  P Q +Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT

Query:  PAPVCQVNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPT---QQYIPPPQQQY
         +PVCQ+N+++C                                         RHHPNFSWGGQG S+G  QGQ+QQ KQPYVP T    Q +PPP QQY
Subjt:  PAPVCQVNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPT---QQYIPPPQQQY

Query:  NQRTQTP--PVHNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM
        NQ  +TP  P HNNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP M
Subjt:  NQRTQTP--PVHNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]3.8e-19769.96Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICWRHHPNFSWGGQGGSNGFNQGQSQQNK
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLICWRHHPNFSWGGQGGS+GFNQGQSQQNK
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICWRHHPNFSWGGQGGSNGFNQGQSQQNK

Query:  QPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRS
        QPYVPPTQQ+IPPPQQQYNQRTQTPP+ NNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRS
Subjt:  QPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRS

Query:  GLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        GL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  GLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]3.0e-7757.05Show/hide
Query:  MVTMNQRLKEMALGIKNPLA-----TPIQPVQSDYCTPA--------PVCQVNDLICWRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQ
        MVTMNQRLKEMAL IK  ++       +  + S  C P         P     +   WRHHPNFSWGGQGGS+GFNQGQSQQNKQ YVP TQQY PPPQQ
Subjt:  MVTMNQRLKEMALGIKNPLA-----TPIQPVQSDYCTPA--------PVCQVNDLICWRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQ

Query:  QYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQI
         YNQR QTPPV NNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EGKE CKAVTLRSGL Y+ PTMPTTDVQI
Subjt:  QYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQI

Query:  PSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLK
         STE                                                                             P+IVEPPTLEQKPLPSHLK
Subjt:  PSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLK

Query:  YAYLGDNDTLPV
        YAYLGDN+TLPV
Subjt:  YAYLGDNDTLPV

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]1.4e-13882.48Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLIC                                        WRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  HNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
         NNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  HNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRKGD
        NPTTPEK NIRKG+
Subjt:  NPTTPEKENIRKGD

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134646.9e-8878.03Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLA PIQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLIC                                        WRHHPNFSWGGQGGS+GFNQGQSQQNKQPYVPPTQQYIPP QQQYNQRT+TP V
Subjt:  VNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  HNNNSNLENMMKEYMARTDAVIQ
         NNNSNLENMMKEYMARTD VIQ
Subjt:  HNNNSNLENMMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185142.1e-8961.79Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   + P  P Q +Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT

Query:  PAPVCQVNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPT---QQYIPPPQQQY
         +PVCQ+N+++C                                         RHHPNFSWGGQG S+G  QGQ+QQ KQPYVP T    Q +PPP QQY
Subjt:  PAPVCQVNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPT---QQYIPPPQQQY

Query:  NQRTQTP--PVHNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM
        NQ  +TP  P HNNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP M
Subjt:  NQRTQTP--PVHNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

A0A6J1DW02 uncharacterized protein LOC1110248971.8e-19769.96Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICWRHHPNFSWGGQGGSNGFNQGQSQQNK
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLICWRHHPNFSWGGQGGS+GFNQGQSQQNK
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICWRHHPNFSWGGQGGSNGFNQGQSQQNK

Query:  QPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRS
        QPYVPPTQQ+IPPPQQQYNQRTQTPP+ NNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRS
Subjt:  QPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRS

Query:  GLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        GL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  GLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

A0A6J1DYG0 uncharacterized protein LOC1110257646.6e-13982.48Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLIC                                        WRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLIC----------------------------------------WRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  HNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
         NNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  HNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRKGD
        NPTTPEK NIRKG+
Subjt:  NPTTPEKENIRKGD

A0A6J1E110 uncharacterized protein LOC1110254245.0e-7857.69Show/hide
Query:  MVTMNQRLKEMALGIKNPLA-----TPIQPVQSDYCTPA--------PVCQVNDLICWRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQ
        MVTMNQRLKEMAL IK  ++       +  + S  C P         P     +   WRHHPNFSWGGQGGS+GFNQGQSQQNKQ YVP TQQY PPPQQ
Subjt:  MVTMNQRLKEMALGIKNPLA-----TPIQPVQSDYCTPA--------PVCQVNDLICWRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQ

Query:  QYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQI
         YNQR QTPPV NNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EGKE CKAVTLRSGL YD PTMPTTDVQI
Subjt:  QYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQI

Query:  PSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLK
         STE                                                                             P+IVEPPTLEQKPLPSHLK
Subjt:  PSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLK

Query:  YAYLGDNDTLPV
        YAYLGDNDTLPV
Subjt:  YAYLGDNDTLPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCATGCGAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATA
GAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGA
CTTAGCTTCACACAACGAACTATGGTGTTCACAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAG
AGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTT
TGCCAAGTCAACGATCTCATTTGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAACGGTTTTAATCAAGGGCAGAGCCAACAGAACAAGCAGCC
CTATGTTCCACCTACACAACAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCACAATAACAACTCAAATCTTGAGAATATGATGA
AGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTTGAGACCCAATTGGGACAACTTGCCAATGAATTGAAGAATAGACCACAA
GGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAAC
AGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAA
ATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATT
GAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGA
TAACGACACTTTACCAGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTG
GTCATCCATCTCCCAGCACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCATGCGAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATA
GAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGA
CTTAGCTTCACACAACGAACTATGGTGTTCACAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAG
AGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTT
TGCCAAGTCAACGATCTCATTTGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAACGGTTTTAATCAAGGGCAGAGCCAACAGAACAAGCAGCC
CTATGTTCCACCTACACAACAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCACAATAACAACTCAAATCTTGAGAATATGATGA
AGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTTGAGACCCAATTGGGACAACTTGCCAATGAATTGAAGAATAGACCACAA
GGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAAC
AGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAA
ATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATT
GAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGA
TAACGACACTTTACCAGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTG
GTCATCCATCTCCCAGCACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGII
EHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPV
CQVNDLICWRHHPNFSWGGQGGSNGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQ
GSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVANPI
EKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPVREVVQHIYNLRASLDFAVLPSWPPALAAILGHPSPSTDTDPSPQPPTS