; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g12210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g12210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr5:9537823..9542127
RNA-Seq ExpressionMoc05g12210
SyntenyMoc05g12210
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]1.4e-10993.95Show/hide
Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        GAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQ PA VLALDIATSMQKEMVTMNQRLKEM LGIKNPLA PIQPVQ DYCTPAPVCQVNDLICSF
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLE
        CSENHIYDNCPHNPASVFYV HGNNRNFNPYSNTYN GWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP TQQYIPP QQQYNQRT+TP VQNNNSNLE
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLE

Query:  NMMKEYMARTDAVIQ
        NMMKEYMARTD VIQ
Subjt:  NMMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]2.0e-10369.69Show/hide
Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        GAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQ  A VLALDIATSMQKEM TMNQ LKE+ L  K   + P  P Q +Y   +PVCQ+N+++CS+
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRT---QQYIPPPQQQYNQRTQTP--PVQNN
        CS+NH+Y+NCPHNPAS +YVGHG NR FNPYSNTYN G RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +PPP QQYNQ  +TP  P  NN
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRT---QQYIPPPQQQYNQRTQTP--PVQNN

Query:  NSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        N++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP MP
Subjt:  NSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]2.2e-18263.65Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------
        AFQNFDSGIVNPIPAH NFELKPM+                                                                           
Subjt:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------GAFTKKTFNEIVDILNDLASHNELW
                                                                                   GAFTKKTFNEIVDILNDLASHNELW
Subjt:  ---------------------------------------------------------------------------GAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQ PA VLALDIATSMQKEMVTMNQRLKEM LGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP TQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]5.3e-8056.36Show/hide
Query:  MVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSS
        MVTMNQRLKEM L IK  ++                    D  C+      +  +CP  P        GNNRNFNPYSNTYN GWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
        GFNQGQSQQNKQ YVP TQQY PPPQQ YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EG
Subjt:  GFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPVMFDEFYDLLVTEIEEELDKIAEGPEDMANPIEKIQ
        KE CKAVTLRSGL Y+ PTMPTTDVQI STE                                                                     
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPVMFDEFYDLLVTEIEEELDKIAEGPEDMANPIEKIQ

Query:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAF
                P+IVEPPTLEQKPLPSHLKYA+
Subjt:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAF

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]8.0e-16193.79Show/hide
Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQ PA VLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQVNDLICSF
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLE
        CSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYN GWRHHPNFSW GQGGS GFNQGQSQQNKQPYVP TQQYIPPPQQ+YNQRTQTPPVQNNNSNLE
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLE

Query:  NMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKE
        NMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPENPTTPEK 
Subjt:  NMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKE

Query:  NIRKGD
        NIRKG+
Subjt:  NIRKGD

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134647.0e-11093.95Show/hide
Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        GAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQ PA VLALDIATSMQKEMVTMNQRLKEM LGIKNPLA PIQPVQ DYCTPAPVCQVNDLICSF
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLE
        CSENHIYDNCPHNPASVFYV HGNNRNFNPYSNTYN GWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP TQQYIPP QQQYNQRT+TP VQNNNSNLE
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLE

Query:  NMMKEYMARTDAVIQ
        NMMKEYMARTD VIQ
Subjt:  NMMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185149.7e-10469.69Show/hide
Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        GAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQ  A VLALDIATSMQKEM TMNQ LKE+ L  K   + P  P Q +Y   +PVCQ+N+++CS+
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRT---QQYIPPPQQQYNQRTQTP--PVQNN
        CS+NH+Y+NCPHNPAS +YVGHG NR FNPYSNTYN G RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +PPP QQYNQ  +TP  P  NN
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRT---QQYIPPPQQQYNQRTQTP--PVQNN

Query:  NSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        N++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP MP
Subjt:  NSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

A0A6J1DW02 uncharacterized protein LOC1110248971.1e-18263.65Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------
        AFQNFDSGIVNPIPAH NFELKPM+                                                                           
Subjt:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------GAFTKKTFNEIVDILNDLASHNELW
                                                                                   GAFTKKTFNEIVDILNDLASHNELW
Subjt:  ---------------------------------------------------------------------------GAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQ PA VLALDIATSMQKEMVTMNQRLKEM LGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP TQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

A0A6J1DYG0 uncharacterized protein LOC1110257643.9e-16193.79Show/hide
Query:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF
        GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQ PA VLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQVNDLICSF
Subjt:  GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVPARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSF

Query:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLE
        CSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYN GWRHHPNFSW GQGGS GFNQGQSQQNKQPYVP TQQYIPPPQQ+YNQRTQTPPVQNNNSNLE
Subjt:  CSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLE

Query:  NMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKE
        NMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPENPTTPEK 
Subjt:  NMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKE

Query:  NIRKGD
        NIRKG+
Subjt:  NIRKGD

A0A6J1E110 uncharacterized protein LOC1110254241.5e-8056.67Show/hide
Query:  MVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSS
        MVTMNQRLKEM L IK  ++                    D  C+      +  +CP  P        GNNRNFNPYSNTYN GWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
        GFNQGQSQQNKQ YVP TQQY PPPQQ YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EG
Subjt:  GFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPVMFDEFYDLLVTEIEEELDKIAEGPEDMANPIEKIQ
        KE CKAVTLRSGL YD PTMPTTDVQI STE                                                                     
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPVMFDEFYDLLVTEIEEELDKIAEGPEDMANPIEKIQ

Query:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAF
                P+IVEPPTLEQKPLPSHLKYA+
Subjt:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCC
TTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATG
GCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATCGGAGCCT
TTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACATAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGTTCCA
GCTAGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTAAAAGAGATGACATTGGGAATAAAAAATCCATTAGCCACGCC
GATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATA
ACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACACAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGA
CAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACGTACACAGCAATACATCCCGCCACCGCAACAGCAGTACAATCAGAG
AACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGA
ATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAA
GCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAAC
ACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTG
AAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATATGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCC
ACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCC
TTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATG
GCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATCGGAGCCT
TTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACATAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGTTCCA
GCTAGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTAAAAGAGATGACATTGGGAATAAAAAATCCATTAGCCACGCC
GATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATA
ACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACACAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGA
CAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACGTACACAGCAATACATCCCGCCACCGCAACAGCAGTACAATCAGAG
AACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGA
ATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAA
GCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAAC
ACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTG
AAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATATGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCC
ACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAG
Protein sequenceShow/hide protein sequence
MGGARRLGSLQKNWFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLGKLKCMSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSM
ADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMIGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQVP
ARVLALDIATSMQKEMVTMNQRLKEMTLGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNTGWRHHPNFSWGG
QGGSSGFNQGQSQQNKQPYVPRTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCK
AVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPVMFDEFYDLLVTEIEEELDKIAEGPEDMANPIEKIQKEECKSLLPSIVEPP
TLEQKPLPSHLKYAFWRSTKRPLDGR