; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g06150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g06150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr3:4491012..4495351
RNA-Seq ExpressionMoc03g06150
SyntenyMoc03g06150
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]3.8e-11895.96Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLA PIQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYDNCPHNPASVFYV HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP TQQYIPP QQQYNQRT+TP V
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQ
        QNNNSNLENMMKEYMARTD VIQ
Subjt:  QNNNSNLENMMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]1.2e-11672.76Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   + P  P Q +Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPST---QQYIPPPQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVPST    Q +PPP QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPST---QQYIPPPQQQY

Query:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM
        NQ  +TP  P  NNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP M
Subjt:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]1.1e-18965.19Show/hide
Query:  MSTRSFLLPLDPEIEQTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIE+TLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIEQTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP TQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]1.7e-8156.97Show/hide
Query:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  ++                    D  C+      +  +CP  P        GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
        GFNQGQSQQNKQ YVP TQQY PPPQQ YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EG
Subjt:  GFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQ
        KE CKAVTLRSGL Y+ PTMPTTDVQI STE                                                                     
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQ

Query:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAF
                P+IVEPPTLEQKPLPSHLKYA+
Subjt:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAF

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]8.1e-16994.9Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVP TQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
        QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRKGD
        NPTTPEK NIRKG+
Subjt:  NPTTPEKENIRKGD

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134641.8e-11895.96Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLA PIQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYDNCPHNPASVFYV HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP TQQYIPP QQQYNQRT+TP V
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQ
        QNNNSNLENMMKEYMARTD VIQ
Subjt:  QNNNSNLENMMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185145.9e-11772.76Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   + P  P Q +Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPST---QQYIPPPQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVPST    Q +PPP QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPST---QQYIPPPQQQY

Query:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM
        NQ  +TP  P  NNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP M
Subjt:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

A0A6J1DW02 uncharacterized protein LOC1110248975.3e-19065.19Show/hide
Query:  MSTRSFLLPLDPEIEQTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIE+TLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIEQTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP TQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

A0A6J1DYG0 uncharacterized protein LOC1110257643.9e-16994.9Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVP TQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
        QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRKGD
        NPTTPEK NIRKG+
Subjt:  NPTTPEKENIRKGD

A0A6J1E110 uncharacterized protein LOC1110254244.7e-8257.27Show/hide
Query:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  ++                    D  C+      +  +CP  P        GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
        GFNQGQSQQNKQ YVP TQQY PPPQQ YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EG
Subjt:  GFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQ
        KE CKAVTLRSGL YD PTMPTTDVQI STE                                                                     
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQ

Query:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAF
                P+IVEPPTLEQKPLPSHLKYA+
Subjt:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACAGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCAGACCC
TTCGAAAAACTAGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATG
GCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATAGAACATTTCTTTAGAGGCTTAGATCATCCTACTAAGATGATGCTAAACAATG
CTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACATAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCA
AAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAA
TCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATG
ATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAAT
TTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCATCTACACAGCAATACATCCCACCACCGCAACA
GCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAG
CGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGG
AAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACC
AGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTCGT
TAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCC
ATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACAGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCAGACCC
TTCGAAAAACTAGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATG
GCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATAGAACATTTCTTTAGAGGCTTAGATCATCCTACTAAGATGATGCTAAACAATG
CTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACATAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCA
AAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAA
TCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATG
ATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAAT
TTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCATCTACACAGCAATACATCCCACCACCGCAACA
GCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAG
CGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGG
AAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACC
AGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTCGT
TAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCC
ATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAG
Protein sequenceShow/hide protein sequence
MGGARRLGSLQKNRFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLGKLKCMSTRSFLLPLDPEIEQTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSM
ADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAP
KKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPN
FSWGGQGGSSGFNQGQSQQNKQPYVPSTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPS
IVEPPTLEQKPLPSHLKYAFWRSTKRPLDGR