; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g25530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g25530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr4:18536907..18545047
RNA-Seq ExpressionMoc04g25530
SyntenyMoc04g25530
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]7.7e-11995.96Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYDNCPH PASVFYV HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQ NKQPYVPPTQQYIPP QQQYNQRT+TP V
Subjt:  VNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQ
        QNNNSNLENMMKEYMARTD VIQ
Subjt:  QNNNSNLENMMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]1.2e-11471.43Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   + P  P Q +Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPT---QQYIPPPQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPH PAS +YVGHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+Q  KQPYVP T    Q +PPP QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPT---QQYIPPPQQQY

Query:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYDGPTM
        NQ  +TP  P  NNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ +R+G EQCKAVTLRSGL+Y+GP M
Subjt:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]7.4e-16261.61Show/hide
Query:  EHDLVEASMADIPPRDPVDPPAVNGNMRYHARNDEFNHIQMADNRDVAMREYDATAFQNFDSGI------------------------------------
        E +    SMADIPPRDPVDPPAVNGNMR HARNDEFN+IQMADNRDVAMREY ATAFQNFDSGI                                    
Subjt:  EHDLVEASMADIPPRDPVDPPAVNGNMRYHARNDEFNHIQMADNRDVAMREYDATAFQNFDSGI------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIK
                 IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIK
Subjt:  ---------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIK

Query:  NPLAMPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVP
        NPLA  IQPVQSDYCT APVCQVNDLIC                                        WRHHPNFSWGGQGGSSGFNQGQSQ NKQPYVP
Subjt:  NPLAMPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVP

Query:  PTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYD
        PTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGL YD
Subjt:  PTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYD

Query:  GPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRK
        GPTMPTTDVQIPST+P VKIPENPTTPEKENIRK
Subjt:  GPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRK

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]5.4e-8055.49Show/hide
Query:  MVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  ++                    D  C+      +  +CP  P        GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREG
        GFNQGQSQ NKQ YVP TQQY PPPQQ YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELP+ EG
Subjt:  GFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKDAMKLPEDFEECSAINSLNPVMFDEFYDLLVIEIEEELDKIAERPEDVAN
        KE CKAVTLRSGL Y+ PTMPTTDVQI STE                                                                     
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKDAMKLPEDFEECSAINSLNPVMFDEFYDLLVIEIEEELDKIAERPEDVAN

Query:  PIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYARL
                      P+IVEPPTLEQKPLPSHLKYA L
Subjt:  PIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYARL

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]1.0e-16694.23Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLA PIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPH PASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQ NKQPYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
        QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELP+REGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRK
        NPTTPEK NIRK
Subjt:  NPTTPEKENIRK

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134643.7e-11995.96Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYDNCPH PASVFYV HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQ NKQPYVPPTQQYIPP QQQYNQRT+TP V
Subjt:  VNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQ
        QNNNSNLENMMKEYMARTD VIQ
Subjt:  QNNNSNLENMMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185145.6e-11571.43Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   + P  P Q +Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPT---QQYIPPPQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPH PAS +YVGHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+Q  KQPYVP T    Q +PPP QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPT---QQYIPPPQQQY

Query:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYDGPTM
        NQ  +TP  P  NNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ +R+G EQCKAVTLRSGL+Y+GP M
Subjt:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

A0A6J1DW02 uncharacterized protein LOC1110248973.6e-16261.61Show/hide
Query:  EHDLVEASMADIPPRDPVDPPAVNGNMRYHARNDEFNHIQMADNRDVAMREYDATAFQNFDSGI------------------------------------
        E +    SMADIPPRDPVDPPAVNGNMR HARNDEFN+IQMADNRDVAMREY ATAFQNFDSGI                                    
Subjt:  EHDLVEASMADIPPRDPVDPPAVNGNMRYHARNDEFNHIQMADNRDVAMREYDATAFQNFDSGI------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIK
                 IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIK
Subjt:  ---------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIK

Query:  NPLAMPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVP
        NPLA  IQPVQSDYCT APVCQVNDLIC                                        WRHHPNFSWGGQGGSSGFNQGQSQ NKQPYVP
Subjt:  NPLAMPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVP

Query:  PTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYD
        PTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGL YD
Subjt:  PTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYD

Query:  GPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRK
        GPTMPTTDVQIPST+P VKIPENPTTPEKENIRK
Subjt:  GPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRK

A0A6J1DYG0 uncharacterized protein LOC1110257644.8e-16794.23Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLA PIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPH PASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQ NKQPYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE
        QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELP+REGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPE
Subjt:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPE

Query:  NPTTPEKENIRK
        NPTTPEK NIRK
Subjt:  NPTTPEKENIRK

A0A6J1E110 uncharacterized protein LOC1110254241.5e-8055.79Show/hide
Query:  MVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  ++                    D  C+      +  +CP  P        GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREG
        GFNQGQSQ NKQ YVP TQQY PPPQQ YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELP+ EG
Subjt:  GFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKDAMKLPEDFEECSAINSLNPVMFDEFYDLLVIEIEEELDKIAERPEDVAN
        KE CKAVTLRSGL YD PTMPTTDVQI STE                                                                     
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKDAMKLPEDFEECSAINSLNPVMFDEFYDLLVIEIEEELDKIAERPEDVAN

Query:  PIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYARL
                      P+IVEPPTLEQKPLPSHLKYA L
Subjt:  PIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYARL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGCCAGACGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGTCGAGGCATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATA
TGAGGTATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGACGCCACGGCTTTTCAGAACTTTGATTCAGGG
ATAATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAATGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCT
AAATGACTTAGCTTCACACAACGAACTATGGTGTTCACAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGC
AAAAAGAGATGGTTACAATGAACCAGAGGCTAAAAGAGATGGCGTTGGGAATAAAAAATCCATTAGCCATGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCC
CCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAAGCCTGCTTCCGTTTTTTATGTAGGACATGGGAACAA
TAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCC
AACACAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAAT
CTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTTGAGACCCAATTGGGACAGCTCGCCAATGAATT
GAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAGACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGAC
CAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGATGCCATGAAA
TTACCAGAAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTATAGAGATTGAAGAAGAGCTTGATAAGAT
AGCAGAAAGACCAGAAGATGTGGCTAATCCTATTGAAAAGATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCAT
TGCCGTCGCATTTGAAATATGCGCGCCTGGCGCCTGCCCTGTTTTTCCAGCATTTCGAAAACTCTCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCA
TTGGATTTTGCGGTTTTACCTTCATGGCCTCAAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGCCAGACGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGTCGAGGCATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATA
TGAGGTATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGACGCCACGGCTTTTCAGAACTTTGATTCAGGG
ATAATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAATGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCT
AAATGACTTAGCTTCACACAACGAACTATGGTGTTCACAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGC
AAAAAGAGATGGTTACAATGAACCAGAGGCTAAAAGAGATGGCGTTGGGAATAAAAAATCCATTAGCCATGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCC
CCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAAGCCTGCTTCCGTTTTTTATGTAGGACATGGGAACAA
TAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCC
AACACAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAAT
CTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTTGAGACCCAATTGGGACAGCTCGCCAATGAATT
GAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAGACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGAC
CAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGATGCCATGAAA
TTACCAGAAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTATAGAGATTGAAGAAGAGCTTGATAAGAT
AGCAGAAAGACCAGAAGATGTGGCTAATCCTATTGAAAAGATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCAT
TGCCGTCGCATTTGAAATATGCGCGCCTGGCGCCTGCCCTGTTTTTCCAGCATTTCGAAAACTCTCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCA
TTGGATTTTGCGGTTTTACCTTCATGGCCTCAAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MGGARRLGSLQKNWFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLVEASMADIPPRDPVDPPAVNGNMRYHARNDEFNHIQMADNRDVAMREYDATAFQNFDSG
IIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQSDYCTPA
PVCQVNDLICSFCSENHIYDNCPHKPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQHNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSN
LENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKDAMK
LPEDFEECSAINSLNPVMFDEFYDLLVIEIEEELDKIAERPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYARLAPALFFQHFENSQVREVVQHIYNLRAS
LDFAVLPSWPQALAAILGHPSPSTDTDPSPQPPTS