; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g18680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g18680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr3:12393638..12399825
RNA-Seq ExpressionMoc03g18680
SyntenyMoc03g18680
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]7.8e-11895.96Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPA VLALDIATSMQKEMVTMNQRLKEMALGIKNPLA PIQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYDNCPHNPASVFYV HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPP QQQYNQRT+TP V
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQ
        QNNNSNLENMMKEYMARTD VIQ
Subjt:  QNNNSNLENMMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]1.4e-9566.2Show/hide
Query:  LDHPIKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT
        LDHP KMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD A VLALDIATSMQKEM TMNQ LKE+AL  K   + P  P Q +Y  
Subjt:  LDHPIKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +PPP QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQY

Query:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLTNELKNRPQDA------MKLQGDFEECSAI
        NQ  +TP  P  NNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ  N+LK RPQ +      +  +   E+C A+
Subjt:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLTNELKNRPQDA------MKLQGDFEECSAI

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]2.0e-15058.9Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKKQRLRKQLEKQKEREGEISPETEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRK+QRLRKQLE QKEREGEISPE+EVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKKQRLRKQLEKQKEREGEISPETEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPIKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHP KMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPIKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPA VLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLTNELKNRPQDAMKLQGDF------EECSAINSLNPVMFD
        TQLG L NELKNRPQ +     +       E+C A+   + + +D
Subjt:  TQLGQLTNELKNRPQDAMKLQGDF------EECSAINSLNPVMFD

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]1.0e-6957.84Show/hide
Query:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  ++                    D  C+      +  +CP  P        GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLTNELKNRPQDAM-------KLQG
        GFNQGQSQQNKQ YVP TQQY PPPQQ YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG L N LKNRPQ +        K +G
Subjt:  GFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLTNELKNRPQDAM-------KLQG

Query:  DFEECSAINSLNPVMFDEFYDLLVTEIEEELDKMAEGPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLP
          E C A+   + + ++                        T P   +Q     S  P+IVEPPTLEQKPLPSHLKYAYLGDN+TLP
Subjt:  DFEECSAINSLNPVMFDEFYDLLVTEIEEELDKMAEGPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLP

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]2.7e-13487.1Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPA VLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLTNELKNRPQDAMKLQGDF------EECSAINSLNPVMFDE
        QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQL NELKNRPQ +     +       E+C A+   + + +DE
Subjt:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLTNELKNRPQDAMKLQGDF------EECSAINSLNPVMFDE

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134643.8e-11895.96Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPA VLALDIATSMQKEMVTMNQRLKEMALGIKNPLA PIQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYDNCPHNPASVFYV HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPP QQQYNQRT+TP V
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQ
        QNNNSNLENMMKEYMARTD VIQ
Subjt:  QNNNSNLENMMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185146.9e-9666.2Show/hide
Query:  LDHPIKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT
        LDHP KMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD A VLALDIATSMQKEM TMNQ LKE+AL  K   + P  P Q +Y  
Subjt:  LDHPIKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +PPP QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQY

Query:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLTNELKNRPQDA------MKLQGDFEECSAI
        NQ  +TP  P  NNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ  N+LK RPQ +      +  +   E+C A+
Subjt:  NQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLTNELKNRPQDA------MKLQGDFEECSAI

A0A6J1DW02 uncharacterized protein LOC1110248979.9e-15158.9Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKKQRLRKQLEKQKEREGEISPETEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRK+QRLRKQLE QKEREGEISPE+EVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKKQRLRKQLEKQKEREGEISPETEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPIKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHP KMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPIKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPA VLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLTNELKNRPQDAMKLQGDF------EECSAINSLNPVMFD
        TQLG L NELKNRPQ +     +       E+C A+   + + +D
Subjt:  TQLGQLTNELKNRPQDAMKLQGDF------EECSAINSLNPVMFD

A0A6J1DYG0 uncharacterized protein LOC1110257641.3e-13487.1Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPA VLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLTNELKNRPQDAMKLQGDF------EECSAINSLNPVMFDE
        QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQL NELKNRPQ +     +       E+C A+   + + +DE
Subjt:  QNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLTNELKNRPQDAMKLQGDF------EECSAINSLNPVMFDE

A0A6J1E110 uncharacterized protein LOC1110254241.7e-7058.54Show/hide
Query:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  ++                    D  C+      +  +CP  P        GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLTNELKNRPQDAM-------KLQG
        GFNQGQSQQNKQ YVP TQQY PPPQQ YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG L N LKNRPQ +        K +G
Subjt:  GFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLTNELKNRPQDAM-------KLQG

Query:  DFEECSAINSLNPVMFDEFYDLLVTEIEEELDKMAEGPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLP
          E C A+   + + +D                        T P   +Q     S  P+IVEPPTLEQKPLPSHLKYAYLGDNDTLP
Subjt:  DFEECSAINSLNPVMFDEFYDLLVTEIEEELDKMAEGPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGAAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAA
GAGAGAGAAGGTGAAATCAGTCCTGAAACTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGT
AATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTT
GATTCAGGGATAATAGAACATTTCTTTAGAGGTTTAGATCATCCTATTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAG
ATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGAAGTTTTGGCTCTG
GACATTGCGACCTCGATGCAAAAAGAAATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACACCGATACAACCTGTG
CAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAATCATATTTATGATAATTGTCCACATAACCCTGCT
TCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAA
GGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCGCCGCCGCAACAGCAGTACAATCAG
AGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCA
ATGAGGAATTTCGAGACCCAATTGGGACAGCTCACCAATGAATTAAAGAATAGACCACAAGACGCCATGAAATTACAAGGAGACTTTGAAGAATGCTCTGCTATA
AATAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATGGCAGAAGGACCGGAAGATGTGACTAAT
CCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACTTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCG
TATCTAGGGGATAACGACACTTTACCAAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGAAGAATTATCGCCCCATGCGCTACAAAGAAGGGAAGGTACTGG
GATTTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTGCGAGATCAGCGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGACACT
GATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGAAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAA
GAGAGAGAAGGTGAAATCAGTCCTGAAACTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGT
AATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTT
GATTCAGGGATAATAGAACATTTCTTTAGAGGTTTAGATCATCCTATTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCAACGAG
ATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGAAGTTTTGGCTCTG
GACATTGCGACCTCGATGCAAAAAGAAATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACACCGATACAACCTGTG
CAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAATCATATTTATGATAATTGTCCACATAACCCTGCT
TCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAA
GGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCGCCGCCGCAACAGCAGTACAATCAG
AGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCA
ATGAGGAATTTCGAGACCCAATTGGGACAGCTCACCAATGAATTAAAGAATAGACCACAAGACGCCATGAAATTACAAGGAGACTTTGAAGAATGCTCTGCTATA
AATAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATGGCAGAAGGACCGGAAGATGTGACTAAT
CCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACTTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCG
TATCTAGGGGATAACGACACTTTACCAAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGAAGAATTATCGCCCCATGCGCTACAAAGAAGGGAAGGTACTGG
GATTTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTGCGAGATCAGCGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGACACT
GATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MSTRSFLLPLDPEIERTLRKTRKKQRLRKQLEKQKEREGEISPETEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNF
DSGIIEHFFRGLDHPIKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAEVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPV
QSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQ
RTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLTNELKNRPQDAMKLQGDFEECSAINSLNPVMFDEFYDLLVTEIEEELDKMAEGPEDVTN
PIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPRDAPIVEEKNIRRIIAPCATKKGRYWDFSYIGDPSSSRGEPTAARSALAAILGHPSSSTDT
DPSPQPPTS