; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g31380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g31380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr6:23589481..23596870
RNA-Seq ExpressionMoc06g31380
SyntenyMoc06g31380
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]5.8e-9595.14Show/hide
Query:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNP
        RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLA PIQPVQ DYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYV HGNNRNFNP
Subjt:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNP

Query:  YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQ
        YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQ YVPPTQQYIPP Q+QYNQRT+TP VQNNNSNLENMMKEYMARTD VIQ
Subjt:  YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]3.1e-8867.44Show/hide
Query:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN
        S+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   + P  P Q +Y   +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FN
Subjt:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN

Query:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPT---QQYIPPPQRQYNQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQ
        PYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQ YVP T    Q +PPP +QYNQ  +TP  P  NNN++LENM KEYMAR DA+       IQ+Q
Subjt:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPT---QQYIPPPQRQYNQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQ

Query:  AASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        AA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP MP
Subjt:  AASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]7.3e-15457.79Show/hide
Query:  RTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNYDSGIVNPIPA
        RTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAATAFQN+DSGIVNPIPA
Subjt:  RTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNYDSGIVNPIPA

Query:  HANFELKPMI------------------------------------------------------------------------------------------
        H NFELKPM+                                                                                          
Subjt:  HANFELKPMI------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------SRAAPKKQDPA
                                                                                                 SRAAPKKQDPA
Subjt:  -----------------------------------------------------------------------------------------SRAAPKKQDPA

Query:  GVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWR
        GVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                                        WR
Subjt:  GVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWR

Query:  HHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQ
        HHPNFSWGGQGGSSGFNQGQSQQNKQ YVPPTQQ+IPPPQ+QYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQ
Subjt:  HHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQ

Query:  GSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        GSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  GSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]2.0e-8757.82Show/hide
Query:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  ++                    D  C+      +  +CP  P        GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
        GFNQGQSQQNKQ YVP TQQY PPPQ+ YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EG
Subjt:  GFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQ
        KE CKAVTLRSGL Y+ PTMPTTDVQI STE                                                                     
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQ

Query:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPV
                P+IVEPPTLEQKPLPSHLKYAYLGDN+TLPV
Subjt:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPV

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]1.6e-14593.86Show/hide
Query:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN
        SRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFN
Subjt:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN

Query:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG
        PYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQ YVPPTQQYIPPPQ++YNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG
Subjt:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG

Query:  QLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGD
        QLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPENPTTPEK NIRKG+
Subjt:  QLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGD

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134642.8e-9595.14Show/hide
Query:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNP
        RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLA PIQPVQ DYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYV HGNNRNFNP
Subjt:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNP

Query:  YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQ
        YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQ YVPPTQQYIPP Q+QYNQRT+TP VQNNNSNLENMMKEYMARTD VIQ
Subjt:  YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185141.5e-8867.44Show/hide
Query:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN
        S+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   + P  P Q +Y   +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FN
Subjt:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN

Query:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPT---QQYIPPPQRQYNQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQ
        PYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQ YVP T    Q +PPP +QYNQ  +TP  P  NNN++LENM KEYMAR DA+       IQ+Q
Subjt:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPT---QQYIPPPQRQYNQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQ

Query:  AASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        AA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP MP
Subjt:  AASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

A0A6J1DW02 uncharacterized protein LOC1110248973.5e-15457.79Show/hide
Query:  RTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNYDSGIVNPIPA
        RTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAATAFQN+DSGIVNPIPA
Subjt:  RTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNYDSGIVNPIPA

Query:  HANFELKPMI------------------------------------------------------------------------------------------
        H NFELKPM+                                                                                          
Subjt:  HANFELKPMI------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------SRAAPKKQDPA
                                                                                                 SRAAPKKQDPA
Subjt:  -----------------------------------------------------------------------------------------SRAAPKKQDPA

Query:  GVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWR
        GVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLIC                                        WR
Subjt:  GVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWR

Query:  HHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQ
        HHPNFSWGGQGGSSGFNQGQSQQNKQ YVPPTQQ+IPPPQ+QYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQ
Subjt:  HHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQ

Query:  GSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        GSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  GSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

A0A6J1DYG0 uncharacterized protein LOC1110257647.8e-14693.86Show/hide
Query:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN
        SRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFN
Subjt:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN

Query:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG
        PYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQ YVPPTQQYIPPPQ++YNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG
Subjt:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG

Query:  QLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGD
        QLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPENPTTPEK NIRKG+
Subjt:  QLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGD

A0A6J1E110 uncharacterized protein LOC1110254243.3e-8858.41Show/hide
Query:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  ++                    D  C+      +  +CP  P        GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
        GFNQGQSQQNKQ YVP TQQY PPPQ+ YNQR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EG
Subjt:  GFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQ
        KE CKAVTLRSGL YD PTMPTTDVQI STE                                                                     
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQ

Query:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPV
                P+IVEPPTLEQKPLPSHLKYAYLGDNDTLPV
Subjt:  KEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAG
AAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGAT
CATGCGAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTATGATTCAGGGATAGTCAA
CCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGC
AAAAAGAGATGGTTACAATGAACCAAAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCC
CCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATCTATGATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAA
TAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCC
AACAGAACAAGCAGCTCTATGTTCCACCTACACAACAATACATCCCACCACCGCAACGGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAAT
CTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATT
GAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGAC
CAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTGAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAA
GAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTCGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGA
TGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAAT
ATGCGTATCTAGGGGATAACGACACTTTACCAGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCG
CTAGCTGCTATCCTTGGTCATCCATCTCCCAGCACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAG
AAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGAT
CATGCGAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTATGATTCAGGGATAGTCAA
CCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGC
AAAAAGAGATGGTTACAATGAACCAAAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCC
CCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATCTATGATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAA
TAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCC
AACAGAACAAGCAGCTCTATGTTCCACCTACACAACAATACATCCCACCACCGCAACGGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAAT
CTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATT
GAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGAC
CAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTGAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAA
GAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTCGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGA
TGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAAT
ATGCGTATCTAGGGGATAACGACACTTTACCAGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCG
CTAGCTGCTATCCTTGGTCATCCATCTCCCAGCACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MGGARRLGSLQKNWFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLRTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRD
HARNDEFNHIQMADNRDVAMREYAATAFQNYDSGIVNPIPAHANFELKPMISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDYCTPA
PVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQLYVPPTQQYIPPPQRQYNQRTQTPPVQNNNSN
LENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFE
ECSAINSLNPIMFDEFYDSLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPVREVVQHIYNLRASLDFAVLPSWPPA
LAAILGHPSPSTDTDPSPQPPTS