; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g23780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g23780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr5:16985815..16993258
RNA-Seq ExpressionMoc05g23780
SyntenyMoc05g23780
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]4.3e-9494.05Show/hide
Query:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNP
        RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPL+ PIQPVQ DYCTPAPVC+ NDLICSFCSENHIYDNCPHNPASVFYV HGNNRNFNP
Subjt:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNP

Query:  YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQ
        YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPP QQQY QRT+TP VQNNNSNLENMMKEYMARTD VIQ
Subjt:  YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]4.6e-8867.83Show/hide
Query:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN
        S+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   S P  P Q +Y   +PVC+ N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FN
Subjt:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN

Query:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQYTQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQ
        PYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +PPP QQY Q  +TP  P  NNN++LENM KEYMAR DA+       IQ+Q
Subjt:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQYTQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQ

Query:  AASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        AA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP MP
Subjt:  AASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]3.9e-16458.64Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKERQGEISPESEVESTSTSMVDIPPRDPVDPPAVNGNMRDHARHDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKER+GEISPESEVESTSTSM DIPPRDPVDPPAVNGNMRDHAR+DEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKERQGEISPESEVESTSTSMVDIPPRDPVDPPAVNGNMRDHARHDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------
        AFQNFDSGIVNPIPAH NFELKPM+                                                                           
Subjt:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNN
            SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPL+T IQPVQSDYCT APVC+ NDLIC                           
Subjt:  ----SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQY QRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDSPSVPPR
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+D+ SVPP+
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDSPSVPPR

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]2.3e-7971.02Show/hide
Query:  MVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  +S                    D  C+      +  +CP  P        GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
        GFNQGQSQQNKQ YVP TQQY PPPQQ Y QR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EG
Subjt:  GFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK
        KE CKAVTLRSGL Y+ PTMPTTDVQI STEP   I E PT  +K
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEK

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]2.1e-14991.22Show/hide
Query:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN
        SRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPL+TPIQPVQSDYCTPAPVC+ NDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFN
Subjt:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN

Query:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG
        PYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+Y QRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG
Subjt:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG

Query:  QLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDSPSVPPR-----KKKIG
        QLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPENPTTPEK NIRKGNED+PSVPP+     KK IG
Subjt:  QLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDSPSVPPR-----KKKIG

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134642.1e-9494.05Show/hide
Query:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNP
        RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPL+ PIQPVQ DYCTPAPVC+ NDLICSFCSENHIYDNCPHNPASVFYV HGNNRNFNP
Subjt:  RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNP

Query:  YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQ
        YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPP QQQY QRT+TP VQNNNSNLENMMKEYMARTD VIQ
Subjt:  YSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185142.2e-8867.83Show/hide
Query:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN
        S+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   S P  P Q +Y   +PVC+ N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FN
Subjt:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN

Query:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQYTQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQ
        PYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +PPP QQY Q  +TP  P  NNN++LENM KEYMAR DA+       IQ+Q
Subjt:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPPQQQYTQRTQTP--PVQNNNSNLENMMKEYMARTDAV-------IQSQ

Query:  AASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP
        AA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP MP
Subjt:  AASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMP

A0A6J1DW02 uncharacterized protein LOC1110248971.9e-16458.64Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKERQGEISPESEVESTSTSMVDIPPRDPVDPPAVNGNMRDHARHDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKER+GEISPESEVESTSTSM DIPPRDPVDPPAVNGNMRDHAR+DEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKERQGEISPESEVESTSTSMVDIPPRDPVDPPAVNGNMRDHARHDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------
        AFQNFDSGIVNPIPAH NFELKPM+                                                                           
Subjt:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNN
            SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPL+T IQPVQSDYCT APVC+ NDLIC                           
Subjt:  ----SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQY QRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDSPSVPPR
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+D+ SVPP+
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDSPSVPPR

A0A6J1DYG0 uncharacterized protein LOC1110257641.0e-14991.22Show/hide
Query:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN
        SRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPL+TPIQPVQSDYCTPAPVC+ NDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFN
Subjt:  SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFN

Query:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG
        PYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPPPQQ+Y QRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG
Subjt:  PYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLG

Query:  QLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDSPSVPPR-----KKKIG
        QLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKIPENPTTPEK NIRKGNED+PSVPP+     KK IG
Subjt:  QLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDSPSVPPR-----KKKIG

A0A6J1E110 uncharacterized protein LOC1110254248.5e-8062.5Show/hide
Query:  MVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  +S                    D  C+      +  +CP  P        GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
        GFNQGQSQQNKQ YVP TQQY PPPQQ Y QR QTPPVQNNNSNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EG
Subjt:  GFNQGQSQQNKQPYVPPTQQYIPPPQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIP----ENPTTPEKENIRKGNEDSPSV--PPRKKKIGEYELVAMTKCSSEAVG
        KE CKAVTLRSGL YD PTMPTTDVQI STEP +  P    + P     +    G+ D+  V          EY L+ + +   +A+G
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIP----ENPTTPEKENIRKGNEDSPSV--PPRKKKIGEYELVAMTKCSSEAVG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCATTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCC
TTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGACAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATG
GTAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCGAGACATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATATCTAGGG
CAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTAAAGGAGATGGCATTGGGA
ATAAAAAATCCATTATCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCAAGGCCAACGATCTCATTTGTTCATTCTGCAGTGAAAACCA
TATTTATGATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAATAGAAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACC
ACCCTAATTTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCGCCA
CCGCAACAGCAGTACACTCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACA
ATCCCAAGCGGCATCAATGAGAAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAAC
GAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTA
AAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGTAATGAGGACAGCCCGAGTGTTCCTCCACGGAAGAAAAAAATAGGAGAGTATGAACTGGT
AGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCCCGCTACCCATGAAATGTAATGATCCTGGCAGTTTTACCATTCCATGTTCCATAAGAGGAAAAAACTTAGGAG
ACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGA
CCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGTCGTCGCA
TTTGAAATATGCGTATCTAGGGGATAACGACACTTTACCAGTTCGAGAAGTCGTGCAACTTATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGC
CTCCAGCGCTAGCTGCTATCCTCGGTCATCCATCTCCCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCATTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCC
TTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGACAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATG
GTAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCGAGACATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATATCTAGGG
CAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTAAAGGAGATGGCATTGGGA
ATAAAAAATCCATTATCCACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCAAGGCCAACGATCTCATTTGTTCATTCTGCAGTGAAAACCA
TATTTATGATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAATAGAAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACC
ACCCTAATTTCTCATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCGCCA
CCGCAACAGCAGTACACTCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACA
ATCCCAAGCGGCATCAATGAGAAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAAC
GAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTA
AAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGTAATGAGGACAGCCCGAGTGTTCCTCCACGGAAGAAAAAAATAGGAGAGTATGAACTGGT
AGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCCCGCTACCCATGAAATGTAATGATCCTGGCAGTTTTACCATTCCATGTTCCATAAGAGGAAAAAACTTAGGAG
ACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGA
CCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGTCGTCGCA
TTTGAAATATGCGTATCTAGGGGATAACGACACTTTACCAGTTCGAGAAGTCGTGCAACTTATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGC
CTCCAGCGCTAGCTGCTATCCTCGGTCATCCATCTCCCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MGGARRLGSLQKNWFSSNFALNETRLPMHFGGSNRCIRVEEVFHYQFEHDLGKLKCMSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKERQGEISPESEVESTSTSM
VDIPPRDPVDPPAVNGNMRDHARHDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALG
IKNPLSTPIQPVQSDYCTPAPVCKANDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPP
PQQQYTQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIV
KIPENPTTPEKENIRKGNEDSPSVPPRKKKIGEYELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIRGKNLGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEG
PEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLSSHLKYAYLGDNDTLPVREVVQLIYNLRASLDFAVLPSWPPALAAILGHPSPSTDTDPSPQPPTS