; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g16970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g16970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr1:11352324..11358490
RNA-Seq ExpressionMoc01g16970
SyntenyMoc01g16970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]2.7e-11493.72Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLK+MALGIKNPL  PIQPVQ DY TPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYDNCPHN ASVFYV HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNK+PYVPPTQQYIPP QQQYNQRT+TP V
Subjt:  VNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTNAVIQ
        QNNNSNLENMMKEYMART+ VIQ
Subjt:  QNNNSNLENMMKEYMARTNAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]8.6e-11371.1Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMNQ LK++AL  K+    P  P Q +Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPT---QQYIPPPQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPHN AS +YVGHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ K+PYVP T    Q +PPP QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPT---QQYIPPPQQQY

Query:  NQRTQTP--PVQNNNSNLENMMKEYMART-------NAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM
        NQ  +TP  P  NNN++LENM KEYMAR        NA IQ+QAA MRN E Q+GQ A +LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP M
Subjt:  NQRTQTP--PVQNNNSNLENMMKEYMART-------NAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]2.3e-15859.93Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVTDIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRD---------
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTS+ DIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRD         
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVTDIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRD---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLK+MALGIKNPL T IQPVQSDY T APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTNAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNK+PYVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMART+AVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTNAVIQSQAASMRNFE

Query:  TQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDV
        TQLG +A ELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDV
Subjt:  TQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDV

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]8.7e-7369.78Show/hide
Query:  MVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLK+MAL IK  +                     D  C+      +  +CP           GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTNAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREG
        GFNQGQSQQNK+ YVP TQQY PPPQQ YNQR QTPPVQNNNSNLEN MKEYMART+AVIQSQAASMRNFETQLG +A  LKNRPQGSF GHTELPK EG
Subjt:  GFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTNAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDV
        KE CKAVTLRSGL Y+ PTMPTTDV
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDV

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]8.0e-15193.38Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLK+M LG+KNPL TPIQPVQSDY TPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPHN ASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNK+PYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTNAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDV
        QNNNSNLENMMKEYMART+AVIQSQAASMRNFETQLGQ+A ELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DV
Subjt:  QNNNSNLENMMKEYMARTNAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDV

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134641.3e-11493.72Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLK+MALGIKNPL  PIQPVQ DY TPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYDNCPHN ASVFYV HGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNK+PYVPPTQQYIPP QQQYNQRT+TP V
Subjt:  VNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTNAVIQ
        QNNNSNLENMMKEYMART+ VIQ
Subjt:  QNNNSNLENMMKEYMARTNAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185144.2e-11371.1Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMNQ LK++AL  K+    P  P Q +Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPT---QQYIPPPQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPHN AS +YVGHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ K+PYVP T    Q +PPP QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPT---QQYIPPPQQQY

Query:  NQRTQTP--PVQNNNSNLENMMKEYMART-------NAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM
        NQ  +TP  P  NNN++LENM KEYMAR        NA IQ+QAA MRN E Q+GQ A +LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP M
Subjt:  NQRTQTP--PVQNNNSNLENMMKEYMART-------NAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

A0A6J1DW02 uncharacterized protein LOC1110248971.1e-15859.93Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVTDIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRD---------
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTS+ DIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRD         
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVTDIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRD---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLK+MALGIKNPL T IQPVQSDY T APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTNAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNK+PYVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMART+AVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTNAVIQSQAASMRNFE

Query:  TQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDV
        TQLG +A ELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDV
Subjt:  TQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDV

A0A6J1DYG0 uncharacterized protein LOC1110257643.9e-15193.38Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLK+M LG+KNPL TPIQPVQSDY TPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPV
        VNDLICSFCSENHIYD CPHN ASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNK+PYVPPTQQYIPPPQQ+YNQRTQTPPV
Subjt:  VNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPV

Query:  QNNNSNLENMMKEYMARTNAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDV
        QNNNSNLENMMKEYMART+AVIQSQAASMRNFETQLGQ+A ELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DV
Subjt:  QNNNSNLENMMKEYMARTNAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDV

A0A6J1E110 uncharacterized protein LOC1110254242.5e-7370.22Show/hide
Query:  MVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLK+MAL IK  +                     D  C+      +  +CP           GNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQVNDLICSFCSENHIYDNCPHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTNAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREG
        GFNQGQSQQNK+ YVP TQQY PPPQQ YNQR QTPPVQNNNSNLEN MKEYMART+AVIQSQAASMRNFETQLG +A  LKNRPQGSF GHTELPK EG
Subjt:  GFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTNAVIQSQAASMRNFETQLGQVAIELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDV
        KE CKAVTLRSGL YD PTMPTTDV
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTCTCATGCCGACAACTTAGGTCAAGCTGGAAGCCCTATTACTACCCGCTCCCTGATTTCTCAAGTCATTCTAGGACTTGATGAGGAATACAATCCAGTTGTGGC
TATGATCCAAGGACGGGCATGGATAACTTGTGGTCAAAAACAAAATCATCGTTCAACAGCCACCAAAGCTCTAGCAGTAACTCGAGAAGAGGAAATGGTTTCGGAAGAGG
CCAAGGAAGATGGAATGGAAACAATAATCGTCTTATTTGCTAGAAGCCCAAATCTCTTTGCAAAAAATCAGAACAGTGGTCCAAATAATAGTGGCACATCTCCAAGTGCT
ACCCAGGCGTTCATGACTGCACAGAAGGGAAGTTCAAATATGTTTGTTGCTAATCCCGAGTCTGTTGCTGATCCCAACTGGACCATTTGGGAGTGCAAATTAATCAAAAG
AAACAAAAAGACGGAAAAACATCATGGGAGGCGTCAGACGCCTGCGTGGCTGCAGAAAGACTGGTTTTCTTCCAAGTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTT
TTGGTGGTTCCAACCGATGCATACGTGTAGAAGACGTGTTCCACTATCAGTTTGAGCACGATTTAGTCAGTCAAAGGAGTTGGAAATGGGGTAAGTTAAAGTGCATGAGC
ACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGG
TGAAATTAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAGTGACAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATG
CAAGAAATGATGAATTCAACCATATACAGATGGCGGACAACAGAGACATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAAC
GGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCA
AGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAAAGATGGCGTTGGGAATAAAAAATCCATTGC
CCACGCCGATACAACCTGTGCAGTCGGATTATTTCACTCCTGCCCCTGTTTGCCAAGTCAATGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGT
CCACATAACCATGCTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATG
GGGAGGTCAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGGAGCCCTATGTTCCACCTACACAGCAATACATCCCTCCACCGCAACAGCAGTACA
ATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCAATGCAGTGATACAATCCCAAGCGGCATCA
ATGAGGAATTTCGAGACCCAATTGGGACAGGTCGCCATTGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACA
GTGCAAAGCTGTCACCCTTAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGTCTCATGCCGACAACTTAGGTCAAGCTGGAAGCCCTATTACTACCCGCTCCCTGATTTCTCAAGTCATTCTAGGACTTGATGAGGAATACAATCCAGTTGTGGC
TATGATCCAAGGACGGGCATGGATAACTTGTGGTCAAAAACAAAATCATCGTTCAACAGCCACCAAAGCTCTAGCAGTAACTCGAGAAGAGGAAATGGTTTCGGAAGAGG
CCAAGGAAGATGGAATGGAAACAATAATCGTCTTATTTGCTAGAAGCCCAAATCTCTTTGCAAAAAATCAGAACAGTGGTCCAAATAATAGTGGCACATCTCCAAGTGCT
ACCCAGGCGTTCATGACTGCACAGAAGGGAAGTTCAAATATGTTTGTTGCTAATCCCGAGTCTGTTGCTGATCCCAACTGGACCATTTGGGAGTGCAAATTAATCAAAAG
AAACAAAAAGACGGAAAAACATCATGGGAGGCGTCAGACGCCTGCGTGGCTGCAGAAAGACTGGTTTTCTTCCAAGTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTT
TTGGTGGTTCCAACCGATGCATACGTGTAGAAGACGTGTTCCACTATCAGTTTGAGCACGATTTAGTCAGTCAAAGGAGTTGGAAATGGGGTAAGTTAAAGTGCATGAGC
ACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGG
TGAAATTAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAGTGACAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATG
CAAGAAATGATGAATTCAACCATATACAGATGGCGGACAACAGAGACATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAAC
GGAGCCTTTACAAAGAAGACATTCAACGAGATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCA
AGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAAAGATGGCGTTGGGAATAAAAAATCCATTGC
CCACGCCGATACAACCTGTGCAGTCGGATTATTTCACTCCTGCCCCTGTTTGCCAAGTCAATGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGT
CCACATAACCATGCTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATG
GGGAGGTCAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGGAGCCCTATGTTCCACCTACACAGCAATACATCCCTCCACCGCAACAGCAGTACA
ATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCAATGCAGTGATACAATCCCAAGCGGCATCA
ATGAGGAATTTCGAGACCCAATTGGGACAGGTCGCCATTGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACA
GTGCAAAGCTGTCACCCTTAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTATAG
Protein sequenceShow/hide protein sequence
MKSHADNLGQAGSPITTRSLISQVILGLDEEYNPVVAMIQGRAWITCGQKQNHRSTATKALAVTREEEMVSEEAKEDGMETIIVLFARSPNLFAKNQNSGPNNSGTSPSA
TQAFMTAQKGSSNMFVANPESVADPNWTIWECKLIKRNKKTEKHHGRRQTPAWLQKDWFSSKFALNETRLPMRFGGSNRCIRVEDVFHYQFEHDLVSQRSWKWGKLKCMS
TRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSVTDIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDIEHFFRGLDHPTKMMLNNAAN
GAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKKMALGIKNPLPTPIQPVQSDYFTPAPVCQVNDLICSFCSENHIYDNC
PHNHASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKEPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTNAVIQSQAAS
MRNFETQLGQVAIELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDV