; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g26780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g26780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr6:20187492..20190285
RNA-Seq ExpressionMoc06g26780
SyntenyMoc06g26780
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]1.5e-11392.38Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMN+RLKEMALGIKNPLA PIQPVQ D+CTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPV
        VNDLICSFCSENHIYDNCPHNPASVFYV HG+NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSG+NQGQSQQNKQPYVP TQQYI P QQQYNQRT+T  V
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPV

Query:  QNNSSNLENMMKKYMARTDAVIQ
        QNN+SNLENMMK+YMARTD VIQ
Subjt:  QNNSSNLENMMKKYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]2.6e-11370.76Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMN+ LKE+AL  K+    P QP  +    
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRT---QQYISPPQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q + PP QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRT---QQYISPPQQQY

Query:  --NQRTQTSPVQNNSSNLENMMKKYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM
          NQRT + P  NN+++LENM K+YMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP M
Subjt:  --NQRTQTSPVQNNSSNLENMMKKYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]1.4e-18363.65Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMVDNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQM DNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMVDNRDVAMREYAAT

Query:  TFQNFDSGI-------------------------------------------------------------------------------------------
         FQNFDSGI                                                                                           
Subjt:  TFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMN+RLKEMALGIKNPLAT IQPVQSD+CT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPVQNNSSNLENMMKKYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSG+NQGQSQQNKQPYVP TQQ+I PPQQQYNQRTQT P+QNN+SNLENMMK+YMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPVQNNSSNLENMMKKYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECS
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+PTVKIPEN TTPEKEN RKG+ +  S
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECS

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]3.9e-7754.85Show/hide
Query:  MVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMN+RLKEMAL IK  ++                    D  C+      +  +CP  P        G+NRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPVQNNSSNLENMMKKYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
        G+NQGQSQQNKQ YVP TQQY  PPQQ YNQR QT PVQNN+SNLEN MK+YMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EG
Subjt:  GYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPVQNNSSNLENMMKKYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECSAITSLNPVMFDEFYDLLVTEIEEELDKIAEGPEYVTNPVEKIQ
        KE CKAVTLRSGL Y+ PTMPTTDVQI STEPT                                                                   
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECSAITSLNPVMFDEFYDLLVTEIEEELDKIAEGPEYVTNPVEKIQ

Query:  KEECKSLLPSIVEPPTLEHKPLPSHLKYAF
                  IVEPPTLE KPLPSHLKYA+
Subjt:  KEECKSLLPSIVEPPTLEHKPLPSHLKYAF

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]6.0e-16392.04Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMN+RLKEM LG+KNPLATPIQPVQSD+CTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPV
        VNDLICSFCSENHIYD CPHNPASVFYVGHG+NRNFNPYSNTYNPGWRHHPNFSW GQGGS G+NQGQSQQNKQPYVP TQQYI PPQQ+YNQRTQT PV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPV

Query:  QNNSSNLENMMKKYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPE
        QNN+SNLENMMK+YMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST PTVKIPE
Subjt:  QNNSSNLENMMKKYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPE

Query:  NSTTPEKENTRKGD
        N TTPEK N RKG+
Subjt:  NSTTPEKENTRKGD

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134647.3e-11492.38Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMN+RLKEMALGIKNPLA PIQPVQ D+CTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPV
        VNDLICSFCSENHIYDNCPHNPASVFYV HG+NRNFNPYSNTYNPGWRHHPNFSWGGQGGSSG+NQGQSQQNKQPYVP TQQYI P QQQYNQRT+T  V
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPV

Query:  QNNSSNLENMMKKYMARTDAVIQ
        QNN+SNLENMMK+YMARTD VIQ
Subjt:  QNNSSNLENMMKKYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185141.2e-11370.76Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMN+ LKE+AL  K+    P QP  +    
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRT---QQYISPPQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q + PP QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRT---QQYISPPQQQY

Query:  --NQRTQTSPVQNNSSNLENMMKKYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM
          NQRT + P  NN+++LENM K+YMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP M
Subjt:  --NQRTQTSPVQNNSSNLENMMKKYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

A0A6J1DW02 uncharacterized protein LOC1110248976.6e-18463.65Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMVDNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQM DNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMVDNRDVAMREYAAT

Query:  TFQNFDSGI-------------------------------------------------------------------------------------------
         FQNFDSGI                                                                                           
Subjt:  TFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMN+RLKEMALGIKNPLAT IQPVQSD+CT APVCQVNDLIC                           
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPVQNNSSNLENMMKKYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSG+NQGQSQQNKQPYVP TQQ+I PPQQQYNQRTQT P+QNN+SNLENMMK+YMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPVQNNSSNLENMMKKYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECS
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+PTVKIPEN TTPEKEN RKG+ +  S
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECS

A0A6J1DYG0 uncharacterized protein LOC1110257642.9e-16392.04Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMN+RLKEM LG+KNPLATPIQPVQSD+CTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPV
        VNDLICSFCSENHIYD CPHNPASVFYVGHG+NRNFNPYSNTYNPGWRHHPNFSW GQGGS G+NQGQSQQNKQPYVP TQQYI PPQQ+YNQRTQT PV
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPV

Query:  QNNSSNLENMMKKYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPE
        QNN+SNLENMMK+YMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST PTVKIPE
Subjt:  QNNSSNLENMMKKYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPE

Query:  NSTTPEKENTRKGD
        N TTPEK N RKG+
Subjt:  NSTTPEKENTRKGD

A0A6J1E110 uncharacterized protein LOC1110254241.1e-7755.15Show/hide
Query:  MVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMN+RLKEMAL IK  ++                    D  C+      +  +CP  P        G+NRNFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPVQNNSSNLENMMKKYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG
        G+NQGQSQQNKQ YVP TQQY  PPQQ YNQR QT PVQNN+SNLEN MK+YMARTDAVIQSQAASMRNFETQLG LAN LKNRPQGSF GHTELPK EG
Subjt:  GYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPVQNNSSNLENMMKKYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREG

Query:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECSAITSLNPVMFDEFYDLLVTEIEEELDKIAEGPEYVTNPVEKIQ
        KE CKAVTLRSGL YD PTMPTTDVQI STEPT                                                                   
Subjt:  KEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEECSAITSLNPVMFDEFYDLLVTEIEEELDKIAEGPEYVTNPVEKIQ

Query:  KEECKSLLPSIVEPPTLEHKPLPSHLKYAF
                  IVEPPTLE KPLPSHLKYA+
Subjt:  KEECKSLLPSIVEPPTLEHKPLPSHLKYAF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCATGCAAGAAATGATGAATTCAACCATATTCAGATGGTGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGACTTTTCAGAACTTTGATTCAGGGATAATA
GAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAATGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGA
CTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAG
AGATGGTTACAATGAACAAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACACCGATACAACCTGTGCAGTCGGATTTTTGCACTCCTGCCCCTGTT
TGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAGCAATAGGAA
CTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTATAATCAAGGGCAGAGCCAGCAGA
ACAAACAGCCCTATGTTCCACGTACACAACAATACATCTCGCCGCCACAACAGCAGTACAATCAGAGAACACAGACTTCACCAGTTCAAAATAACAGCTCAAATCTTGAG
AATATGATGAAGAAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAA
TAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAGCGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTTAGGAGTGGACTGGCATATGATGGACCAACAA
TGCCAACAACAGATGTACAGATTCCATCCACTGAACCAACTGTAAAGATACCAGAAAATTCAACAACACCAGAAAAAGAAAATACTAGAAAAGGAGACTTTGAAGAGTGC
TCTGCTATAACTAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAATATGTGAC
CAATCCTGTTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCATAAGCCATTGCCGTCGCATTTGAAATATGCAT
TTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCATGCAAGAAATGATGAATTCAACCATATTCAGATGGTGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGACTTTTCAGAACTTTGATTCAGGGATAATA
GAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAATGGAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGA
CTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAG
AGATGGTTACAATGAACAAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACACCGATACAACCTGTGCAGTCGGATTTTTGCACTCCTGCCCCTGTT
TGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAGCAATAGGAA
CTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTATAATCAAGGGCAGAGCCAGCAGA
ACAAACAGCCCTATGTTCCACGTACACAACAATACATCTCGCCGCCACAACAGCAGTACAATCAGAGAACACAGACTTCACCAGTTCAAAATAACAGCTCAAATCTTGAG
AATATGATGAAGAAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAA
TAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAGCGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTTAGGAGTGGACTGGCATATGATGGACCAACAA
TGCCAACAACAGATGTACAGATTCCATCCACTGAACCAACTGTAAAGATACCAGAAAATTCAACAACACCAGAAAAAGAAAATACTAGAAAAGGAGACTTTGAAGAGTGC
TCTGCTATAACTAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAATATGTGAC
CAATCCTGTTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCATAAGCCATTGCCGTCGCATTTGAAATATGCAT
TTTGGAGAAGCACAAAAAGGCCATTGGATGGACGATAG
Protein sequenceShow/hide protein sequence
MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMVDNRDVAMREYAATTFQNFDSGII
EHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNKRLKEMALGIKNPLATPIQPVQSDFCTPAPV
CQVNDLICSFCSENHIYDNCPHNPASVFYVGHGSNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGYNQGQSQQNKQPYVPRTQQYISPPQQQYNQRTQTSPVQNNSSNLE
NMMKKYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPTVKIPENSTTPEKENTRKGDFEEC
SAITSLNPVMFDEFYDLLVTEIEEELDKIAEGPEYVTNPVEKIQKEECKSLLPSIVEPPTLEHKPLPSHLKYAFWRSTKRPLDGR