; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g19980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g19980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr10:14753637..14756429
RNA-Seq ExpressionMoc10g19980
SyntenyMoc10g19980
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]1.1e-8174.22Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEM LGIKNPLA  IQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIHPTTATTAVQSENTTP
        VNDLICSFCSENHIYDNCPHNPASVFYV HGNNRNFNPYSNTYNP                        +  +Q  +  T   I P  +         TP
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIHPTTATTAVQSENTTP

Query:  PVHNNNSNLENMMKEYMARTDAVIQ
         V NNNSNLENMMKEYMARTD VIQ
Subjt:  PVHNNNSNLENMMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]1.0e-9061.13Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+ L  K   +    P Q +Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIH--PTTATTA
         +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FNPYSNTYNP                        +  +Q  + ST+++    P      
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIH--PTTATTA

Query:  VQSENT-TPPVHNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM
         Q++ T + P HNNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP M
Subjt:  VQSENT-TPPVHNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]2.2e-16562.12Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------
        AFQNFDSGIVNPIPAH NFELKPM+                                                                           
Subjt:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------

Query:  ------------------------------------------------------------GLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                                    GLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------------GLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEM LGIKNPLAT IQPVQSDYCT APVCQVNDLIC             H+P +  + G G +
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPEPTEQAALCSTYTAIHPTTATTAVQSENTTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHT
          FN   +  N +P          T  H             TPP+ NNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPGHT
Subjt:  RNFNPYSNTYNPEPTEQAALCSTYTAIHPTTATTAVQSENTTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHT

Query:  ELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKKILEK
        ELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEK+ + K
Subjt:  ELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKKILEK

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]4.2e-5237.14Show/hide
Query:  PPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAATAFQNFDSGIVN----PIPAHANFELKPMIGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLAS
        PP  N ++R     +E    +  +N      E    A++ F   I+N     IPA    E     G D  TKMMLN AANG FT K+FNEIV+IL+ L+ 
Subjt:  PPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAATAFQNFDSGIVN----PIPAHANFELKPMIGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLAS

Query:  HNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYV
        HN  WCS++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK ME    N  A S     +    P+PV Q+ +  C +C + H  +NCP NP+S++YV
Subjt:  HNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYV

Query:  GHGNNRNFNPYSNTYNPEPTEQ-----AALCSTYTAIH-------------------PTTATTAVQSENTTPPVHNNNSNLENMMKEYMARTDAVIQS--
        G  N + FNPYSNTYNP   +      +   S+ T  H                   P T     Q +N   P   N SN+E +MKE + + DA ++   
Subjt:  GHGNNRNFNPYSNTYNPEPTEQ-----AALCSTYTAIH-------------------PTTATTAVQSENTTPPVHNNNSNLENMMKEYMARTDAVIQS--

Query:  -------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTP
                              ++R  E QLGQL NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP      PS E      +    P
Subjt:  -------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTP

Query:  EKKILEKVMRTP
        + KI+E  +  P
Subjt:  EKKILEKVMRTP

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]2.9e-13079.3Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLAT IQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIHPTTATTAVQSENTTP
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNP                        +  +Q  +  T   I P            TP
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIHPTTATTAVQSENTTP

Query:  PVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKI
        PV NNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKI
Subjt:  PVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKI

Query:  PENPTTPEKKILEK
        PENPTTPEK  + K
Subjt:  PENPTTPEKKILEK

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134645.5e-8274.22Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEM LGIKNPLA  IQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIHPTTATTAVQSENTTP
        VNDLICSFCSENHIYDNCPHNPASVFYV HGNNRNFNPYSNTYNP                        +  +Q  +  T   I P  +         TP
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIHPTTATTAVQSENTTP

Query:  PVHNNNSNLENMMKEYMARTDAVIQ
         V NNNSNLENMMKEYMARTD VIQ
Subjt:  PVHNNNSNLENMMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185144.9e-9161.13Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AGVLALDIATSMQKEM TMNQ LKE+ L  K   +    P Q +Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIH--PTTATTA
         +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YVGHG NR FNPYSNTYNP                        +  +Q  + ST+++    P      
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIH--PTTATTA

Query:  VQSENT-TPPVHNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM
         Q++ T + P HNNN++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+GP M
Subjt:  VQSENT-TPPVHNNNSNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTM

Query:  P
        P
Subjt:  P

A0A6J1DW02 uncharacterized protein LOC1110248971.0e-16562.12Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------
        AFQNFDSGIVNPIPAH NFELKPM+                                                                           
Subjt:  AFQNFDSGIVNPIPAHANFELKPMI---------------------------------------------------------------------------

Query:  ------------------------------------------------------------GLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                                    GLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------------GLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEM LGIKNPLAT IQPVQSDYCT APVCQVNDLIC             H+P +  + G G +
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPEPTEQAALCSTYTAIHPTTATTAVQSENTTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHT
          FN   +  N +P          T  H             TPP+ NNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPGHT
Subjt:  RNFNPYSNTYNPEPTEQAALCSTYTAIHPTTATTAVQSENTTPPVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHT

Query:  ELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKKILEK
        ELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEK+ + K
Subjt:  ELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKKILEK

A0A6J1DY39 uncharacterized protein LOC1110256532.0e-5237.14Show/hide
Query:  PPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAATAFQNFDSGIVN----PIPAHANFELKPMIGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLAS
        PP  N ++R     +E    +  +N      E    A++ F   I+N     IPA    E     G D  TKMMLN AANG FT K+FNEIV+IL+ L+ 
Subjt:  PPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAATAFQNFDSGIVN----PIPAHANFELKPMIGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLAS

Query:  HNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYV
        HN  WCS++SR   K+ DPAGVLALD  TSMQK++ T+ Q LK ME    N  A S     +    P+PV Q+ +  C +C + H  +NCP NP+S++YV
Subjt:  HNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYV

Query:  GHGNNRNFNPYSNTYNPEPTEQ-----AALCSTYTAIH-------------------PTTATTAVQSENTTPPVHNNNSNLENMMKEYMARTDAVIQS--
        G  N + FNPYSNTYNP   +      +   S+ T  H                   P T     Q +N   P   N SN+E +MKE + + DA ++   
Subjt:  GHGNNRNFNPYSNTYNPEPTEQ-----AALCSTYTAIH-------------------PTTATTAVQSENTTPPVHNNNSNLENMMKEYMARTDAVIQS--

Query:  -------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTP
                              ++R  E QLGQL NE++ RPQGS P  TE P+R GKE C ++  RSGL Y+GP MP      PS E      +    P
Subjt:  -------------------QAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTP

Query:  EKKILEKVMRTP
        + KI+E  +  P
Subjt:  EKKILEKVMRTP

A0A6J1DY39 uncharacterized protein LOC1110256531.9e-0245.16Show/hide
Query:  ARNDEFNYIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMIGLDHPTKMMLNN
        A N   N I +AD RD AMR+YAA   ++ +S ++N  PA A FE KPM+        MLNN
Subjt:  ARNDEFNYIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHANFELKPMIGLDHPTKMMLNN

A0A6J1DYG0 uncharacterized protein LOC1110257641.4e-13079.3Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLAT IQPVQSDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPVQSDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIHPTTATTAVQSENTTP
        VNDLICSFCSENHIYD CPHNPASVFYVGHGNNRNFNPYSNTYNP                        +  +Q  +  T   I P            TP
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNP------------------------EPTEQAALCSTYTAIHPTTATTAVQSENTTP

Query:  PVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKI
        PV NNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIPST P VKI
Subjt:  PVHNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKI

Query:  PENPTTPEKKILEK
        PENPTTPEK  + K
Subjt:  PENPTTPEKKILEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCATGCGAGAAATGATGAATTCAACTATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTC
AACCCTATTCCAGCCCATGCAAACTTTGAGCTTAAACCAATGATAGGCTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGAC
ATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGG
CTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGAATTGGGAATAAAAAATCCATTAGCCACGTCGATACAACCTGTG
CAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATCTATGATAATTGTCCACATAACCCTGCTTCCGT
TTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGAGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATC
CCACCACCGCAACAACAGCAGTACAATCAGAGAACACAACTCCACCAGTTCACAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCA
GTGATACAATCTCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATT
ACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAAC
CAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGCTTGAATCCTATTATGTTTGAT
GAGTTTTATGACTCGCTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAA
GTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGAT
AG
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
AGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCATGCGAGAAATGATGAATTCAACTATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTC
AACCCTATTCCAGCCCATGCAAACTTTGAGCTTAAACCAATGATAGGCTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGAC
ATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGG
CTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGAATTGGGAATAAAAAATCCATTAGCCACGTCGATACAACCTGTG
CAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATCTATGATAATTGTCCACATAACCCTGCTTCCGT
TTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGAGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATC
CCACCACCGCAACAACAGCAGTACAATCAGAGAACACAACTCCACCAGTTCACAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCA
GTGATACAATCTCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATT
ACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAAC
CAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGCTTGAATCCTATTATGTTTGAT
GAGTTTTATGACTCGCTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAA
GTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGAT
AG
Protein sequenceShow/hide protein sequence
MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNYIQMADNRDVAMREYAATAFQNFDSGIV
NPIPAHANFELKPMIGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMELGIKNPLATSIQPV
QSDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPEPTEQAALCSTYTAIHPTTATTAVQSENTTPPVHNNNSNLENMMKEYMARTDA
VIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKKILEKVMRTPRVFLHSLNPIMFD
EFYDSLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAFWRSTKRPLDGR