; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g17870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g17870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022160
Genome locationchr4:13113608..13116337
RNA-Seq ExpressionMoc04g17870
SyntenyMoc04g17870
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]2.5e-10992.99Show/hide
Query:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC
        AFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAG+LALDIATSMQKEMVTMNQRLKEMALGIK+PLA PIQPVQ D+CTPAPVCQVNDLICSFC
Subjt:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC

Query:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLEN
        SENHIYDN PHNPASVFYV HGNN+NFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIP  QQQYNQRT+TP VQNN+SNLEN
Subjt:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLEN

Query:  MMKEYMARTDAVIQ
        MMKEYMARTD VIQ
Subjt:  MMKEYMARTDAVIQ

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]1.5e-10659.09Show/hide
Query:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC
        AFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AG+LALDIATSMQKEM TMNQ LKE+AL  KS    P QP  +     +PVCQ+N+++CS+C
Subjt:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC

Query:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPSPQQQYNQRTQTP--PVQNNS
        S+NH+Y+N PHNPAS +YVGHG N+ FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +P P QQYNQ  +TP  P  NN+
Subjt:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPSPQQQYNQRTQTP--PVQNNS

Query:  SNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFDEFYDLLVTEIEEELDKMAE
        ++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQG F            E+C A+T  + + +                   E
Subjt:  SNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFDEFYDLLVTEIEEELDKMAE

Query:  GPEDVTNPIEKIQKEECKSLLPSIILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNSIESQRRLNLKMKE
        GP+                 +P  +L K+KKAIGWTIADI+GISPAFCM+KILLEE A+NSIE+QRRLN KMKE
Subjt:  GPEDVTNPIEKIQKEECKSLLPSIILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNSIESQRRLNLKMKE

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]4.2e-14159.55Show/hide
Query:  MADIPPRDPVDPPAVNGNMRDHARNDEFNHIQM------AMREYAATAFQNFDSRIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI
        MADIPPRDPVDPPAVNGNMRDHARNDEFN+IQM      AMREYAATAFQNFDS IVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI
Subjt:  MADIPPRDPVDPPAVNGNMRDHARNDEFNHIQM------AMREYAATAFQNFDSRIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI

Query:  ANAFRLP---------------------------------------------------------------------------------------------
        ANAFRLP                                                                                             
Subjt:  ANAFRLP---------------------------------------------------------------------------------------------

Query:  -----------------------AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQ
                               AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAG+LALDIATSMQKEMVTMNQRLKEMALGIK+PLAT IQ
Subjt:  -----------------------AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQ

Query:  PVQSDFCTPAPVCQVNDLICSFCSENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPS
        PVQSD+CT APVCQVNDLIC                                        WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IP 
Subjt:  PVQSDFCTPAPVCQVNDLICSFCSENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPS

Query:  PQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFD
        PQQQYNQRTQTPP+QNN+SNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQG F            E+C A+T  + + +D
Subjt:  PQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFD

XP_022158979.1 uncharacterized protein LOC111025424 [Momordica charantia]3.7e-7657.32Show/hide
Query:  MVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  ++       +D       C V  L    C                     GNN+NFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF-----------
        GFNQGQSQQNKQ YVP TQQY P PQQ YNQR QTPPVQNN+SNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQG F           
Subjt:  GFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF-----------

Query:  -EECSAITNLNLVMFD-EFYDLLVTEIEEELDKMAEGPEDVTNPIEKIQKEEC---KSLLPSI---------------ILEKHKKAIGWTIADIRGISPA
         E C A+T  + + ++         +I      + E P     P+    K         LP I               ILEKHKKAIGWTIADIRGIS  
Subjt:  -EECSAITNLNLVMFD-EFYDLLVTEIEEELDKMAEGPEDVTNPIEKIQKEEC---KSLLPSI---------------ILEKHKKAIGWTIADIRGISPA

Query:  FCMHKILLEEDAKNSIESQRRLNLKMKE
        FCMHKILLEEDAKNSIESQRRLN KMKE
Subjt:  FCMHKILLEEDAKNSIESQRRLNLKMKE

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]3.6e-14878.59Show/hide
Query:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC
        AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAG+LALDIA+SMQKE VTMNQRLKEM LG+K+PLATPIQPVQSD+CTPAPVCQVNDLICSFC
Subjt:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC

Query:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLEN
        SENHIYD  PHNPASVFYVGHGNN+NFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIP PQQ+YNQRTQTPPVQNN+SNLEN
Subjt:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLEN

Query:  MMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFDEFYDLLVTEIEEELDKMAEGPEDVTNPIEKI
        MMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQG F            E+C A+T  + + +DE                 + PE+ T P EK 
Subjt:  MMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFDEFYDLLVTEIEEELDKMAEGPEDVTNPIEKI

Query:  QKEECKSLLPSI---ILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNSIES
           +     PS+   ILEKHKKAIGWTIADIRGISPAFCMHKILLEED KNSIES
Subjt:  QKEECKSLLPSI---ILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNSIES

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134641.2e-10992.99Show/hide
Query:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC
        AFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAG+LALDIATSMQKEMVTMNQRLKEMALGIK+PLA PIQPVQ D+CTPAPVCQVNDLICSFC
Subjt:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC

Query:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLEN
        SENHIYDN PHNPASVFYV HGNN+NFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIP  QQQYNQRT+TP VQNN+SNLEN
Subjt:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLEN

Query:  MMKEYMARTDAVIQ
        MMKEYMARTD VIQ
Subjt:  MMKEYMARTDAVIQ

A0A6J1DAE9 uncharacterized protein LOC1110185147.4e-10759.09Show/hide
Query:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC
        AFTKKTFNEIVDIL DLASHNELWCSQRS+ APKKQD AG+LALDIATSMQKEM TMNQ LKE+AL  KS    P QP  +     +PVCQ+N+++CS+C
Subjt:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC

Query:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPSPQQQYNQRTQTP--PVQNNS
        S+NH+Y+N PHNPAS +YVGHG N+ FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +P P QQYNQ  +TP  P  NN+
Subjt:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPSPQQQYNQRTQTP--PVQNNS

Query:  SNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFDEFYDLLVTEIEEELDKMAE
        ++LENM KEYMAR DA+       IQ+QAA MRN E Q+GQ AN+LK RPQG F            E+C A+T  + + +                   E
Subjt:  SNLENMMKEYMARTDAV-------IQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFDEFYDLLVTEIEEELDKMAE

Query:  GPEDVTNPIEKIQKEECKSLLPSIILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNSIESQRRLNLKMKE
        GP+                 +P  +L K+KKAIGWTIADI+GISPAFCM+KILLEE A+NSIE+QRRLN KMKE
Subjt:  GPEDVTNPIEKIQKEECKSLLPSIILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNSIESQRRLNLKMKE

A0A6J1DW02 uncharacterized protein LOC1110248972.1e-14159.55Show/hide
Query:  MADIPPRDPVDPPAVNGNMRDHARNDEFNHIQM------AMREYAATAFQNFDSRIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI
        MADIPPRDPVDPPAVNGNMRDHARNDEFN+IQM      AMREYAATAFQNFDS IVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI
Subjt:  MADIPPRDPVDPPAVNGNMRDHARNDEFNHIQM------AMREYAATAFQNFDSRIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQI

Query:  ANAFRLP---------------------------------------------------------------------------------------------
        ANAFRLP                                                                                             
Subjt:  ANAFRLP---------------------------------------------------------------------------------------------

Query:  -----------------------AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQ
                               AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAG+LALDIATSMQKEMVTMNQRLKEMALGIK+PLAT IQ
Subjt:  -----------------------AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQ

Query:  PVQSDFCTPAPVCQVNDLICSFCSENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPS
        PVQSD+CT APVCQVNDLIC                                        WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IP 
Subjt:  PVQSDFCTPAPVCQVNDLICSFCSENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPS

Query:  PQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFD
        PQQQYNQRTQTPP+QNN+SNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQG F            E+C A+T  + + +D
Subjt:  PQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFD

A0A6J1DYG0 uncharacterized protein LOC1110257641.7e-14878.59Show/hide
Query:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC
        AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAG+LALDIA+SMQKE VTMNQRLKEM LG+K+PLATPIQPVQSD+CTPAPVCQVNDLICSFC
Subjt:  AFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFC

Query:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLEN
        SENHIYD  PHNPASVFYVGHGNN+NFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIP PQQ+YNQRTQTPPVQNN+SNLEN
Subjt:  SENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLEN

Query:  MMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFDEFYDLLVTEIEEELDKMAEGPEDVTNPIEKI
        MMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQG F            E+C A+T  + + +DE                 + PE+ T P EK 
Subjt:  MMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF------------EECSAITNLNLVMFDEFYDLLVTEIEEELDKMAEGPEDVTNPIEKI

Query:  QKEECKSLLPSI---ILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNSIES
           +     PS+   ILEKHKKAIGWTIADIRGISPAFCMHKILLEED KNSIES
Subjt:  QKEECKSLLPSI---ILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNSIES

A0A6J1E110 uncharacterized protein LOC1110254241.0e-7657.62Show/hide
Query:  MVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSS
        MVTMNQRLKEMAL IK  ++       +D       C V  L    C                     GNN+NFNPYSNTYNPGWRHHPNFSWGGQGGSS
Subjt:  MVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNFPHNPASVFYVGHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSS

Query:  GFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF-----------
        GFNQGQSQQNKQ YVP TQQY P PQQ YNQR QTPPVQNN+SNLEN MKEYMARTDAVIQSQAASMRNFETQLG LAN LKNRPQG F           
Subjt:  GFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGDF-----------

Query:  -EECSAITNLNLVMFD-EFYDLLVTEIEEELDKMAEGPEDVTNPIEKIQKEEC---KSLLPSI---------------ILEKHKKAIGWTIADIRGISPA
         E C A+T  + + +D         +I      + E P     P+    K         LP I               ILEKHKKAIGWTIADIRGIS  
Subjt:  -EECSAITNLNLVMFD-EFYDLLVTEIEEELDKMAEGPEDVTNPIEKIQKEEC---KSLLPSI---------------ILEKHKKAIGWTIADIRGISPA

Query:  FCMHKILLEEDAKNSIESQRRLNLKMKE
        FCMHKILLEEDAKNSIESQRRLN KMKE
Subjt:  FCMHKILLEEDAKNSIESQRRLNLKMKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCAATGCGAGA
ATATGCCGCCACGGCTTTTCAGAACTTTGATTCAAGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAATGTTGCAGACAATTG
GACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTAAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGCCTTTACAAAGAAGACATTCAACGAG
ATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGACTTTTGGCTCTGGACAT
TGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAGTCCATTAGCCACACCGATACAACCTGTGCAGTCGGATT
TTTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTTTCCACATAACCCTGCTTCTGTTTTTTATGTA
GGACATGGGAACAATAAGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTTTAA
TCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCGTCGCCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAA
ATAACAGTTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAG
CTCGCCAATGAATTGAAGAATAGACCACAAGGAGATTTTGAAGAATGCTCTGCTATAACTAACTTGAATCTTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGA
GATTGAAGAAGAGCTTGATAAGATGGCAGAAGGACCGGAAGATGTGACTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAATTTTGG
AGAAGCACAAAAAGGCCATTGGATGGACGATAGCAGATATCCGAGGGATAAGCCCGGCCTTTTGCATGCACAAAATTCTATTGGAAGAAGATGCTAAGAACTCTATTGAG
AGTCAAAGGCGGTTGAACCTGAAGATGAAAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCAATGCGAGA
ATATGCCGCCACGGCTTTTCAGAACTTTGATTCAAGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAATGTTGCAGACAATTG
GACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTAAAATCATTCATTCAAATTGCAAATGCATTTCGATTACCTGCCTTTACAAAGAAGACATTCAACGAG
ATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGACTTTTGGCTCTGGACAT
TGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAGTCCATTAGCCACACCGATACAACCTGTGCAGTCGGATT
TTTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTTTCCACATAACCCTGCTTCTGTTTTTTATGTA
GGACATGGGAACAATAAGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTTTAA
TCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCGTCGCCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAA
ATAACAGTTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAG
CTCGCCAATGAATTGAAGAATAGACCACAAGGAGATTTTGAAGAATGCTCTGCTATAACTAACTTGAATCTTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGA
GATTGAAGAAGAGCTTGATAAGATGGCAGAAGGACCGGAAGATGTGACTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAATTTTGG
AGAAGCACAAAAAGGCCATTGGATGGACGATAGCAGATATCCGAGGGATAAGCCCGGCCTTTTGCATGCACAAAATTCTATTGGAAGAAGATGCTAAGAACTCTATTGAG
AGTCAAAGGCGGTTGAACCTGAAGATGAAAGAATGA
Protein sequenceShow/hide protein sequence
MADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMAMREYAATAFQNFDSRIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPAFTKKTFNE
IVDILNDLASHNELWCSQRSRAAPKKQDPAGLLALDIATSMQKEMVTMNQRLKEMALGIKSPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNFPHNPASVFYV
GHGNNKNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPSPQQQYNQRTQTPPVQNNSSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQ
LANELKNRPQGDFEECSAITNLNLVMFDEFYDLLVTEIEEELDKMAEGPEDVTNPIEKIQKEECKSLLPSIILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNSIE
SQRRLNLKMKE