; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014740 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014740
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00001047:552403..555819
RNA-Seq ExpressionSgr014740
SyntenySgr014740
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602701.1 hypothetical protein SDJN03_07934, partial [Cucurbita argyrosperma subsp. sororia]1.1e-8481.06Show/hide
Query:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST
        MAIEA  P+I V  +SPRISFSHDF   EAIPVEQRPN SRS+SS  NSS DFDFCIRECS QESSSADEIFSHGKILPLEIKKK EEP +RVDQSS S 
Subjt:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST

Query:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR
        H+ PLTR +SLD NAEKCLK+DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAPNIKRT LSKDGV  KQSSHR
Subjt:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR

Query:  NATKTSSQCSSSMGYQKPPLKKVHGSY
        NA K S  CSSSMGYQKPPLKKVHGSY
Subjt:  NATKTSSQCSSSMGYQKPPLKKVHGSY

KAG7033387.1 hypothetical protein SDJN02_07443, partial [Cucurbita argyrosperma subsp. argyrosperma]4.0e-8480.62Show/hide
Query:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST
        MAIEA  P+I V  +SPRISFSHDF   EAIPVEQRPN SRS+SS  NSS DFDFCIRECS QESSSADEIFSHGKILPLEIKKK EEP +RVDQSS S 
Subjt:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST

Query:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR
        H+ PLTR +SLD NAEKCLK+DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAPNIKRT LSKDGV  KQSSHR
Subjt:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR

Query:  NATKTSSQCSSSMGYQKPPLKKVHGSY
        NA K S  CSSSMG+QKPPLKKVHGSY
Subjt:  NATKTSSQCSSSMGYQKPPLKKVHGSY

XP_022133535.1 uncharacterized protein LOC111006095 [Momordica charantia]4.4e-9987.22Show/hide
Query:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTH
        MAIEA CP+ISV  MSPRISFSHDFCQ+EAIPVEQRP  SRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHG+ILPLEIKKKPE+PPV +DQSSS  
Subjt:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTH

Query:  APLTRTQSLDVNAEKCLKEDRSSKETKAANSDSEEKQS--SKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHRNAT
        APL RT+SLD + EKCLK+DRSSKE KAANSDSEEKQS  SKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDG +HKQSSHRN++
Subjt:  APLTRTQSLDVNAEKCLKEDRSSKETKAANSDSEEKQS--SKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHRNAT

Query:  KTS---SQCSSSMGYQKPPLKKVHGSY
        KTS   SQCSSSMGYQKPPLKKVHGSY
Subjt:  KTS---SQCSSSMGYQKPPLKKVHGSY

XP_022953381.1 uncharacterized protein LOC111455949 isoform X1 [Cucurbita moschata]2.3e-8480.62Show/hide
Query:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST
        MAIEA  P+I V  +SPRISFSHDF   EAIPVEQRPN SRS+SS  NSS DFDFCIRECS QESSSADEIFSHGKILPLEIKKK EEP +R+DQSS S 
Subjt:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST

Query:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR
        H+ PLTR +SLD NAEKCLK+DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAPNIKRT LSKDGV  KQSSHR
Subjt:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR

Query:  NATKTSSQCSSSMGYQKPPLKKVHGSY
        NA K S  CSSSMGYQKPPLKKVHGSY
Subjt:  NATKTSSQCSSSMGYQKPPLKKVHGSY

XP_022990991.1 uncharacterized protein LOC111487716 isoform X1 [Cucurbita maxima]5.2e-8480.62Show/hide
Query:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST
        MAIEA  P+I V  +SPRISFSHDF   EAIPVEQR N SRS+SS  NSS DFDFCIRECS QESSSADEIFSHGKILPLEIKKK EEP +RVDQSS S 
Subjt:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST

Query:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR
        H+ PLTR +SLD NAEKCLK+DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAPNIKRT LSKDGV  KQSSHR
Subjt:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR

Query:  NATKTSSQCSSSMGYQKPPLKKVHGSY
        NA K S  CSSSMGYQKPPLKKVHGSY
Subjt:  NATKTSSQCSSSMGYQKPPLKKVHGSY

TrEMBL top hitse value%identityAlignment
A0A6J1BVI8 uncharacterized protein LOC1110060952.1e-9987.22Show/hide
Query:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTH
        MAIEA CP+ISV  MSPRISFSHDFCQ+EAIPVEQRP  SRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHG+ILPLEIKKKPE+PPV +DQSSS  
Subjt:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTH

Query:  APLTRTQSLDVNAEKCLKEDRSSKETKAANSDSEEKQS--SKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHRNAT
        APL RT+SLD + EKCLK+DRSSKE KAANSDSEEKQS  SKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDG +HKQSSHRN++
Subjt:  APLTRTQSLDVNAEKCLKEDRSSKETKAANSDSEEKQS--SKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHRNAT

Query:  KTS---SQCSSSMGYQKPPLKKVHGSY
        KTS   SQCSSSMGYQKPPLKKVHGSY
Subjt:  KTS---SQCSSSMGYQKPPLKKVHGSY

A0A6J1GMV2 uncharacterized protein LOC111455949 isoform X11.1e-8480.62Show/hide
Query:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST
        MAIEA  P+I V  +SPRISFSHDF   EAIPVEQRPN SRS+SS  NSS DFDFCIRECS QESSSADEIFSHGKILPLEIKKK EEP +R+DQSS S 
Subjt:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST

Query:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR
        H+ PLTR +SLD NAEKCLK+DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAPNIKRT LSKDGV  KQSSHR
Subjt:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR

Query:  NATKTSSQCSSSMGYQKPPLKKVHGSY
        NA K S  CSSSMGYQKPPLKKVHGSY
Subjt:  NATKTSSQCSSSMGYQKPPLKKVHGSY

A0A6J1GN78 uncharacterized protein LOC111455949 isoform X22.6e-8180.18Show/hide
Query:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST
        MAIEA  P+I V  +SPRISFSHDF   EAIPVEQRPN SRS+SS  NSS DFDFCIRECS QESSSADEIFSHGKILPLEIKKK EEP +R+DQSS S 
Subjt:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST

Query:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR
        H+ PLTR +SLD NAEKCLK+DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAPNIKRT LSKDGV  KQSSHR
Subjt:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR

Query:  NATKTSSQCSSSMGYQKPPLKK
        NA K S  CSSSMGYQKPPLKK
Subjt:  NATKTSSQCSSSMGYQKPPLKK

A0A6J1JKF9 uncharacterized protein LOC111487716 isoform X25.8e-8180.18Show/hide
Query:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST
        MAIEA  P+I V  +SPRISFSHDF   EAIPVEQR N SRS+SS  NSS DFDFCIRECS QESSSADEIFSHGKILPLEIKKK EEP +RVDQSS S 
Subjt:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST

Query:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR
        H+ PLTR +SLD NAEKCLK+DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAPNIKRT LSKDGV  KQSSHR
Subjt:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR

Query:  NATKTSSQCSSSMGYQKPPLKK
        NA K S  CSSSMGYQKPPLKK
Subjt:  NATKTSSQCSSSMGYQKPPLKK

A0A6J1JPH3 uncharacterized protein LOC111487716 isoform X12.5e-8480.62Show/hide
Query:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST
        MAIEA  P+I V  +SPRISFSHDF   EAIPVEQR N SRS+SS  NSS DFDFCIRECS QESSSADEIFSHGKILPLEIKKK EEP +RVDQSS S 
Subjt:  MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSS-ST

Query:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR
        H+ PLTR +SLD NAEKCLK+DRS KE K A +SDSEEKQSS  KSFW FKRSSSCGSGYT SLCPLPLLSRSNSTGSAPNIKRT LSKDGV  KQSSHR
Subjt:  HA-PLTRTQSLDVNAEKCLKEDRSSKETK-AANSDSEEKQSS--KSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHR

Query:  NATKTSSQCSSSMGYQKPPLKKVHGSY
        NA K S  CSSSMGYQKPPLKKVHGSY
Subjt:  NATKTSSQCSSSMGYQKPPLKKVHGSY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48780.1 unknown protein5.5e-1538.97Show/hide
Query:  RISFSHDFCQTEAIP---VEQRPNSSRSNSSGLNSSIDFDFCIRECSDQ-ESSSADEIFSHGKILPLEIKK---------KPEEPPVRVDQSSSTHAPLT
        RISFS D  Q++  P   +E      R  +   +S+ DF+F I    D  +SS ADEIF+ G ILP  +           K E PP+    SS + +PL+
Subjt:  RISFSHDFCQTEAIP---VEQRPNSSRSNSSGLNSSIDFDFCIRECSDQ-ESSSADEIFSHGKILPLEIKK---------KPEEPPVRVDQSSSTHAPLT

Query:  RTQSLDVNAEKCLKEDRSSKETKAANSDSEEKQSSKSFWRFKRSSSCGSGYTHSL-CPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHRNATKTSSQ
               ++EK      ++     ANSDSE ++SSKSFW FKRSSS       SL C  P L+RSNSTGS  N KR  L +D  NH+ SS       SS 
Subjt:  RTQSLDVNAEKCLKEDRSSKETKAANSDSEEKQSSKSFWRFKRSSSCGSGYTHSL-CPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHRNATKTSSQ

Query:  CSSSMGYQKPPLK
        C++   YQ  P K
Subjt:  CSSSMGYQKPPLK

AT1G67050.1 unknown protein8.4e-4050Show/hide
Query:  ATMSPRISFSHDFCQTEAIPVEQRP-NSSRSNSSGLNSSIDFDFCI------RECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTHAPLTR
        + MSPRISFS DFCQ++AIP+E+RP  SS S  S LNSSIDFDFCI       E  DQ S SADE+FS+GKILP EIKKKPE      +       P +R
Subjt:  ATMSPRISFSHDFCQTEAIPVEQRP-NSSRSNSSGLNSSIDFDFCI------RECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTHAPLTR

Query:  TQSLDVNAEKCLKEDRSSKETKAANSDSEEKQSSKSFWRFKRSSS--CGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNH---KQSSHRNATKT
         Q    N E+  +ED     T       EEK ++KSFW FKRSSS  CGS Y  SLCPLPLL+RSNSTGS  + ++   S+    H   +QSS  +++ +
Subjt:  TQSLDVNAEKCLKEDRSSKETKAANSDSEEKQSSKSFWRFKRSSS--CGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNH---KQSSHRNATKT

Query:  SSQCSSSMGYQKPPLKKVHGSY
        +S   S+ G+ KPPLKK +G Y
Subjt:  SSQCSSSMGYQKPPLKKVHGSY

AT3G18300.1 unknown protein6.7e-1335.27Show/hide
Query:  RISFSHDFCQTE-AIPVEQRPNSSRSNSSGL--NSSIDFDFCIRECSDQ-ESSSADEIFSHGKILPL------------EIKKKPEEPPVRVDQSSSTHA
        R SF+ D  Q++   P+EQ+P+      + L  +S+ DF+F I    D  +SS ADEIF+ G ILP+            +   K E PP+    + S++ 
Subjt:  RISFSHDFCQTE-AIPVEQRPNSSRSNSSGL--NSSIDFDFCIRECSDQ-ESSSADEIFSHGKILPL------------EIKKKPEEPPVRVDQSSSTHA

Query:  PLTRTQSLDVNAEKCLKEDRSSKETK--AANSDSEEKQSSKSFWRFKRSSSCGSGYTHSL-CPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHRNAT
        P       + + +  +KE R S   +   ANSDSE ++SSKSFW FKRSSS       SL C  P L+RSNSTGS    KR  L      +K SS R+  
Subjt:  PLTRTQSLDVNAEKCLKEDRSSKETK--AANSDSEEKQSSKSFWRFKRSSSCGSGYTHSL-CPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHRNAT

Query:  KTSSQCSSSMGYQKPPLKKVHGSYRLNCGDRKQHHLHKSTG
               SS  + +PP      SY+     R Q H  K+ G
Subjt:  KTSSQCSSSMGYQKPPLKKVHGSYRLNCGDRKQHHLHKSTG

AT5G38320.1 unknown protein1.5e-0732.3Show/hide
Query:  ATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRE--CSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTHAPLTRTQSLD
        A  SPRISFS+DFC  E+IP+EQR + S  + S        +F I     S + S SA+E F+ GKILP+E+KK PE  P+   ++      L R + + 
Subjt:  ATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRE--CSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTHAPLTRTQSLD

Query:  V-NAEKCLK-EDRSSKETKAANSDSEEKQSSKSFWRFKRSSSCGSGYTHSLCPLPLLSRSN
        + + E  L+ E+   +E +          +  +  + + SSS  S +  S  P P+L  ++
Subjt:  V-NAEKCLK-EDRSSKETKAANSDSEEKQSSKSFWRFKRSSSCGSGYTHSLCPLPLLSRSN

AT5G38320.2 unknown protein1.8e-1048.05Show/hide
Query:  ATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRE--CSDQESSSADEIFSHGKILPLEIKKKPE
        A  SPRISFS+DFC  E+IP+EQR + S  + S        +F I     S + S SA+E F+ GKILP+E+KK PE
Subjt:  ATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRE--CSDQESSSADEIFSHGKILPLEIKKKPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATCGAGGCAGCTTGTCCGGAGATTTCTGTTGCGACAATGAGCCCTAGAATTTCATTTTCTCACGACTTCTGCCAGACTGAAGCTATTCCGGTAGAACAACGGCC
TAATTCCTCCCGATCCAATTCTTCCGGTTTGAATTCCAGCATTGATTTCGACTTCTGCATTCGTGAGTGTTCCGATCAGGAGTCGTCTTCCGCGGATGAAATTTTCTCCC
ACGGAAAAATTCTGCCGCTCGAAATCAAGAAGAAACCTGAAGAGCCTCCTGTGCGAGTCGATCAGTCTTCTTCTACTCATGCTCCATTGACGCGAACACAATCTCTTGAT
GTTAACGCCGAAAAATGTTTGAAAGAAGATAGATCGTCAAAGGAAACCAAGGCAGCGAATAGCGACTCCGAAGAGAAGCAAAGTTCCAAGTCCTTTTGGCGTTTCAAAAG
AAGCAGCAGCTGTGGCTCTGGATACACTCATAGCTTATGTCCTTTGCCGCTTCTATCACGAAGCAATTCAACTGGCTCTGCACCGAACATTAAGCGAACGCCATTGTCCA
AGGACGGTGTAAATCACAAGCAGAGCTCCCATAGAAATGCCACCAAAACTTCATCACAGTGTTCGTCTTCAATGGGATATCAGAAACCTCCATTGAAGAAGGTACATGGT
TCGTACCGCTTAAACTGCGGTGACCGGAAACAGCATCATCTCCACAAATCGACCGGCCGCGACAAAACTCCGGCGTGCCGCCGAACCGCTTCCGACAAAGATCTCATCAC
GCACCGTCTTAACCTCCCTGCAGATTCAGATGTGGCCTCCCCGGAGAACCACGCACCTGCACCGGCACCGCCGTGGGACAGAGGTGGCCGAGACATAGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATCGAGGCAGCTTGTCCGGAGATTTCTGTTGCGACAATGAGCCCTAGAATTTCATTTTCTCACGACTTCTGCCAGACTGAAGCTATTCCGGTAGAACAACGGCC
TAATTCCTCCCGATCCAATTCTTCCGGTTTGAATTCCAGCATTGATTTCGACTTCTGCATTCGTGAGTGTTCCGATCAGGAGTCGTCTTCCGCGGATGAAATTTTCTCCC
ACGGAAAAATTCTGCCGCTCGAAATCAAGAAGAAACCTGAAGAGCCTCCTGTGCGAGTCGATCAGTCTTCTTCTACTCATGCTCCATTGACGCGAACACAATCTCTTGAT
GTTAACGCCGAAAAATGTTTGAAAGAAGATAGATCGTCAAAGGAAACCAAGGCAGCGAATAGCGACTCCGAAGAGAAGCAAAGTTCCAAGTCCTTTTGGCGTTTCAAAAG
AAGCAGCAGCTGTGGCTCTGGATACACTCATAGCTTATGTCCTTTGCCGCTTCTATCACGAAGCAATTCAACTGGCTCTGCACCGAACATTAAGCGAACGCCATTGTCCA
AGGACGGTGTAAATCACAAGCAGAGCTCCCATAGAAATGCCACCAAAACTTCATCACAGTGTTCGTCTTCAATGGGATATCAGAAACCTCCATTGAAGAAGGTACATGGT
TCGTACCGCTTAAACTGCGGTGACCGGAAACAGCATCATCTCCACAAATCGACCGGCCGCGACAAAACTCCGGCGTGCCGCCGAACCGCTTCCGACAAAGATCTCATCAC
GCACCGTCTTAACCTCCCTGCAGATTCAGATGTGGCCTCCCCGGAGAACCACGCACCTGCACCGGCACCGCCGTGGGACAGAGGTGGCCGAGACATAGCGTAA
Protein sequenceShow/hide protein sequence
MAIEAACPEISVATMSPRISFSHDFCQTEAIPVEQRPNSSRSNSSGLNSSIDFDFCIRECSDQESSSADEIFSHGKILPLEIKKKPEEPPVRVDQSSSTHAPLTRTQSLD
VNAEKCLKEDRSSKETKAANSDSEEKQSSKSFWRFKRSSSCGSGYTHSLCPLPLLSRSNSTGSAPNIKRTPLSKDGVNHKQSSHRNATKTSSQCSSSMGYQKPPLKKVHG
SYRLNCGDRKQHHLHKSTGRDKTPACRRTASDKDLITHRLNLPADSDVASPENHAPAPAPPWDRGGRDIA