; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein MKS1-like
Genome locationchr2:13952889..13953533
RNA-Seq ExpressionMoc02g18740
SyntenyMoc02g18740
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR008889 - VQ
IPR039607 - VQ motif-containing protein 8/17/18/20/21/25


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571354.1 Protein MKS1, partial [Cucurbita argyrosperma subsp. sororia]5.4e-7073.64Show/hide
Query:  PGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADSS
        PG+P     RKKEIQLQGPRPPQLRV+ +SRKIKKPPPHPQ    AGRPPLP   P Q+PQPLIIYDISPKVIHV ENNFMSVVQRLTGLSSS AAA   
Subjt:  PGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADSS

Query:  AAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAA----------FELSPHWPASSFLASPSAL
          GDLSPAARLA IEKASPRPEREREREIDVS+ MM++ ++SVEL Q PGILSPAPASLAPIP+GFFSPA            ELSPHWP      SPSAL
Subjt:  AAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAA----------FELSPHWPASSFLASPSAL

Query:  LSAPLVSPISPPNFYNNLFD
         SAPLVSPIS PN +NNLFD
Subjt:  LSAPLVSPISPPNFYNNLFD

KAG6605314.1 Protein MKS1, partial [Cucurbita argyrosperma subsp. sororia]3.3e-6772.12Show/hide
Query:  MDPLGPGFPGA-NPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSL
        M+P G    G+ N  RKKEIQLQGPRPPQLRV+ ESRKIKKPPPHPQ    +GRPPL PPA +Q+PQPLIIYDISPKV+HVAENNFMSVVQRLTGLSS  
Subjt:  MDPLGPGFPGA-NPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSL

Query:  AAADSSAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPA----------AFELSPHWPASSFL
          A SS  GDLSPAAR ATIEKASPR EREREREIDVSD MM++ +V VELGQ PGILSPAPASLAPI SGFFSPA            ELSPHWP     
Subjt:  AAADSSAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPA----------AFELSPHWPASSFL

Query:  ASPSALLSAPLVSPISPPNFYNNLFD
         SPSAL  APLVSPIS PN +N+LFD
Subjt:  ASPSALLSAPLVSPISPPNFYNNLFD

XP_022143600.1 protein MKS1-like [Momordica charantia]3.5e-109100Show/hide
Query:  MDPLGPGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS
        MDPLGPGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS
Subjt:  MDPLGPGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS

Query:  SAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFELSPHWPASSFLASPSALLSAPLVSPI
        SAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFELSPHWPASSFLASPSALLSAPLVSPI
Subjt:  SAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFELSPHWPASSFLASPSALLSAPLVSPI

Query:  SPPNFYNNLFDLLS
        SPPNFYNNLFDLLS
Subjt:  SPPNFYNNLFDLLS

XP_022932014.1 protein MKS1-like [Cucurbita moschata]7.8e-6972.77Show/hide
Query:  PGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPA-QYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS
        PG+P     RKKEIQLQGPRPPQLRVS +SRKIKKPPPHPQ    A RPPLPP A A Q+PQPLIIYDISPKVIHV ENNFMSVVQRLTGLSSS A+A  
Subjt:  PGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPA-QYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS

Query:  SAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAA----------FELSPHWPASSFLASPSA
           GDLSPAARLA IEKASPRPEREREREIDVS+ MM++ ++SVEL Q PGILSPAPASLAPIP+GFFSPA            ELSPHWP      SPSA
Subjt:  SAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAA----------FELSPHWPASSFLASPSA

Query:  LLSAPLVSPISPPNFYNNLFDLLS
        L SAPLVSPIS PN +N+LFD  S
Subjt:  LLSAPLVSPISPPNFYNNLFDLLS

XP_038901805.1 protein MKS1-like [Benincasa hispida]5.1e-6873.01Show/hide
Query:  MDPLGPGFPGA-NPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSL
        M+P G    G+    RKKEIQLQGPRPPQLRV+ ESRKIKKPPPHPQ     GRPPL PP PAQ+PQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSS 
Subjt:  MDPLGPGFPGA-NPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSL

Query:  AAADSSAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPA----------AFELSPHWPASSFL
        AAA  +  GDLSPAARLATIEKASPR   EREREI+VSD MM++A+VSVELGQ+PGILSPAPA+LAPIP+G+FSPA            ELSPHWP     
Subjt:  AAADSSAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPA----------AFELSPHWPASSFL

Query:  ASPSALLSAPLVSPISPPNFYNNLFD
         SPSAL SAPLVSPIS PN +NNLFD
Subjt:  ASPSALLSAPLVSPISPPNFYNNLFD

TrEMBL top hitse value%identityAlignment
A0A5A7TBE4 Protein MKS1-like5.7e-6570.35Show/hide
Query:  MDPLGPGFPGA-NPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSL
        M+P G    G+    RKKEIQLQGPRPPQLRVS ESRKIKKPPPHPQ     GRPPL PP P+Q+PQPLIIYDISPKVIHVAENNFMSVVQRLTG  SS 
Subjt:  MDPLGPGFPGA-NPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSL

Query:  AAADSSAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAA----------FELSPHWPASSFL
        A  D    GDLSPAARLATIEKASPR   EREREI+VSD MM++A+VSVELGQ+PGILSPAP +LAPIP+G+FSPA            ELSPHWP     
Subjt:  AAADSSAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAA----------FELSPHWPASSFL

Query:  ASPSALLSAPLVSPISPPNFYNNLFD
         SPSAL S PL+SPIS PN +NNLFD
Subjt:  ASPSALLSAPLVSPISPPNFYNNLFD

A0A6J1CP82 protein MKS1-like1.7e-109100Show/hide
Query:  MDPLGPGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS
        MDPLGPGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS
Subjt:  MDPLGPGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS

Query:  SAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFELSPHWPASSFLASPSALLSAPLVSPI
        SAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFELSPHWPASSFLASPSALLSAPLVSPI
Subjt:  SAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFELSPHWPASSFLASPSALLSAPLVSPI

Query:  SPPNFYNNLFDLLS
        SPPNFYNNLFDLLS
Subjt:  SPPNFYNNLFDLLS

A0A6J1EVV6 protein MKS1-like3.8e-6972.77Show/hide
Query:  PGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPA-QYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS
        PG+P     RKKEIQLQGPRPPQLRVS +SRKIKKPPPHPQ    A RPPLPP A A Q+PQPLIIYDISPKVIHV ENNFMSVVQRLTGLSSS A+A  
Subjt:  PGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPA-QYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS

Query:  SAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAA----------FELSPHWPASSFLASPSA
           GDLSPAARLA IEKASPRPEREREREIDVS+ MM++ ++SVEL Q PGILSPAPASLAPIP+GFFSPA            ELSPHWP      SPSA
Subjt:  SAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAA----------FELSPHWPASSFLASPSA

Query:  LLSAPLVSPISPPNFYNNLFDLLS
        L SAPLVSPIS PN +N+LFD  S
Subjt:  LLSAPLVSPISPPNFYNNLFDLLS

A0A6J1G715 protein MKS1-like5.7e-6571.24Show/hide
Query:  MDPLGPGFPGA-NPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSL
        M+P G    G+ N  RKKEIQLQGPRPPQLRV+ ESRKIKKPPPHPQ    +GRPPL PPA +Q+PQPLIIYDISPKV+HVAENNFMSVVQRLTGLSS  
Subjt:  MDPLGPGFPGA-NPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSL

Query:  AAADSSAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPA----------AFELSPHWPASSFL
          A SS  GDLSPAAR ATIEKASPR   EREREIDVSD MM++ +V VELGQ PGILSPAPASLAPI SGFFSPA            ELSPHWP     
Subjt:  AAADSSAAGDLSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPA----------AFELSPHWPASSFL

Query:  ASPSALLSAPLVSPISPPNFYNNLFD
         SPSAL  APLVSPIS PN +N+LFD
Subjt:  ASPSALLSAPLVSPISPPNFYNNLFD

A0A6J1I2Q6 protein MKS1-like8.7e-6670.85Show/hide
Query:  PGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPA-QYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS
        PG+P     RKKEIQLQGPRPPQLRVS +SRKIKKPPPHPQ     GRPPLPP A A Q+PQPLIIYDISPKVIHV ENNFMSVVQRLTGLSSS      
Subjt:  PGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQ----AGRPPLPPPAPA-QYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS

Query:  SAAGDLSPAARLATIEKASPRP--EREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAA----------FELSPHWPASSFLASP
           GDLSPAARLA IEKASPRP  EREREREIDVS+ MM++ ++SVEL Q PGILSPAPASLAPIP+GFFSPA            ELSPHWP      SP
Subjt:  SAAGDLSPAARLATIEKASPRP--EREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAA----------FELSPHWPASSFLASP

Query:  SALLSAPLVSPISPPNFYNNLFD
        SAL SAPLVSPIS PN ++ LF+
Subjt:  SALLSAPLVSPISPPNFYNNLFD

SwissProt top hitse value%identityAlignment
F4HWF9 Nuclear speckle RNA-binding protein B8.0e-0835.56Show/hide
Query:  GPRPPQLRVSHESRK-IKKP---PPHPQAGRPPLPPPAPAQY---PQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADSSAA------------
        GP+P  L+V  +S K IKKP   PPHPQ   P      P+Q    P P+ IY ++P++IH   NNFM++VQRLTG +S+   + SS++            
Subjt:  GPRPPQLRVSHESRK-IKKP---PPHPQAGRPPLPPPAPAQY---PQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADSSAA------------

Query:  ----GDLSPAARLATIEKASPRPERER----EREIDVSDMMMEMADVSVELG------QVPGILSPAPASLAPIPSGFFS
            G +SPAAR A  EKA+   E       E  +D               G         GILSP P SL  +   FFS
Subjt:  ----GDLSPAARLATIEKASPRPERER----EREIDVSDMMMEMADVSVELG------QVPGILSPAPASLAPIPSGFFS

Q8LGD5 Protein MKS11.9e-2541.48Show/hide
Query:  MDPLGPGFPGANPS----RKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLA
        MDP    F G NPS    +K+++Q+ GPRP  L V  +S KIKKPP HP A  P    P P    +P++IY +SPKV+H   + FM+VVQRLTG+SS + 
Subjt:  MDPLGPGFPGANPS----RKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLA

Query:  AADSSAAGDLSPAARLATIEKASPRPERE---REREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFE---LSPHWPASSFLASPSA
          +S   GD+SPAARLA+ E ASPR  +E   R+  ++++  M E A+     G  PGILSP+PA L    +G FSP   +    SP  P   F  SP+ 
Subjt:  AADSSAAGDLSPAARLATIEKASPRPERE---REREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFE---LSPHWPASSFLASPSA

Query:  LLSAPLVSP-----ISPP---NFYNNLFD
         +S P  SP     ++ P   +F+++++D
Subjt:  LLSAPLVSP-----ISPP---NFYNNLFD

Arabidopsis top hitse value%identityAlignment
AT1G21320.1 nucleotide binding;nucleic acid binding5.7e-0935.56Show/hide
Query:  GPRPPQLRVSHESRK-IKKP---PPHPQAGRPPLPPPAPAQY---PQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADSSAA------------
        GP+P  L+V  +S K IKKP   PPHPQ   P      P+Q    P P+ IY ++P++IH   NNFM++VQRLTG +S+   + SS++            
Subjt:  GPRPPQLRVSHESRK-IKKP---PPHPQAGRPPLPPPAPAQY---PQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADSSAA------------

Query:  ----GDLSPAARLATIEKASPRPERER----EREIDVSDMMMEMADVSVELG------QVPGILSPAPASLAPIPSGFFS
            G +SPAAR A  EKA+   E       E  +D               G         GILSP P SL  +   FFS
Subjt:  ----GDLSPAARLATIEKASPRPERER----EREIDVSDMMMEMADVSVELG------QVPGILSPAPASLAPIPSGFFS

AT1G21326.1 VQ motif-containing protein3.9e-1034.65Show/hide
Query:  GPRPPQLRVSHESRK-IKKP---PPHPQAGRPPLPPPAPAQY---PQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS---------------
        GPRP  L+V  +S K IKKP   PPHPQ   P      P+Q    P P+IIY +SP++IH   NNFM++VQRLTG +S+   + S               
Subjt:  GPRPPQLRVSHESRK-IKKP---PPHPQAGRPPLPPPAPAQY---PQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADS---------------

Query:  -SAAGDLSPAARLATIEKASPRPERER----EREIDV----------SDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFELSPHWPASSFL
         ++ G +SPAAR A  EKA+   E       E  +D            +   +            GILSP P SL  +   FFS       P   +S F 
Subjt:  -SAAGDLSPAARLATIEKASPRPERER----EREIDV----------SDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFELSPHWPASSFL

Query:  ASPSALLSAP--LVSPISPPNFYNNLFD
           S L S+P  + SP S  + +NN FD
Subjt:  ASPSALLSAP--LVSPISPPNFYNNLFD

AT3G18360.1 VQ motif-containing protein1.9e-0439.44Show/hide
Query:  PRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSS
        P PP L+V+ +S  IKKPP        P    + A+   P+IIY  +P++IH    +FM++VQ+LTG++ S
Subjt:  PRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSS

AT3G18690.1 MAP kinase substrate 11.3e-2641.48Show/hide
Query:  MDPLGPGFPGANPS----RKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLA
        MDP    F G NPS    +K+++Q+ GPRP  L V  +S KIKKPP HP A  P    P P    +P++IY +SPKV+H   + FM+VVQRLTG+SS + 
Subjt:  MDPLGPGFPGANPS----RKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLA

Query:  AADSSAAGDLSPAARLATIEKASPRPERE---REREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFE---LSPHWPASSFLASPSA
          +S   GD+SPAARLA+ E ASPR  +E   R+  ++++  M E A+     G  PGILSP+PA L    +G FSP   +    SP  P   F  SP+ 
Subjt:  AADSSAAGDLSPAARLATIEKASPRPERE---REREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFE---LSPHWPASSFLASPSA

Query:  LLSAPLVSP-----ISPP---NFYNNLFD
         +S P  SP     ++ P   +F+++++D
Subjt:  LLSAPLVSP-----ISPP---NFYNNLFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCCGCTGGGCCCGGGCTTTCCCGGCGCCAATCCGTCGCGGAAGAAGGAGATCCAGCTGCAGGGTCCACGGCCGCCGCAGCTCCGGGTCAGCCACGAATCC
CGCAAAATCAAGAAGCCGCCGCCGCACCCCCAGGCCGGCCGCCCCCCTCTGCCCCCGCCGGCGCCGGCCCAGTACCCCCAGCCGTTGATCATCTACGACATCTCC
CCCAAAGTCATCCACGTGGCGGAGAACAATTTCATGTCGGTGGTACAGCGCCTGACGGGGTTGTCCTCGTCGTTGGCCGCCGCAGACTCGTCCGCCGCCGGGGAC
CTCTCGCCGGCGGCCAGGCTGGCGACAATCGAGAAGGCGAGCCCCCGGCCGGAGCGGGAAAGGGAAAGGGAAATCGACGTCAGCGATATGATGATGGAGATGGCG
GACGTTTCGGTGGAATTGGGGCAGGTGCCAGGAATTCTATCTCCGGCGCCGGCGAGTTTGGCGCCGATTCCGTCGGGATTCTTCTCGCCGGCGGCGTTTGAGCTG
AGCCCGCATTGGCCGGCTTCCAGTTTCTTGGCCAGCCCTTCGGCTCTCCTCTCAGCGCCTTTGGTTTCCCCAATTTCTCCCCCAAATTTCTACAACAATCTCTTT
GACTTGCTCAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCCGCTGGGCCCGGGCTTTCCCGGCGCCAATCCGTCGCGGAAGAAGGAGATCCAGCTGCAGGGTCCACGGCCGCCGCAGCTCCGGGTCAGCCACGAATCC
CGCAAAATCAAGAAGCCGCCGCCGCACCCCCAGGCCGGCCGCCCCCCTCTGCCCCCGCCGGCGCCGGCCCAGTACCCCCAGCCGTTGATCATCTACGACATCTCC
CCCAAAGTCATCCACGTGGCGGAGAACAATTTCATGTCGGTGGTACAGCGCCTGACGGGGTTGTCCTCGTCGTTGGCCGCCGCAGACTCGTCCGCCGCCGGGGAC
CTCTCGCCGGCGGCCAGGCTGGCGACAATCGAGAAGGCGAGCCCCCGGCCGGAGCGGGAAAGGGAAAGGGAAATCGACGTCAGCGATATGATGATGGAGATGGCG
GACGTTTCGGTGGAATTGGGGCAGGTGCCAGGAATTCTATCTCCGGCGCCGGCGAGTTTGGCGCCGATTCCGTCGGGATTCTTCTCGCCGGCGGCGTTTGAGCTG
AGCCCGCATTGGCCGGCTTCCAGTTTCTTGGCCAGCCCTTCGGCTCTCCTCTCAGCGCCTTTGGTTTCCCCAATTTCTCCCCCAAATTTCTACAACAATCTCTTT
GACTTGCTCAGCTAA
Protein sequenceShow/hide protein sequence
MDPLGPGFPGANPSRKKEIQLQGPRPPQLRVSHESRKIKKPPPHPQAGRPPLPPPAPAQYPQPLIIYDISPKVIHVAENNFMSVVQRLTGLSSSLAAADSSAAGD
LSPAARLATIEKASPRPEREREREIDVSDMMMEMADVSVELGQVPGILSPAPASLAPIPSGFFSPAAFELSPHWPASSFLASPSALLSAPLVSPISPPNFYNNLF
DLLS