; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G018240 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G018240
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionSAM domain-containing protein
Genome locationCG_Chr06:31195922..31197281
RNA-Seq ExpressionClCG06G018240
SyntenyClCG06G018240
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464381.1 PREDICTED: uncharacterized protein LOC103502284 [Cucumis melo]2.4e-12792.66Show/hide
Query:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS
        MADIQPPDPQINGAPPPLV+SSETVGSKRQRRPSVRLGDIGGDQPYDSY RRTNKPWKF+LDNRKDSSAA+AKNSKTRPLTNFSS    ++EDKENNLD+
Subjt:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS

Query:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
        VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
Subjt:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS

Query:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW
        RVSEGR+HQEGLEFSGPSDTDARNWKCGTSGDRNGNG     GEDGVR WLNGLGLGR+
Subjt:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW

XP_022921618.1 protein bicaudal C homolog 1-A-like [Cucurbita moschata]7.4e-12187.5Show/hide
Query:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPG--------AVESE
        MADIQPPDPQINGAP P V+SSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFA DNRKDSSAASAKNSKTRPLTN  S G          E E
Subjt:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPG--------AVESE

Query:  DKENNLDSVAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQ
        DKENNLDSVAIGSWRLKDSKRRGSAATKR RTNWVS KHDEGGGGGEADEKYSAGEDV+EGYRDYD++NSESPLKEQSSMHSLENLAMEG+G +REMLYQ
Subjt:  DKENNLDSVAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQ

Query:  GNRRSIRSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGGE--DGVRAWLNGLGLGRW
        GNRR IRSRVSEGREHQ+GLEFSGPSDTDARNWKCGTSGDRNGNGG   DGVR WLNGLGLGR+
Subjt:  GNRRSIRSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGGE--DGVRAWLNGLGLGRW

XP_023516987.1 uncharacterized protein LOC111780732 [Cucurbita pepo subsp. pepo]1.3e-12087.5Show/hide
Query:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPG--------AVESE
        MADIQPPDPQINGAP P V+SSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFA DNRKDSSAASAKNSKTRPLTN SS G          E E
Subjt:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPG--------AVESE

Query:  DKENNLDSVAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQ
        DKENNLDSVAIGSWRLKDSKRRGSAATKR RTNWVS KH EGGGGGEADEKYSAGEDV+EGYRDYD+ENSESPLKEQSSMHSLENLAMEG+G +REMLYQ
Subjt:  DKENNLDSVAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQ

Query:  GNRRSIRSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGGE--DGVRAWLNGLGLGRW
        GNRR IRSRVSEGREHQ+G+EFSGPSDTDARNWKCGTSGDRNGNGG   DGVR WLNGLGLGR+
Subjt:  GNRRSIRSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGGE--DGVRAWLNGLGLGRW

XP_031738774.1 uncharacterized protein LOC101208783 [Cucumis sativus]4.5e-12691.51Show/hide
Query:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS
        MADIQPPDPQINGAPPPLV+SSETVGSKRQRRPSVRLGDIGGDQPYDSY RRTNKPWKF+L+NRKDSSAA+AKNSKTRPLTNFSS    ++EDKENNLD+
Subjt:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS

Query:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
        VAIGSWRLKDSKRRGSAATKRARTNW SSKHDEGGGGGEAD+KYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
Subjt:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS

Query:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW
        RVSEGR+HQEGLEFSGPSDTDARNWKCGTSGDRNGNG     GEDGVR WLNGLGLGR+
Subjt:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW

XP_038879043.1 uncharacterized protein LOC120071081 [Benincasa hispida]2.9e-12591.98Show/hide
Query:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS
        MADIQPPDPQINGAPPPLV+SSETVGSKRQRRPSVRLGDIGGDQPYDSY RRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSS G VE EDKENNLDS
Subjt:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS

Query:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLY--QGNRRSI
        VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENL MEGQGNDREMLY  QGNRRSI
Subjt:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLY--QGNRRSI

Query:  RSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGG------EDGVRAWLNGLGLGRW
        RSRVSEGREHQ   EFSGPSDTD RNWKCGTSGDRNGNGG      +DGVR WLNGLGLGR+
Subjt:  RSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGG------EDGVRAWLNGLGLGRW

TrEMBL top hitse value%identityAlignment
A0A0A0LRM8 SAM domain-containing protein2.2e-12691.51Show/hide
Query:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS
        MADIQPPDPQINGAPPPLV+SSETVGSKRQRRPSVRLGDIGGDQPYDSY RRTNKPWKF+L+NRKDSSAA+AKNSKTRPLTNFSS    ++EDKENNLD+
Subjt:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS

Query:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
        VAIGSWRLKDSKRRGSAATKRARTNW SSKHDEGGGGGEAD+KYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
Subjt:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS

Query:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW
        RVSEGR+HQEGLEFSGPSDTDARNWKCGTSGDRNGNG     GEDGVR WLNGLGLGR+
Subjt:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW

A0A1S3CLT3 uncharacterized protein LOC1035022841.2e-12792.66Show/hide
Query:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS
        MADIQPPDPQINGAPPPLV+SSETVGSKRQRRPSVRLGDIGGDQPYDSY RRTNKPWKF+LDNRKDSSAA+AKNSKTRPLTNFSS    ++EDKENNLD+
Subjt:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS

Query:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
        VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
Subjt:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS

Query:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW
        RVSEGR+HQEGLEFSGPSDTDARNWKCGTSGDRNGNG     GEDGVR WLNGLGLGR+
Subjt:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW

A0A5A7UW08 Protein bicaudal C-1-like protein1.2e-12792.66Show/hide
Query:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS
        MADIQPPDPQINGAPPPLV+SSETVGSKRQRRPSVRLGDIGGDQPYDSY RRTNKPWKF+LDNRKDSSAA+AKNSKTRPLTNFSS    ++EDKENNLD+
Subjt:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS

Query:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
        VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
Subjt:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS

Query:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW
        RVSEGR+HQEGLEFSGPSDTDARNWKCGTSGDRNGNG     GEDGVR WLNGLGLGR+
Subjt:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW

A0A6J1E4E0 protein bicaudal C homolog 1-A-like3.6e-12187.5Show/hide
Query:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPG--------AVESE
        MADIQPPDPQINGAP P V+SSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFA DNRKDSSAASAKNSKTRPLTN  S G          E E
Subjt:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPG--------AVESE

Query:  DKENNLDSVAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQ
        DKENNLDSVAIGSWRLKDSKRRGSAATKR RTNWVS KHDEGGGGGEADEKYSAGEDV+EGYRDYD++NSESPLKEQSSMHSLENLAMEG+G +REMLYQ
Subjt:  DKENNLDSVAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQ

Query:  GNRRSIRSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGGE--DGVRAWLNGLGLGRW
        GNRR IRSRVSEGREHQ+GLEFSGPSDTDARNWKCGTSGDRNGNGG   DGVR WLNGLGLGR+
Subjt:  GNRRSIRSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGGE--DGVRAWLNGLGLGRW

E5GBM6 SAM domain-containing protein1.2e-12792.66Show/hide
Query:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS
        MADIQPPDPQINGAPPPLV+SSETVGSKRQRRPSVRLGDIGGDQPYDSY RRTNKPWKF+LDNRKDSSAA+AKNSKTRPLTNFSS    ++EDKENNLD+
Subjt:  MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDS

Query:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
        VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS
Subjt:  VAIGSWRLKDSKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRS

Query:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW
        RVSEGR+HQEGLEFSGPSDTDARNWKCGTSGDRNGNG     GEDGVR WLNGLGLGR+
Subjt:  RVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNG-----GEDGVRAWLNGLGLGRW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G48800.1 Sterile alpha motif (SAM) domain-containing protein1.3e-2737.99Show/hide
Query:  MADIQPPD-PQINGAPPPLVISSE-------TVGSKRQRRPSVRLGDIGGDQ------------PYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPL
        MA++QP D  Q NG    +  S+          GSKR RRPSVRLG+IGGDQ             YDS  R++         NRKD+S    K+S+TR L
Subjt:  MADIQPPD-PQINGAPPPLVISSE-------TVGSKRQRRPSVRLGDIGGDQ------------PYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPL

Query:  TNFSSP----GAVESEDKENNLDSVAIGSWRLKDSKRRG-SAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSL
        TN SS     G ++ E +E N+DS  +GSWR+K  KR G SAA KR R+NWVS         G+ DEK S GE+++ G+RD+  E+SESP+KE       
Subjt:  TNFSSP----GAVESEDKENNLDSVAIGSWRLKDSKRRG-SAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSL

Query:  ENLAMEGQGNDREMLYQGNRRSIRSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGGEDGVRAWLNGLGLGRW
        E+L  +G G      + G RR   +  S  RE +  ++                       GG++GV+ WL  LGLGR+
Subjt:  ENLAMEGQGNDREMLYQGNRRSIRSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGGEDGVRAWLNGLGLGRW

AT5G23680.1 Sterile alpha motif (SAM) domain-containing protein2.9e-1432.44Show/hide
Query:  MADIQPPD-PQINGA--PPPLVISSE----------TVGSKRQRRPSVRLGDIGGDQ-------PYDSYARRTNKPWK------FALDNRKDSSAASAK-
        MA++Q  +  QING   PP ++ S E          +VGSKR RRPSVRLGDIGGDQ        YDS   R  K W+          NRK+ +  S K 
Subjt:  MADIQPPD-PQINGA--PPPLVISSE----------TVGSKRQRRPSVRLGDIGGDQ-------PYDSYARRTNKPWK------FALDNRKDSSAASAK-

Query:  --NSKTRPLTNFSSPGAVESEDKENNLDSVAIGSWRLKD-----------SKRRGSAATKRARTNWVS-----SKHDEGGGGGEADEKYSAGEDVDEGYR
          +S+TR +TN SS G   +   +   D V+IGSWR+K            +    +A+ KR R+NW +      + DE   G E +E+       +EG+R
Subjt:  --NSKTRPLTNFSSPGAVESEDKENNLDSVAIGSWRLKD-----------SKRRGSAATKRARTNWVS-----SKHDEGGGGGEADEKYSAGEDVDEGYR

Query:  DYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGGEDGVRAWLNGLGLGRW
        D+  E+SESP+KE+                               R  E RE    +E  G       +W       ++G  G++GV+ WL  LGLGR+
Subjt:  DYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRSRVSEGREHQEGLEFSGPSDTDARNWKCGTSGDRNGNGGEDGVRAWLNGLGLGRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATATTCAACCCCCCGACCCCCAGATCAACGGCGCCCCTCCCCCTCTCGTCATCTCCTCCGAAACCGTTGGATCCAAACGCCAGCGAAGGCCCAGCGTCCGATT
GGGGGATATCGGCGGCGACCAACCCTACGATTCCTATGCCCGCCGCACCAATAAACCCTGGAAGTTTGCCTTGGATAATCGCAAGGACTCCTCCGCTGCCTCTGCCAAGA
ACTCCAAAACTCGCCCCTTGACCAACTTCAGCTCGCCCGGGGCTGTAGAGAGCGAAGACAAAGAGAACAATTTGGATAGCGTTGCAATTGGCAGCTGGAGACTAAAGGAT
TCCAAGCGGAGAGGCTCCGCCGCCACCAAGAGGGCCAGAACTAATTGGGTATCCTCCAAGCACGACGAAGGCGGCGGAGGTGGGGAAGCAGACGAAAAATACAGCGCCGG
CGAAGACGTGGATGAGGGGTATCGGGATTACGACATTGAGAACTCGGAAAGTCCCCTGAAAGAACAAAGCTCAATGCATTCGCTTGAGAATTTGGCGATGGAAGGTCAGG
GGAATGATAGGGAGATGCTTTATCAGGGGAATAGAAGATCAATAAGGTCTAGGGTTTCGGAGGGGAGAGAACACCAAGAGGGGCTCGAATTTTCAGGGCCTTCGGATACT
GATGCCAGGAATTGGAAGTGCGGGACTAGCGGTGATAGGAATGGGAATGGTGGGGAAGATGGGGTTAGGGCTTGGCTCAACGGTTTAGGGTTAGGTCGTTGGGTCACGGA
GAAAAATGTATTGCGCAATCCAGAAGTTAGGGAAGGGATTTTCATGACTTTTGAATTTCAGCCATGGCAATATTCTGCATTTTGGGTTCTCCTCAAAATAGTGTACAACT
GCGTTAGTTATTTGGCCAAGTTTTTCCACTCTGGGAACTGGAAAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATATTCAACCCCCCGACCCCCAGATCAACGGCGCCCCTCCCCCTCTCGTCATCTCCTCCGAAACCGTTGGATCCAAACGCCAGCGAAGGCCCAGCGTCCGATT
GGGGGATATCGGCGGCGACCAACCCTACGATTCCTATGCCCGCCGCACCAATAAACCCTGGAAGTTTGCCTTGGATAATCGCAAGGACTCCTCCGCTGCCTCTGCCAAGA
ACTCCAAAACTCGCCCCTTGACCAACTTCAGCTCGCCCGGGGCTGTAGAGAGCGAAGACAAAGAGAACAATTTGGATAGCGTTGCAATTGGCAGCTGGAGACTAAAGGAT
TCCAAGCGGAGAGGCTCCGCCGCCACCAAGAGGGCCAGAACTAATTGGGTATCCTCCAAGCACGACGAAGGCGGCGGAGGTGGGGAAGCAGACGAAAAATACAGCGCCGG
CGAAGACGTGGATGAGGGGTATCGGGATTACGACATTGAGAACTCGGAAAGTCCCCTGAAAGAACAAAGCTCAATGCATTCGCTTGAGAATTTGGCGATGGAAGGTCAGG
GGAATGATAGGGAGATGCTTTATCAGGGGAATAGAAGATCAATAAGGTCTAGGGTTTCGGAGGGGAGAGAACACCAAGAGGGGCTCGAATTTTCAGGGCCTTCGGATACT
GATGCCAGGAATTGGAAGTGCGGGACTAGCGGTGATAGGAATGGGAATGGTGGGGAAGATGGGGTTAGGGCTTGGCTCAACGGTTTAGGGTTAGGTCGTTGGGTCACGGA
GAAAAATGTATTGCGCAATCCAGAAGTTAGGGAAGGGATTTTCATGACTTTTGAATTTCAGCCATGGCAATATTCTGCATTTTGGGTTCTCCTCAAAATAGTGTACAACT
GCGTTAGTTATTTGGCCAAGTTTTTCCACTCTGGGAACTGGAAAAGCTGA
Protein sequenceShow/hide protein sequence
MADIQPPDPQINGAPPPLVISSETVGSKRQRRPSVRLGDIGGDQPYDSYARRTNKPWKFALDNRKDSSAASAKNSKTRPLTNFSSPGAVESEDKENNLDSVAIGSWRLKD
SKRRGSAATKRARTNWVSSKHDEGGGGGEADEKYSAGEDVDEGYRDYDIENSESPLKEQSSMHSLENLAMEGQGNDREMLYQGNRRSIRSRVSEGREHQEGLEFSGPSDT
DARNWKCGTSGDRNGNGGEDGVRAWLNGLGLGRWVTEKNVLRNPEVREGIFMTFEFQPWQYSAFWVLLKIVYNCVSYLAKFFHSGNWKS