; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005330 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005330
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSerine/arginine repetitive matrix protein 1-like
Genome locationChr07:1602223..1603098
RNA-Seq ExpressionHG10005330
SyntenyHG10005330
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031591.1 serine/arginine repetitive matrix protein 1-like [Cucumis melo var. makuwa]5.9e-12887.72Show/hide
Query:  RALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTE
        RALDRGQPH LDKN RSAKPPSPGWFDTDVEN+P N  PEEETVKEVLSETPIAKPCSV+QT+PK K PE KVK+S MDGSL KGEESIVSVSETSQVTE
Subjt:  RALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTE

Query:  WCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRH
        WC NMSESVSMATTISEQ+EGDEASSK SREIGRN +PKIRRKRP SGD SYRREQRDKCA+KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNG +H
Subjt:  WCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRH

Query:  EQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL
        EQQSGVSHGRRSRSPA RTV++TNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPMD +IQPPNESIENPLVSLECFIFL
Subjt:  EQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL

KAE8645764.1 hypothetical protein Csa_020528 [Cucumis sativus]9.1e-12987.76Show/hide
Query:  SRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVT
        SRALDRGQPH LDKNHRSAKP SPGWFDTDVEN+P N  PEEETVKEVLSETPIAKPCSV QT+ K K PE +VK+SEMDGSLGKGEESIVSVSETSQVT
Subjt:  SRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVT

Query:  EWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGR
        EWC NMSESVSMATTISEQ+EGDEASSK SR+IGRN +PKIRRKRP SG+ SYRREQRDKC +KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNGG+
Subjt:  EWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGR

Query:  HEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL
        HEQQSGVSHGRRSRSPATRTV+ETNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPMD +IQPPNESIENPLVSLECFIFL
Subjt:  HEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL

XP_008455342.1 PREDICTED: uncharacterized protein LOC103495529 isoform X2 [Cucumis melo]1.2e-12888.07Show/hide
Query:  RALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTE
        RALDRGQPH LDKNHRSAKPPSPGWFDTDVEN+P N  PEEETVKEVLSETPIAKPCSV+QT+ K K PE KVK+SEMDGSL KGEESIVSVSETSQVTE
Subjt:  RALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTE

Query:  WCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRH
        WC NMSESVSMATTISEQ+EGDEASSK SREIGRN +PKIRRKRP SGD SYRREQRDKCA+KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNG +H
Subjt:  WCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRH

Query:  EQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL
        EQQSGVSHGRRSRSPA RTV++TNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPMD +IQPPNESIENPLVSLECFIFL
Subjt:  EQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL

XP_031744655.1 uncharacterized protein LOC105436079 [Cucumis sativus]4.7e-13387.97Show/hide
Query:  MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSE
        MGCCCSRALDRGQPH LDKNHRSAKP SPGWFDTDVEN+P N  PEEETVKEVLSETPIAKPCSV QT+ K K PE +VK+SEMDGSLGKGEESIVSVSE
Subjt:  MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSE

Query:  TSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRK
        TSQVTEWC NMSESVSMATTISEQ+EGDEASSK SR+IGRN +PKIRRKRP SG+ SYRREQRDKC +KRPAELLPEKKSRVNCRY+HGTTESREARTRK
Subjt:  TSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRK

Query:  LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL
        LNGG+HEQQSGVSHGRRSRSPATRTV+ETNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPMD +IQPPNESIENPLVSLECFIFL
Subjt:  LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL

XP_038887224.1 uncharacterized protein LOC120077414 [Benincasa hispida]1.7e-13890.38Show/hide
Query:  MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSE
        MGCCCSRALDRGQPHILDK+HR AKPPSPGWFDTDV NIP NG PEEETVKEVLSETPIAKPC+VQQT PKNKS ELKVK+SEMDGS  K EESIVSVSE
Subjt:  MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSE

Query:  TSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRK
        TSQVTEWC N+SES+SMATTISEQ+EGDEASSK SREIGRNT+PKIRRKRPYSGD SYRREQRDKCA+KRPAELLPEKKSRVNCRYTHGTTESREARTRK
Subjt:  TSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRK

Query:  LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL
        LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSS MKMTGQ GDQ E +TTEK  +GKMEKPMDDAIQPPNESIENPLVSLECFIFL
Subjt:  LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0K701 Uncharacterized protein2.3e-13387.97Show/hide
Query:  MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSE
        MGCCCSRALDRGQPH LDKNHRSAKP SPGWFDTDVEN+P N  PEEETVKEVLSETPIAKPCSV QT+ K K PE +VK+SEMDGSLGKGEESIVSVSE
Subjt:  MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSE

Query:  TSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRK
        TSQVTEWC NMSESVSMATTISEQ+EGDEASSK SR+IGRN +PKIRRKRP SG+ SYRREQRDKC +KRPAELLPEKKSRVNCRY+HGTTESREARTRK
Subjt:  TSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRK

Query:  LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL
        LNGG+HEQQSGVSHGRRSRSPATRTV+ETNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPMD +IQPPNESIENPLVSLECFIFL
Subjt:  LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL

A0A1S3C0P3 uncharacterized protein LOC103495529 isoform X25.8e-12988.07Show/hide
Query:  RALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTE
        RALDRGQPH LDKNHRSAKPPSPGWFDTDVEN+P N  PEEETVKEVLSETPIAKPCSV+QT+ K K PE KVK+SEMDGSL KGEESIVSVSETSQVTE
Subjt:  RALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTE

Query:  WCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRH
        WC NMSESVSMATTISEQ+EGDEASSK SREIGRN +PKIRRKRP SGD SYRREQRDKCA+KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNG +H
Subjt:  WCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRH

Query:  EQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL
        EQQSGVSHGRRSRSPA RTV++TNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPMD +IQPPNESIENPLVSLECFIFL
Subjt:  EQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL

A0A5D3C531 Serine/arginine repetitive matrix protein 1-like2.9e-12887.72Show/hide
Query:  RALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTE
        RALDRGQPH LDKN RSAKPPSPGWFDTDVEN+P N  PEEETVKEVLSETPIAKPCSV+QT+PK K PE KVK+S MDGSL KGEESIVSVSETSQVTE
Subjt:  RALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTE

Query:  WCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRH
        WC NMSESVSMATTISEQ+EGDEASSK SREIGRN +PKIRRKRP SGD SYRREQRDKCA+KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNG +H
Subjt:  WCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRH

Query:  EQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL
        EQQSGVSHGRRSRSPA RTV++TNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPMD +IQPPNESIENPLVSLECFIFL
Subjt:  EQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL

A0A6J1GNG7 uncharacterized protein LOC1114555168.6e-12583.56Show/hide
Query:  MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSE
        MGCCCSR L+ GQPHILDK+  SAK PSPG   TDV NIPG G PEEETVKEVLSETPIAKPC++QQT+PKNKS ELKVKSSEMDGSL K EE  +SVSE
Subjt:  MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSE

Query:  TSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRK
         SQVTEWC NMSESVSMATTISEQ+EGDEASSKQSRE+GRN +PKIRRKRPYSGDPSYRREQRDKCA+KRPAELL EKKSRV CRYTHGTTESREARTRK
Subjt:  TSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRK

Query:  LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMD-DAIQPPNESIENPLVSLECFIFL
        LNGG+ EQ+SGV+HGRRSRSPATRTVRETNKTGNMKSS +K+TGQ G+Q EAVTTEKRD+GK++K MD  A QPPNESIENPLVSLECFIFL
Subjt:  LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMD-DAIQPPNESIENPLVSLECFIFL

A0A6J1HYF2 uncharacterized protein LOC1114680621.5e-12483.56Show/hide
Query:  MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSE
        MGCCCSR L+ GQPHILDK+  SAK PSPG   TDV NIPG G PEEETVKEVLSETPIAKPC++QQT+PKNKS ELKVKSSEMDGSL K EE  +SVSE
Subjt:  MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSE

Query:  TSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRK
         SQVTEWC NMSESVSMATTISEQ+EGDEASSKQSRE+GRN +PKIRRKRPYSGDPSYRR+QRDKCA+KRPAELL EKKSRV CRYTHGTTESREARTRK
Subjt:  TSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRK

Query:  LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDD-AIQPPNESIENPLVSLECFIFL
        LNGG+ EQ+SGVSHGRRSRSPAT+TVRETNKTGNMKSS +K TGQ G+Q EAVTTEKRD+GK++K MD  AIQPPNESIENPLVSLECFIFL
Subjt:  LNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDD-AIQPPNESIENPLVSLECFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61170.1 unknown protein1.5e-0427.78Show/hide
Query:  MGCCC--SRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSE--MDGSLGKGEESI-
        MG CC  S   DR   ++ DKN             T VE        EE  VKEVLSET +  P S  +T+      + K++  E    G L    + + 
Subjt:  MGCCC--SRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSE--MDGSLGKGEESI-

Query:  -----VSVSETSQVTEWCG-NMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHG
             V   E S+V+E C  ++SESVS +T +    + +    KQ +   + +  K R +   +  P+ R +Q             P K++   C     
Subjt:  -----VSVSETSQVTEWCG-NMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHG

Query:  TTESREARTRKLNGGRH---EQQSGVSHGRRSRSPAT-RTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSL
                    NG R+    +  G   GRRSRSPAT R+V ++N++  +  +  +   Q+  +   V  +   +G  ++   +      E +ENPLVSL
Subjt:  TTESREARTRKLNGGRH---EQQSGVSHGRRSRSPAT-RTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSL

Query:  ECFIFL
        ECFIFL
Subjt:  ECFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGTTGTTGTAGCAGAGCCTTGGACAGAGGTCAGCCTCATATCCTGGACAAGAATCACCGTTCTGCAAAACCTCCATCACCGGGATGGTTCGACACTGATGTCGA
AAACATCCCAGGCAACGGAATGCCGGAAGAAGAAACCGTCAAAGAAGTCCTCTCAGAGACTCCCATTGCTAAGCCATGTAGCGTACAACAAACAGCCCCTAAGAACAAGT
CTCCCGAGCTGAAAGTAAAATCGAGTGAAATGGATGGTTCGCTTGGCAAGGGGGAAGAAAGTATAGTTTCAGTTTCGGAGACATCTCAGGTTACAGAATGGTGCGGCAAT
ATGAGCGAAAGTGTGTCAATGGCGACCACAATTTCGGAGCAAAAGGAAGGGGATGAAGCATCAAGTAAACAGAGCAGAGAAATTGGTCGAAATACAAGACCAAAGATTCG
CAGAAAGCGTCCATATTCCGGCGACCCATCGTACCGAAGAGAACAGAGAGACAAATGTGCATCAAAGAGACCTGCTGAACTTTTACCAGAGAAGAAGTCTCGTGTTAATT
GCAGGTACACACATGGAACGACAGAATCAAGAGAAGCGAGGACCAGGAAGCTGAATGGAGGGCGGCACGAGCAACAATCTGGAGTCAGCCATGGCCGCCGTTCGAGGTCA
CCAGCTACTCGAACAGTTAGAGAAACGAATAAGACAGGGAATATGAAAAGCAGTGTCATGAAAATGACCGGACAAACTGGGGACCAGCAAGAGGCAGTGACCACCGAGAA
AAGAGATGATGGAAAGATGGAGAAGCCAATGGATGATGCAATTCAGCCCCCTAATGAATCCATTGAAAACCCACTTGTCTCACTTGAATGTTTCATCTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGTTGTTGTAGCAGAGCCTTGGACAGAGGTCAGCCTCATATCCTGGACAAGAATCACCGTTCTGCAAAACCTCCATCACCGGGATGGTTCGACACTGATGTCGA
AAACATCCCAGGCAACGGAATGCCGGAAGAAGAAACCGTCAAAGAAGTCCTCTCAGAGACTCCCATTGCTAAGCCATGTAGCGTACAACAAACAGCCCCTAAGAACAAGT
CTCCCGAGCTGAAAGTAAAATCGAGTGAAATGGATGGTTCGCTTGGCAAGGGGGAAGAAAGTATAGTTTCAGTTTCGGAGACATCTCAGGTTACAGAATGGTGCGGCAAT
ATGAGCGAAAGTGTGTCAATGGCGACCACAATTTCGGAGCAAAAGGAAGGGGATGAAGCATCAAGTAAACAGAGCAGAGAAATTGGTCGAAATACAAGACCAAAGATTCG
CAGAAAGCGTCCATATTCCGGCGACCCATCGTACCGAAGAGAACAGAGAGACAAATGTGCATCAAAGAGACCTGCTGAACTTTTACCAGAGAAGAAGTCTCGTGTTAATT
GCAGGTACACACATGGAACGACAGAATCAAGAGAAGCGAGGACCAGGAAGCTGAATGGAGGGCGGCACGAGCAACAATCTGGAGTCAGCCATGGCCGCCGTTCGAGGTCA
CCAGCTACTCGAACAGTTAGAGAAACGAATAAGACAGGGAATATGAAAAGCAGTGTCATGAAAATGACCGGACAAACTGGGGACCAGCAAGAGGCAGTGACCACCGAGAA
AAGAGATGATGGAAAGATGGAGAAGCCAATGGATGATGCAATTCAGCCCCCTAATGAATCCATTGAAAACCCACTTGTCTCACTTGAATGTTTCATCTTTCTGTAG
Protein sequenceShow/hide protein sequence
MGCCCSRALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETVKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGN
MSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRS
PATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL