; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG00G001050 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG00G001050
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionKelch repeat-containing protein
Genome locationCG_Chr00:3678480..3682930
RNA-Seq ExpressionClCG00G001050
SyntenyClCG00G001050
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652542.1 hypothetical protein Csa_013076 [Cucumis sativus]6.1e-10064.94Show/hide
Query:  LLSMDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSL
        LLSMDG+  DPFDSL KLC++S SQEDILR CSFAG P S+DAS F  SQ LH +P+ VS  SPPESLEQREQ+ S DA QPPPEQSG +RP+ VDDPS+
Subjt:  LLSMDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSL

Query:  QDTAAAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVEKLLVDLEFGKESKLIDESVRF
        QD AAAV GGCV +V  GVDLGKN++LGF  EVQS +Q+  IEIIGV R KVSES  DGEAESA KRLKLSNEALG       D        LI++SV  
Subjt:  QDTAAAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVEKLLVDLEFGKESKLIDESVRF

Query:  GEESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGSNSK-KTDDATSPGDSGTPTSIIMKILKILSQ-EESEE
        GEESGKIDDGK+SNGEETHCNKNK+   EKK  NSQPE P  Y G +NRD  R  +    NG  SK K DDATS G+SG+  SIIM+ILKILSQ E S+E
Subjt:  GEESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGSNSK-KTDDATSPGDSGTPTSIIMKILKILSQ-EESEE

Query:  DKKLGNMSFLEIALHRGMTFPWPCWWTE
        D+KL +M+ +E+A+ RGMTFP PCWW E
Subjt:  DKKLGNMSFLEIALHRGMTFPWPCWWTE

KAG6577658.1 hypothetical protein SDJN03_25232, partial [Cucurbita argyrosperma subsp. sororia]2.4e-1931.36Show/hide
Query:  IVSDSQEDILRRCSFAGDPKSIDA---SDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQ-----SGRQRPMAVDDPSLQDTAAAVEGGC
        +V  S  ++ RRC F G PKS D+   SD S S   H+  + + M S P+SLE+    +   +Y+  PE+     S  +R +A D    + +A  V+ G 
Subjt:  IVSDSQEDILRRCSFAGDPKSIDA---SDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQ-----SGRQRPMAVDDPSLQDTAAAVEGGC

Query:  VRDVARGVDLGKNNDLGFRFEVQSKQQSAE----IEIIGVCR--------RKVSESDAD-GEAESALKRLKLSNEALGT-------VEKLLVD-LEFGKE
             + +DLG++ D+G   EVQS  ++ E    IE+I  CR        R++   ++D     S+ K+L+LS EALG        V+K  VD    G  
Subjt:  VRDVARGVDLGKNNDLGFRFEVQSKQQSAE----IEIIGVCR--------RKVSESDAD-GEAESALKRLKLSNEALGT-------VEKLLVD-LEFGKE

Query:  SKLIDESVRFGEESG------KIDDGKISNGE-----ETHC---NKNKEKFAEKKTGNSQPEEPSYLG---ALNRDPWRHVVPLTTNGSNSKKTDDATSP
        SK+   SV   ++SG       ++DGK+SNG+     E HC    ++ EK  ++   NSQ ++ S L    A        V+P + +G     + + ++ 
Subjt:  SKLIDESVRFGEESG------KIDDGKISNGE-----ETHC---NKNKEKFAEKKTGNSQPEEPSYLG---ALNRDPWRHVVPLTTNGSNSKKTDDATSP

Query:  GDSGTPTSIIMKILKILSQEESEE----DKKLGNMSFLEIALHRGMTFPWPCWW
         +       +++ILKIL+ E+  E    D+ L N+S L+I   RGMTFP P WW
Subjt:  GDSGTPTSIIMKILKILSQEESEE----DKKLGNMSFLEIALHRGMTFPWPCWW

KAG6584353.1 hypothetical protein SDJN03_20285, partial [Cucurbita argyrosperma subsp. sororia]1.1e-9666.04Show/hide
Query:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILH-LHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQD
        MDGNGDDPFDSLTK C++SDSQEDILRRCSFAG+P+S+D S  + SQI H L P+ V M S PESLEQREQIT  +AYQ PPEQSG QRPMA DDPSLQD
Subjt:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILH-LHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQD

Query:  TA-------AAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGT-------VEKLLVDLEFG
         A       A VEGGCVRDVA  VDLGKN+DLGFR EVQS QQ+ EIE++GV RRKVSESDA GE ESA KRL  SNEALGT       VE L VDLE G
Subjt:  TA-------AAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGT-------VEKLLVDLEFG

Query:  KESKLIDESVRFGE-------ESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGS-NSKKTDDATSPGDSGTP
          SKL+D +V   E       ESGK+DDGK+SNGEETHCNKNKEKFAEK+  NSQPEEP +Y G + RD WR V+P T NGS NS++TD         TP
Subjt:  KESKLIDESVRFGE-------ESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGS-NSKKTDDATSPGDSGTP

Query:  TSIIMKILKILSQEESEEDKK
         SIIM+ILK+L++EE EEDK+
Subjt:  TSIIMKILKILSQEESEEDKK

KAG7019938.1 hypothetical protein SDJN02_18905 [Cucurbita argyrosperma subsp. argyrosperma]8.2e-11365.73Show/hide
Query:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILH-LHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQD
        MDGNGDDPFDSLTK C++SDSQEDILRRCSFAG+P+S+D S  + SQI H L P+ V M S PESLEQREQITS +AYQ PPEQSG QRPMA DDPSLQD
Subjt:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILH-LHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQD

Query:  TA-------AAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGT-------VEKLLVDLEFG
         A       A VEGGCVRDVA  VDLGKN+DLGFR EVQS Q + EIE++GV RRKVSESDA GE ESA KRL  SNEALGT       VE L VDLE G
Subjt:  TA-------AAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGT-------VEKLLVDLEFG

Query:  KESKLIDESVRFGE-------ESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGS-NSKKTDDATSPGDSGTP
          SKL+D +V   E       ESGK+DDGK+SNGEETHCNKNKEKFAEK+  NSQPEEP +Y G + RD WR V+P T NGS NS++TD         TP
Subjt:  KESKLIDESVRFGE-------ESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGS-NSKKTDDATSPGDSGTP

Query:  TSIIMKILKILSQEESEEDKKLGNMSFLEIALHRGMTFPWPCWWTEGEDFSSKKNS
         SIIM+ILK+L++EE EEDK+  +MS LE+  HRGMTFP PCWW EG++FS KK S
Subjt:  TSIIMKILKILSQEESEEDKKLGNMSFLEIALHRGMTFPWPCWWTEGEDFSSKKNS

TYJ98997.1 hypothetical protein E5676_scaffold248G002010 [Cucumis melo var. makuwa]3.2e-10967.66Show/hide
Query:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQDT
        MDG+G DPFDSL KLC++S SQEDILR  SFAG PKS+DAS F ASQ L  HP+ VS  SPPESLEQREQ+ + DAY+PPPEQSG +RP+AVDDPS+QD 
Subjt:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQDT

Query:  AAAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVEKL-LVDLEFGKESKLIDESVRFGE
        AAAV GGCV +V  GVDLGKN++LGF  EVQS QQ+ EIEIIGV R KVSES  DGEAESA KRLKLSNEALG    + LV LE G+ES LI++SV  GE
Subjt:  AAAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVEKL-LVDLEFGKESKLIDESVRFGE

Query:  ESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGSNSK-KTDDATSPGDSGTPTSIIMKILKILSQ-EESEEDK
        ESGKIDDGK+SNGEETHCNKNKEK  EKK  NS PEEP  Y G LN DPWR  +  T NG  SK K DDA+S G+SG   SIIM+ILKI+SQ E S+ED+
Subjt:  ESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGSNSK-KTDDATSPGDSGTPTSIIMKILKILSQ-EESEEDK

Query:  KLGNMSFLEIALHRGMTFPWPCWWTE--GEDFSSKKN
        KL NM  +E+A+ RGMTFP PCWW E  G +   KKN
Subjt:  KLGNMSFLEIALHRGMTFPWPCWWTE--GEDFSSKKN

TrEMBL top hitse value%identityAlignment
A0A0A0L1F4 Uncharacterized protein7.7e-1631.04Show/hide
Query:  IVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQ-----SGRQRPMAVDDPSLQDTAAAVEGGCVRD
        IV+ S  ++LRRC F G+  S   SD S S+   +  + + M S P+S+E+++  +   ++Q  PE+     S  +R +A D   ++ +   V+ G    
Subjt:  IVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQ-----SGRQRPMAVDDPSLQDTAAAVEGGCVRD

Query:  VARGVDLGKNNDLGFRFEVQSKQQSAE----IEIIGVCRRKVSESDADG---------EAESALKRLKLSNEALGTVEKLLVDLEFGKESKLIDESVRFG
          + VDLG++ D+G   EVQS  ++ E    I +IGVC    SE    G           ES+ K+L+LS EALG        L  G      D S ++ 
Subjt:  VARGVDLGKNNDLGFRFEVQSKQQSAE----IEIIGVCRRKVSESDADG---------EAESALKRLKLSNEALGTVEKLLVDLEFGKESKLIDESVRFG

Query:  EESGKIDDGKISNGE--ETHCNKNKEKFAEKKTGNSQPEEPSYLG-------ALNRDPWRHVVPLTTNGSNSKKTDDATSPGDSGTPTSIIMKILKILSQ
         +    DDGKISN E  E  CN  KEK AE +   S     S+         A        V+P + +G  +   ++ ++  +       ++KIL IL  
Subjt:  EESGKIDDGKISNGE--ETHCNKNKEKFAEKKTGNSQPEEPSYLG-------ALNRDPWRHVVPLTTNGSNSKKTDDATSPGDSGTPTSIIMKILKILSQ

Query:  EESE----EDKKLGNMSFLEIALHRGMTFPWPCWW
         +      +D+ L  +S LEIA  RGMTFP P WW
Subjt:  EESE----EDKKLGNMSFLEIALHRGMTFPWPCWW

A0A0A0LT00 Uncharacterized protein9.5e-9964.62Show/hide
Query:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQDT
        MDG+  DPFDSL KLC++S SQEDILR CSFAG P S+DAS F  SQ LH +P+ VS  SPPESLEQREQ+ S DA QPPPEQSG +RP+ VDDPS+QD 
Subjt:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQDT

Query:  AAAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVEKLLVDLEFGKESKLIDESVRFGEE
        AAAV GGCV +V  GVDLGKN++LGF  EVQS +Q+  IEIIGV R KVSES  DGEAESA KRLKLSNEALG       D        LI++SV  GEE
Subjt:  AAAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVEKLLVDLEFGKESKLIDESVRFGEE

Query:  SGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGSNSK-KTDDATSPGDSGTPTSIIMKILKILSQ-EESEEDKK
        SGKIDDGK+SNGEETHCNKNK+   EKK  NSQPE P  Y G +NRD  R  +    NG  SK K DDATS G+SG+  SIIM+ILKILSQ E S+ED+K
Subjt:  SGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGSNSK-KTDDATSPGDSGTPTSIIMKILKILSQ-EESEEDKK

Query:  LGNMSFLEIALHRGMTFPWPCWWTE
        L +M+ +E+A+ RGMTFP PCWW E
Subjt:  LGNMSFLEIALHRGMTFPWPCWWTE

A0A2N9G3N6 Uncharacterized protein4.6e-2130.99Show/hide
Query:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSI------DASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQS------GRQR
        +D   +DPF S+T+LC VS SQE+ LR CSFA +          D+SD + S  + +  + + M SPPES E+++     D +Q P E S       +Q+
Subjt:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSI------DASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQS------GRQR

Query:  PMAVDDPSLQDTAAAVEGGCVRDVARGVDLGKNNDLGF-----RFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLK-----LSNEALGTVEKL
        PMAVD P   D  A    GC  D +  VDLGK++DLGF        V  +    E E +G+ RR+ S  ++   AES  K+LK     L++EA       
Subjt:  PMAVDDPSLQDTAAAVEGGCVRDVARGVDLGKNNDLGF-----RFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLK-----LSNEALGTVEKL

Query:  LVDL--EFGKESKLI--------DESVRF-GEESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEPSYLGALNRDPWRHVVPLTTNGSNSKKTDDA
        L D   E  + +++I        DE ++  GE S K          E   ++++E + E + G    E+       N      V+P++ +     + +  
Subjt:  LVDL--EFGKESKLI--------DESVRF-GEESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEPSYLGALNRDPWRHVVPLTTNGSNSKKTDDA

Query:  TSPGDSGTPTSIIMKILKILSQEESEEDKKLGNMSFLEIALHRGMTFPWPCWWTE
         + G        I+ +LK L     EED  L ++S  ++   +GMTFP PCWW E
Subjt:  TSPGDSGTPTSIIMKILKILSQEESEEDKKLGNMSFLEIALHRGMTFPWPCWWTE

A0A5D3BIQ1 Uncharacterized protein1.6e-10967.66Show/hide
Query:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQDT
        MDG+G DPFDSL KLC++S SQEDILR  SFAG PKS+DAS F ASQ L  HP+ VS  SPPESLEQREQ+ + DAY+PPPEQSG +RP+AVDDPS+QD 
Subjt:  MDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQDT

Query:  AAAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVEKL-LVDLEFGKESKLIDESVRFGE
        AAAV GGCV +V  GVDLGKN++LGF  EVQS QQ+ EIEIIGV R KVSES  DGEAESA KRLKLSNEALG    + LV LE G+ES LI++SV  GE
Subjt:  AAAVEGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVEKL-LVDLEFGKESKLIDESVRFGE

Query:  ESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGSNSK-KTDDATSPGDSGTPTSIIMKILKILSQ-EESEEDK
        ESGKIDDGK+SNGEETHCNKNKEK  EKK  NS PEEP  Y G LN DPWR  +  T NG  SK K DDA+S G+SG   SIIM+ILKI+SQ E S+ED+
Subjt:  ESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEP-SYLGALNRDPWRHVVPLTTNGSNSK-KTDDATSPGDSGTPTSIIMKILKILSQ-EESEEDK

Query:  KLGNMSFLEIALHRGMTFPWPCWWTE--GEDFSSKKN
        KL NM  +E+A+ RGMTFP PCWW E  G +   KKN
Subjt:  KLGNMSFLEIALHRGMTFPWPCWWTE--GEDFSSKKN

A0A7N2REC3 Uncharacterized protein1.0e-1530.23Show/hide
Query:  DDPFDSLTKLCIVSDSQEDILRRCSFA-------GDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQ-----ITSRDAYQPPPEQS------GRQ
        DDPF S+T+LC +S SQE+ LR C FA           + D+ D +    + +  + + M S PE+ ++ E+      T +D +Q P E S       +Q
Subjt:  DDPFDSLTKLCIVSDSQEDILRRCSFA-------GDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQ-----ITSRDAYQPPPEQS------GRQ

Query:  RPMAVDDPSLQDTAAAVEGGCVRDVARGVDLGKNNDLGFRFEVQ----SKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVE---KLLV
         P+A DDP++ D   A E  C  D    VDLGK++DLGF    +    SK   +E ++        +  D +   E A++  K   EA  ++E   K  +
Subjt:  RPMAVDDPSLQDTAAAVEGGCVRDVARGVDLGKNNDLGFRFEVQ----SKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVE---KLLV

Query:  DLEFGKESKLIDESVRFGEESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEPSYLGALNRDPWRHVVPLTTNGSNSKKTDDATSPGDSGTPTSII
        +  +G ++ ++D      EE    DD +   GEE    KN+EKF EK  G+ +                 V+P +     S+      + G  G   + I
Subjt:  DLEFGKESKLIDESVRFGEESGKIDDGKISNGEETHCNKNKEKFAEKKTGNSQPEEPSYLGALNRDPWRHVVPLTTNGSNSKKTDDATSPGDSGTPTSII

Query:  MKILKILSQEESEEDKK--LGNMSFLEIALHRGMTFPWPCWWTE
        + +LK+L Q   +E+K   L ++S  EI + RG+TFP PCWW E
Subjt:  MKILKILSQEESEEDKK--LGNMSFLEIALHRGMTFPWPCWWTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAACCTTCTTTCGATGGATGGTAATGGCGACGATCCCTTCGATTCACTCACCAAACTCTGCATCGTTTCTGACTCTCAAGAAGATATCTTACGTCGCTGCTCTTT
CGCCGGCGACCCTAAATCCATCGATGCCTCTGATTTTTCCGCCTCCCAAATCCTTCACCTTCATCCCTCTCATGTTTCCATGCCTTCTCCTCCAGAGAGTTTGGAACAGA
GAGAACAAATTACCTCTCGTGATGCCTACCAGCCTCCACCGGAACAGTCCGGCAGACAGCGGCCCATGGCTGTAGATGATCCTTCTCTTCAAGATACGGCGGCAGCCGTT
GAAGGCGGATGTGTCCGCGATGTTGCTAGGGGAGTCGATTTGGGGAAGAATAATGACCTAGGGTTTCGTTTTGAAGTTCAATCGAAACAACAGTCGGCTGAAATCGAAAT
AATCGGCGTTTGTAGAAGAAAGGTTTCAGAGTCTGATGCTGATGGAGAGGCTGAATCTGCATTGAAGAGGTTGAAATTGTCGAATGAAGCTTTGGGCACAGTAGAAAAAT
TACTCGTCGATCTCGAATTCGGTAAAGAATCGAAACTGATTGATGAATCCGTACGATTCGGTGAAGAATCCGGCAAAATAGATGATGGAAAGATCTCCAATGGCGAAGAA
ACTCACTGTAACAAGAACAAAGAAAAATTCGCAGAGAAGAAAACCGGAAACTCCCAACCTGAAGAACCGAGCTACTTAGGAGCATTGAATCGCGATCCATGGAGGCACGT
TGTGCCATTAACAACGAATGGATCAAATTCGAAGAAGACAGATGATGCAACTTCCCCTGGTGATTCTGGTACGCCCACATCCATTATTATGAAAATCTTAAAGATTCTCT
CACAAGAAGAAAGTGAGGAAGACAAGAAATTAGGCAATATGAGCTTTTTGGAAATCGCATTGCATCGTGGAATGACATTTCCTTGGCCGTGTTGGTGGACGGAGGGGGAG
GATTTCAGCTCCAAGAAGAACAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCAACCTTCTTTCGATGGATGGTAATGGCGACGATCCCTTCGATTCACTCACCAAACTCTGCATCGTTTCTGACTCTCAAGAAGATATCTTACGTCGCTGCTCTTT
CGCCGGCGACCCTAAATCCATCGATGCCTCTGATTTTTCCGCCTCCCAAATCCTTCACCTTCATCCCTCTCATGTTTCCATGCCTTCTCCTCCAGAGAGTTTGGAACAGA
GAGAACAAATTACCTCTCGTGATGCCTACCAGCCTCCACCGGAACAGTCCGGCAGACAGCGGCCCATGGCTGTAGATGATCCTTCTCTTCAAGATACGGCGGCAGCCGTT
GAAGGCGGATGTGTCCGCGATGTTGCTAGGGGAGTCGATTTGGGGAAGAATAATGACCTAGGGTTTCGTTTTGAAGTTCAATCGAAACAACAGTCGGCTGAAATCGAAAT
AATCGGCGTTTGTAGAAGAAAGGTTTCAGAGTCTGATGCTGATGGAGAGGCTGAATCTGCATTGAAGAGGTTGAAATTGTCGAATGAAGCTTTGGGCACAGTAGAAAAAT
TACTCGTCGATCTCGAATTCGGTAAAGAATCGAAACTGATTGATGAATCCGTACGATTCGGTGAAGAATCCGGCAAAATAGATGATGGAAAGATCTCCAATGGCGAAGAA
ACTCACTGTAACAAGAACAAAGAAAAATTCGCAGAGAAGAAAACCGGAAACTCCCAACCTGAAGAACCGAGCTACTTAGGAGCATTGAATCGCGATCCATGGAGGCACGT
TGTGCCATTAACAACGAATGGATCAAATTCGAAGAAGACAGATGATGCAACTTCCCCTGGTGATTCTGGTACGCCCACATCCATTATTATGAAAATCTTAAAGATTCTCT
CACAAGAAGAAAGTGAGGAAGACAAGAAATTAGGCAATATGAGCTTTTTGGAAATCGCATTGCATCGTGGAATGACATTTCCTTGGCCGTGTTGGTGGACGGAGGGGGAG
GATTTCAGCTCCAAGAAGAACAGTTGA
Protein sequenceShow/hide protein sequence
MPNLLSMDGNGDDPFDSLTKLCIVSDSQEDILRRCSFAGDPKSIDASDFSASQILHLHPSHVSMPSPPESLEQREQITSRDAYQPPPEQSGRQRPMAVDDPSLQDTAAAV
EGGCVRDVARGVDLGKNNDLGFRFEVQSKQQSAEIEIIGVCRRKVSESDADGEAESALKRLKLSNEALGTVEKLLVDLEFGKESKLIDESVRFGEESGKIDDGKISNGEE
THCNKNKEKFAEKKTGNSQPEEPSYLGALNRDPWRHVVPLTTNGSNSKKTDDATSPGDSGTPTSIIMKILKILSQEESEEDKKLGNMSFLEIALHRGMTFPWPCWWTEGE
DFSSKKNS