; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002622 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002622
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold6:405717..413614
RNA-Seq ExpressionSpg002622
SyntenySpg002622
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039949.1 hypothetical protein E6C27_scaffold122G002290 [Cucumis melo var. makuwa]2.5e-2728.92Show/hide
Query:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ
        C ++ + F +      RE N+   E    K   I ++   L+W +     +L  P  + FF EK   +F  + + K  +   +  E       G +  I 
Subjt:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ

Query:  VPAGLNKKGWYVFWEMIR------DFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKR------GGSSKSLVSLTDSIRNVK----GI
        VP GL+K GW +F +M+         +F    Y NQ       KE+    +D  + S+    +YVE V          SS+S  S T S  ++K     +
Subjt:  VPAGLNKKGWYVFWEMIR------DFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKR------GGSSKSLVSLTDSIRNVK----GI

Query:  NEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF---KSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSKIHSQ
          E   +RK     +ID E+SI++SR   H  W  +   L++      S     PF  DKAL+   D  L      N  W   G  ++K E WS   H+ 
Subjt:  NEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF---KSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSKIHSQ

Query:  PKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINV
         K I SYGGW   R +PL++W+ ++F  IG+  GG +  +  ++N L+ +EA I+V++N+ GF+PA I +
Subjt:  PKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINV

KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]2.9e-2828.81Show/hide
Query:  SCCIQNRSFCI----W-REGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRII
        SC I+ + F +    W R+  +   E    K   I ++   L+W +     +L  P  + FF EK  EE   + + K ++   +  E       G +  I
Subjt:  SCCIQNRSFCI----W-REGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRII

Query:  QVPAGLNKKGWYVFWEMIRDFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSS-KSLVSLTDSIRNVKGINEEAYWVRKNCDVL
         VP G  K GW  F  ++     K         R++L+ +         S       SY E V +G SS +   S T++I+             K+    
Subjt:  QVPAGLNKKGWYVFWEMIRDFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSS-KSLVSLTDSIRNVKGINEEAYWVRKNCDVL

Query:  EIDLERSIVVSRLMAHYSWKDVKIALENFFKSSVLVNPFMDDKALIH--AADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLP
          D ER+ V++R   H  W+ +   L     ++V   PF  DKALI+    +       N  W   G  ++K E WS K H+ PK I SYGGW+ +R +P
Subjt:  EIDLERSIVVSRLMAHYSWKDVKIALENFFKSSVLVNPFMDDKALIH--AADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLP

Query:  LNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEF
        L+ W+ +SF  IG   GG V ++  T  L D  EA I+++ N+ GFIPA I +    +H F
Subjt:  LNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEF

KAA0041398.1 hypothetical protein E6C27_scaffold206G00440 [Cucumis melo var. makuwa]2.2e-2826.98Show/hide
Query:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ
        C ++ + F +      RE N+   E    K   I ++   L+W +     +L  P  + FF EK   +F  + + K  +   +  E       G +  I 
Subjt:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ

Query:  VPAGLNKKGWYVFWEMI--------RDFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSLVSLTDSIRNVKG-INEEAYWV
        VP GL+K GW +F +M+        ++   + Y  +++       KE+    +D  + S++   +Y EVV    SS+S  S   +  ++K  +      +
Subjt:  VPAGLNKKGWYVFWEMI--------RDFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSLVSLTDSIRNVKG-INEEAYWV

Query:  RKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALE---NFFKSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSY
        RK     +ID E++I++SR   H  W  +   L    +  +S     PF  DKAL+   D  L      N  W   G  ++K E WS  +H+  K I SY
Subjt:  RKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALE---NFFKSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSY

Query:  GGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR---YGDINSLENRNLNF-DSRKKLDA
        GGW   R +PL++W+ ++F  IG+  GG +  +  ++N ++ +EA I+V++N+ GF+PA I +     H F ++   + +   L  RN +   S KK  A
Subjt:  GGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR---YGDINSLENRNLNF-DSRKKLDA

Query:  NDFS
         +F+
Subjt:  NDFS

TYK10355.1 hypothetical protein E5676_scaffold367G00330 [Cucumis melo var. makuwa]8.5e-2828.74Show/hide
Query:  IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFIFKFYSYENQPIR
        I ++   L+W +     +L  P  + FF EK  E++  + + K ++   +  E       G +  I VP G  K GW  F  ++     K  S      R
Subjt:  IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFIFKFYSYENQPIR

Query:  SLLSKEECLPVFDKVSASQAFPN----SYVEVVKRGGSS-KSLVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF
        +LL+ +    + D+ S+S++  +    SY E V +G SS +   S T +I+     N               + ER++V++R   H  W+ +   L    
Subjt:  SLLSKEECLPVFDKVSASQAFPN----SYVEVVKRGGSS-KSLVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF

Query:  KSSVLVNPFMDDKALI--HAADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLL
         ++V   PF  DKALI     +       N  W   G  ++K E W+ K H+ PK I SYGGW+ +R +PL+ W+ +SF  IG   GG + ++  T  L 
Subjt:  KSSVLVNPFMDDKALI--HAADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLL

Query:  DCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR
        D  EA I ++ N+ GFIPA I +    +H F ++
Subjt:  DCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR

XP_038904899.1 uncharacterized protein LOC120091119 isoform X2 [Benincasa hispida]3.2e-2744.37Show/hide
Query:  NGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKH
        +GKW+KFG+ HLK E W++ IH +P  ++ YGGW++I+NLPL+ W + +FEAIGK  GGL SI+   LNL+   +A I+V++N CGF+PA I V    + 
Subjt:  NGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKH

Query:  EFSLRYGDINSLENRNLNFDSRKKLDANDFSNSLDLIRVRQV
           L +GDI++    N     +  L  +DF+N +DLIR+ +V
Subjt:  EFSLRYGDINSLENRNLNFDSRKKLDANDFSNSLDLIRVRQV

TrEMBL top hitse value%identityAlignment
A0A5A7TEP0 DUF4283 domain-containing protein1.1e-2826.98Show/hide
Query:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ
        C ++ + F +      RE N+   E    K   I ++   L+W +     +L  P  + FF EK   +F  + + K  +   +  E       G +  I 
Subjt:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ

Query:  VPAGLNKKGWYVFWEMI--------RDFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSLVSLTDSIRNVKG-INEEAYWV
        VP GL+K GW +F +M+        ++   + Y  +++       KE+    +D  + S++   +Y EVV    SS+S  S   +  ++K  +      +
Subjt:  VPAGLNKKGWYVFWEMI--------RDFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSLVSLTDSIRNVKG-INEEAYWV

Query:  RKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALE---NFFKSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSY
        RK     +ID E++I++SR   H  W  +   L    +  +S     PF  DKAL+   D  L      N  W   G  ++K E WS  +H+  K I SY
Subjt:  RKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALE---NFFKSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSY

Query:  GGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR---YGDINSLENRNLNF-DSRKKLDA
        GGW   R +PL++W+ ++F  IG+  GG +  +  ++N ++ +EA I+V++N+ GF+PA I +     H F ++   + +   L  RN +   S KK  A
Subjt:  GGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR---YGDINSLENRNLNF-DSRKKLDA

Query:  NDFS
         +F+
Subjt:  NDFS

A0A5A7TFK7 DUF4283 domain-containing protein1.4e-2828.81Show/hide
Query:  SCCIQNRSFCI----W-REGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRII
        SC I+ + F +    W R+  +   E    K   I ++   L+W +     +L  P  + FF EK  EE   + + K ++   +  E       G +  I
Subjt:  SCCIQNRSFCI----W-REGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRII

Query:  QVPAGLNKKGWYVFWEMIRDFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSS-KSLVSLTDSIRNVKGINEEAYWVRKNCDVL
         VP G  K GW  F  ++     K         R++L+ +         S       SY E V +G SS +   S T++I+             K+    
Subjt:  QVPAGLNKKGWYVFWEMIRDFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSS-KSLVSLTDSIRNVKGINEEAYWVRKNCDVL

Query:  EIDLERSIVVSRLMAHYSWKDVKIALENFFKSSVLVNPFMDDKALIH--AADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLP
          D ER+ V++R   H  W+ +   L     ++V   PF  DKALI+    +       N  W   G  ++K E WS K H+ PK I SYGGW+ +R +P
Subjt:  EIDLERSIVVSRLMAHYSWKDVKIALENFFKSSVLVNPFMDDKALIH--AADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLP

Query:  LNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEF
        L+ W+ +SF  IG   GG V ++  T  L D  EA I+++ N+ GFIPA I +    +H F
Subjt:  LNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEF

A0A5A7U495 DUF4283 domain-containing protein3.5e-2728.18Show/hide
Query:  IPLSFSFLQWFEKVLVEILQN-PVSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFIFKFYSYENQPIR
        I ++   L+W +     +L     + FF EK  E++  + + K ++   +  E       G +  I VP G  K GW  F  ++     K  S +    R
Subjt:  IPLSFSFLQWFEKVLVEILQN-PVSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFIFKFYSYENQPIR

Query:  SLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSS-KSLVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFFKSSV
        +LL+ +         S       SY E V +G SS +   S T +I+     N               + ER++V++R   H  W+ +   L     ++V
Subjt:  SLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSS-KSLVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFFKSSV

Query:  LVNPFMDDKALI--HAADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSE
           PF  DKALI     +       N  W   G  ++K E W+ K H+ PK I SYGGW+ +R +PL+ W+ +SF  IG   GG + ++  T  L D  E
Subjt:  LVNPFMDDKALI--HAADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSE

Query:  AFIEVEKNFCGFIPADINVKIGNKHEFSLR
        A I ++ N+ GFIPA I +    +H F ++
Subjt:  AFIEVEKNFCGFIPADINVKIGNKHEFSLR

A0A5D3CFS8 DUF4283 domain-containing protein4.1e-2828.74Show/hide
Query:  IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFIFKFYSYENQPIR
        I ++   L+W +     +L  P  + FF EK  E++  + + K ++   +  E       G +  I VP G  K GW  F  ++     K  S      R
Subjt:  IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFIFKFYSYENQPIR

Query:  SLLSKEECLPVFDKVSASQAFPN----SYVEVVKRGGSS-KSLVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF
        +LL+ +    + D+ S+S++  +    SY E V +G SS +   S T +I+     N               + ER++V++R   H  W+ +   L    
Subjt:  SLLSKEECLPVFDKVSASQAFPN----SYVEVVKRGGSS-KSLVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF

Query:  KSSVLVNPFMDDKALI--HAADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLL
         ++V   PF  DKALI     +       N  W   G  ++K E W+ K H+ PK I SYGGW+ +R +PL+ W+ +SF  IG   GG + ++  T  L 
Subjt:  KSSVLVNPFMDDKALI--HAADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLL

Query:  DCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR
        D  EA I ++ N+ GFIPA I +    +H F ++
Subjt:  DCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR

A0A5D3DLT1 DUF4283 domain-containing protein1.2e-2728.92Show/hide
Query:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ
        C ++ + F +      RE N+   E    K   I ++   L+W +     +L  P  + FF EK   +F  + + K  +   +  E       G +  I 
Subjt:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-IPLSFSFLQWFEKVLVEILQNP-VSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ

Query:  VPAGLNKKGWYVFWEMIR------DFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKR------GGSSKSLVSLTDSIRNVK----GI
        VP GL+K GW +F +M+         +F    Y NQ       KE+    +D  + S+    +YVE V          SS+S  S T S  ++K     +
Subjt:  VPAGLNKKGWYVFWEMIR------DFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKR------GGSSKSLVSLTDSIRNVK----GI

Query:  NEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF---KSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSKIHSQ
          E   +RK     +ID E+SI++SR   H  W  +   L++      S     PF  DKAL+   D  L      N  W   G  ++K E WS   H+ 
Subjt:  NEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF---KSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSKIHSQ

Query:  PKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINV
         K I SYGGW   R +PL++W+ ++F  IG+  GG +  +  ++N L+ +EA I+V++N+ GF+PA I +
Subjt:  PKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTATCAGCTGTTGTATTCAGAATAGGTCTTTTTGTATTTGGAGAGAAGGAAATATCCATTTTGTTGAAGATACTTGCAACAAGCGTTTGATTCCATTGTCCTT
TTCCTTTTTACAGTGGTTTGAAAAAGTGTTAGTTGAGATTTTGCAAAATCCTGTTTCTTCTTTCTTTCATGAAAAAATCATGGAAGAATTTGGAGTCATTAGGTTGATTA
AGTTCTTTTCGGATAATGAATGGTTCTTTGAATGTGCTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCTGCAGGCTTGAATAAGAAAGGATGGTATGTT
TTTTGGGAAATGATTAGGGATTTCATTTTTAAATTTTATTCTTATGAGAATCAACCTATTCGGTCATTGTTAAGCAAAGAGGAGTGTCTTCCGGTTTTTGATAAAGTTTC
AGCAAGTCAAGCCTTTCCCAATTCATATGTTGAGGTGGTAAAGCGAGGTGGTTCTTCAAAAAGTTTAGTTTCTTTGACCGATTCAATAAGAAATGTCAAGGGTATTAATG
AAGAAGCTTACTGGGTTCGCAAGAACTGTGATGTGCTGGAAATAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCATTATTCTTGGAAGGATGTCAAGATT
GCCCTTGAGAATTTCTTTAAATCTTCTGTCTTAGTTAACCCCTTCATGGATGATAAAGCTTTGATTCATGCAGCAGATGGTGGCTTGGAATTTTCTGCAAATGGCAAGTG
GAAGAAATTTGGAAACTTACATTTGAAATTGGAATTTTGGTCCTCTAAAATTCATTCACAGCCGAAGTCTATAAAAAGCTATGGAGGCTGGCTTGCAATTAGAAATCTTC
CATTAAATTTATGGCATCGTGACTCCTTTGAAGCTATTGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCTTCCAATACGCTTAATTTGTTAGATTGCTCTGAAGCCTTC
ATTGAAGTAGAAAAGAATTTTTGTGGATTTATTCCTGCTGATATTAATGTTAAGATTGGTAATAAGCATGAATTTTCATTAAGATATGGTGATATTAATTCTTTGGAGAA
CAGAAATTTGAATTTTGATTCAAGAAAAAAGCTAGATGCCAATGACTTTTCAAATTCCCTGGATTTAATTAGGGTAAGGCAAGTGATTTTGGATGAAGAATCTGAGATTG
TTAATAATGAGGATAGGATGAGTGAGCTGGCAACTATCTCTCGGTATCAGGAGGCATTTAATGAGGATTTGGATATTTCAAAGAATGTCTCGGCACAAGATAAATGTATT
AAGTGCAGTGGCTGTATTATTCCTTCAACCAAGTTGATTAATGATGATAGCAGTTTTTTGAATAATGAAGATTTGAATGGGGGTTTGGTTCTTTCAAAGGATGCATCGGT
GCAAGATGTAGGTTGTAAAGGTTTTAATGCCAGCATTAATGAGCCGACATTAGCTCTCTCTCCTTCATTAAATGACAATGAATTTAATGAGTTCAGTCCTCAGGAAGCCC
AACAGTTTCAGGTTTTTGAACTTCCTTCTAAGAATGATAATGTCGTTAAGGGTATTTCAGTTATTAATGATGCATTAATTGATGAAGCTTTGCATGAGTCCCAGGATGTA
TTATTAACGCCTATTCATGACCCAACTTCAGGTTTGAAGAGTAATAATGCTGCTGTTTTGGAAGAAAATGAATCGATTGTTTCTTCAAAGGCATTAAAGAAACAATATGA
ATCTTTTCCTCTATATTATTCTCGAAGGAAATATGAAAAGTCAGCAATTTTGGACTCAATTCCCATTAATTCCAATTTCAACCCTGACGTTATTGAAGAATCTTGTTCTC
AATTTTTGCTCCCTGCTTTGAATCAGCCTAGGTGTTGTCAATCCGATCTTAATGAGTTATCAAATTCCACATCCTCCAATAAGTACATTCTTTCAAATATTCAATCTGAC
CCTTCTTTATCAAAGGGAGTTTTTCTTCCTTCATCCAAAGGTGAAAACAAAGTTGATCACTCATATTTATCTCCTATTGATTCCGATGATGATTCAGTGGTGAGTATTAG
TAGTGTTGAGGCTGAAAGTCAACAATTGAATGATGAAAACAACGAATTGGAGGAAGACTCTTTTGCATTGGCTTTTAATCGGATTTTCCAGAACAATGAAGCTGTTTCTG
AACTATTGGAAAGAACCTTGGAGGGTTGGGTTAGACTGGTGATTTTGGATGAAGAATCGGATATTGTTAATGAAGGGGGATGGATGAGTGAGCTGCCATTTAATTCTTGT
TATCAGGAGCAAGTGAATGGGGAGTTGGTTATCTCAAAGGACGCCTCGGTGCATGAAGAAAGAAGTTATTGCGTTGGCTGTAATGATCCTCCACCCAAGATAATTAATGA
TGATAGCTATAATTTGATTAGTGATTTACAGCAGATTTTTAGTGAGAAAGATCAATTTAATGAGGTGTTGGGCTCTCCAAAGGTGCTTCATTGCATGATAAGGGATCTTC
AAATATCAGTGCTGGAAATGGGTTGTTTCAGGCCAAGTGGAGATGATAGCGGACAAGTTCAGCTCTTCAGATTGATAAGAAATCCTTGGAGGATAACTACTAGAATATCT
TTTGTTTACACTGGCCAAATAGCCCATGGATCTTCAGTTTCAGCAACTATGGAAATGGTGGAAGGCAGGTTGAAGAGTTTAGAAACGATTTCTTCCATCTACTTGGGTTG
CCAGAAAATTGTCGATGCTACTGTAGCTAGCGCCGCCATCCACCTGTCGTCCGTCCAGTCAGTTCGACGCCTTCTATTCGTCGTTCGCCTGCCATTCTTCGTTGGTCGTT
TCGTTGTCCAGTCCAGTCTATTAGCCGCTCGGCCGACAGCTGCTCCTTATTTGTGGGTCCCTTTGGTTTGTGGTTGGGTTCTGGTTTATGGTTCAATGGAAGTGCTGGAA
GAGATGAGTTGTTTTTGGCTTGAAAACTCATTGGTTGAACTTTTGCAATATCCTGTTCACTCATTTTTTAACAAGAAATTTAAGGATGATTTTGGGACCATTCGTCTATC
TAAATTTAGATCTGTCGAGGGTCAGTATTTGGAATGTGTCGTATGGCCTTTTTATGGAGGAAGGAAGAGTGTTCATGTTCCAACTAGTTTTGAAAAGAAAGGTTGGTTTG
TTTTTTGGGAAATGGTAAGAGATTTTTTCCTTAAATATGGAGAACCAGAGTATTCTGAGTTGGATTTATCCGCTCAAATTGAGAAAGTTGATACCGCTCAGGAAGATGTG
GTGAATCCCTCTGAGATTTCTTCCTTTTGGGTGAGGAAAGAGAGAGAAGTAATGGATTTGAATTTCAATTCCATTTTGGTGGTTTCTAGGTTGTTTGCTCATTACTCCTG
GAAAGATGTCCAAAGCGCTTTAGAAAATCATTTTCGGACTAGAATTTCAATGAATCCTTTTATGGATGATAAAGCGTTGATCAAGTTTGATGAGGGTCTTTCTGATTTAA
ATTTGGATGGAAAGTGGTGTCTCGTTGGTGACCTTCATCTGAAAATTGAGCGTTGGTCCGACGAAAAGCATTCTCACCCGGAATTCACCAAGAGCTATGGAGGTCGGATA
TCTATAAAGAATTTACCTTTGATTTATTGGAAACGACCATGTTTTGAAGCTATTGGGCAGCATTTTGGTGGATTGATTAGTATTTCCTCTCAAACACATAACTTAATTGA
TTGTTCAGTAGCAGTAATTGAAGTGGAGGAAAACATTTGTGGTTTCATTCCAGCTGAGATTGATGTTAAAGGTGGGAATATCAGAAATTTTTCTCTCCGTTTTGGTGACA
TAGTACTGTTGAGTATCCAGACCCCAAACCTGCAAGATAAACGAAGAACAAGCAAAACACACTCGACGGGGCAAAGAGAACACTGTAATATTGAGGAAACAGTGATAAAC
AACCCTATCACAACAGAACAACGATTACAGATAAAGATAATAACCAACACAACTCCCAGGGAATTCGAGAGCCCTGGTAACCTCTCCACAGAAAGTTCTCTCACTTTCTA
A
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTATCAGCTGTTGTATTCAGAATAGGTCTTTTTGTATTTGGAGAGAAGGAAATATCCATTTTGTTGAAGATACTTGCAACAAGCGTTTGATTCCATTGTCCTT
TTCCTTTTTACAGTGGTTTGAAAAAGTGTTAGTTGAGATTTTGCAAAATCCTGTTTCTTCTTTCTTTCATGAAAAAATCATGGAAGAATTTGGAGTCATTAGGTTGATTA
AGTTCTTTTCGGATAATGAATGGTTCTTTGAATGTGCTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCTGCAGGCTTGAATAAGAAAGGATGGTATGTT
TTTTGGGAAATGATTAGGGATTTCATTTTTAAATTTTATTCTTATGAGAATCAACCTATTCGGTCATTGTTAAGCAAAGAGGAGTGTCTTCCGGTTTTTGATAAAGTTTC
AGCAAGTCAAGCCTTTCCCAATTCATATGTTGAGGTGGTAAAGCGAGGTGGTTCTTCAAAAAGTTTAGTTTCTTTGACCGATTCAATAAGAAATGTCAAGGGTATTAATG
AAGAAGCTTACTGGGTTCGCAAGAACTGTGATGTGCTGGAAATAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCATTATTCTTGGAAGGATGTCAAGATT
GCCCTTGAGAATTTCTTTAAATCTTCTGTCTTAGTTAACCCCTTCATGGATGATAAAGCTTTGATTCATGCAGCAGATGGTGGCTTGGAATTTTCTGCAAATGGCAAGTG
GAAGAAATTTGGAAACTTACATTTGAAATTGGAATTTTGGTCCTCTAAAATTCATTCACAGCCGAAGTCTATAAAAAGCTATGGAGGCTGGCTTGCAATTAGAAATCTTC
CATTAAATTTATGGCATCGTGACTCCTTTGAAGCTATTGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCTTCCAATACGCTTAATTTGTTAGATTGCTCTGAAGCCTTC
ATTGAAGTAGAAAAGAATTTTTGTGGATTTATTCCTGCTGATATTAATGTTAAGATTGGTAATAAGCATGAATTTTCATTAAGATATGGTGATATTAATTCTTTGGAGAA
CAGAAATTTGAATTTTGATTCAAGAAAAAAGCTAGATGCCAATGACTTTTCAAATTCCCTGGATTTAATTAGGGTAAGGCAAGTGATTTTGGATGAAGAATCTGAGATTG
TTAATAATGAGGATAGGATGAGTGAGCTGGCAACTATCTCTCGGTATCAGGAGGCATTTAATGAGGATTTGGATATTTCAAAGAATGTCTCGGCACAAGATAAATGTATT
AAGTGCAGTGGCTGTATTATTCCTTCAACCAAGTTGATTAATGATGATAGCAGTTTTTTGAATAATGAAGATTTGAATGGGGGTTTGGTTCTTTCAAAGGATGCATCGGT
GCAAGATGTAGGTTGTAAAGGTTTTAATGCCAGCATTAATGAGCCGACATTAGCTCTCTCTCCTTCATTAAATGACAATGAATTTAATGAGTTCAGTCCTCAGGAAGCCC
AACAGTTTCAGGTTTTTGAACTTCCTTCTAAGAATGATAATGTCGTTAAGGGTATTTCAGTTATTAATGATGCATTAATTGATGAAGCTTTGCATGAGTCCCAGGATGTA
TTATTAACGCCTATTCATGACCCAACTTCAGGTTTGAAGAGTAATAATGCTGCTGTTTTGGAAGAAAATGAATCGATTGTTTCTTCAAAGGCATTAAAGAAACAATATGA
ATCTTTTCCTCTATATTATTCTCGAAGGAAATATGAAAAGTCAGCAATTTTGGACTCAATTCCCATTAATTCCAATTTCAACCCTGACGTTATTGAAGAATCTTGTTCTC
AATTTTTGCTCCCTGCTTTGAATCAGCCTAGGTGTTGTCAATCCGATCTTAATGAGTTATCAAATTCCACATCCTCCAATAAGTACATTCTTTCAAATATTCAATCTGAC
CCTTCTTTATCAAAGGGAGTTTTTCTTCCTTCATCCAAAGGTGAAAACAAAGTTGATCACTCATATTTATCTCCTATTGATTCCGATGATGATTCAGTGGTGAGTATTAG
TAGTGTTGAGGCTGAAAGTCAACAATTGAATGATGAAAACAACGAATTGGAGGAAGACTCTTTTGCATTGGCTTTTAATCGGATTTTCCAGAACAATGAAGCTGTTTCTG
AACTATTGGAAAGAACCTTGGAGGGTTGGGTTAGACTGGTGATTTTGGATGAAGAATCGGATATTGTTAATGAAGGGGGATGGATGAGTGAGCTGCCATTTAATTCTTGT
TATCAGGAGCAAGTGAATGGGGAGTTGGTTATCTCAAAGGACGCCTCGGTGCATGAAGAAAGAAGTTATTGCGTTGGCTGTAATGATCCTCCACCCAAGATAATTAATGA
TGATAGCTATAATTTGATTAGTGATTTACAGCAGATTTTTAGTGAGAAAGATCAATTTAATGAGGTGTTGGGCTCTCCAAAGGTGCTTCATTGCATGATAAGGGATCTTC
AAATATCAGTGCTGGAAATGGGTTGTTTCAGGCCAAGTGGAGATGATAGCGGACAAGTTCAGCTCTTCAGATTGATAAGAAATCCTTGGAGGATAACTACTAGAATATCT
TTTGTTTACACTGGCCAAATAGCCCATGGATCTTCAGTTTCAGCAACTATGGAAATGGTGGAAGGCAGGTTGAAGAGTTTAGAAACGATTTCTTCCATCTACTTGGGTTG
CCAGAAAATTGTCGATGCTACTGTAGCTAGCGCCGCCATCCACCTGTCGTCCGTCCAGTCAGTTCGACGCCTTCTATTCGTCGTTCGCCTGCCATTCTTCGTTGGTCGTT
TCGTTGTCCAGTCCAGTCTATTAGCCGCTCGGCCGACAGCTGCTCCTTATTTGTGGGTCCCTTTGGTTTGTGGTTGGGTTCTGGTTTATGGTTCAATGGAAGTGCTGGAA
GAGATGAGTTGTTTTTGGCTTGAAAACTCATTGGTTGAACTTTTGCAATATCCTGTTCACTCATTTTTTAACAAGAAATTTAAGGATGATTTTGGGACCATTCGTCTATC
TAAATTTAGATCTGTCGAGGGTCAGTATTTGGAATGTGTCGTATGGCCTTTTTATGGAGGAAGGAAGAGTGTTCATGTTCCAACTAGTTTTGAAAAGAAAGGTTGGTTTG
TTTTTTGGGAAATGGTAAGAGATTTTTTCCTTAAATATGGAGAACCAGAGTATTCTGAGTTGGATTTATCCGCTCAAATTGAGAAAGTTGATACCGCTCAGGAAGATGTG
GTGAATCCCTCTGAGATTTCTTCCTTTTGGGTGAGGAAAGAGAGAGAAGTAATGGATTTGAATTTCAATTCCATTTTGGTGGTTTCTAGGTTGTTTGCTCATTACTCCTG
GAAAGATGTCCAAAGCGCTTTAGAAAATCATTTTCGGACTAGAATTTCAATGAATCCTTTTATGGATGATAAAGCGTTGATCAAGTTTGATGAGGGTCTTTCTGATTTAA
ATTTGGATGGAAAGTGGTGTCTCGTTGGTGACCTTCATCTGAAAATTGAGCGTTGGTCCGACGAAAAGCATTCTCACCCGGAATTCACCAAGAGCTATGGAGGTCGGATA
TCTATAAAGAATTTACCTTTGATTTATTGGAAACGACCATGTTTTGAAGCTATTGGGCAGCATTTTGGTGGATTGATTAGTATTTCCTCTCAAACACATAACTTAATTGA
TTGTTCAGTAGCAGTAATTGAAGTGGAGGAAAACATTTGTGGTTTCATTCCAGCTGAGATTGATGTTAAAGGTGGGAATATCAGAAATTTTTCTCTCCGTTTTGGTGACA
TAGTACTGTTGAGTATCCAGACCCCAAACCTGCAAGATAAACGAAGAACAAGCAAAACACACTCGACGGGGCAAAGAGAACACTGTAATATTGAGGAAACAGTGATAAAC
AACCCTATCACAACAGAACAACGATTACAGATAAAGATAATAACCAACACAACTCCCAGGGAATTCGAGAGCCCTGGTAACCTCTCCACAGAAAGTTCTCTCACTTTCTA
A
Protein sequenceShow/hide protein sequence
MEVISCCIQNRSFCIWREGNIHFVEDTCNKRLIPLSFSFLQWFEKVLVEILQNPVSSFFHEKIMEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYV
FWEMIRDFIFKFYSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSLVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKI
ALENFFKSSVLVNPFMDDKALIHAADGGLEFSANGKWKKFGNLHLKLEFWSSKIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAF
IEVEKNFCGFIPADINVKIGNKHEFSLRYGDINSLENRNLNFDSRKKLDANDFSNSLDLIRVRQVILDEESEIVNNEDRMSELATISRYQEAFNEDLDISKNVSAQDKCI
KCSGCIIPSTKLINDDSSFLNNEDLNGGLVLSKDASVQDVGCKGFNASINEPTLALSPSLNDNEFNEFSPQEAQQFQVFELPSKNDNVVKGISVINDALIDEALHESQDV
LLTPIHDPTSGLKSNNAAVLEENESIVSSKALKKQYESFPLYYSRRKYEKSAILDSIPINSNFNPDVIEESCSQFLLPALNQPRCCQSDLNELSNSTSSNKYILSNIQSD
PSLSKGVFLPSSKGENKVDHSYLSPIDSDDDSVVSISSVEAESQQLNDENNELEEDSFALAFNRIFQNNEAVSELLERTLEGWVRLVILDEESDIVNEGGWMSELPFNSC
YQEQVNGELVISKDASVHEERSYCVGCNDPPPKIINDDSYNLISDLQQIFSEKDQFNEVLGSPKVLHCMIRDLQISVLEMGCFRPSGDDSGQVQLFRLIRNPWRITTRIS
FVYTGQIAHGSSVSATMEMVEGRLKSLETISSIYLGCQKIVDATVASAAIHLSSVQSVRRLLFVVRLPFFVGRFVVQSSLLAARPTAAPYLWVPLVCGWVLVYGSMEVLE
EMSCFWLENSLVELLQYPVHSFFNKKFKDDFGTIRLSKFRSVEGQYLECVVWPFYGGRKSVHVPTSFEKKGWFVFWEMVRDFFLKYGEPEYSELDLSAQIEKVDTAQEDV
VNPSEISSFWVRKEREVMDLNFNSILVVSRLFAHYSWKDVQSALENHFRTRISMNPFMDDKALIKFDEGLSDLNLDGKWCLVGDLHLKIERWSDEKHSHPEFTKSYGGRI
SIKNLPLIYWKRPCFEAIGQHFGGLISISSQTHNLIDCSVAVIEVEENICGFIPAEIDVKGGNIRNFSLRFGDIVLLSIQTPNLQDKRRTSKTHSTGQREHCNIEETVIN
NPITTEQRLQIKIITNTTPREFESPGNLSTESSLTF