; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029622 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029622
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold2:21654718..21658287
RNA-Seq ExpressionSpg029622
SyntenySpg029622
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039949.1 hypothetical protein E6C27_scaffold122G002290 [Cucumis melo var. makuwa]4.9e-2828.92Show/hide
Query:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ
        C ++ + F +      RE N+   E    K   I ++   L+W +     +L  P  + FF EK   +F  + + K  +   +  E       G +  I 
Subjt:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ

Query:  VPAGLNKKGWYVFWEMIR------DFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKR------GGSSKSSVSLTDSIRNVK----GI
        VP GL+K GW +F +M+         +F    Y NQ       KE+    +D  + S+    +YVE V          SS+S  S T S  ++K     +
Subjt:  VPAGLNKKGWYVFWEMIR------DFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKR------GGSSKSSVSLTDSIRNVK----GI

Query:  NEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF---KSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSEIHSQ
          E   +RK     +ID E+SI++SR   H  W  +   L++      S     PF  DKAL+   D  L      N  W   G  ++K E WS   H+ 
Subjt:  NEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF---KSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSEIHSQ

Query:  PKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINV
         K I SYGGW   R +PL++W+ ++F  IG+  GG +  +  ++N L+ +EA I+V++N+ GF+PA I +
Subjt:  PKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINV

KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]1.3e-2828.81Show/hide
Query:  SCCIQNRSFCI----W-REGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRII
        SC I+ + F +    W R+  +   E    K   I ++   L+W +     +L  P  + FF EK  EE   + + K ++   +  E       G +  I
Subjt:  SCCIQNRSFCI----W-REGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRII

Query:  QVPAGLNKKGWYVFWEMIRDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSS-KSSVSLTDSIRNVKGINEEAYWVRKNCDVL
         VP G  K GW  F  ++     K         R++L+ +         S       SY E V +G SS + S S T++I+             K+    
Subjt:  QVPAGLNKKGWYVFWEMIRDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSS-KSSVSLTDSIRNVKGINEEAYWVRKNCDVL

Query:  EIDLERSIVVSRLMAHYSWKDVKIALENFFKSSVLVNPFMDDKALIH--AADGGLEFSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLP
          D ER+ V++R   H  W+ +   L     ++V   PF  DKALI+    +       N  W   G  ++K E WS + H+ PK I SYGGW+ +R +P
Subjt:  EIDLERSIVVSRLMAHYSWKDVKIALENFFKSSVLVNPFMDDKALIH--AADGGLEFSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLP

Query:  LNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEF
        L+ W+ +SF  IG   GG V ++  T  L D  EA I+++ N+ GFIPA I +    +H F
Subjt:  LNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEF

KAA0041398.1 hypothetical protein E6C27_scaffold206G00440 [Cucumis melo var. makuwa]4.4e-2927.54Show/hide
Query:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ
        C ++ + F +      RE N+   E    K   I ++   L+W +     +L  P  + FF EK   +F  + + K  +   +  E       G +  I 
Subjt:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ

Query:  VPAGLNKKGWYVFWEMI-------RDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSSVSLTDSIRNVKG-INEEAYWVR
        VP GL+K GW +F +M+       +  I   H Y          KE+    +D  + S++   +Y EVV    SS+S  S   +  ++K  +      +R
Subjt:  VPAGLNKKGWYVFWEMI-------RDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSSVSLTDSIRNVKG-INEEAYWVR

Query:  KNCDVLEIDLERSIVVSRLMAHYSWKDVKIALE---NFFKSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYG
        K     +ID E++I++SR   H  W  +   L    +  +S     PF  DKAL+   D  L      N  W   G  ++K E WS  +H+  K I SYG
Subjt:  KNCDVLEIDLERSIVVSRLMAHYSWKDVKIALE---NFFKSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYG

Query:  GWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR---YGDINSLENRNLNF-DSRKKLDAN
        GW   R +PL++W+ ++F  IG+  GG +  +  ++N ++ +EA I+V++N+ GF+PA I +     H F ++   + +   L  RN +   S KK  A 
Subjt:  GWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR---YGDINSLENRNLNF-DSRKKLDAN

Query:  DFS
        +F+
Subjt:  DFS

TYK10355.1 hypothetical protein E5676_scaffold367G00330 [Cucumis melo var. makuwa]8.3e-2828.44Show/hide
Query:  ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFIFKFHSYENQPIR
        I ++   L+W +     +L  P  + FF EK  E++  + + K ++   +  E       G +  I VP G  K GW  F  ++     K  S      R
Subjt:  ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFIFKFHSYENQPIR

Query:  SLLSKEECLPVFDKVSASQAFPN----SYVEVVKRGGSS-KSSVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF
        +LL+ +    + D+ S+S++  +    SY E V +G SS + + S T +I+     N               + ER++V++R   H  W+ +   L    
Subjt:  SLLSKEECLPVFDKVSASQAFPN----SYVEVVKRGGSS-KSSVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF

Query:  KSSVLVNPFMDDKALI--HAADGGLEFSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLL
         ++V   PF  DKALI     +       N  W   G  ++K E W+ + H+ PK I SYGGW+ +R +PL+ W+ +SF  IG   GG + ++  T  L 
Subjt:  KSSVLVNPFMDDKALI--HAADGGLEFSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLL

Query:  DCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR
        D  EA I ++ N+ GFIPA I +    +H F ++
Subjt:  DCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR

XP_038904899.1 uncharacterized protein LOC120091119 isoform X2 [Benincasa hispida]1.8e-2744.37Show/hide
Query:  NGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKH
        +GKW+KFG+ HLK E W++ IH +P  ++ YGGW++I+NLPL+ W + +FEAIGK  GGL SI+   LNL+   +A I+V++N CGF+PA I V    + 
Subjt:  NGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKH

Query:  EFSLRYGDINSLENRNLNFDSRKKLDANDFSNSLDLIRVRQV
           L +GDI++    N     +  L  +DF+N +DLIR+ +V
Subjt:  EFSLRYGDINSLENRNLNFDSRKKLDANDFSNSLDLIRVRQV

TrEMBL top hitse value%identityAlignment
A0A5A7TEP0 DUF4283 domain-containing protein2.1e-2927.54Show/hide
Query:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ
        C ++ + F +      RE N+   E    K   I ++   L+W +     +L  P  + FF EK   +F  + + K  +   +  E       G +  I 
Subjt:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ

Query:  VPAGLNKKGWYVFWEMI-------RDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSSVSLTDSIRNVKG-INEEAYWVR
        VP GL+K GW +F +M+       +  I   H Y          KE+    +D  + S++   +Y EVV    SS+S  S   +  ++K  +      +R
Subjt:  VPAGLNKKGWYVFWEMI-------RDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSSVSLTDSIRNVKG-INEEAYWVR

Query:  KNCDVLEIDLERSIVVSRLMAHYSWKDVKIALE---NFFKSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYG
        K     +ID E++I++SR   H  W  +   L    +  +S     PF  DKAL+   D  L      N  W   G  ++K E WS  +H+  K I SYG
Subjt:  KNCDVLEIDLERSIVVSRLMAHYSWKDVKIALE---NFFKSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYG

Query:  GWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR---YGDINSLENRNLNF-DSRKKLDAN
        GW   R +PL++W+ ++F  IG+  GG +  +  ++N ++ +EA I+V++N+ GF+PA I +     H F ++   + +   L  RN +   S KK  A 
Subjt:  GWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR---YGDINSLENRNLNF-DSRKKLDAN

Query:  DFS
        +F+
Subjt:  DFS

A0A5A7TFK7 DUF4283 domain-containing protein6.2e-2928.81Show/hide
Query:  SCCIQNRSFCI----W-REGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRII
        SC I+ + F +    W R+  +   E    K   I ++   L+W +     +L  P  + FF EK  EE   + + K ++   +  E       G +  I
Subjt:  SCCIQNRSFCI----W-REGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRII

Query:  QVPAGLNKKGWYVFWEMIRDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSS-KSSVSLTDSIRNVKGINEEAYWVRKNCDVL
         VP G  K GW  F  ++     K         R++L+ +         S       SY E V +G SS + S S T++I+             K+    
Subjt:  QVPAGLNKKGWYVFWEMIRDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSS-KSSVSLTDSIRNVKGINEEAYWVRKNCDVL

Query:  EIDLERSIVVSRLMAHYSWKDVKIALENFFKSSVLVNPFMDDKALIH--AADGGLEFSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLP
          D ER+ V++R   H  W+ +   L     ++V   PF  DKALI+    +       N  W   G  ++K E WS + H+ PK I SYGGW+ +R +P
Subjt:  EIDLERSIVVSRLMAHYSWKDVKIALENFFKSSVLVNPFMDDKALIH--AADGGLEFSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLP

Query:  LNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEF
        L+ W+ +SF  IG   GG V ++  T  L D  EA I+++ N+ GFIPA I +    +H F
Subjt:  LNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEF

A0A5A7TRS9 DUF4283 domain-containing protein2.6e-2726.86Show/hide
Query:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ
        C ++ + F +      RE N+   E    K   I ++   L+W +     +L  P  + FF EK  ++                         G +  I 
Subjt:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ

Query:  VPAGLNKKGWYVFWEMI-------RDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSSVSLTDSIRNVK----GINEEAY
        VP GL+K GW +F +M+       +  I   H Y          KE+    +D  + S++   +Y EVV    SS+S  S   +  ++K     +  E  
Subjt:  VPAGLNKKGWYVFWEMI-------RDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSSVSLTDSIRNVK----GINEEAY

Query:  WVRKNCDVLEIDLERSIVVSRLMAHYSWKDV--KIALENFFKSSVL-VNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSEIHSQPKSIK
         +RK     +ID E++I++SR   H  W  +  ++  +   K S     PF  DKAL+   D  L      N  W   G  ++K E WS   H+  K I 
Subjt:  WVRKNCDVLEIDLERSIVVSRLMAHYSWKDV--KIALENFFKSSVL-VNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSEIHSQPKSIK

Query:  SYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR
        SYGGW   R +PL++W+ ++F  IG+  GG +  +  ++N ++ +EA I+V++N+ GF+PA I +     H F ++
Subjt:  SYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR

A0A5D3CFS8 DUF4283 domain-containing protein4.0e-2828.44Show/hide
Query:  ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFIFKFHSYENQPIR
        I ++   L+W +     +L  P  + FF EK  E++  + + K ++   +  E       G +  I VP G  K GW  F  ++     K  S      R
Subjt:  ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFIFKFHSYENQPIR

Query:  SLLSKEECLPVFDKVSASQAFPN----SYVEVVKRGGSS-KSSVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF
        +LL+ +    + D+ S+S++  +    SY E V +G SS + + S T +I+     N               + ER++V++R   H  W+ +   L    
Subjt:  SLLSKEECLPVFDKVSASQAFPN----SYVEVVKRGGSS-KSSVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF

Query:  KSSVLVNPFMDDKALI--HAADGGLEFSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLL
         ++V   PF  DKALI     +       N  W   G  ++K E W+ + H+ PK I SYGGW+ +R +PL+ W+ +SF  IG   GG + ++  T  L 
Subjt:  KSSVLVNPFMDDKALI--HAADGGLEFSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLL

Query:  DCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR
        D  EA I ++ N+ GFIPA I +    +H F ++
Subjt:  DCSEAFIEVEKNFCGFIPADINVKIGNKHEFSLR

A0A5D3DLT1 DUF4283 domain-containing protein2.4e-2828.92Show/hide
Query:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ
        C ++ + F +      RE N+   E    K   I ++   L+W +     +L  P  + FF EK   +F  + + K  +   +  E       G +  I 
Subjt:  CCIQNRSFCI-----WREGNIHFVEDTCNKRL-ILLSFSFLQWFEKVLAEILQNP-VSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQ

Query:  VPAGLNKKGWYVFWEMIR------DFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKR------GGSSKSSVSLTDSIRNVK----GI
        VP GL+K GW +F +M+         +F    Y NQ       KE+    +D  + S+    +YVE V          SS+S  S T S  ++K     +
Subjt:  VPAGLNKKGWYVFWEMIR------DFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKR------GGSSKSSVSLTDSIRNVK----GI

Query:  NEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF---KSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSEIHSQ
          E   +RK     +ID E+SI++SR   H  W  +   L++      S     PF  DKAL+   D  L      N  W   G  ++K E WS   H+ 
Subjt:  NEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFF---KSSVLVNPFMDDKALIHAADGGLE--FSANGKWKKFGNLHLKLEFWSSEIHSQ

Query:  PKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINV
         K I SYGGW   R +PL++W+ ++F  IG+  GG +  +  ++N L+ +EA I+V++N+ GF+PA I +
Subjt:  PKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTATCAGCTGTTGTATTCAGAATAGGTCTTTTTGTATTTGGAGAGAAGGAAATATCCATTTTGTTGAAGATACTTGCAACAAGCGTTTGATTCTATTGTCCTT
TTCCTTTTTACAGTGGTTTGAAAAAGTGTTAGCTGAGATTTTGCAAAATCCTGTTTCTTCTTTCTTTCATGAAAAAATCAAGGAAGAATTTGGAGTCATTAGGTTGATTA
AGTTCTTCTCGGATAATGAATGGTTCTTTGAATGTGCTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCTGCAGGCTTGAATAAGAAAGGATGGTATGTT
TTTTGGGAAATGATTAGGGATTTCATTTTTAAATTTCATTCTTATGAGAATCAACCTATTCGGTCATTGTTAAGCAAAGAGGAGTGTCTTCCGGTTTTTGATAAAGTTTC
AGCAAGTCAAGCCTTTCCCAATTCATATGTTGAGGTGGTAAAGCGAGGTGGTTCTTCAAAAAGTTCAGTTTCTTTGACCGATTCAATAAGAAATGTCAAGGGTATTAATG
AAGAAGCTTACTGGGTTCGCAAGAACTGTGATGTGCTGGAAATAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCATTATTCTTGGAAGGATGTCAAGATT
GCCCTTGAGAATTTCTTTAAATCTTCTGTCTTAGTTAACCCCTTCATGGATGATAAAGCTTTGATTCATGCAGCAGATGGTGGCTTGGAATTTTCTGCAAATGGCAAGTG
GAAGAAATTTGGAAACTTACATTTGAAATTGGAATTTTGGTCCTCTGAAATTCATTCACAGCCGAAGTCTATAAAAAGCTATGGAGGCTGGCTTGCAATTAGAAATCTTC
CATTAAATTTATGGCATCGTGACTCCTTTGAAGCTATTGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCTTCCAATACGCTTAATTTGTTAGATTGCTCTGAAGCCTTC
ATTGAAGTAGAAAAGAATTTTTGTGGATTTATTCCTGCTGATATTAATGTTAAGATTGGTAATAAGCATGAATTTTCATTAAGATATGGTGATATTAATTCTTTGGAGAA
CAGAAATTTGAATTTTGATTCAAGAAAAAAGCTAGATGCCAATGACTTTTCAAATTCCCTGGATTTAATTAGGGTAAGGCAAGTGATTTTGGATGAAGAATCTGAGATTG
TTAATAATGAGGATAGGATGAGTGAGCTGCCAACTATCTCTCGGTATCAGGAGGCATTTAATGAGGATTTGGATATTTCAAAGAATGTCTCGGCACAAGATAAATGTATT
AAGTGCAGTGGCTGTATTATTCCTTCAACCAAGTTGATTAATGATGATAGCAGTTTTTTGAATAATGAAGATTTGAATGGGGGTTTGGTTCTTTCAAAGGATGCATCGGT
GCAAGATGTAGGTATTAATTGCAGTGGTTGCTTTATTCCTTCAACCAAGATGATTAATGATGATAGCTGTTTTTTGAATAATGAAGTGCAGCAGATTTTAAAAGAGAGAG
GCCACGTTAATGAGATGTTGGGTTCTTCAAAAGGTGCTTCATTGCATGACAAGAGTATTAATAATGCTGGTTGTAAAGGTTTTTATGCCAGCATTAATGAGCCGACATTA
GCTCTCTCTCCTTCATTAAATGACAATGAATTTAATGAGTTCAGTCCTCAGGAAGCCCAACAGTTTCAGGTTTTTGAACTTCCTTCTAAGAATGATAATGTCGTTAAGGG
TATTTCAGTTATTAATGATGCATTAATTGATGAAGCTTTGCATGAGTCCCAGGATGTATTATTAACGCCTATTCATGACCCAACTTCAGGTTTGAAGAGTAATAATGCTG
CTGTTTTGGAAGAAAATGAATCGATTGTTTCTTCAAAGGCATTAAAGAAACAATATGAATCTTTTCCTCTATATTATTCTCGAAGGAAATATGAAAAGTCAGCAATTTTG
GACTCAATTCCCATTAATTCCAATTTCAACCCTGACGTTATTGAAGAATCTTGTTCTCAATTTTTGCTCCCTGCTTTGAATCAGCCTAGGTGCTGTCAAGCCGATCTTAA
TGAGTTATCAAATTCCACATCCTCCAATAAGTACATTCTTTCAAATATTCAATCTGACCCTTCTTTATCAAAGGGAGTTTTTCTTCCTTCATCCAAAGGTGAAAACAAAG
TTGATCACTCATATTTATCTCCTATTGATTCCGATGATGATTCAGTGGTGAGTATTAGTAGTGTTGAGGCTGAAAGTCAACAATTGAATGATGAAAACAACGAATTGGAG
GAAGACTCTTTTGCATTGGCTTTTAATCGGATTTTCCAGAACAATGAAGCTGTTTCTGAACTATTGGAAAGAACCTTGGAGGGTTGGGTTAGACTGGTGATTTTGGATGA
AGAATCGGATATTGTTAATGAAGGGGGATGGATGAGTGAGCTGCCATTTAATTCTTGTTATCAGGAGGAAGTGAATGGGGAGTTGGTTATCTCAAAGGACGCCTCGGTGC
ATGAAGAAAGAAGTTATTGCGTTGGCTGTAATGATCCTCCACCCAAGATAATTAATGATGATAGCTGTAAGGTGATTAGTGATTTACAGCAGATTTTTAGTGAGAAAGAT
CAATTTAATGAGGTGTTGGGCTCTCCAAAAGGTGCTTCATTGCATGATAAGGGTATTAATTATGATAGCTGTAATTTAATTAATGATTTACAACAGATTTCTAGTGAGAG
AGATCAATTTAATGAGGTGTTGGGCTCTCCAAAAGGTGCTTCATTGCATGAAAAGGGTATTAATTATGCTGGTTCAGGCAGCATTAAAGGCATATTCTTGGAAAAATCTC
TTTCATTGGCTGTTAAGTCCAACATTAATGCTGATTATTTGGTGGCTGAATGTTCTCACTTAATTGTTGCAAAAGGCTTCAGGATCTTCAAATATCAGTGCTGGAAATGG
GTTGTTTCAGGCCAAGTGGAGATGATAGCGGACAAGTTCAGCTCTTCAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTATCAGCTGTTGTATTCAGAATAGGTCTTTTTGTATTTGGAGAGAAGGAAATATCCATTTTGTTGAAGATACTTGCAACAAGCGTTTGATTCTATTGTCCTT
TTCCTTTTTACAGTGGTTTGAAAAAGTGTTAGCTGAGATTTTGCAAAATCCTGTTTCTTCTTTCTTTCATGAAAAAATCAAGGAAGAATTTGGAGTCATTAGGTTGATTA
AGTTCTTCTCGGATAATGAATGGTTCTTTGAATGTGCTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCTGCAGGCTTGAATAAGAAAGGATGGTATGTT
TTTTGGGAAATGATTAGGGATTTCATTTTTAAATTTCATTCTTATGAGAATCAACCTATTCGGTCATTGTTAAGCAAAGAGGAGTGTCTTCCGGTTTTTGATAAAGTTTC
AGCAAGTCAAGCCTTTCCCAATTCATATGTTGAGGTGGTAAAGCGAGGTGGTTCTTCAAAAAGTTCAGTTTCTTTGACCGATTCAATAAGAAATGTCAAGGGTATTAATG
AAGAAGCTTACTGGGTTCGCAAGAACTGTGATGTGCTGGAAATAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCATTATTCTTGGAAGGATGTCAAGATT
GCCCTTGAGAATTTCTTTAAATCTTCTGTCTTAGTTAACCCCTTCATGGATGATAAAGCTTTGATTCATGCAGCAGATGGTGGCTTGGAATTTTCTGCAAATGGCAAGTG
GAAGAAATTTGGAAACTTACATTTGAAATTGGAATTTTGGTCCTCTGAAATTCATTCACAGCCGAAGTCTATAAAAAGCTATGGAGGCTGGCTTGCAATTAGAAATCTTC
CATTAAATTTATGGCATCGTGACTCCTTTGAAGCTATTGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCTTCCAATACGCTTAATTTGTTAGATTGCTCTGAAGCCTTC
ATTGAAGTAGAAAAGAATTTTTGTGGATTTATTCCTGCTGATATTAATGTTAAGATTGGTAATAAGCATGAATTTTCATTAAGATATGGTGATATTAATTCTTTGGAGAA
CAGAAATTTGAATTTTGATTCAAGAAAAAAGCTAGATGCCAATGACTTTTCAAATTCCCTGGATTTAATTAGGGTAAGGCAAGTGATTTTGGATGAAGAATCTGAGATTG
TTAATAATGAGGATAGGATGAGTGAGCTGCCAACTATCTCTCGGTATCAGGAGGCATTTAATGAGGATTTGGATATTTCAAAGAATGTCTCGGCACAAGATAAATGTATT
AAGTGCAGTGGCTGTATTATTCCTTCAACCAAGTTGATTAATGATGATAGCAGTTTTTTGAATAATGAAGATTTGAATGGGGGTTTGGTTCTTTCAAAGGATGCATCGGT
GCAAGATGTAGGTATTAATTGCAGTGGTTGCTTTATTCCTTCAACCAAGATGATTAATGATGATAGCTGTTTTTTGAATAATGAAGTGCAGCAGATTTTAAAAGAGAGAG
GCCACGTTAATGAGATGTTGGGTTCTTCAAAAGGTGCTTCATTGCATGACAAGAGTATTAATAATGCTGGTTGTAAAGGTTTTTATGCCAGCATTAATGAGCCGACATTA
GCTCTCTCTCCTTCATTAAATGACAATGAATTTAATGAGTTCAGTCCTCAGGAAGCCCAACAGTTTCAGGTTTTTGAACTTCCTTCTAAGAATGATAATGTCGTTAAGGG
TATTTCAGTTATTAATGATGCATTAATTGATGAAGCTTTGCATGAGTCCCAGGATGTATTATTAACGCCTATTCATGACCCAACTTCAGGTTTGAAGAGTAATAATGCTG
CTGTTTTGGAAGAAAATGAATCGATTGTTTCTTCAAAGGCATTAAAGAAACAATATGAATCTTTTCCTCTATATTATTCTCGAAGGAAATATGAAAAGTCAGCAATTTTG
GACTCAATTCCCATTAATTCCAATTTCAACCCTGACGTTATTGAAGAATCTTGTTCTCAATTTTTGCTCCCTGCTTTGAATCAGCCTAGGTGCTGTCAAGCCGATCTTAA
TGAGTTATCAAATTCCACATCCTCCAATAAGTACATTCTTTCAAATATTCAATCTGACCCTTCTTTATCAAAGGGAGTTTTTCTTCCTTCATCCAAAGGTGAAAACAAAG
TTGATCACTCATATTTATCTCCTATTGATTCCGATGATGATTCAGTGGTGAGTATTAGTAGTGTTGAGGCTGAAAGTCAACAATTGAATGATGAAAACAACGAATTGGAG
GAAGACTCTTTTGCATTGGCTTTTAATCGGATTTTCCAGAACAATGAAGCTGTTTCTGAACTATTGGAAAGAACCTTGGAGGGTTGGGTTAGACTGGTGATTTTGGATGA
AGAATCGGATATTGTTAATGAAGGGGGATGGATGAGTGAGCTGCCATTTAATTCTTGTTATCAGGAGGAAGTGAATGGGGAGTTGGTTATCTCAAAGGACGCCTCGGTGC
ATGAAGAAAGAAGTTATTGCGTTGGCTGTAATGATCCTCCACCCAAGATAATTAATGATGATAGCTGTAAGGTGATTAGTGATTTACAGCAGATTTTTAGTGAGAAAGAT
CAATTTAATGAGGTGTTGGGCTCTCCAAAAGGTGCTTCATTGCATGATAAGGGTATTAATTATGATAGCTGTAATTTAATTAATGATTTACAACAGATTTCTAGTGAGAG
AGATCAATTTAATGAGGTGTTGGGCTCTCCAAAAGGTGCTTCATTGCATGAAAAGGGTATTAATTATGCTGGTTCAGGCAGCATTAAAGGCATATTCTTGGAAAAATCTC
TTTCATTGGCTGTTAAGTCCAACATTAATGCTGATTATTTGGTGGCTGAATGTTCTCACTTAATTGTTGCAAAAGGCTTCAGGATCTTCAAATATCAGTGCTGGAAATGG
GTTGTTTCAGGCCAAGTGGAGATGATAGCGGACAAGTTCAGCTCTTCAGATTGA
Protein sequenceShow/hide protein sequence
MEVISCCIQNRSFCIWREGNIHFVEDTCNKRLILLSFSFLQWFEKVLAEILQNPVSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYV
FWEMIRDFIFKFHSYENQPIRSLLSKEECLPVFDKVSASQAFPNSYVEVVKRGGSSKSSVSLTDSIRNVKGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKI
ALENFFKSSVLVNPFMDDKALIHAADGGLEFSANGKWKKFGNLHLKLEFWSSEIHSQPKSIKSYGGWLAIRNLPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAF
IEVEKNFCGFIPADINVKIGNKHEFSLRYGDINSLENRNLNFDSRKKLDANDFSNSLDLIRVRQVILDEESEIVNNEDRMSELPTISRYQEAFNEDLDISKNVSAQDKCI
KCSGCIIPSTKLINDDSSFLNNEDLNGGLVLSKDASVQDVGINCSGCFIPSTKMINDDSCFLNNEVQQILKERGHVNEMLGSSKGASLHDKSINNAGCKGFYASINEPTL
ALSPSLNDNEFNEFSPQEAQQFQVFELPSKNDNVVKGISVINDALIDEALHESQDVLLTPIHDPTSGLKSNNAAVLEENESIVSSKALKKQYESFPLYYSRRKYEKSAIL
DSIPINSNFNPDVIEESCSQFLLPALNQPRCCQADLNELSNSTSSNKYILSNIQSDPSLSKGVFLPSSKGENKVDHSYLSPIDSDDDSVVSISSVEAESQQLNDENNELE
EDSFALAFNRIFQNNEAVSELLERTLEGWVRLVILDEESDIVNEGGWMSELPFNSCYQEEVNGELVISKDASVHEERSYCVGCNDPPPKIINDDSCKVISDLQQIFSEKD
QFNEVLGSPKGASLHDKGINYDSCNLINDLQQISSERDQFNEVLGSPKGASLHEKGINYAGSGSIKGIFLEKSLSLAVKSNINADYLVAECSHLIVAKGFRIFKYQCWKW
VVSGQVEMIADKFSSSD