; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037551 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037551
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold11:32808887..32815515
RNA-Seq ExpressionSpg037551
SyntenySpg037551
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018455.1 hypothetical protein SDJN02_20323, partial [Cucurbita argyrosperma subsp. argyrosperma]4.3e-8277.93Show/hide
Query:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC
        MEIKHKGK+HPS     PSSSSSVFKLLPAA+LALVSVLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASI GGN NVST YHKTP+FSCDCFYC
Subjt:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAA-VKDEVESAAVEE----
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRI RQ+T+KS PVV RP PV DECVV+P SPER+SAAA V +E + + V+E    
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAA-VKDEVESAAVEE----

Query:  ---EVVGGDHDQKGLATKVILD
           + V GDH QKGLATKV+ D
Subjt:  ---EVVGGDHDQKGLATKVILD

XP_022955886.1 uncharacterized protein LOC111457737 [Cucurbita moschata]5.0e-8378.38Show/hide
Query:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC
        MEIKHKGK+HPS     PSSSSSVFKLLPAA+LALVSVLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASINGGN NVST YHKTP+FSCDCFYC
Subjt:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAA-VKDEVESAAVEE----
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRI RQ+T+KS PVV RP PV DECVV+P SPER+SAAA V +E + + V+E    
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAA-VKDEVESAAVEE----

Query:  ---EVVGGDHDQKGLATKVILD
           + V GDH QKGLATKV+ D
Subjt:  ---EVVGGDHDQKGLATKVILD

XP_022979917.1 uncharacterized protein LOC111479467 [Cucurbita maxima]7.0e-8579.73Show/hide
Query:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC
        MEIKHKGK+HPS     PSSSSSVFKLLPAA+LALVSVLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASINGGN NVSTGYHKTPMFSCDCFYC
Subjt:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESA-AAVKDEVESAAVEE----
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRIGRQ+T+KS PVV RPPPVA ECVV+P SPER+SA A+V +E + + V+E    
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESA-AAVKDEVESAAVEE----

Query:  ---EVVGGDHDQKGLATKVILD
           + V GDH QKGLATKV+ D
Subjt:  ---EVVGGDHDQKGLATKVILD

XP_023528171.1 uncharacterized protein LOC111791162 [Cucurbita pepo subsp. pepo]5.4e-8579.28Show/hide
Query:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC
        MEIKHKGK+HPS     PSSSSSVFKLLPAA+LALV+VLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASI+GGN NVSTGYHKTPMFSCDCFYC
Subjt:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAA-VKDEVESAAVEE----
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRIGRQ+T+KS PVV RPPP+ADECVV+P SPER+SAAA V +E + + V+E    
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAA-VKDEVESAAVEE----

Query:  ---EVVGGDHDQKGLATKVILD
           + V GDH QKGLATKV+ D
Subjt:  ---EVVGGDHDQKGLATKVILD

XP_038904588.1 uncharacterized protein LOC120090946 [Benincasa hispida]2.1e-7674.11Show/hide
Query:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNV-----STGYHKTPMFSC
        MEIKHKGKIHPS     PSSSSSVFKLLP A+LALVS+LSLD+REVLAYMIARSIQSSAFTS+  SRKKST+KASINGGNGNV     +T YHKTPMFSC
Subjt:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNV-----STGYHKTPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPER-----ESAAAVKDEVES
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKK+ GRGKRRDRIGRQ T K+ PV+  PPPVADECV +P   ER        + VK+  ES
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPER-----ESAAAVKDEVES

Query:  AAVEEEVVGGDHDQKGLATKVILD
          VEE   GGDH +KGL TKV+ D
Subjt:  AAVEEEVVGGDHDQKGLATKVILD

TrEMBL top hitse value%identityAlignment
A0A0A0KTW2 Uncharacterized protein6.9e-7071.17Show/hide
Query:  MEIKHKGKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTS-SHDSRKKSTKKASINGGNGNV--------STGYHKTPMFS
        MEIKHK KIHPS  PSSSSSVFKLLPAA+LAL S+LSLD+REVLAYMIARSIQSSAFTS +  SRKKSTKK  IN GN NV        +T YHKTP+FS
Subjt:  MEIKHKGKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTS-SHDSRKKSTKKASINGGNGNV--------STGYHKTPMFS

Query:  CDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQST--EKSAPVVPRPPPVADECVVLPPSPERESAAAVKDEVESAA
        CDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKK+ GRGKRRDRIGRQ +   K+ PVV  P  V DECV +P SP  E   +V  EVE + 
Subjt:  CDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQST--EKSAPVVPRPPPVADECVVLPPSPERESAAAVKDEVESAA

Query:  VEEEVVGGDHDQKGLATKVILD
           E VGG   QKGLATKV+ D
Subjt:  VEEEVVGGDHDQKGLATKVILD

A0A1S3BXS0 uncharacterized protein LOC1034945673.8e-6869.64Show/hide
Query:  MEIKHKGKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSA-FTSSHDSRKKSTKKASINGGNGNV-------STGYHKTPMFSC
        MEIKHK KIHPS  PSSSSSVFKLLPAA+LAL S+LSLD+REVLAYMIARSIQSSA  TS+  SRKKSTKK SIN GN NV       +T YHKTP+FSC
Subjt:  MEIKHKGKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSA-FTSSHDSRKKSTKKASINGGNGNV-------STGYHKTPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQST-----EKSAPVVPRPPPVADECVVLPPSPERESAAAVKDEVES
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKK+ GRGKRRDRIGRQ +      K+ PV+  P  VADECV +  SP  E   ++  EVE 
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQST-----EKSAPVVPRPPPVADECVVLPPSPERESAAAVKDEVES

Query:  AAVEEEVVGGDHDQKGLATKVILD
             E VGG   QKGLATKV+ D
Subjt:  AAVEEEVVGGDHDQKGLATKVILD

A0A5A7TRT9 Uncharacterized protein1.7e-6869.64Show/hide
Query:  MEIKHKGKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSA-FTSSHDSRKKSTKKASINGGNGNV-------STGYHKTPMFSC
        MEIKHK KIHPS  PSSSSSVFKLLPAA+LAL S+LSLD+REVLAYMIARSIQSSA  TS+  SRKKSTKK SIN GN NV       +T YHKTP+FSC
Subjt:  MEIKHKGKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSA-FTSSHDSRKKSTKKASINGGNGNV-------STGYHKTPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQST-----EKSAPVVPRPPPVADECVVLPPSPERESAAAVKDEVES
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKK+ GRGKRRDRIGRQ +      K+ PV+  P  VADECV +  SP  E   ++  EVE 
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQST-----EKSAPVVPRPPPVADECVVLPPSPERESAAAVKDEVES

Query:  AAVEEEVVGGDHDQKGLATKVILD
             E VGG   QKGLATKV+ D
Subjt:  AAVEEEVVGGDHDQKGLATKVILD

A0A6J1GV32 uncharacterized protein LOC1114577372.4e-8378.38Show/hide
Query:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC
        MEIKHKGK+HPS     PSSSSSVFKLLPAA+LALVSVLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASINGGN NVST YHKTP+FSCDCFYC
Subjt:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAA-VKDEVESAAVEE----
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRI RQ+T+KS PVV RP PV DECVV+P SPER+SAAA V +E + + V+E    
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAA-VKDEVESAAVEE----

Query:  ---EVVGGDHDQKGLATKVILD
           + V GDH QKGLATKV+ D
Subjt:  ---EVVGGDHDQKGLATKVILD

A0A6J1IQ09 uncharacterized protein LOC1114794673.4e-8579.73Show/hide
Query:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC
        MEIKHKGK+HPS     PSSSSSVFKLLPAA+LALVSVLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASINGGN NVSTGYHKTPMFSCDCFYC
Subjt:  MEIKHKGKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESA-AAVKDEVESAAVEE----
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRIGRQ+T+KS PVV RPPPVA ECVV+P SPER+SA A+V +E + + V+E    
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESA-AAVKDEVESAAVEE----

Query:  ---EVVGGDHDQKGLATKVILD
           + V GDH QKGLATKV+ D
Subjt:  ---EVVGGDHDQKGLATKVILD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein1.6e-2341.72Show/hide
Query:  EIKHKGKIHPSP---SSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIA-RSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYCYT
        ++  KG +HPSP    S+  +  LLP A+ +L +VLS ++REVLAY+I+  S       +S  ++ K+ KKA  +          + +P+F CDCF CYT
Subjt:  EIKHKGKIHPSP---SSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIA-RSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYCYT

Query:  AYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKS-AGRGKRRDRIGRQST
        +YW RWDSSP+R+LIH+ I+AFED L   +  KK+  G+  RR R G+ S+
Subjt:  AYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKS-AGRGKRRDRIGRQST

AT1G24270.1 unknown protein1.3e-2544.85Show/hide
Query:  MEIKHKGKIHPSP----SSSS------SVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSC
        M++  KGK+HPSP    SSSS      SVFKLL +A+L LVSVLS ++ EVLAY+I RS+ ++   S    R                    HK P+  C
Subjt:  MEIKHKGKIHPSP----SSSS------SVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE-----KPKKSAGRGKRRDRIGRQSTEKS
         CF CYT+YW +WDSS NRELI+Q IEAFEDHLT  E       KK+  R K+ +    Q   KS
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE-----KPKKSAGRGKRRDRIGRQSTEKS

AT1G62422.1 unknown protein9.0e-2240.94Show/hide
Query:  KGKIHPSP----SSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYCYTAYWC
        KG +HPSP     +      LLP A+L+LV+ LS+++REVLAY+I+ S  S+       SR K  K+ +            H +P+F CDCF CYT+YW 
Subjt:  KGKIHPSP----SSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYCYTAYWC

Query:  RWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGR-GKRRDRIGRQSTEK
        RWD+SP R+LIH+ I+A+ED L   +K K    R GK   R+    T +
Subjt:  RWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGR-GKRRDRIGRQSTEK

AT5G13090.1 unknown protein1.9e-2740.81Show/hide
Query:  MEIKHKGKIHPSP---------SSSS-----------SVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVST
        M++K KGK++PSP         SSSS           SV KLLPA +L LVSVLS +EREVLAY+I R    S     + S K  TKK S      N S+
Subjt:  MEIKHKGKIHPSP---------SSSS-----------SVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVST

Query:  GYHKTPMFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKS-AGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAAV
          HK P+F C+CF CYT YW RWDSSPNRELIH+ IEAFE+H        +S + RGK++++ GR+ T+  +    R     D+       P  E+  + 
Subjt:  GYHKTPMFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKS-AGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAAV

Query:  KDEVESAA--VEEEVVGGDHDQK
           V S     E EV  G+ + +
Subjt:  KDEVESAA--VEEEVVGGDHDQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTAAGCACAAGGGTAAAATACACCCCTCGCCGTCCTCCTCCTCCTCCGTCTTCAAGCTCCTTCCGGCCGCTGTTTTAGCTCTAGTTTCAGTCCTCTCTCTCGA
CGAACGCGAGGTCTTGGCCTACATGATCGCCAGGTCCATTCAGTCCTCCGCATTCACTTCTTCTCACGATTCCAGGAAGAAATCCACGAAAAAAGCTTCGATCAATGGCG
GAAACGGTAATGTTAGTACTGGTTATCACAAAACTCCGATGTTCAGTTGCGATTGCTTCTACTGCTACACCGCCTACTGGTGCCGCTGGGACTCCTCTCCCAATCGCGAA
CTAATCCACCAGGCGATCGAGGCGTTTGAAGATCACTTGACGAGCGGCGAGAAGCCGAAGAAGAGCGCCGGCAGAGGCAAGAGGAGAGACAGAATCGGCCGTCAAAGTAC
TGAGAAGTCTGCGCCTGTTGTTCCACGACCGCCGCCAGTGGCCGATGAGTGCGTCGTTCTTCCGCCGTCGCCGGAGCGTGAATCGGCGGCGGCAGTGAAGGACGAAGTTG
AGAGTGCTGCGGTGGAGGAGGAGGTGGTGGGCGGCGACCATGATCAGAAGGGTTTGGCGACGAAGGTAATTTTGGACCACTCCGATATGCAAGGAGCTGACGAAGACAGC
GCGATACCAGCCCAAGGAGTTTTGCTCATATTCCCTCCGATTACTGACTTAAGCATCGGAGGAGGTGTGGCAAGCACCACACCGGTGTGCAGGTTTCCCTTGTTTTGTAG
GCCACGTCTTCCCCGGATTCAAACAAATTCACTGTTGGTGTCACGTGAAGGTCAGGGTGAAGAAGAAAAACTAGATTCCAGATGCAAAACGAACAGTGCCGACCCGAGGC
CGAGGCCGAGGCCGACCAGGCCGAGATTTACTACCAGTAGCTTCCACAGAGATGGAGCACCACTGCCCCACCACGGGAAGGCACAGCCCGACCATGCAGAGGATGACCAG
GCCAAGGCCGACCGGGCCGAGGCTCGAGTGGCAATGCCTATACTCCCAAGGCCCTCGTTGGAGTGGCAACGCCTATACTCCTCGAGGAGTGGCAATGCCTATACTTCCAA
GGCCCTCGTTGGAGTGGCAACTCCTATACTCCCCGAGGAGTGGCAACGCCTATACTCCCAAGGCCCTCGTTGGAGTGGCAACGCCTATACTCCCCGAGTCAGGGTCAGGG
CTCCAATCAGAGTCGTACCGATAGAAGAAGGTGTGGAGGCATTAAGGCTTCAAGGAAGGATGATGCTGAAATTGTTGTGGAACAGGACCGAGGCCGACCAGGCCACGTCC
ATACCTTGTTTATGCAAGAGTGTCATTCTCCATACTTTAAGGCATAAGCTCAAGGTCGACCAGGATAACGAAAATTCGCACATAAGTCAGCACAATAGGAAGATCTTTAC
TCTTAAATCGGGCTTAAGGAGCTCAACCTATAGCAATAAGTCACACCTGAATCCCAATTCAAGACCCAAGGAGGCAGCCAGTAAACGGGATCCGGAGGAAGAACAGGCCA
GAGGGTCGGGCCAAAGCCGAAGGGATCGGGCTTTTGGCCCGACCACTAGGCCCGCTTGCACGGGCCGAGTTCTTCCGCCTCTGTTCGATCCCTGGTGCCCCCAGTTGCCT
CGGTTCCACTCGGTTCAGCCCAAATCGTCTCCGAACGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTAAGCACAAGGGTAAAATACACCCCTCGCCGTCCTCCTCCTCCTCCGTCTTCAAGCTCCTTCCGGCCGCTGTTTTAGCTCTAGTTTCAGTCCTCTCTCTCGA
CGAACGCGAGGTCTTGGCCTACATGATCGCCAGGTCCATTCAGTCCTCCGCATTCACTTCTTCTCACGATTCCAGGAAGAAATCCACGAAAAAAGCTTCGATCAATGGCG
GAAACGGTAATGTTAGTACTGGTTATCACAAAACTCCGATGTTCAGTTGCGATTGCTTCTACTGCTACACCGCCTACTGGTGCCGCTGGGACTCCTCTCCCAATCGCGAA
CTAATCCACCAGGCGATCGAGGCGTTTGAAGATCACTTGACGAGCGGCGAGAAGCCGAAGAAGAGCGCCGGCAGAGGCAAGAGGAGAGACAGAATCGGCCGTCAAAGTAC
TGAGAAGTCTGCGCCTGTTGTTCCACGACCGCCGCCAGTGGCCGATGAGTGCGTCGTTCTTCCGCCGTCGCCGGAGCGTGAATCGGCGGCGGCAGTGAAGGACGAAGTTG
AGAGTGCTGCGGTGGAGGAGGAGGTGGTGGGCGGCGACCATGATCAGAAGGGTTTGGCGACGAAGGTAATTTTGGACCACTCCGATATGCAAGGAGCTGACGAAGACAGC
GCGATACCAGCCCAAGGAGTTTTGCTCATATTCCCTCCGATTACTGACTTAAGCATCGGAGGAGGTGTGGCAAGCACCACACCGGTGTGCAGGTTTCCCTTGTTTTGTAG
GCCACGTCTTCCCCGGATTCAAACAAATTCACTGTTGGTGTCACGTGAAGGTCAGGGTGAAGAAGAAAAACTAGATTCCAGATGCAAAACGAACAGTGCCGACCCGAGGC
CGAGGCCGAGGCCGACCAGGCCGAGATTTACTACCAGTAGCTTCCACAGAGATGGAGCACCACTGCCCCACCACGGGAAGGCACAGCCCGACCATGCAGAGGATGACCAG
GCCAAGGCCGACCGGGCCGAGGCTCGAGTGGCAATGCCTATACTCCCAAGGCCCTCGTTGGAGTGGCAACGCCTATACTCCTCGAGGAGTGGCAATGCCTATACTTCCAA
GGCCCTCGTTGGAGTGGCAACTCCTATACTCCCCGAGGAGTGGCAACGCCTATACTCCCAAGGCCCTCGTTGGAGTGGCAACGCCTATACTCCCCGAGTCAGGGTCAGGG
CTCCAATCAGAGTCGTACCGATAGAAGAAGGTGTGGAGGCATTAAGGCTTCAAGGAAGGATGATGCTGAAATTGTTGTGGAACAGGACCGAGGCCGACCAGGCCACGTCC
ATACCTTGTTTATGCAAGAGTGTCATTCTCCATACTTTAAGGCATAAGCTCAAGGTCGACCAGGATAACGAAAATTCGCACATAAGTCAGCACAATAGGAAGATCTTTAC
TCTTAAATCGGGCTTAAGGAGCTCAACCTATAGCAATAAGTCACACCTGAATCCCAATTCAAGACCCAAGGAGGCAGCCAGTAAACGGGATCCGGAGGAAGAACAGGCCA
GAGGGTCGGGCCAAAGCCGAAGGGATCGGGCTTTTGGCCCGACCACTAGGCCCGCTTGCACGGGCCGAGTTCTTCCGCCTCTGTTCGATCCCTGGTGCCCCCAGTTGCCT
CGGTTCCACTCGGTTCAGCCCAAATCGTCTCCGAACGCCTAA
Protein sequenceShow/hide protein sequence
MEIKHKGKIHPSPSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINGGNGNVSTGYHKTPMFSCDCFYCYTAYWCRWDSSPNRE
LIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTEKSAPVVPRPPPVADECVVLPPSPERESAAAVKDEVESAAVEEEVVGGDHDQKGLATKVILDHSDMQGADEDS
AIPAQGVLLIFPPITDLSIGGGVASTTPVCRFPLFCRPRLPRIQTNSLLVSREGQGEEEKLDSRCKTNSADPRPRPRPTRPRFTTSSFHRDGAPLPHHGKAQPDHAEDDQ
AKADRAEARVAMPILPRPSLEWQRLYSSRSGNAYTSKALVGVATPILPEEWQRLYSQGPRWSGNAYTPRVRVRAPIRVVPIEEGVEALRLQGRMMLKLLWNRTEADQATS
IPCLCKSVILHTLRHKLKVDQDNENSHISQHNRKIFTLKSGLRSSTYSNKSHLNPNSRPKEAASKRDPEEEQARGSGQSRRDRAFGPTTRPACTGRVLPPLFDPWCPQLP
RFHSVQPKSSPNA