; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021787 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021787
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr7:12117874..12118563
RNA-Seq ExpressionLag0021787
SyntenyLag0021787
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018455.1 hypothetical protein SDJN02_20323, partial [Cucurbita argyrosperma subsp. argyrosperma]7.0e-9278.33Show/hide
Query:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC
        MEIKHK K+HPS     PSSSSSVFKLLPAA+LALVSVLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASI  GN NV+T+YHKTP+FSCDCFYC
Subjt:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESAAA-VKDQVESAAVEVVAG
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRI RQ+ T+KS+PVV RP PV DECVV+P SPER+SAAA V ++ + + V+ V  
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESAAA-VKDQVESAAVEVVAG

Query:  GG-----GGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL
         G      GDH QKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GG-----GGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_022955886.1 uncharacterized protein LOC111457737 [Cucurbita moschata]8.3e-9378.75Show/hide
Query:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC
        MEIKHK K+HPS     PSSSSSVFKLLPAA+LALVSVLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASIN GN NV+T+YHKTP+FSCDCFYC
Subjt:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESAAA-VKDQVESAAVEVVAG
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRI RQ+ T+KS+PVV RP PV DECVV+P SPER+SAAA V ++ + + V+ V  
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESAAA-VKDQVESAAVEVVAG

Query:  GG-----GGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL
         G      GDH QKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GG-----GGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_022979917.1 uncharacterized protein LOC111479467 [Cucurbita maxima]7.5e-9479.58Show/hide
Query:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC
        MEIKHK K+HPS     PSSSSSVFKLLPAA+LALVSVLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASIN GN NV+T YHKTPMFSCDCFYC
Subjt:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESA-AAVKDQVESAAVEVVAG
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRIGRQ+ T+KS+PVV RPPPVA ECVV+P SPER+SA A+V ++ + + V+ V  
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESA-AAVKDQVESAAVEVVAG

Query:  GG-----GGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL
         G      GDH QKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GG-----GGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_023528171.1 uncharacterized protein LOC111791162 [Cucurbita pepo subsp. pepo]2.4e-9278.19Show/hide
Query:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC
        MEIKHK K+HPS     PSSSSSVFKLLPAA+LALV+VLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASI+ GN NV+T YHKTPMFSCDCFYC
Subjt:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESAAA-VKDQVESAAVEVVAG
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRIGRQ+ T+KS+PVV RPPP+ADECVV+P SPER+SAAA V ++ + + V+ V  
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESAAA-VKDQVESAAVEVVAG

Query:  GG-----GGDHDQKGLATKVLPDVLGFFNSR---LWSLWSPNL
         G      GDH QKGLATKVLPDVLGFFNSR   LWSLWSPNL
Subjt:  GG-----GGDHDQKGLATKVLPDVLGFFNSR---LWSLWSPNL

XP_038904588.1 uncharacterized protein LOC120090946 [Benincasa hispida]2.3e-8775.82Show/hide
Query:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNV-----TTAYHKTPMFSC
        MEIKHK KIHPS     PSSSSSVFKLLP A+LALVS+LSLD+REVLAYMIARSIQSSAFTS+  SRKKST+KASIN GNGNV     TT YHKTPMFSC
Subjt:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNV-----TTAYHKTPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPER-----ESAAAVKDQVE
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKK+ GRGKRRDRIGRQ  T K++PV+  PPPVADECV +P   ER        + VK+  E
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPER-----ESAAAVKDQVE

Query:  SAAVEVVAGGGGGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL
        S  VE V  GGGGDH +KGL TKVLPDVLGF NSRLWSLWSPNL
Subjt:  SAAVEVVAGGGGGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL

TrEMBL top hitse value%identityAlignment
A0A0A0KTW2 Uncharacterized protein9.9e-8474.69Show/hide
Query:  MEIKHKPKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTS-SHDSRKKSTKKASINAGNGNV--------TTAYHKTPMFS
        MEIKHK KIHPS  PSSSSSVFKLLPAA+LAL S+LSLD+REVLAYMIARSIQSSAFTS +  SRKKSTKK  IN GN NV         T YHKTP+FS
Subjt:  MEIKHKPKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTS-SHDSRKKSTKKASINAGNGNV--------TTAYHKTPMFS

Query:  CDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQ-STTEKSVPVVPRPPPVADECVVLPPSPERESAAAVKDQVESAA
        CDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKK+ GRGKRRDRIGRQ ST  K++PVV  P  V DECV +P SP  E   +V  +VE + 
Subjt:  CDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQ-STTEKSVPVVPRPPPVADECVVLPPSPERESAAAVKDQVESAA

Query:  VEVVAGGGGGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL
          VV   GGG+H QKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  VEVVAGGGGGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A1S3BXS0 uncharacterized protein LOC1034945677.1e-8272.84Show/hide
Query:  MEIKHKPKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSA-FTSSHDSRKKSTKKASINAGNGNV-------TTAYHKTPMFSC
        MEIKHK KIHPS  PSSSSSVFKLLPAA+LAL S+LSLD+REVLAYMIARSIQSSA  TS+  SRKKSTKK SIN GN NV       TT YHKTP+FSC
Subjt:  MEIKHKPKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSA-FTSSHDSRKKSTKKASINAGNGNV-------TTAYHKTPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQST----TEKSVPVVPRPPPVADECVVLPPSPERESAAAVKDQVES
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKK+ GRGKRRDRIGRQ +      K++PV+  P  VADECV +  SP  E   ++  +VE 
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQST----TEKSVPVVPRPPPVADECVVLPPSPERESAAAVKDQVES

Query:  AAVEVVAGGGGGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL
            VV   GGG+H QKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  AAVEVVAGGGGGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A5A7TRT9 Uncharacterized protein2.4e-8273.25Show/hide
Query:  MEIKHKPKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSA-FTSSHDSRKKSTKKASINAGNGNV-------TTAYHKTPMFSC
        MEIKHK KIHPS  PSSSSSVFKLLPAA+LAL S+LSLD+REVLAYMIARSIQSSA  TS+  SRKKSTKK SINAGN NV       TT YHKTP+FSC
Subjt:  MEIKHKPKIHPS--PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSA-FTSSHDSRKKSTKKASINAGNGNV-------TTAYHKTPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQST----TEKSVPVVPRPPPVADECVVLPPSPERESAAAVKDQVES
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKK+ GRGKRRDRIGRQ +      K++PV+  P  VADECV +  SP  E   ++  +VE 
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQST----TEKSVPVVPRPPPVADECVVLPPSPERESAAAVKDQVES

Query:  AAVEVVAGGGGGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL
            VV   GGG+H QKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  AAVEVVAGGGGGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A6J1GV32 uncharacterized protein LOC1114577374.0e-9378.75Show/hide
Query:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC
        MEIKHK K+HPS     PSSSSSVFKLLPAA+LALVSVLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASIN GN NV+T+YHKTP+FSCDCFYC
Subjt:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESAAA-VKDQVESAAVEVVAG
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRI RQ+ T+KS+PVV RP PV DECVV+P SPER+SAAA V ++ + + V+ V  
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESAAA-VKDQVESAAVEVVAG

Query:  GG-----GGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL
         G      GDH QKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GG-----GGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A6J1IQ09 uncharacterized protein LOC1114794673.6e-9479.58Show/hide
Query:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC
        MEIKHK K+HPS     PSSSSSVFKLLPAA+LALVSVLSLDEREVLAYMIARSIQSSA TS+HDSRKKS KKASIN GN NV+T YHKTPMFSCDCFYC
Subjt:  MEIKHKPKIHPS-----PSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESA-AAVKDQVESAAVEVVAG
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKK+ G+GKRRDRIGRQ+ T+KS+PVV RPPPVA ECVV+P SPER+SA A+V ++ + + V+ V  
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESA-AAVKDQVESAAVEVVAG

Query:  GG-----GGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL
         G      GDH QKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GG-----GGDHDQKGLATKVLPDVLGFFNSRLWSLWSPNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein9.7e-2340Show/hide
Query:  EIKHKPKIHPSP---SSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYCYTA
        ++  K  +HPSP    S+  +  LLP A+ +L +VLS ++REVLAY+I+        T+S+   +  T + +    +       H +P+F CDCF CYT+
Subjt:  EIKHKPKIHPSP---SSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYCYTA

Query:  YWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKS-AGRGKRRDRIGRQST
        YW RWDSSP+R+LIH+ I+AFED L   +  KK+  G+  RR R G+ S+
Subjt:  YWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKS-AGRGKRRDRIGRQST

AT1G24270.1 unknown protein1.0e-2443.71Show/hide
Query:  MEIKHKPKIHPSP----SSSS------SVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSC
        M++  K K+HPSP    SSSS      SVFKLL +A+L LVSVLS ++ EVLAY+I RS+ ++   S    R                    HK P+  C
Subjt:  MEIKHKPKIHPSP----SSSS------SVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE-----KPKKSAGRGKRRDRIGRQSTTEKSV
         CF CYT+YW +WDSS NRELI+Q IEAFEDHLT  E       KK+  R K+ + I  +    KS+
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE-----KPKKSAGRGKRRDRIGRQSTTEKSV

AT1G62422.1 unknown protein5.3e-2140.4Show/hide
Query:  KPKIHPSP----SSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYCYTAYWC
        K  +HPSP     +      LLP A+L+LV+ LS+++REVLAY+I+ S  S+       SR K  K+ +            H +P+F CDCF CYT+YW 
Subjt:  KPKIHPSP----SSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYCYTAYWC

Query:  RWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGR-GKRRDRIGRQSTTEKS
        RWD+SP R+LIH+ I+A+ED L   +K K    R GK   R+    T+  S
Subjt:  RWDSSPNRELIHQAIEAFEDHLTSGEKPKKSAGR-GKRRDRIGRQSTTEKS

AT5G13090.1 unknown protein4.0e-3740.29Show/hide
Query:  MEIKHKPKIHPSP---------SSSS-----------SVFKLLPAAVLALVSVLSLDEREVLAYMIAR--SIQSSAFTSSHDSRKKSTKKASINAGNGNV
        M++K K K++PSP         SSSS           SV KLLPA +L LVSVLS +EREVLAY+I R  +I     +SS +  KK + K+S N      
Subjt:  MEIKHKPKIHPSP---------SSSS-----------SVFKLLPAAVLALVSVLSLDEREVLAYMIAR--SIQSSAFTSSHDSRKKSTKKASINAGNGNV

Query:  TTAYHKTPMFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKS-AGRGKRRDRIGRQSTTEKSVPV-------------VPRP------
            HK P+F C+CF CYT YW RWDSSPNRELIH+ IEAFE+H        +S + RGK++++ GR+ T   S P              V  P      
Subjt:  TTAYHKTPMFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKS-AGRGKRRDRIGRQSTTEKSVPV-------------VPRP------

Query:  --------PPVADECVVLPPSPERESAAAVKDQVESAAVEVVAGGGGGDHDQKGLATKVLPDVLGFFNSRLWSLWSPN
                P    E  V    PE E       Q E + V V           KGLA KVLPDVLG F+S  W LW+PN
Subjt:  --------PPVADECVVLPPSPERESAAAVKDQVESAAVEVVAGGGGGDHDQKGLATKVLPDVLGFFNSRLWSLWSPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTAAGCACAAGCCTAAAATACACCCCTCGCCGTCCTCCTCCTCCTCCGTCTTCAAGCTCCTTCCGGCCGCTGTTTTGGCCCTAGTTTCAGTCCTCTCTCTCGA
CGAACGCGAGGTCTTGGCCTACATGATCGCCAGGTCCATCCAGTCCTCCGCATTCACTTCTTCTCACGATTCCAGGAAGAAATCCACCAAAAAAGCTTCGATCAATGCCG
GAAACGGTAATGTTACTACTGCTTATCACAAAACTCCGATGTTCAGTTGCGATTGCTTCTACTGCTACACCGCCTACTGGTGCCGCTGGGACTCCTCTCCCAATCGCGAA
CTAATCCACCAGGCGATCGAGGCGTTTGAAGATCACTTGACCAGCGGCGAGAAGCCGAAGAAGAGCGCCGGCCGAGGCAAGAGGAGAGACAGAATCGGCCGTCAAAGTAC
TACTGAGAAGTCTGTGCCTGTTGTTCCACGACCGCCGCCAGTGGCCGATGAGTGCGTCGTTCTTCCGCCGTCGCCGGAGCGTGAATCTGCGGCGGCAGTGAAGGACCAAG
TTGAGAGTGCTGCGGTGGAGGTGGTGGCGGGCGGCGGCGGCGGCGACCATGATCAGAAGGGTTTGGCGACGAAGGTACTGCCGGACGTGTTGGGGTTTTTCAATTCGCGT
TTGTGGAGTCTGTGGAGTCCGAATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTAAGCACAAGCCTAAAATACACCCCTCGCCGTCCTCCTCCTCCTCCGTCTTCAAGCTCCTTCCGGCCGCTGTTTTGGCCCTAGTTTCAGTCCTCTCTCTCGA
CGAACGCGAGGTCTTGGCCTACATGATCGCCAGGTCCATCCAGTCCTCCGCATTCACTTCTTCTCACGATTCCAGGAAGAAATCCACCAAAAAAGCTTCGATCAATGCCG
GAAACGGTAATGTTACTACTGCTTATCACAAAACTCCGATGTTCAGTTGCGATTGCTTCTACTGCTACACCGCCTACTGGTGCCGCTGGGACTCCTCTCCCAATCGCGAA
CTAATCCACCAGGCGATCGAGGCGTTTGAAGATCACTTGACCAGCGGCGAGAAGCCGAAGAAGAGCGCCGGCCGAGGCAAGAGGAGAGACAGAATCGGCCGTCAAAGTAC
TACTGAGAAGTCTGTGCCTGTTGTTCCACGACCGCCGCCAGTGGCCGATGAGTGCGTCGTTCTTCCGCCGTCGCCGGAGCGTGAATCTGCGGCGGCAGTGAAGGACCAAG
TTGAGAGTGCTGCGGTGGAGGTGGTGGCGGGCGGCGGCGGCGGCGACCATGATCAGAAGGGTTTGGCGACGAAGGTACTGCCGGACGTGTTGGGGTTTTTCAATTCGCGT
TTGTGGAGTCTGTGGAGTCCGAATCTTTAA
Protein sequenceShow/hide protein sequence
MEIKHKPKIHPSPSSSSSVFKLLPAAVLALVSVLSLDEREVLAYMIARSIQSSAFTSSHDSRKKSTKKASINAGNGNVTTAYHKTPMFSCDCFYCYTAYWCRWDSSPNRE
LIHQAIEAFEDHLTSGEKPKKSAGRGKRRDRIGRQSTTEKSVPVVPRPPPVADECVVLPPSPERESAAAVKDQVESAAVEVVAGGGGGDHDQKGLATKVLPDVLGFFNSR
LWSLWSPNL