; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030123 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030123
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold6:11591164..11596815
RNA-Seq ExpressionSpg030123
SyntenySpg030123
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600670.1 hypothetical protein SDJN03_05903, partial [Cucurbita argyrosperma subsp. sororia]5.0e-6977.44Show/hide
Query:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK
        MK STIAAW  N+NNLK+NP+ALDS+DM LLEF +KPGVDLIRNCDLPPPQK+FTAS       R RGREE E+ G+EEKLELLKALRLSQTRAREAERK
Subjt:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK

Query:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNG-EGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI
        AAKLMEERD ISRA EDEARLIFCYRQ ++L+ELRVSKL+KRK   EEEE+  + NGNG  G+KWVWALAICLSVVGVG LLGYTC NVDEDPF+
Subjt:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNG-EGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI

KAG7031309.1 hypothetical protein SDJN02_05349, partial [Cucurbita argyrosperma subsp. argyrosperma]8.5e-6976.92Show/hide
Query:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK
        MK STIAAW  N+NNLK+NP+ALDS+DM LLEF +KPGVDLIRNCDLPPPQK+FTAS       R RGREE E+ G+EEKLELLKALRLSQTRAREAERK
Subjt:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK

Query:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNG-EGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI
        AAKLMEERD ISRA EDEARLIFCYRQ ++L+ELRVSKL+KRKEEEE+       NGNG  G+KWVWALAICLSVVGVG LLGYTC N DEDPF+
Subjt:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNG-EGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI

XP_022136673.1 uncharacterized protein LOC111008325 [Momordica charantia]8.0e-5974.44Show/hide
Query:  NPKALDSSDMLL--LEFSHKPGVDLIRNCDLPPPQKVFTASVARVRGRE--EAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALEDEA
        NPKA+DS+DMLL  LEFSHKPGVDLIRNCDLPPPQK+FT  +ARV+ RE  EAES GVEEKLELLKALRLSQTRAREAERKAAKLMEERD ISRA EDEA
Subjt:  NPKALDSSDMLL--LEFSHKPGVDLIRNCDLPPPQKVFTASVARVRGRE--EAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALEDEA

Query:  RLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEED------YNGNGEGMKWVWALAICLSVVGVGFLLGYTCNVDEDPFI
        RLIF YRQ V+LL+LR+S LQK  +EEE     +         G GE MKWVWALAIC +VVGVGFL GYTCNVDEDP +
Subjt:  RLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEED------YNGNGEGMKWVWALAICLSVVGVGFLLGYTCNVDEDPFI

XP_022943244.1 uncharacterized protein LOC111448032 [Cucurbita moschata]2.1e-6776.41Show/hide
Query:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK
        MK STIA W  N+NNLK+NP+ALDS+DM LLEF +KP VDLIRNCDLPPPQK+FTAS      AR RGREE E+ G+EEKLELLKALRLSQTRAREAERK
Subjt:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK

Query:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNG-EGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI
        AAKLMEERD ISRA EDEARLIFCYRQ ++L+ELRVSKL+KRKEEEE+       NGNG  G+KWVWALAICLSVVGVG LLGYTC NV EDPF+
Subjt:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNG-EGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI

XP_022970087.1 uncharacterized protein LOC111469054 [Cucurbita maxima]2.0e-7077.84Show/hide
Query:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK
        MK STIAAW  N+NNLK+NP+ALDS+DM LLEF +KPGVDLIRNCDLPPPQK+FTAS      AR RGREE E+ G+EEKLELLKALRLSQTRAREAERK
Subjt:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK

Query:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI
        AAKLMEERD ISRA EDEARLIFCYRQ ++L+ELRVSKL+KRK     EEEE++ NGNG G+KWVWALAICLSVVGVG LLGY C NVDEDPF+
Subjt:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI

TrEMBL top hitse value%identityAlignment
A0A1S3C2P7 uncharacterized protein LOC1034957943.1e-4869.88Show/hide
Query:  NPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTASVARVRGREEAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALEDEARLIF
        N ++LDS DM+      + GV+LIRNCDLPPPQKVF               +G+EEK+ELLKALRLSQTRAREAERKAAKLMEERD ISRA EDEARL+F
Subjt:  NPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTASVARVRGREEAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALEDEARLIF

Query:  CYRQCVRLLELRVSKLQKR---KEEEEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYTCN
        CYRQ ++LLELRV KLQK+   + EEEEEEEE D NG   GMKWVWALAICLSVVGVGFLLGYTCN
Subjt:  CYRQCVRLLELRVSKLQKR---KEEEEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYTCN

A0A5D3BEN7 Uncharacterized protein3.1e-4869.88Show/hide
Query:  NPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTASVARVRGREEAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALEDEARLIF
        N ++LDS DM+      + GV+LIRNCDLPPPQKVF               +G+EEK+ELLKALRLSQTRAREAERKAAKLMEERD ISRA EDEARL+F
Subjt:  NPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTASVARVRGREEAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALEDEARLIF

Query:  CYRQCVRLLELRVSKLQKR---KEEEEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYTCN
        CYRQ ++LLELRV KLQK+   + EEEEEEEE D NG   GMKWVWALAICLSVVGVGFLLGYTCN
Subjt:  CYRQCVRLLELRVSKLQKR---KEEEEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYTCN

A0A6J1C873 uncharacterized protein LOC1110083253.9e-5974.44Show/hide
Query:  NPKALDSSDMLL--LEFSHKPGVDLIRNCDLPPPQKVFTASVARVRGRE--EAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALEDEA
        NPKA+DS+DMLL  LEFSHKPGVDLIRNCDLPPPQK+FT  +ARV+ RE  EAES GVEEKLELLKALRLSQTRAREAERKAAKLMEERD ISRA EDEA
Subjt:  NPKALDSSDMLL--LEFSHKPGVDLIRNCDLPPPQKVFTASVARVRGRE--EAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALEDEA

Query:  RLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEED------YNGNGEGMKWVWALAICLSVVGVGFLLGYTCNVDEDPFI
        RLIF YRQ V+LL+LR+S LQK  +EEE     +         G GE MKWVWALAIC +VVGVGFL GYTCNVDEDP +
Subjt:  RLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEED------YNGNGEGMKWVWALAICLSVVGVGFLLGYTCNVDEDPFI

A0A6J1FTQ9 uncharacterized protein LOC1114480321.0e-6776.41Show/hide
Query:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK
        MK STIA W  N+NNLK+NP+ALDS+DM LLEF +KP VDLIRNCDLPPPQK+FTAS      AR RGREE E+ G+EEKLELLKALRLSQTRAREAERK
Subjt:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK

Query:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNG-EGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI
        AAKLMEERD ISRA EDEARLIFCYRQ ++L+ELRVSKL+KRKEEEE+       NGNG  G+KWVWALAICLSVVGVG LLGYTC NV EDPF+
Subjt:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNG-EGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI

A0A6J1HY55 uncharacterized protein LOC1114690549.8e-7177.84Show/hide
Query:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK
        MK STIAAW  N+NNLK+NP+ALDS+DM LLEF +KPGVDLIRNCDLPPPQK+FTAS      AR RGREE E+ G+EEKLELLKALRLSQTRAREAERK
Subjt:  MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTAS-----VARVRGREEAESAGVEEKLELLKALRLSQTRAREAERK

Query:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI
        AAKLMEERD ISRA EDEARLIFCYRQ ++L+ELRVSKL+KRK     EEEE++ NGNG G+KWVWALAICLSVVGVG LLGY C NVDEDPF+
Subjt:  AAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYTC-NVDEDPFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01240.1 unknown protein6.0e-1230.24Show/hide
Query:  KALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKV-----------------------------FTASVARVRGREEAESAGVE-------EKLELLKALRL
        ++    D L L  + K     I+NCDLPPPQK+                             F  S++     E   ++ +         K +LL+ALR 
Subjt:  KALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKV-----------------------------FTASVARVRGREEAESAGVE-------EKLELLKALRL

Query:  SQTRAREAERKAAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEE----------EEEDYNGNGEGMKWVWALAICLSVVGVGF
        SQTRAREAER A +   E+D +   L  +A  +  Y+Q ++LLE+    LQ +KEEE+EE+          +  +    GE  +++ A A+  S++G G 
Subjt:  SQTRAREAERKAAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEE----------EEEDYNGNGEGMKWVWALAICLSVVGVGF

Query:  LLGYT
        LLG+T
Subjt:  LLGYT

AT1G01240.2 unknown protein6.0e-1230.24Show/hide
Query:  KALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKV-----------------------------FTASVARVRGREEAESAGVE-------EKLELLKALRL
        ++    D L L  + K     I+NCDLPPPQK+                             F  S++     E   ++ +         K +LL+ALR 
Subjt:  KALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKV-----------------------------FTASVARVRGREEAESAGVE-------EKLELLKALRL

Query:  SQTRAREAERKAAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEE----------EEEDYNGNGEGMKWVWALAICLSVVGVGF
        SQTRAREAER A +   E+D +   L  +A  +  Y+Q ++LLE+    LQ +KEEE+EE+          +  +    GE  +++ A A+  S++G G 
Subjt:  SQTRAREAERKAAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEE----------EEEDYNGNGEGMKWVWALAICLSVVGVGF

Query:  LLGYT
        LLG+T
Subjt:  LLGYT

AT1G01240.3 unknown protein6.0e-1230.24Show/hide
Query:  KALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKV-----------------------------FTASVARVRGREEAESAGVE-------EKLELLKALRL
        ++    D L L  + K     I+NCDLPPPQK+                             F  S++     E   ++ +         K +LL+ALR 
Subjt:  KALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKV-----------------------------FTASVARVRGREEAESAGVE-------EKLELLKALRL

Query:  SQTRAREAERKAAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEE----------EEEDYNGNGEGMKWVWALAICLSVVGVGF
        SQTRAREAER A +   E+D +   L  +A  +  Y+Q ++LLE+    LQ +KEEE+EE+          +  +    GE  +++ A A+  S++G G 
Subjt:  SQTRAREAERKAAKLMEERDYISRALEDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEE----------EEEDYNGNGEGMKWVWALAICLSVVGVGF

Query:  LLGYT
        LLG+T
Subjt:  LLGYT

AT2G46550.1 unknown protein5.1e-1130.69Show/hide
Query:  VDLIRNCDLPPPQKV------------------FTASVARVRG---------REEAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALE
        +D + NCDLP PQK+                  ++ S   ++G         R EA S     K ELL+ALR SQTRAREAE  A +   E++++ + L 
Subjt:  VDLIRNCDLPPPQKV------------------FTASVARVRG---------REEAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALE

Query:  DEARLIFCYRQCVRLLELRVSKLQKRKEE---------------------EEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYT
         +A  +F Y+Q ++LL+L    LQ + +E                      +E  +     G   G K+   LA+ +S+VG G LLG+T
Subjt:  DEARLIFCYRQCVRLLELRVSKLQKRKEE---------------------EEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYT

AT2G46550.2 unknown protein5.1e-1130.69Show/hide
Query:  VDLIRNCDLPPPQKV------------------FTASVARVRG---------REEAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALE
        +D + NCDLP PQK+                  ++ S   ++G         R EA S     K ELL+ALR SQTRAREAE  A +   E++++ + L 
Subjt:  VDLIRNCDLPPPQKV------------------FTASVARVRG---------REEAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRALE

Query:  DEARLIFCYRQCVRLLELRVSKLQKRKEE---------------------EEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYT
         +A  +F Y+Q ++LL+L    LQ + +E                      +E  +     G   G K+   LA+ +S+VG G LLG+T
Subjt:  DEARLIFCYRQCVRLLELRVSKLQKRKEE---------------------EEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCCGTCGACGATCGCCGCCTGGCCGACGAATCACAACAACCTCAAGGCCAATCCGAAAGCCCTAGATTCCAGCGACATGCTCCTGCTGGAATTTTCCCACAAACC
CGGCGTTGACCTAATTCGGAACTGCGATCTGCCGCCGCCGCAGAAGGTGTTCACGGCGTCGGTGGCTAGAGTTAGAGGTCGGGAGGAAGCGGAATCGGCGGGGGTGGAGG
AGAAATTGGAGTTGCTGAAGGCTCTGAGACTGTCGCAAACGAGGGCGAGAGAGGCGGAGAGGAAGGCGGCGAAATTGATGGAGGAGAGGGATTACATAAGTAGGGCTTTG
GAAGATGAGGCGAGATTGATCTTCTGTTACCGACAGTGCGTGAGATTGTTGGAGCTTAGGGTTTCGAAGTTGCAGAAAAGGAAAGAGGAAGAAGAAGAAGAAGAAGAAGA
AGAAGATTACAATGGCAATGGGGAAGGAATGAAATGGGTTTGGGCTTTGGCAATTTGTTTGAGTGTTGTGGGAGTGGGCTTTCTATTGGGCTATACGTGTAATGTTGACG
AAGACCCATTTATTACTTGGGGATTTATTTTCCAATTTGTGATTGCAAATCAAACTGGAAAATTACTAGGAGGCGTTAGAATTGAGATTAGGGAGGATCATACTCCTCTT
GCTGTTGCAAACAGGATCTCAAAGCTGATTGAAAAATCCTCTAAGGATAAGGTTGCAGTCAAAGACAACCCGCTGTTCGAATCTGTCGTTCCAACATCTAAGCAGCCAAA
TGATGCACTAAATCCTGATGTGATGTCTGTCATGATGGCTGATGTAGACCAGGATGAAAGAATGGCAGAGATGGAAAGCAAACTCAATCTCTTGATGAAGGCAGTTGATG
AAAGAGGTCTGGAGATTGCCTATTTGAAGAACCAGCTGCAAAACCAAGAAACGGCTGAGTCTAGCCAAACCCCTGATGAGAAAGAAGGCTGGACCCTTGTCGTTCGTCGC
AAAAAGCAAAAGCAAAGTTACACACAAAAAGAGTCCCGCCTATTTCGAGACAGTAAAAGAAAGGTTAAGTCTCAAAGGAAGAAGGGAAAAAAGAAGTCAAGGAGGTCAAA
GCCTGTCGTGGAGGAAAATGAAGATTCCTTTTGCCCTCCACAACCCATAACTTTGGCAGAATACTTCCCAAGGCGCTTTCTCGATGATAGTCGAGGAGAAGCACTTGAAA
TCGTCACGTGTCACATTGTGGACGTGGTGGAAGATGATGATGTTCCTGCTAATTCCTCGGAAACGGTGGCAAGTCCAGGAGACTTATCCTCCTTTAGCATAAAGGACTTA
TTGTCACTTCCTCAGGAAGCTAAAACTGTTGAGGATGTGAAGGCATCTGACCTGAAAAAGGGTGAAACATCTACAAGCCTTGTGAAACCTAAAGTTGTAGAGGATGAGAA
GTGTTCACCTGTCCTACGATACGTCCCTTTATCCCGGCGTAAAAAGGGTGAATCACCTTTCACTGAATGTCCAAAAAGCATAAAGAAGAAGCTTCTAAAGGAAGGCTATG
GTCTGCCTCCGACGAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCCGTCGACGATCGCCGCCTGGCCGACGAATCACAACAACCTCAAGGCCAATCCGAAAGCCCTAGATTCCAGCGACATGCTCCTGCTGGAATTTTCCCACAAACC
CGGCGTTGACCTAATTCGGAACTGCGATCTGCCGCCGCCGCAGAAGGTGTTCACGGCGTCGGTGGCTAGAGTTAGAGGTCGGGAGGAAGCGGAATCGGCGGGGGTGGAGG
AGAAATTGGAGTTGCTGAAGGCTCTGAGACTGTCGCAAACGAGGGCGAGAGAGGCGGAGAGGAAGGCGGCGAAATTGATGGAGGAGAGGGATTACATAAGTAGGGCTTTG
GAAGATGAGGCGAGATTGATCTTCTGTTACCGACAGTGCGTGAGATTGTTGGAGCTTAGGGTTTCGAAGTTGCAGAAAAGGAAAGAGGAAGAAGAAGAAGAAGAAGAAGA
AGAAGATTACAATGGCAATGGGGAAGGAATGAAATGGGTTTGGGCTTTGGCAATTTGTTTGAGTGTTGTGGGAGTGGGCTTTCTATTGGGCTATACGTGTAATGTTGACG
AAGACCCATTTATTACTTGGGGATTTATTTTCCAATTTGTGATTGCAAATCAAACTGGAAAATTACTAGGAGGCGTTAGAATTGAGATTAGGGAGGATCATACTCCTCTT
GCTGTTGCAAACAGGATCTCAAAGCTGATTGAAAAATCCTCTAAGGATAAGGTTGCAGTCAAAGACAACCCGCTGTTCGAATCTGTCGTTCCAACATCTAAGCAGCCAAA
TGATGCACTAAATCCTGATGTGATGTCTGTCATGATGGCTGATGTAGACCAGGATGAAAGAATGGCAGAGATGGAAAGCAAACTCAATCTCTTGATGAAGGCAGTTGATG
AAAGAGGTCTGGAGATTGCCTATTTGAAGAACCAGCTGCAAAACCAAGAAACGGCTGAGTCTAGCCAAACCCCTGATGAGAAAGAAGGCTGGACCCTTGTCGTTCGTCGC
AAAAAGCAAAAGCAAAGTTACACACAAAAAGAGTCCCGCCTATTTCGAGACAGTAAAAGAAAGGTTAAGTCTCAAAGGAAGAAGGGAAAAAAGAAGTCAAGGAGGTCAAA
GCCTGTCGTGGAGGAAAATGAAGATTCCTTTTGCCCTCCACAACCCATAACTTTGGCAGAATACTTCCCAAGGCGCTTTCTCGATGATAGTCGAGGAGAAGCACTTGAAA
TCGTCACGTGTCACATTGTGGACGTGGTGGAAGATGATGATGTTCCTGCTAATTCCTCGGAAACGGTGGCAAGTCCAGGAGACTTATCCTCCTTTAGCATAAAGGACTTA
TTGTCACTTCCTCAGGAAGCTAAAACTGTTGAGGATGTGAAGGCATCTGACCTGAAAAAGGGTGAAACATCTACAAGCCTTGTGAAACCTAAAGTTGTAGAGGATGAGAA
GTGTTCACCTGTCCTACGATACGTCCCTTTATCCCGGCGTAAAAAGGGTGAATCACCTTTCACTGAATGTCCAAAAAGCATAAAGAAGAAGCTTCTAAAGGAAGGCTATG
GTCTGCCTCCGACGAGAAAATGA
Protein sequenceShow/hide protein sequence
MKPSTIAAWPTNHNNLKANPKALDSSDMLLLEFSHKPGVDLIRNCDLPPPQKVFTASVARVRGREEAESAGVEEKLELLKALRLSQTRAREAERKAAKLMEERDYISRAL
EDEARLIFCYRQCVRLLELRVSKLQKRKEEEEEEEEEEDYNGNGEGMKWVWALAICLSVVGVGFLLGYTCNVDEDPFITWGFIFQFVIANQTGKLLGGVRIEIREDHTPL
AVANRISKLIEKSSKDKVAVKDNPLFESVVPTSKQPNDALNPDVMSVMMADVDQDERMAEMESKLNLLMKAVDERGLEIAYLKNQLQNQETAESSQTPDEKEGWTLVVRR
KKQKQSYTQKESRLFRDSKRKVKSQRKKGKKKSRRSKPVVEENEDSFCPPQPITLAEYFPRRFLDDSRGEALEIVTCHIVDVVEDDDVPANSSETVASPGDLSSFSIKDL
LSLPQEAKTVEDVKASDLKKGETSTSLVKPKVVEDEKCSPVLRYVPLSRRKKGESPFTECPKSIKKKLLKEGYGLPPTRK