; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G14620 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G14620
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGenomic DNA, chromosome 3, P1 clone: MJL12
Genome locationChr7:13101049..13102068
RNA-Seq ExpressionCSPI07G14620
SyntenyCSPI07G14620
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593013.1 hypothetical protein SDJN03_12489, partial [Cucurbita argyrosperma subsp. sororia]1.5e-5656.63Show/hide
Query:  EEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN------I
        EE + SP S F+S+ISQF SL++SHPLYFSYFLFFSPY+L++LSF SPLL  TFLL+L P    FFSH+HQ    DQ F L    EW+N  FN       
Subjt:  EEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN------I

Query:  IQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVNKKNRCIS
          FPL    QEPEI K+  +EE  DR D   +  ENGI         ++G    +V     CK FE DE++MDLLWEKYE+K+             +  +
Subjt:  IQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVNKKNRCIS

Query:  KKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT
         KKDLRSLVN QKEMEE E++EEEEE E GKICCLQALK ST KMRFGMGKK+GLKKISKAFKGLK LHQL T  KNKT
Subjt:  KKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT

KAG7025422.1 hypothetical protein SDJN02_11917, partial [Cucurbita argyrosperma subsp. argyrosperma]7.5e-5656.58Show/hide
Query:  EEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN------I
        EE + SP S F+S+ISQF SL++SHPLYFSYFLFFSPY+L++LSF SPLL  TFLL+L P    FFSH+HQ    DQ F L    EW+N  FN       
Subjt:  EEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN------I

Query:  IQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVNKKNRCIS
          FPL    QEPEI K+  +EE  DR D   +  ENGI         ++G    +V     CK FE DE++MDLLWEKYE+K+             +  +
Subjt:  IQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVNKKNRCIS

Query:  KKKDLRSLVNQQKEMEELEDQEEEEEE--ENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT
         KKDLRSLVN QKEMEE E++EEEEEE  E GKICCLQALK ST KMRFGMGKK+GLKKISKAFKGLK LHQL T  KNKT
Subjt:  KKKDLRSLVNQQKEMEELEDQEEEEEE--ENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT

XP_016902223.1 PREDICTED: uncharacterized protein LOC107991592 [Cucumis melo]3.1e-12693.96Show/hide
Query:  EEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFNIIQFPLLEEA
        EEEKLSPSSIFLSQISQF SLIISHPLYFSYFLFFSPYILKVLSF SPL +VTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFN IQFPLLEEA
Subjt:  EEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFNIIQFPLLEEA

Query:  QEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFEDEEKMDLLWEKYEDKELVVVIKEEVNKKNRCISKKKDLRSLVN
        QEPEIKKEINQEETKD H HH DIIE+GISTRN SKEEKVGTNCSI+KSVMECKVFEDEEKMDLLWEKYED+ELVVVI+EEVNKKNRCISKKKDLRSLVN
Subjt:  QEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFEDEEKMDLLWEKYEDKELVVVIKEEVNKKNRCISKKKDLRSLVN

Query:  QQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNG
        QQKEMEELE  EEEEEEENGKICCLQALKFS+SKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNG
Subjt:  QQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNG

XP_023004158.1 uncharacterized protein LOC111497570 [Cucurbita maxima]3.7e-5555.63Show/hide
Query:  MSTEEEKL--SPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN--
        MS EEE+L  SPS+ F+S+ISQ  SL++SHPLYFSYFLFFSPY+L++LSF SPLL+ TFLL+L P    FFSH+HQ    DQ F L    EW+N  FN  
Subjt:  MSTEEEKL--SPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN--

Query:  ----IIQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVNKK
               FPL    QEP    EIN++E K+      +  ENGI         ++G    +V     CK FE DE+KMDLLWEKYE+K+            
Subjt:  ----IIQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVNKK

Query:  NRCISKKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT
         +  + KKDLRSLVN QKEMEE    EEEEEEE GKICCLQALK ST KMRFGMGKK+GLKKISKAFKG K LHQL T  KNKT
Subjt:  NRCISKKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT

XP_023513792.1 uncharacterized protein LOC111778295 [Cucurbita pepo subsp. pepo]1.2e-5655.24Show/hide
Query:  MSTEEE----KLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN
        MS EEE    + SP + F+S+ISQF SL++SHPLYFSYFLFFSPY+ ++LSF SPLL+ TFLL+L P    FFSH+HQ    DQ F L    EW+N  FN
Subjt:  MSTEEE----KLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN

Query:  ------IIQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVN
                 FPL    QEPEI K+  +EE  +  +      ENGI         ++G    +V     CK FE DE+KMDLLWEKYE+K+          
Subjt:  ------IIQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVN

Query:  KKNRCISKKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT
           +  + KKDLRSLVN QKEMEE E++EEEEEEE GKICCLQALK ST KMRFGMGKK+GLKKISKAFKGLK LHQL T  KNKT
Subjt:  KKNRCISKKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT

TrEMBL top hitse value%identityAlignment
A0A0A0K4L5 Genomic DNA, chromosome 3, P1 clone: MJL122.1e-14199.27Show/hide
Query:  MSTEEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFNIIQFPLL
        MSTEEEKLSPSSIFLSQISQF SLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFNIIQFPLL
Subjt:  MSTEEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFNIIQFPLL

Query:  EEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFEDEEKMDLLWEKYEDKELVVVIKEEVNKKNRCISKKKDLRS
        EEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFEDEEKMDLLWEKYEDKELVVVIKEEVNKKNRCISKKKDLRS
Subjt:  EEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFEDEEKMDLLWEKYEDKELVVVIKEEVNKKNRCISKKKDLRS

Query:  LVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKTHS
        LVNQQKEMEELEDQ EEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKTHS
Subjt:  LVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKTHS

A0A1S4E1X2 uncharacterized protein LOC1079915921.5e-12693.96Show/hide
Query:  EEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFNIIQFPLLEEA
        EEEKLSPSSIFLSQISQF SLIISHPLYFSYFLFFSPYILKVLSF SPL +VTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFN IQFPLLEEA
Subjt:  EEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFNIIQFPLLEEA

Query:  QEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFEDEEKMDLLWEKYEDKELVVVIKEEVNKKNRCISKKKDLRSLVN
        QEPEIKKEINQEETKD H HH DIIE+GISTRN SKEEKVGTNCSI+KSVMECKVFEDEEKMDLLWEKYED+ELVVVI+EEVNKKNRCISKKKDLRSLVN
Subjt:  QEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFEDEEKMDLLWEKYEDKELVVVIKEEVNKKNRCISKKKDLRSLVN

Query:  QQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNG
        QQKEMEELE  EEEEEEENGKICCLQALKFS+SKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNG
Subjt:  QQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNG

A0A6J1DVV3 uncharacterized protein LOC1110239763.6e-3244.24Show/hide
Query:  EKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFNIIQFPLLE-EAQ
        E+L  S IF +    F SLI SHPLYF Y LFFSPY+LK+L F SPLL+ T L  L   L   F    Q+  H        W N  F   + P+ E E +
Subjt:  EKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFNIIQFPLLE-EAQ

Query:  EPEIKKEI-NQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFED-------EEKMDLLWEKYEDKELVVVIKEEVNKKNRCISKKK
          +I + I N+E  +      C   +  I  R+  +            + +  K FED       +++MDLLWE YE KE  +    + +K+    SKKK
Subjt:  EPEIKKEI-NQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFED-------EEKMDLLWEKYEDKELVVVIKEEVNKKNRCISKKK

Query:  DLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKTHS
        DLRSLVN+    +E E+ EE EEEE GKICCLQALKFST KMR G+GK++GL KISKAFKGLKFLH L  +GK   HS
Subjt:  DLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKTHS

A0A6J1H9Q5 uncharacterized protein LOC111460896 isoform X12.3e-5556.34Show/hide
Query:  MSTEEEKL--SPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN--
        MS EEE+L  SP + F+S+ISQF SL+ISHPLYFSYFLFFSPY+L++LSF SPLL  TFLL+L P    FFSH+HQ    DQ F L    EW+N  FN  
Subjt:  MSTEEEKL--SPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN--

Query:  ----IIQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVNKK
               FPL    QEPEI K+  +EE  DR D   +  ENGI         ++G    +V     CK FE DE++MDLLWEKYE+K+            
Subjt:  ----IIQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVNKK

Query:  NRCISKKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT
         +  + KKDLRSLVN QKEME  E++EEEEE E GKICCLQALK ST KMRFGMGKK+GLKKISKAFKGLK LH L T  KNKT
Subjt:  NRCISKKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT

A0A6J1KYN7 uncharacterized protein LOC1114975701.8e-5555.63Show/hide
Query:  MSTEEEKL--SPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN--
        MS EEE+L  SPS+ F+S+ISQ  SL++SHPLYFSYFLFFSPY+L++LSF SPLL+ TFLL+L P    FFSH+HQ    DQ F L    EW+N  FN  
Subjt:  MSTEEEKL--SPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLD---EWYNNFFN--

Query:  ----IIQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVNKK
               FPL    QEP    EIN++E K+      +  ENGI         ++G    +V     CK FE DE+KMDLLWEKYE+K+            
Subjt:  ----IIQFPLLEEAQEPEIKKEINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFE-DEEKMDLLWEKYEDKELVVVIKEEVNKK

Query:  NRCISKKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT
         +  + KKDLRSLVN QKEMEE    EEEEEEE GKICCLQALK ST KMRFGMGKK+GLKKISKAFKG K LHQL T  KNKT
Subjt:  NRCISKKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25130.1 unknown protein6.4e-1325.56Show/hide
Query:  STEEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLL---------------------PFLFTFFSHSHQNQDH--
        S    K+  SS+ LS    F S I++HP YFSY LFFSPYI K+LSF SPL   T LLLL                       FLF+F S      +H  
Subjt:  STEEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLL---------------------PFLFTFFSHSHQNQDH--

Query:  -----------------------------------DQLFLLDEWYNN-----------------------FFNIIQFPLLEEAQEPEIKKEINQEETKDR
                                           D+L  +D++ ++                       F ++I     EE ++ E K+E+ +++ K +
Subjt:  -----------------------------------DQLFLLDEWYNN-----------------------FFNIIQFPLLEEAQEPEIKKEINQEETKDR

Query:  HDHHCD---------------------------------IIENGISTRNISKEEKVGTNCSIV-----------KSVMECKVFEDE------EKMDLLWE
         D   D                                  +  G   RN+  + +   N S+            +  + CK+FE+       + MD LWE
Subjt:  HDHHCD---------------------------------IIENGISTRNISKEEKVGTNCSIV-----------KSVMECKVFEDE------EKMDLLWE

Query:  KYEDKELVVVIKEEVNKKNRCISKKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGK
         YE +       EE  KK     KKK    +  +  E E + ++E+++  ++ ++CCLQALKFST KM  G+ + N L K+SKAFKG+   +    + K
Subjt:  KYEDKELVVVIKEEVNKKNRCISKKKDLRSLVNQQKEMEELEDQEEEEEEENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTACTGAAGAAGAAAAACTCTCCCCTTCATCTATATTCCTCTCTCAAATCTCCCAATTTGGTTCCTTAATTATCTCACACCCTCTCTATTTCTCTTACTTCCTCTT
CTTTTCCCCTTACATCCTCAAAGTCCTCTCCTTTTTCTCCCCACTTTTGTCCGTTACTTTTCTTCTCCTTCTTCTACCATTTCTCTTCACATTCTTCTCTCATTCCCACC
AAAATCAAGATCATGACCAATTGTTCCTTCTCGATGAGTGGTACAACAATTTCTTCAACATCATCCAATTCCCGCTACTTGAAGAAGCTCAAGAACCTGAAATCAAGAAA
GAAATCAACCAAGAAGAAACCAAAGATCGTCATGATCATCATTGTGATATTATTGAAAATGGGATCAGTACAAGAAATATCTCAAAAGAAGAAAAAGTGGGAACAAATTG
CAGTATTGTTAAGAGTGTTATGGAGTGCAAAGTGTTTGAAGATGAAGAGAAAATGGATTTGCTTTGGGAAAAGTATGAGGACAAAGAATTGGTAGTAGTTATTAAAGAAG
AGGTGAATAAGAAGAATAGATGCATTTCAAAGAAGAAGGATTTGAGGAGTTTGGTGAATCAACAAAAGGAAATGGAAGAATTAGAAGATCAGGAAGAAGAAGAAGAAGAA
GAAAATGGGAAGATTTGTTGCTTACAAGCATTGAAATTTTCCACTTCAAAAATGAGATTTGGAATGGGAAAGAAAAATGGTTTGAAGAAGATTTCTAAAGCTTTTAAAGG
GCTTAAATTCTTGCATCAACTCACTACTAATGGTAAGAACAAGACACACTCTTGA
mRNA sequenceShow/hide mRNA sequence
CCCTCAATTTCTAAAACACAACACACACTTTCACAAACTGATTCACACCAAAACTTCCCTATTGAAAATCTTTTCTTTCTCCTACAAGATTCCATAGAAAAAAAGAAAAA
TAGGGGAGAAAATGAGTACTGAAGAAGAAAAACTCTCCCCTTCATCTATATTCCTCTCTCAAATCTCCCAATTTGGTTCCTTAATTATCTCACACCCTCTCTATTTCTCT
TACTTCCTCTTCTTTTCCCCTTACATCCTCAAAGTCCTCTCCTTTTTCTCCCCACTTTTGTCCGTTACTTTTCTTCTCCTTCTTCTACCATTTCTCTTCACATTCTTCTC
TCATTCCCACCAAAATCAAGATCATGACCAATTGTTCCTTCTCGATGAGTGGTACAACAATTTCTTCAACATCATCCAATTCCCGCTACTTGAAGAAGCTCAAGAACCTG
AAATCAAGAAAGAAATCAACCAAGAAGAAACCAAAGATCGTCATGATCATCATTGTGATATTATTGAAAATGGGATCAGTACAAGAAATATCTCAAAAGAAGAAAAAGTG
GGAACAAATTGCAGTATTGTTAAGAGTGTTATGGAGTGCAAAGTGTTTGAAGATGAAGAGAAAATGGATTTGCTTTGGGAAAAGTATGAGGACAAAGAATTGGTAGTAGT
TATTAAAGAAGAGGTGAATAAGAAGAATAGATGCATTTCAAAGAAGAAGGATTTGAGGAGTTTGGTGAATCAACAAAAGGAAATGGAAGAATTAGAAGATCAGGAAGAAG
AAGAAGAAGAAGAAAATGGGAAGATTTGTTGCTTACAAGCATTGAAATTTTCCACTTCAAAAATGAGATTTGGAATGGGAAAGAAAAATGGTTTGAAGAAGATTTCTAAA
GCTTTTAAAGGGCTTAAATTCTTGCATCAACTCACTACTAATGGTAAGAACAAGACACACTCTTGAAATCTTTTTCTACTCAGAATTTGAATGCTTTATTTGTATTTTTT
TCTGGGTTTCTTCTCTCTCTCCGTTTTTTT
Protein sequenceShow/hide protein sequence
MSTEEEKLSPSSIFLSQISQFGSLIISHPLYFSYFLFFSPYILKVLSFFSPLLSVTFLLLLLPFLFTFFSHSHQNQDHDQLFLLDEWYNNFFNIIQFPLLEEAQEPEIKK
EINQEETKDRHDHHCDIIENGISTRNISKEEKVGTNCSIVKSVMECKVFEDEEKMDLLWEKYEDKELVVVIKEEVNKKNRCISKKKDLRSLVNQQKEMEELEDQEEEEEE
ENGKICCLQALKFSTSKMRFGMGKKNGLKKISKAFKGLKFLHQLTTNGKNKTHS