; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G09045 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G09045
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGenomic DNA, chromosome 3, P1 clone: MJL12
Genome locationClcChr09:7794306..7795148
RNA-Seq ExpressionClc09G09045
SyntenyClc09G09045
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593013.1 hypothetical protein SDJN03_12489, partial [Cucurbita argyrosperma subsp. sororia]2.3e-5256.03Show/hide
Query:  EEEEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---EWFSNFFN
        +EEEE+EEE E S  S F+S+ISQF SL++S+PLYFSY LFFSPY+L++LSF+ PLL TTFLL+L P    FFSH+HQ    DQ F L    EWF+  FN
Subjt:  EEEEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---EWFSNFFN

Query:  ------TIQFPQLEEVQEPEIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEEINKKNSCI
              T  FP    +QEPEI K+  KEE ++   +  ENG         +MG   +    +GCK FE DE++MDLLWE YE+K+          + SC 
Subjt:  ------TIQFPQLEEVQEPEIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEEINKKNSCI

Query:  SISKKKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT
          + KKDLRSLVN QKE+EE EEEEEEEE    KICCLQAL+ ST KMRFGMGKKSGL+KISKAFKGLK LHQL T  KNKT
Subjt:  SISKKKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT

KAG7025422.1 hypothetical protein SDJN02_11917, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-5256.34Show/hide
Query:  EEEEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---EWFSNFFN
        +EEEE+EEE E S  S F+S+ISQF SL++S+PLYFSY LFFSPY+L++LSF+ PLL TTFLL+L P    FFSH+HQ    DQ F L    EWF+  FN
Subjt:  EEEEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---EWFSNFFN

Query:  ------TIQFPQLEEVQEPEIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEEINKKNSCI
              T  FP    +QEPEI K+  KEE ++   +  ENG         +MG   +    +GCK FE DE++MDLLWE YE+K+          + SC 
Subjt:  ------TIQFPQLEEVQEPEIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEEINKKNSCI

Query:  SISKKKDLRSLVNQQKEIEELEEEEEEEEEANE--KICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT
          + KKDLRSLVN QKE+EE EEEEEEEEE  E  KICCLQAL+ ST KMRFGMGKKSGL+KISKAFKGLK LHQL T  KNKT
Subjt:  SISKKKDLRSLVNQQKEIEELEEEEEEEEEANE--KICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT

XP_016902223.1 PREDICTED: uncharacterized protein LOC107991592 [Cucumis melo]4.5e-10182.53Show/hide
Query:  EEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLDEWFSNFFNTIQFPQL
        E EEEE+LS SS+FLSQISQFCSLIIS+PLYFSY LFFSPYILKVLSFL PL T TFLLLLLPFLFTFFSHSHQNQDHDQ FLLDEW++NFFNTIQFP L
Subjt:  EEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLDEWFSNFFNTIQFPQL

Query:  EEVQEPEIKKEIDKEETKDHHH--DITENGF-IRNSSKEEKMGTN-YYNKRFLGCKVFEDEEKMDLLWEMYEDKELVVV-EEEINKKNSCISISKKKDLR
        EE QEPEIKKEI++EETKDHHH  DI E+G   RNSSKEEK+GTN    K  + CKVFEDEEKMDLLWE YED+ELVVV EEE+NKKN C  ISKKKDLR
Subjt:  EEVQEPEIKKEIDKEETKDHHH--DITENGF-IRNSSKEEKMGTN-YYNKRFLGCKVFEDEEKMDLLWEMYEDKELVVV-EEEINKKNSCISISKKKDLR

Query:  SLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNG
        SLVNQQKE+EELEEEEEEEE  N KICCLQAL+FS+SKMRFGMGKK+GL+KISKAFKGLKFLHQLTTNG
Subjt:  SLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNG

XP_022960018.1 uncharacterized protein LOC111460896 isoform X1 [Cucurbita moschata]1.4e-4955.04Show/hide
Query:  EEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---EWFSNFFN----
        +EEEE E S  + F+S+ISQF SL+IS+PLYFSY LFFSPY+L++LSF+ PLL TTFLL+L P    FFSH+HQ    DQ F L    EWF+  FN    
Subjt:  EEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---EWFSNFFN----

Query:  --TIQFPQLEEVQEPEIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEEINKKNSCISISK
          T  FP    +QEPEI K+  KEE ++   +  ENG         +MG   +    +GCK FE DE++MDLLWE YE+K+          + SC   + 
Subjt:  --TIQFPQLEEVQEPEIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEEINKKNSCISISK

Query:  KKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT
        KKDLRSLVN QKE+E  EEEEEEEE    KICCLQAL+ ST KMRFGMGKKSGL+KISKAFKGLK LH L T  KNKT
Subjt:  KKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT

XP_023513792.1 uncharacterized protein LOC111778295 [Cucurbita pepo subsp. pepo]7.4e-5155.21Show/hide
Query:  EEEEEEEEEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---EWF
        +++++  +EEEEEE E S  + F+S+ISQF SL++S+PLYFSY LFFSPY+ ++LSF+ PLL TTFLL+L P    FFSH+HQ    DQ F L    EWF
Subjt:  EEEEEEEEEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---EWF

Query:  SNFFN------TIQFPQLEEVQEPEIKKEIDKEETKDHH-HDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEEIN
        +  FN      T  FP    +QEPEI K+  KEE  DH   +  ENG         +MG   +    +GCK FE DE+KMDLLWE YE+K+         
Subjt:  SNFFN------TIQFPQLEEVQEPEIKKEIDKEETKDHH-HDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEEIN

Query:  KKNSCISISKKKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT
         + SC   + KKDLRSLVN QKE+EE EEEEEEEEE   KICCLQAL+ ST KMRFGMGKKSGL+KISKAFKGLK LHQL T  KNKT
Subjt:  KKNSCISISKKKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT

TrEMBL top hitse value%identityAlignment
A0A0A0K4L5 Genomic DNA, chromosome 3, P1 clone: MJL129.9e-10281.32Show/hide
Query:  EEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLDEWFSNFFNTIQFPQLEEV
        EEE+LS SS+FLSQISQFCSLIIS+PLYFSY LFFSPYILKVLSF  PLL+ TFLLLLLPFLFTFFSHSHQNQDHDQ FLLDEW++NFFN IQFP LEE 
Subjt:  EEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLDEWFSNFFNTIQFPQLEEV

Query:  QEPEIKKEIDKEETK---DHHHDITENGF-IRNSSKEEKMGTN-YYNKRFLGCKVFEDEEKMDLLWEMYEDKELVVV-EEEINKKNSCISISKKKDLRSL
        QEPEIKKEI++EETK   DHH DI ENG   RN SKEEK+GTN    K  + CKVFEDEEKMDLLWE YEDKELVVV +EE+NKKN C  ISKKKDLRSL
Subjt:  QEPEIKKEIDKEETK---DHHHDITENGF-IRNSSKEEKMGTN-YYNKRFLGCKVFEDEEKMDLLWEMYEDKELVVV-EEEINKKNSCISISKKKDLRSL

Query:  VNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKTRS
        VNQQKE+EELE++EEEEEE N KICCLQAL+FSTSKMRFGMGKK+GL+KISKAFKGLKFLHQLTTNGKNKT S
Subjt:  VNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKTRS

A0A1S4E1X2 uncharacterized protein LOC1079915922.2e-10182.53Show/hide
Query:  EEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLDEWFSNFFNTIQFPQL
        E EEEE+LS SS+FLSQISQFCSLIIS+PLYFSY LFFSPYILKVLSFL PL T TFLLLLLPFLFTFFSHSHQNQDHDQ FLLDEW++NFFNTIQFP L
Subjt:  EEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLDEWFSNFFNTIQFPQL

Query:  EEVQEPEIKKEIDKEETKDHHH--DITENGF-IRNSSKEEKMGTN-YYNKRFLGCKVFEDEEKMDLLWEMYEDKELVVV-EEEINKKNSCISISKKKDLR
        EE QEPEIKKEI++EETKDHHH  DI E+G   RNSSKEEK+GTN    K  + CKVFEDEEKMDLLWE YED+ELVVV EEE+NKKN C  ISKKKDLR
Subjt:  EEVQEPEIKKEIDKEETKDHHH--DITENGF-IRNSSKEEKMGTN-YYNKRFLGCKVFEDEEKMDLLWEMYEDKELVVV-EEEINKKNSCISISKKKDLR

Query:  SLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNG
        SLVNQQKE+EELEEEEEEEE  N KICCLQAL+FS+SKMRFGMGKK+GL+KISKAFKGLKFLHQLTTNG
Subjt:  SLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNG

A0A6J1DVV3 uncharacterized protein LOC1110239763.1e-3145.79Show/hide
Query:  EELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQFLLDEWFSNFFNTIQFPQLEEVQEP
        EEL LS +F +    F SLI S+PLYF Y+LFFSPY+LK+L FL PLLTTT L  L   L   F    Q+  H       W +  F        E V+E 
Subjt:  EELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQFLLDEWFSNFFNTIQFPQLEEVQEP

Query:  EIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRF--LGCKVFED-------EEKMDLLWEMYEDKELVVVEEEINKKNSCISISKKKDLRSL
        E K +   E   +   D + +    +S K++    + + +R   L  K FED       +++MDLLWEMYE KE  + E   + K    + SKKKDLRSL
Subjt:  EIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRF--LGCKVFED-------EEKMDLLWEMYEDKELVVVEEEINKKNSCISISKKKDLRSL

Query:  VNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKTRS
        VN+    +E EE EE EEE   KICCLQAL+FST KMR G+GK+SGL KISKAFKGLKFLH L  +GK    S
Subjt:  VNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKTRS

A0A6J1H9Q5 uncharacterized protein LOC111460896 isoform X16.7e-5055.04Show/hide
Query:  EEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---EWFSNFFN----
        +EEEE E S  + F+S+ISQF SL+IS+PLYFSY LFFSPY+L++LSF+ PLL TTFLL+L P    FFSH+HQ    DQ F L    EWF+  FN    
Subjt:  EEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---EWFSNFFN----

Query:  --TIQFPQLEEVQEPEIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEEINKKNSCISISK
          T  FP    +QEPEI K+  KEE ++   +  ENG         +MG   +    +GCK FE DE++MDLLWE YE+K+          + SC   + 
Subjt:  --TIQFPQLEEVQEPEIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEEINKKNSCISISK

Query:  KKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT
        KKDLRSLVN QKE+E  EEEEEEEE    KICCLQAL+ ST KMRFGMGKKSGL+KISKAFKGLK LH L T  KNKT
Subjt:  KKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT

A0A6J1KYN7 uncharacterized protein LOC1114975708.2e-4853.45Show/hide
Query:  MSEEEEEEEEEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---E
        + + +++++  +EEEE E S S+ F+S+ISQ  SL++S+PLYFSY LFFSPY+L++LSF+ PLL TTFLL+L P    FFSH+HQ    DQ F L    E
Subjt:  MSEEEEEEEEEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQ-FLLD---E

Query:  WFSNFFN------TIQFPQLEEVQEPEIKKEIDKEETKDHHHDIT-ENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEE
        WF+  FN      T  FP    +QEPEI K+  KEE  DH  + T ENG         +MG   +    +GCK FE DE+KMDLLWE YE+K+       
Subjt:  WFSNFFN------TIQFPQLEEVQEPEIKKEIDKEETKDHHHDIT-ENGFIRNSSKEEKMGTNYYNKRFLGCKVFE-DEEKMDLLWEMYEDKELVVVEEE

Query:  INKKNSCISISKKKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT
           + SC   + KKDLRSLVN QKE+    EEEEEEEE   KICCLQAL+ ST KMRFGMGKKSGL+KISKAFKG K LHQL T  KNKT
Subjt:  INKKNSCISISKKKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25130.1 unknown protein9.4e-1225.62Show/hide
Query:  EEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLL---------------------PFLFTFFSHSHQNQD
        +EE     ++ LSSL LS    FCS I+++P YFSY+LFFSPYI K+LSFL PL  TT LLLL                       FLF+F S      +
Subjt:  EEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLL---------------------PFLFTFFSHSHQNQD

Query:  H-------------------------------------DQFLLDEWFSNFFNTIQFP--------------------------------QLEEVQEPEIK
        H                                     D+    + F +  +T+                                   + EEV+E ++K
Subjt:  H-------------------------------------DQFLLDEWFSNFFNTIQFP--------------------------------QLEEVQEPEIK

Query:  KEID----------KEETKDHHHDIT----------------------ENGFIRNSSKEEKMGTNYYN--------KRFLGCKVFEDE------EKMDLL
         + D          KEE+K    D+                       +   +  + +E+ +    +         +R L CK+FE+       + MD L
Subjt:  KEID----------KEETKDHHHDIT----------------------ENGFIRNSSKEEKMGTNYYN--------KRFLGCKVFEDE------EKMDLL

Query:  WEMYEDKELVVVEEEINKKNSCISISKKKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTN
        WE YE +     + E  KK       KKK    +  +  E E + EEE+++   ++++CCLQAL+FST KM  G+ + + L K+SKAFKG+   +    +
Subjt:  WEMYEDKELVVVEEEINKKNSCISISKKKDLRSLVNQQKEIEELEEEEEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTN

Query:  GK
         K
Subjt:  GK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAACTCTCCCTCTCATCTCTATTCCTCTCTCAAATCTCCCAATTTTGTTCCTTAATTAT
TTCATACCCTCTTTATTTCTCTTATATCCTCTTCTTCTCCCCTTATATCCTCAAAGTTCTTTCCTTTTTATACCCACTTTTGACCACTACTTTCCTCCTCCTCCTTCTAC
CCTTTCTTTTCACATTCTTCTCTCATTCCCACCAAAATCAAGATCATGACCAATTCCTTCTTGATGAGTGGTTCAGCAATTTCTTCAACACCATCCAATTCCCACAACTT
GAAGAAGTTCAAGAACCTGAAATCAAGAAAGAAATCGACAAAGAAGAAACAAAGGATCATCATCATGATATTACAGAAAATGGGTTCATCAGAAATAGTTCAAAAGAAGA
AAAGATGGGAACAAATTATTACAATAAGAGGTTTTTGGGGTGTAAGGTGTTTGAAGATGAAGAGAAAATGGATTTGCTTTGGGAAATGTATGAGGACAAGGAATTGGTAG
TAGTTGAAGAAGAGATCAATAAAAAGAACAGCTGCATTTCAATTTCAAAGAAGAAGGATTTGAGGAGTTTGGTGAATCAACAAAAGGAAATAGAGGAATTAGAAGAAGAA
GAAGAAGAAGAAGAAGAAGCAAATGAGAAGATTTGTTGCTTACAAGCATTGAGATTTTCCACTTCAAAAATGAGATTTGGAATGGGAAAGAAAAGTGGTTTGAGGAAGAT
TTCAAAAGCTTTCAAAGGCCTTAAATTCTTGCATCAACTCACCACTAATGGTAAGAACAAGACACGTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAACTCTCCCTCTCATCTCTATTCCTCTCTCAAATCTCCCAATTTTGTTCCTTAATTAT
TTCATACCCTCTTTATTTCTCTTATATCCTCTTCTTCTCCCCTTATATCCTCAAAGTTCTTTCCTTTTTATACCCACTTTTGACCACTACTTTCCTCCTCCTCCTTCTAC
CCTTTCTTTTCACATTCTTCTCTCATTCCCACCAAAATCAAGATCATGACCAATTCCTTCTTGATGAGTGGTTCAGCAATTTCTTCAACACCATCCAATTCCCACAACTT
GAAGAAGTTCAAGAACCTGAAATCAAGAAAGAAATCGACAAAGAAGAAACAAAGGATCATCATCATGATATTACAGAAAATGGGTTCATCAGAAATAGTTCAAAAGAAGA
AAAGATGGGAACAAATTATTACAATAAGAGGTTTTTGGGGTGTAAGGTGTTTGAAGATGAAGAGAAAATGGATTTGCTTTGGGAAATGTATGAGGACAAGGAATTGGTAG
TAGTTGAAGAAGAGATCAATAAAAAGAACAGCTGCATTTCAATTTCAAAGAAGAAGGATTTGAGGAGTTTGGTGAATCAACAAAAGGAAATAGAGGAATTAGAAGAAGAA
GAAGAAGAAGAAGAAGAAGCAAATGAGAAGATTTGTTGCTTACAAGCATTGAGATTTTCCACTTCAAAAATGAGATTTGGAATGGGAAAGAAAAGTGGTTTGAGGAAGAT
TTCAAAAGCTTTCAAAGGCCTTAAATTCTTGCATCAACTCACCACTAATGGTAAGAACAAGACACGTTCTTGA
Protein sequenceShow/hide protein sequence
MSEEEEEEEEEEEEEEEELSLSSLFLSQISQFCSLIISYPLYFSYILFFSPYILKVLSFLYPLLTTTFLLLLLPFLFTFFSHSHQNQDHDQFLLDEWFSNFFNTIQFPQL
EEVQEPEIKKEIDKEETKDHHHDITENGFIRNSSKEEKMGTNYYNKRFLGCKVFEDEEKMDLLWEMYEDKELVVVEEEINKKNSCISISKKKDLRSLVNQQKEIEELEEE
EEEEEEANEKICCLQALRFSTSKMRFGMGKKSGLRKISKAFKGLKFLHQLTTNGKNKTRS