; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021826 (gene) of Chayote v1 genome

Gene IDSed0021826
OrganismSechium edule (Chayote v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationLG12:7598742..7601575
RNA-Seq ExpressionSed0021826
SyntenySed0021826
Gene Ontology termsNA
InterPro domainsIPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606968.1 Alkylated DNA repair protein ALKBH8-like protein, partial [Cucurbita argyrosperma subsp. sororia]5.7e-7963.88Show/hide
Query:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------
        M E+LDDE EILKQVFG+SSED   ED EDQTV+ DS+YE G I RWEQ KEIKGLWLCRSFLS QQESSLL+AIRNEGWFMEASQNQ            
Subjt:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------

Query:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-
                                                                 GICAHVDLMRFEDGIAIVSLES CVMHFTRVDETSCDPTSKG 
Subjt:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-

Query:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD
         DVS SKVP+YL PGS+VLLWGEARYLWKHEINRKPGFQMWE QELTQGRRTSITLRKLC+VD
Subjt:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD

XP_022948590.1 uncharacterized protein LOC111452220 [Cucurbita moschata]2.4e-7763.5Show/hide
Query:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------
        M E+LDDE EILKQVFG+SSED   ED EDQTV+ DS+YE G I RWEQ KEIKGLWLCRSFLS QQESSLLSAIRNEGWFMEASQNQ            
Subjt:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------

Query:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-
                                                                 GICAHVDLMRFEDGIAIVSLES CVMHFTRVDETS +PTSKG 
Subjt:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-

Query:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD
         DVS SKVP+YL PGS+VLLWGEARYLWKHEINRKPGFQMWE QELTQGRRTSITLRKLC+VD
Subjt:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD

XP_022998149.1 uncharacterized protein LOC111492881 [Cucurbita maxima]2.6e-7964.26Show/hide
Query:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------
        M E+LDDE EILKQVFG+SSED   ED EDQTV+ DS+YE G I RWEQ KEIKGLWLCRSFLS QQESSLLSAIRNEGWFMEASQNQ            
Subjt:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------

Query:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-
                                                                 GICAHVDLMRFEDGIAIVSLES CVMHFTRVDETSCDPTSKG 
Subjt:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-

Query:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD
         DVS SKVP+YL PGS+VLLWGEARYLWKHEINRKPGFQMWE QELTQGRRTSITLRKLC+VD
Subjt:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD

XP_023524850.1 uncharacterized protein LOC111788655 [Cucurbita pepo subsp. pepo]1.3e-7863.5Show/hide
Query:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------
        M E+LDDE EILKQVFG+SSED   ED EDQTV+ DS+YE G I RWEQ KEI+GLWLCRSFLS QQESSLL+AIRNEGWFMEASQNQ            
Subjt:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------

Query:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-
                                                                 GICAHVDLMRFEDGIAIVSLES CVMHFTRVDETSCDPTSKG 
Subjt:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-

Query:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD
         DVS SKVP+YL PGS+VLLWGEARYLWKHEINRKPGFQMWE QELTQGRRTSITLRKLC+VD
Subjt:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD

XP_038905579.1 uncharacterized protein LOC120091559 [Benincasa hispida]3.1e-7761.6Show/hide
Query:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------
        M EKLDDE+EILKQVFG+SS+D   EDFEDQT+M D SYE G IH+WEQVKEIKGLWLC+ FLS QQ+SSLLSAIRNEGWFMEASQNQ            
Subjt:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------

Query:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-
                                                                 GICAHVDLMRFEDGIAIVSLES CVMHFTRVDETSCDP+ KG 
Subjt:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-

Query:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD
         D+S +KVPVYL PGS+V+LWGEARYLWKHEINRKPGFQ+WE QELTQGRRTSITLRKLC V+
Subjt:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD

TrEMBL top hitse value%identityAlignment
A0A0A0KV56 Fe2OG dioxygenase domain-containing protein9.2e-7560.46Show/hide
Query:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------
        M EKLDDE EILKQVFG+SSED   EDF D+TV  DSSYE G IH+WEQVK+IKGLWLCR FLS QQ+SSLLSAIRNEGWFMEASQNQ            
Subjt:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------

Query:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKGD
                                                                 GICAHVDLMRFEDGIAIVSLES C+MHFT+VD+TSCDP+ KG+
Subjt:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKGD

Query:  V--STSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD
        V  STSKVPVYL PGS+V+LWGEARY WKHEINRKPGFQ+WE QEL QGRRTSITLRKLC V+
Subjt:  V--STSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD

A0A1S3BYI6 uncharacterized protein LOC1034947835.1e-7359.32Show/hide
Query:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------
        M EKLDDE EILKQVFG+SSED   EDF D+TV  DSS+E G IH+WEQVK+IKGLWLCR FLS QQ+SSLLSAIRNEGWFMEAS+NQ            
Subjt:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------

Query:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKGD
                                                                 GICAHVDLMRFEDGIAIVSLES CVMHFT VDE+SCDP+ KG+
Subjt:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKGD

Query:  --VSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD
          +S  KVPVYL PGS+V+LWGEARYLWKHEINRKPGFQ+WE QEL+QGRRTSITLRKLC V+
Subjt:  --VSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD

A0A5A7TNC5 Alkylated DNA repair protein alkB-like protein 85.1e-7359.32Show/hide
Query:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------
        M EKLDDE EILKQVFG+SSED   EDF D+TV  DSS+E G IH+WEQVK+IKGLWLCR FLS QQ+SSLLSAIRNEGWFMEAS+NQ            
Subjt:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------

Query:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKGD
                                                                 GICAHVDLMRFEDGIAIVSLES CVMHFT VDE+SCDP+ KG+
Subjt:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKGD

Query:  --VSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD
          +S  KVPVYL PGS+V+LWGEARYLWKHEINRKPGFQ+WE QEL+QGRRTSITLRKLC V+
Subjt:  --VSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD

A0A6J1GAA7 uncharacterized protein LOC1114522201.2e-7763.5Show/hide
Query:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------
        M E+LDDE EILKQVFG+SSED   ED EDQTV+ DS+YE G I RWEQ KEIKGLWLCRSFLS QQESSLLSAIRNEGWFMEASQNQ            
Subjt:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------

Query:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-
                                                                 GICAHVDLMRFEDGIAIVSLES CVMHFTRVDETS +PTSKG 
Subjt:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-

Query:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD
         DVS SKVP+YL PGS+VLLWGEARYLWKHEINRKPGFQMWE QELTQGRRTSITLRKLC+VD
Subjt:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD

A0A6J1KDK2 uncharacterized protein LOC1114928811.2e-7964.26Show/hide
Query:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------
        M E+LDDE EILKQVFG+SSED   ED EDQTV+ DS+YE G I RWEQ KEIKGLWLCRSFLS QQESSLLSAIRNEGWFMEASQNQ            
Subjt:  MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ------------

Query:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-
                                                                 GICAHVDLMRFEDGIAIVSLES CVMHFTRVDETSCDPTSKG 
Subjt:  ---------------------------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKG-

Query:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD
         DVS SKVP+YL PGS+VLLWGEARYLWKHEINRKPGFQMWE QELTQGRRTSITLRKLC+VD
Subjt:  -DVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD

SwissProt top hitse value%identityAlignment
Q8RWY1 Alkylated DNA repair protein ALKBH8 homolog8.5e-0936.79Show/hide
Query:  GICAHVDL-MRFEDGIAIVSLESACVMHFTRVD-----ETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTS
        G+  H+D    FED I  +SL   C+M F R        ++ D    GD S  K  +YL P S++LL GEARY W H I      ++ +       RR S
Subjt:  GICAHVDL-MRFEDGIAIVSLESACVMHFTRVD-----ETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTS

Query:  ITLRKL
         TLRK+
Subjt:  ITLRKL

Q9UT12 Uncharacterized protein P8A3.02c1.4e-0636.19Show/hide
Query:  GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQE---LTQGRRTSITL
        GI  H DL  F DG+AI S  S   M FT        P  K      K  + L  GS++L+ G ARY W HEI  + G  +  D E   +++ +R S+T+
Subjt:  GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQE---LTQGRRTSITL

Query:  RKLCE
        R++ E
Subjt:  RKLCE

Arabidopsis top hitse value%identityAlignment
AT1G31600.1 RNA-binding (RRM/RBD/RNP motifs) family protein6.0e-1036.79Show/hide
Query:  GICAHVDL-MRFEDGIAIVSLESACVMHFTRVD-----ETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTS
        G+  H+D    FED I  +SL   C+M F R        ++ D    GD S  K  +YL P S++LL GEARY W H I      ++ +       RR S
Subjt:  GICAHVDL-MRFEDGIAIVSLESACVMHFTRVD-----ETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTS

Query:  ITLRKL
         TLRK+
Subjt:  ITLRKL

AT1G31600.2 RNA-binding (RRM/RBD/RNP motifs) family protein6.0e-1036.79Show/hide
Query:  GICAHVDL-MRFEDGIAIVSLESACVMHFTRVD-----ETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTS
        G+  H+D    FED I  +SL   C+M F R        ++ D    GD S  K  +YL P S++LL GEARY W H I      ++ +       RR S
Subjt:  GICAHVDL-MRFEDGIAIVSLESACVMHFTRVD-----ETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTS

Query:  ITLRKL
         TLRK+
Subjt:  ITLRKL

AT1G31600.3 RNA-binding (RRM/RBD/RNP motifs) family protein6.0e-1036.79Show/hide
Query:  GICAHVDL-MRFEDGIAIVSLESACVMHFTRVD-----ETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTS
        G+  H+D    FED I  +SL   C+M F R        ++ D    GD S  K  +YL P S++LL GEARY W H I      ++ +       RR S
Subjt:  GICAHVDL-MRFEDGIAIVSLESACVMHFTRVD-----ETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTS

Query:  ITLRKL
         TLRK+
Subjt:  ITLRKL

AT4G02485.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.6e-4044.17Show/hide
Query:  LDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ----------------
        +D+E E L+  FG+SS+D   ED  D+   E  +   G    WE+V+EI GLWL R+FLS   +S LLSAI NEGWF+E S NQ                
Subjt:  LDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQ----------------

Query:  -------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKGDVSTSKVPVYLTPGSIVLLWG
                                             GICAHVDL+RFEDGIAIVSLES CVM F+        P  K +     V V L PGS++L+ G
Subjt:  -------------------------------------GICAHVDLMRFEDGIAIVSLESACVMHFTRVDETSCDPTSKGDVSTSKVPVYLTPGSIVLLWG

Query:  EARYLWKHEINRKP-GFQMWEDQELTQGRRTSITLRKLCE
        EARY WKHEINRK  GFQ+WE +E+ Q RR SITLRKLC+
Subjt:  EARYLWKHEINRKP-GFQMWEDQELTQGRRTSITLRKLCE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAAAGCTAGACGATGAAGAAGAAATCCTGAAGCAAGTTTTTGGCAATTCATCCGAAGACGAAGAAGAAGAAGACTTCGAAGATCAAACTGTAATGGAGGATTC
ATCCTATGAAGCAGGTCGCATTCACAGATGGGAGCAAGTCAAGGAAATCAAAGGGTTATGGCTCTGCAGAAGCTTCCTCTCGCACCAACAAGAATCGTCGCTGCTCTCCG
CAATCCGAAACGAAGGGTGGTTCATGGAAGCCTCTCAAAATCAGGGGATCTGTGCACATGTTGACCTCATGCGCTTTGAAGATGGAATTGCCATCGTCTCCCTCGAGTCA
GCATGCGTGATGCATTTTACCCGAGTTGATGAAACTTCCTGCGATCCTACAAGCAAGGGAGATGTATCGACGTCGAAAGTGCCCGTATATCTAACCCCAGGATCAATTGT
TCTATTGTGGGGCGAAGCTCGCTACCTCTGGAAGCACGAGATCAATCGCAAGCCCGGGTTTCAAATGTGGGAAGACCAGGAACTTACTCAGGGAAGACGAACTTCCATTA
CACTGAGAAAGCTCTGTGAAGTTGATTAG
mRNA sequenceShow/hide mRNA sequence
CTCTAACAGAAGTCCCAGAGACGAGCGAGGTTCCGAGAACTGGAGAGAAATGGCGGAAAAGCTAGACGATGAAGAAGAAATCCTGAAGCAAGTTTTTGGCAATTCATCCG
AAGACGAAGAAGAAGAAGACTTCGAAGATCAAACTGTAATGGAGGATTCATCCTATGAAGCAGGTCGCATTCACAGATGGGAGCAAGTCAAGGAAATCAAAGGGTTATGG
CTCTGCAGAAGCTTCCTCTCGCACCAACAAGAATCGTCGCTGCTCTCCGCAATCCGAAACGAAGGGTGGTTCATGGAAGCCTCTCAAAATCAGGGGATCTGTGCACATGT
TGACCTCATGCGCTTTGAAGATGGAATTGCCATCGTCTCCCTCGAGTCAGCATGCGTGATGCATTTTACCCGAGTTGATGAAACTTCCTGCGATCCTACAAGCAAGGGAG
ATGTATCGACGTCGAAAGTGCCCGTATATCTAACCCCAGGATCAATTGTTCTATTGTGGGGCGAAGCTCGCTACCTCTGGAAGCACGAGATCAATCGCAAGCCCGGGTTT
CAAATGTGGGAAGACCAGGAACTTACTCAGGGAAGACGAACTTCCATTACACTGAGAAAGCTCTGTGAAGTTGATTAGAGGAAGTTCACAGTTGGAAGCTGCATTTTCTT
ATGTTCGTTCTTTGTTGTGTTCGACTGAATTCGAGTTAAAGCAGAATTCGAACAAACGGTTCGCTGTCATTTGATTAGGATATCTTTAGTTCACTGTTGTAGATCCGAAA
CCTCCTTTGACTGGAAATCATTTTTCTTCAAGGTTTA
Protein sequenceShow/hide protein sequence
MAEKLDDEEEILKQVFGNSSEDEEEEDFEDQTVMEDSSYEAGRIHRWEQVKEIKGLWLCRSFLSHQQESSLLSAIRNEGWFMEASQNQGICAHVDLMRFEDGIAIVSLES
ACVMHFTRVDETSCDPTSKGDVSTSKVPVYLTPGSIVLLWGEARYLWKHEINRKPGFQMWEDQELTQGRRTSITLRKLCEVD