; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG00G000140 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG00G000140
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionMidasin like
Genome locationCG_Chr00:177328..179343
RNA-Seq ExpressionClCG00G000140
SyntenyClCG00G000140
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596817.1 hypothetical protein SDJN03_09997, partial [Cucurbita argyrosperma subsp. sororia]1.5e-9576.89Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAG ENVKPRQLSDAI+DEVG+EIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRS DGMKNEITETVQS+Y+ LANPKA+E+AEAST  AIP WKEADNNGSMKASTS+ EH+ET+P+EP G+SFAGNHTN EKQHV+ELQF  R     DN
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D R       N++SDADNV   PGFVSN KHNQMC D  SDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

XP_004147961.1 uncharacterized protein LOC101206797 [Cucumis sativus]9.7e-11184.58Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPK+SGE VIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAGTENVKPRQ+SDAIYDEVGEEIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPKAEENAEAST  AIP  KE DNNGSMKASTS+ EHSE DP+EPPGFSFAGNHTNN +QH+E+LQFP  HEGRH+N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGS--DEDPDVPPGFG
        D RNVE H+PNNVSDADNV LPPGFVSNRKHNQM KDAGS  DEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGS--DEDPDVPPGFG

XP_008448942.1 PREDICTED: uncharacterized protein LOC103490958 [Cucumis melo]2.9e-10782.87Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPKISGE VIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAGTENVKPRQ+SDAIYDEVGEEIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPKAEENAEAST  A+P WKE DNNGSMKASTS+ EHSE DP EPPGFSFAGNHTNN KQH+E+LQFP   EGRH N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D RN+E H+PNNVSDADNV LPPGFVSNRKHN    D   DEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

XP_022152780.1 uncharacterized protein LOC111020416 [Momordica charantia]1.2e-10079.28Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEV P ISGEDVIEKLKDDGDFD LRLKIIRKLKDN           EELR+NI+AIVKQSA+LNRAGTENVKPRQLSDAIYDEVG+EIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPK  E+A AST       KEA NNG MKASTS FE SE DP+EPPGFSFAGNH NNEKQHVEE +FP RHEGRHD 
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        + RNVE HH NNV DAD V LPPGF SN+ HN+MCKDAGSDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

XP_038903340.1 uncharacterized protein LOC120089960 [Benincasa hispida]1.4e-11286.06Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        ME+GPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNI+AIVKQSA+LNRAG ENVKPRQLSDAIYDEVGEEIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQ  IP  KE  NNGSMKASTS+F+HSE DP+EPPGFSFAGNHTNN KQHVEELQFP  HEGRH+N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D RNVE HHPNNV DAD+V LPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

TrEMBL top hitse value%identityAlignment
A0A0A0L4H3 Uncharacterized protein4.7e-11184.58Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPK+SGE VIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAGTENVKPRQ+SDAIYDEVGEEIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPKAEENAEAST  AIP  KE DNNGSMKASTS+ EHSE DP+EPPGFSFAGNHTNN +QH+E+LQFP  HEGRH+N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGS--DEDPDVPPGFG
        D RNVE H+PNNVSDADNV LPPGFVSNRKHNQM KDAGS  DEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGS--DEDPDVPPGFG

A0A1S3BKW9 uncharacterized protein LOC1034909581.4e-10782.87Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPKISGE VIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAGTENVKPRQ+SDAIYDEVGEEIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPKAEENAEAST  A+P WKE DNNGSMKASTS+ EHSE DP EPPGFSFAGNHTNN KQH+E+LQFP   EGRH N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D RN+E H+PNNVSDADNV LPPGFVSNRKHN    D   DEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

A0A6J1DEX6 uncharacterized protein LOC1110204165.7e-10179.28Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEV P ISGEDVIEKLKDDGDFD LRLKIIRKLKDN           EELR+NI+AIVKQSA+LNRAGTENVKPRQLSDAIYDEVG+EIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPK  E+A AST       KEA NNG MKASTS FE SE DP+EPPGFSFAGNH NNEKQHVEE +FP RHEGRHD 
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        + RNVE HH NNV DAD V LPPGF SN+ HN+MCKDAGSDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

A0A6J1FJ59 uncharacterized protein LOC1114461762.1e-9576.49Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAG ENVKPRQLSDAI+DEVG+EIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRS DGMKNEITETVQS+Y+ LANPKA+E+AEAST  AIP WKEADNNGSMKASTS+ EH+E++P+EP G+SFAGNHTN EKQHV+ELQF  R     DN
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D R       N++SDADNV   PGFVSN KHNQMC D  SDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

A0A6J1KZH9 uncharacterized protein LOC1114984921.4e-9476.1Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAG ENVKPRQLSD I+DEVG+EIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRS DGMKNEITETVQS+YN LANPKA+E+AEA+T  AIP WKEADNNGSMKASTS+ EH+ET+P+EP GFSFAGNHTN EKQHV+ELQF  R      N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D R       N++SDADNV   PGFVSN KH+QMC D  SDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12530.1 unknown protein1.6e-3441.15Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        M +  KIS E+V+EKLKDDGDFD+LRLKIIR+LK+N           E+LRNN++++VK+S SL R G +N+K RQLSDAI++EVG ++++++SD LW I
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQH-VEELQFPNRHEGRHD
        IRS DGMKNEI ETVQSVY  L+NP+ E+                        S  + EH    P +     F  +  + +KQ  ++     N+ E   D
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQH-VEELQFPNRHEGRHD

Query:  NDRRNVERHHPNNVSDADNVQLPPGF
        + + N    + +   D ++ +LPPGF
Subjt:  NDRRNVERHHPNNVSDADNVQLPPGF

AT1G56420.1 unknown protein2.0e-1331.48Show/hide
Query:  ISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEIIRSADG
        I  EDV+E L +DG  D LRL+II +LK N           EEL++  + + ++S  LN  G E    R+L DA+  E+   ++ K S ++W++I   DG
Subjt:  ISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEIIRSADG

Query:  MKNEITETVQSVYNKLANPKAEENAEASTQQA-IPFWKEADNNGSMKASTSKFEHSETDPLE
        +  EI ETV+ V+  L+  +    + ++ ++  +   KE +   S K    K   SE +  E
Subjt:  MKNEITETVQSVYNKLANPKAEENAEASTQQA-IPFWKEADNNGSMKASTSKFEHSETDPLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTCGGGCCCAAGATTAGTGGAGAAGATGTGATCGAAAAGCTCAAGGACGATGGCGACTTCGACAAACTCCGTCTCAAAATCATTCGCAAGCTGAAAGAC
AATAGAATGGAGCTGTGTCGAAAGAGGGATTTCTTGGAAGAATTGCGCAATAATATAGTTGCAATAGTGAAGCAATCGGCATCTCTTAATCGGGCAGGAACTGAA
AATGTAAAGCCTCGGCAACTGTCTGATGCGATATATGACGAGGTTGGGGAAGAAATAATGAACAAGGTTTCTGATAACTTATGGGAGATAATCAGATCAGCTGAT
GGCATGAAAAATGAAATCACAGAAACTGTACAATCCGTCTACAATAAGTTAGCGAACCCAAAAGCCGAGGAAAATGCCGAAGCATCTACCCAACAAGCTATACCA
TTTTGGAAGGAAGCTGATAATAATGGTTCTATGAAGGCCTCCACCAGTAAATTTGAACATTCTGAGACTGATCCACTAGAACCACCAGGTTTTTCTTTTGCTGGT
AATCATACGAACAACGAAAAGCAGCACGTAGAGGAGCTTCAATTTCCAAATCGTCATGAAGGAAGGCATGATAATGATAGACGAAATGTAGAGAGACACCATCCA
AACAATGTGTCTGATGCAGATAATGTTCAACTGCCGCCAGGCTTTGTTTCAAATAGGAAGCACAACCAAATGTGTAAAGATGCTGGTAGTGATGAAGACCCGGAC
GTTCCCCCTGGTTTCGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTCGGGCCCAAGATTAGTGGAGAAGATGTGATCGAAAAGCTCAAGGACGATGGCGACTTCGACAAACTCCGTCTCAAAATCATTCGCAAGCTGAAAGAC
AATAGAATGGAGCTGTGTCGAAAGAGGGATTTCTTGGAAGAATTGCGCAATAATATAGTTGCAATAGTGAAGCAATCGGCATCTCTTAATCGGGCAGGAACTGAA
AATGTAAAGCCTCGGCAACTGTCTGATGCGATATATGACGAGGTTGGGGAAGAAATAATGAACAAGGTTTCTGATAACTTATGGGAGATAATCAGATCAGCTGAT
GGCATGAAAAATGAAATCACAGAAACTGTACAATCCGTCTACAATAAGTTAGCGAACCCAAAAGCCGAGGAAAATGCCGAAGCATCTACCCAACAAGCTATACCA
TTTTGGAAGGAAGCTGATAATAATGGTTCTATGAAGGCCTCCACCAGTAAATTTGAACATTCTGAGACTGATCCACTAGAACCACCAGGTTTTTCTTTTGCTGGT
AATCATACGAACAACGAAAAGCAGCACGTAGAGGAGCTTCAATTTCCAAATCGTCATGAAGGAAGGCATGATAATGATAGACGAAATGTAGAGAGACACCATCCA
AACAATGTGTCTGATGCAGATAATGTTCAACTGCCGCCAGGCTTTGTTTCAAATAGGAAGCACAACCAAATGTGTAAAGATGCTGGTAGTGATGAAGACCCGGAC
GTTCCCCCTGGTTTCGGTTGA
Protein sequenceShow/hide protein sequence
MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEIIRSAD
GMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKASTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDNDRRNVERHHP
NNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG