; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G03600 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G03600
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionMidasin like
Genome locationClcChr10:3928604..3931066
RNA-Seq ExpressionClc10G03600
SyntenyClc10G03600
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596817.1 hypothetical protein SDJN03_09997, partial [Cucurbita argyrosperma subsp. sororia]4.4e-9576.49Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAG ENVKPRQLSDAI+DEVG+EIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRS DGMKNEITETVQS+Y+ LANPKA+E+AEAST  AIP WKEADNNGSMK+STS+ EH+ET+P+EP G+SFAGNHTN EKQHV+ELQF  R     DN
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D R       N++SDADNV   PGFVSN KHNQMC D  SDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

XP_004147961.1 uncharacterized protein LOC101206797 [Cucumis sativus]2.8e-11084.19Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPK+SGE VIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAGTENVKPRQ+SDAIYDEVGEEIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPKAEENAEAST  AIP  KE DNNGSMK+STS+ EHSE DP+EPPGFSFAGNHTNN +QH+E+LQFP  HEGRH+N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGS--DEDPDVPPGFG
        D RNVE H+PNNVSDADNV LPPGFVSNRKHNQM KDAGS  DEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGS--DEDPDVPPGFG

XP_008448942.1 PREDICTED: uncharacterized protein LOC103490958 [Cucumis melo]8.5e-10782.47Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPKISGE VIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAGTENVKPRQ+SDAIYDEVGEEIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPKAEENAEAST  A+P WKE DNNGSMK+STS+ EHSE DP EPPGFSFAGNHTNN KQH+E+LQFP   EGRH N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D RN+E H+PNNVSDADNV LPPGFVSNRKHN    D   DEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

XP_022152780.1 uncharacterized protein LOC111020416 [Momordica charantia]3.5e-10078.88Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEV P ISGEDVIEKLKDDGDFD LRLKIIRKLKDN           EELR+NI+AIVKQSA+LNRAGTENVKPRQLSDAIYDEVG+EIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPK  E+A AST       KEA NNG MK+STS FE SE DP+EPPGFSFAGNH NNEKQHVEE +FP RHEGRHD 
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        + RNVE HH NNV DAD V LPPGF SN+ HN+MCKDAGSDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

XP_038903340.1 uncharacterized protein LOC120089960 [Benincasa hispida]3.9e-11285.66Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        ME+GPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNI+AIVKQSA+LNRAG ENVKPRQLSDAIYDEVGEEIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQ  IP  KE  NNGSMK+STS+F+HSE DP+EPPGFSFAGNHTNN KQHVEELQFP  HEGRH+N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D RNVE HHPNNV DAD+V LPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

TrEMBL top hitse value%identityAlignment
A0A0A0L4H3 Uncharacterized protein1.4e-11084.19Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPK+SGE VIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAGTENVKPRQ+SDAIYDEVGEEIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPKAEENAEAST  AIP  KE DNNGSMK+STS+ EHSE DP+EPPGFSFAGNHTNN +QH+E+LQFP  HEGRH+N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGS--DEDPDVPPGFG
        D RNVE H+PNNVSDADNV LPPGFVSNRKHNQM KDAGS  DEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGS--DEDPDVPPGFG

A0A1S3BKW9 uncharacterized protein LOC1034909584.1e-10782.47Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPKISGE VIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAGTENVKPRQ+SDAIYDEVGEEIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPKAEENAEAST  A+P WKE DNNGSMK+STS+ EHSE DP EPPGFSFAGNHTNN KQH+E+LQFP   EGRH N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D RN+E H+PNNVSDADNV LPPGFVSNRKHN    D   DEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

A0A6J1DEX6 uncharacterized protein LOC1110204161.7e-10078.88Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEV P ISGEDVIEKLKDDGDFD LRLKIIRKLKDN           EELR+NI+AIVKQSA+LNRAGTENVKPRQLSDAIYDEVG+EIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRSADGMKNEITETVQSVYNKLANPK  E+A AST       KEA NNG MK+STS FE SE DP+EPPGFSFAGNH NNEKQHVEE +FP RHEGRHD 
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        + RNVE HH NNV DAD V LPPGF SN+ HN+MCKDAGSDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

A0A6J1FJ59 uncharacterized protein LOC1114461766.1e-9576.1Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAG ENVKPRQLSDAI+DEVG+EIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRS DGMKNEITETVQS+Y+ LANPKA+E+AEAST  AIP WKEADNNGSMK+STS+ EH+E++P+EP G+SFAGNHTN EKQHV+ELQF  R     DN
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D R       N++SDADNV   PGFVSN KHNQMC D  SDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

A0A6J1KZH9 uncharacterized protein LOC1114984924.0e-9475.7Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDN           EELRNNIVAIVKQSA+LNRAG ENVKPRQLSD I+DEVG+EIM+KVSDNLWEI
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN
        IRS DGMKNEITETVQS+YN LANPKA+E+AEA+T  AIP WKEADNNGSMK+STS+ EH+ET+P+EP GFSFAGNHTN EKQHV+ELQF  R      N
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDN

Query:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG
        D R       N++SDADNV   PGFVSN KH+QMC D  SDEDPDVPPGFG
Subjt:  DRRNVERHHPNNVSDADNVQLPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12530.1 unknown protein1.6e-3441.15Show/hide
Query:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI
        M +  KIS E+V+EKLKDDGDFD+LRLKIIR+LK+N           E+LRNN++++VK+S SL R G +N+K RQLSDAI++EVG ++++++SD LW I
Subjt:  MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEI

Query:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQH-VEELQFPNRHEGRHD
        IRS DGMKNEI ETVQSVY  L+NP+ E+                       +S  + EH    P +     F  +  + +KQ  ++     N+ E   D
Subjt:  IRSADGMKNEITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQH-VEELQFPNRHEGRHD

Query:  NDRRNVERHHPNNVSDADNVQLPPGF
        + + N    + +   D ++ +LPPGF
Subjt:  NDRRNVERHHPNNVSDADNVQLPPGF

AT1G56420.1 unknown protein2.0e-1331.48Show/hide
Query:  ISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEIIRSADG
        I  EDV+E L +DG  D LRL+II +LK N           EEL++  + + ++S  LN  G E    R+L DA+  E+   ++ K S ++W++I   DG
Subjt:  ISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEIIRSADG

Query:  MKNEITETVQSVYNKLANPKAEENAEASTQQA-IPFWKEADNNGSMKSSTSKFEHSETDPLE
        +  EI ETV+ V+  L+  +    + ++ ++  +   KE +   S K+   K   SE +  E
Subjt:  MKNEITETVQSVYNKLANPKAEENAEASTQQA-IPFWKEADNNGSMKSSTSKFEHSETDPLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTCGGGCCCAAGATTAGTGGAGAAGATGTGATCGAAAAGCTCAAGGACGATGGCGACTTCGACAAACTCCGTCTCAAAATCATTCGCAAGCTGAAAGACAATAG
AATGGAGCTGTGTCGAAAGAGGGATTTCTTGGAAGAATTGCGCAATAATATAGTTGCAATAGTGAAGCAATCGGCATCTCTTAATCGGGCAGGAACTGAAAATGTAAAGC
CTCGGCAACTGTCTGATGCGATATATGACGAGGTTGGGGAAGAAATAATGAACAAGGTTTCTGATAACTTATGGGAGATAATCAGATCAGCTGATGGCATGAAAAATGAA
ATCACAGAAACTGTACAATCCGTCTACAATAAGTTAGCGAACCCAAAAGCCGAGGAAAATGCCGAAGCATCTACCCAACAAGCTATACCATTTTGGAAGGAAGCTGATAA
TAATGGTTCTATGAAGTCCTCCACCAGTAAATTTGAACATTCTGAGACTGATCCACTAGAACCACCAGGTTTTTCTTTTGCTGGTAATCATACGAACAACGAAAAGCAGC
ACGTAGAGGAGCTTCAATTTCCGAATCGTCATGAAGGAAGGCATGATAATGATAGACGAAATGTAGAGAGACACCATCCAAACAATGTGTCTGATGCAGATAATGTTCAA
CTGCCGCCAGGCTTTGTTTCAAATAGGAAGCACAACCAAATGTGTAAAGATGCTGGTAGTGATGAAGACCCGGACGTTCCCCCTGGTTTCGGTTGA
mRNA sequenceShow/hide mRNA sequence
TTACAATGCTATTTGTGTAGGGCAAACGATCTTGATAATAGAAGAACTCGAAGCTTGGAAGAAAAACACAGCTTGTTATTAATGGAAGTCGGGCCCAAGATTAGTGGAGA
AGATGTGATCGAAAAGCTCAAGGACGATGGCGACTTCGACAAACTCCGTCTCAAAATCATTCGCAAGCTGAAAGACAATAGAATGGAGCTGTGTCGAAAGAGGGATTTCT
TGGAAGAATTGCGCAATAATATAGTTGCAATAGTGAAGCAATCGGCATCTCTTAATCGGGCAGGAACTGAAAATGTAAAGCCTCGGCAACTGTCTGATGCGATATATGAC
GAGGTTGGGGAAGAAATAATGAACAAGGTTTCTGATAACTTATGGGAGATAATCAGATCAGCTGATGGCATGAAAAATGAAATCACAGAAACTGTACAATCCGTCTACAA
TAAGTTAGCGAACCCAAAAGCCGAGGAAAATGCCGAAGCATCTACCCAACAAGCTATACCATTTTGGAAGGAAGCTGATAATAATGGTTCTATGAAGTCCTCCACCAGTA
AATTTGAACATTCTGAGACTGATCCACTAGAACCACCAGGTTTTTCTTTTGCTGGTAATCATACGAACAACGAAAAGCAGCACGTAGAGGAGCTTCAATTTCCGAATCGT
CATGAAGGAAGGCATGATAATGATAGACGAAATGTAGAGAGACACCATCCAAACAATGTGTCTGATGCAGATAATGTTCAACTGCCGCCAGGCTTTGTTTCAAATAGGAA
GCACAACCAAATGTGTAAAGATGCTGGTAGTGATGAAGACCCGGACGTTCCCCCTGGTTTCGGTTGAAGAACTAAATTTTCACTTTGGATTTCTTCTGATTATGCAGTCT
CCAGTTGTTGAGCATCTTGTAATGAGTATTTGGTCGAACCCTGTGTTGTGATGTGATGAAAAAAGTCGGGATTAATCGATCGATCATCTCGTTTGCCTTGGTTTGTAATG
GATGCAGGTGATGATTATGTATAAAGTTTGAGTTCCATGTGGATTGAATTGTGAAACATTATGTTGAGAAGAGAATGAAGTTTGATAGAAAGCCACAAGTTTGAAATATC
TATCGTAAGTTAATGTGCGTTTGCCTTGAACTTAAAATTGATAGCATAAAATTATTTTCGTTGAGAAATCTTCTCTCTGAATAGGAAATTCCTAATATTTACTTGATTTA
TT
Protein sequenceShow/hide protein sequence
MEVGPKISGEDVIEKLKDDGDFDKLRLKIIRKLKDNRMELCRKRDFLEELRNNIVAIVKQSASLNRAGTENVKPRQLSDAIYDEVGEEIMNKVSDNLWEIIRSADGMKNE
ITETVQSVYNKLANPKAEENAEASTQQAIPFWKEADNNGSMKSSTSKFEHSETDPLEPPGFSFAGNHTNNEKQHVEELQFPNRHEGRHDNDRRNVERHHPNNVSDADNVQ
LPPGFVSNRKHNQMCKDAGSDEDPDVPPGFG