; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G03370 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G03370
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUnknown protein
Genome locationChr3:2714126..2717244
RNA-Seq ExpressionCSPI03G03370
SyntenyCSPI03G03370
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147961.1 uncharacterized protein LOC101206797 [Cucumis sativus]5.3e-13099.59Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
        TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGN TNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

XP_008448942.1 PREDICTED: uncharacterized protein LOC103490958 [Cucumis melo]2.7e-11892.56Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        MEVGPK+SGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
        TETVQSVYNKLANPKAEENAEASTHHA+P  KEGDNNGSMKASTSQLEHSEADP EPPGFSFAGN TNNG+QHIEDLQFPKH EGRH+NDSRN+EGHNPN
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        NVSDADNVDLPPGFVSNRKHN        DDDEDPDVPPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

XP_022152780.1 uncharacterized protein LOC111020416 [Momordica charantia]2.6e-9779.34Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        MEV P +SGE VIEKLKDDGDFD LRLKIIRKLKDNEELR+NI+AIVKQSAALNRAGTENVKPRQ+SDAIYDEVG+EIMSKVSDNLWEIIRSADGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
        TETVQSVYNKLANPK  E+A AST H     KE  NNG MKASTS  E SE DPVEPPGFSFAGN  NN +QH+E+ +FP+ HEGRH+ +SRNVEGH+ N
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        NV DAD VDLPPGF SN+ HN+M KDAGS  DEDPDVPPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

XP_022940681.1 uncharacterized protein LOC111446176 [Cucurbita moschata]6.7e-9377.27Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        MEVGPK+SGE VIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAG ENVKPRQ+SDAI+DEVG+EIMSKVSDNLWEIIRS DGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
        TETVQS+Y+ LANPKA+E+AEASTHHAIP  KE DNNGSMKASTSQ EH+E++P+EP G+SFAGN TN  +QH+++LQF K      +NDSR       N
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        ++SDADNVD  PGFVSN KHNQM  D   D DEDPDVPPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

XP_038903340.1 uncharacterized protein LOC120089960 [Benincasa hispida]1.1e-11489.67Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        ME+GPK+SGE VIEKLKDDGDFDKLRLKIIRKLKDNEELRNNI+AIVKQSAALNRAG ENVKPRQ+SDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
        TETVQSVYNKLANPKAEENAEAST H IP RKE  NNGSMKASTSQ +HSEADP+EPPGFSFAGN TNNG+QH+E+LQFPK HEGRHNNDSRNVEGH+PN
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        NV DAD+VDLPPGFVSNRKHNQM KDAGS  DEDPDVPPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

TrEMBL top hitse value%identityAlignment
A0A0A0L4H3 Uncharacterized protein2.6e-13099.59Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
        TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGN TNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

A0A1S3BKW9 uncharacterized protein LOC1034909581.3e-11892.56Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        MEVGPK+SGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
        TETVQSVYNKLANPKAEENAEASTHHA+P  KEGDNNGSMKASTSQLEHSEADP EPPGFSFAGN TNNG+QHIEDLQFPKH EGRH+NDSRN+EGHNPN
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        NVSDADNVDLPPGFVSNRKHN        DDDEDPDVPPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

A0A6J1DEX6 uncharacterized protein LOC1110204161.3e-9779.34Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        MEV P +SGE VIEKLKDDGDFD LRLKIIRKLKDNEELR+NI+AIVKQSAALNRAGTENVKPRQ+SDAIYDEVG+EIMSKVSDNLWEIIRSADGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
        TETVQSVYNKLANPK  E+A AST H     KE  NNG MKASTS  E SE DPVEPPGFSFAGN  NN +QH+E+ +FP+ HEGRH+ +SRNVEGH+ N
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        NV DAD VDLPPGF SN+ HN+M KDAGS  DEDPDVPPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

A0A6J1FJ59 uncharacterized protein LOC1114461763.3e-9377.27Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        MEVGPK+SGE VIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAG ENVKPRQ+SDAI+DEVG+EIMSKVSDNLWEIIRS DGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
        TETVQS+Y+ LANPKA+E+AEASTHHAIP  KE DNNGSMKASTSQ EH+E++P+EP G+SFAGN TN  +QH+++LQF K      +NDSR       N
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        ++SDADNVD  PGFVSN KHNQM  D   D DEDPDVPPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

A0A6J1KZH9 uncharacterized protein LOC1114984921.6e-9276.86Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        MEVGPK+SGE VIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAG ENVKPRQ+SD I+DEVG+EIMSKVSDNLWEIIRS DGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
        TETVQS+YN LANPKA+E+AEA+THHAIP  KE DNNGSMKASTSQ EH+E +P+EP GFSFAGN TN  +QH+++LQF K       NDSR       N
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        ++SDADNVD  PGFVSN KH+QM  D   D DEDPDVPPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12530.1 unknown protein1.7e-3840.08Show/hide
Query:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI
        M +  K+S E V+EKLKDDGDFD+LRLKIIR+LK+NE+LRNN++++VK+S +L R G +N+K RQ+SDAI++EVG +++S++SD LW IIRS DGMKNEI
Subjt:  MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEI

Query:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN
         ETVQSVY  L+NP+ E+                        S  ++EH    P +     F  +  +  +Q +                   ++G   +
Subjt:  TETVQSVYNKLANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPN

Query:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG
        N  +A + D     V + K      +A +DD+EDP++PPGFG
Subjt:  NVSDADNVDLPPGFVSNRKHNQMFKDAGSDDDEDPDVPPGFG

AT1G56420.1 unknown protein1.1e-1331.13Show/hide
Query:  VSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEITETVQS
        +  E V+E L +DG  D LRL+II +LK NEEL++  + + ++S  LN  G E    R++ DA+  E+   ++ K S ++W++I   DG+  EI ETV+ 
Subjt:  VSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEITETVQS

Query:  VYNKLANPKAEENAEASTHHA-IPARKEGDNNGSMKASTSQLEHSEADPVE
        V+  L+  +    + ++     +   KE +   S K    +   SE +  E
Subjt:  VYNKLANPKAEENAEASTHHA-IPARKEGDNNGSMKASTSQLEHSEADPVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTCGGCCCTAAGGTTAGTGGAGAAGCTGTGATCGAAAAGCTCAAGGATGACGGCGACTTCGACAAACTCCGTCTCAAAATCATTCGCAAGCTGAAAGACAATGA
AGAATTGCGCAATAATATTGTTGCAATAGTGAAGCAATCGGCAGCACTTAATCGGGCAGGAACTGAAAATGTGAAGCCTCGGCAAATTTCTGATGCGATATATGATGAGG
TTGGGGAAGAAATAATGAGCAAGGTTTCTGATAACTTATGGGAGATCATCAGATCAGCTGATGGCATGAAAAATGAAATCACAGAAACGGTGCAATCTGTCTACAATAAG
TTAGCGAACCCAAAAGCGGAGGAAAATGCCGAAGCATCTACCCACCATGCGATACCAGCTCGGAAGGAAGGTGATAATAATGGTTCTATGAAGGCCTCCACCAGTCAATT
AGAACATTCCGAGGCTGATCCGGTAGAACCTCCCGGTTTTTCTTTTGCTGGTAATCAAACAAACAATGGAAGGCAGCACATAGAGGATCTGCAATTTCCGAAACATCATG
AAGGAAGACACAATAACGATAGTCGAAACGTAGAGGGACATAATCCAAACAATGTGTCTGATGCAGATAATGTTGATCTGCCGCCAGGCTTTGTTTCAAACAGGAAGCAC
AACCAAATGTTTAAAGATGCTGGTAGTGATGATGATGAAGACCCGGATGTTCCTCCTGGTTTCGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATTTTGTGTAGGGCAATAGAAGAACATAGGCCTTCGAAGCTTGAAAGAAATACACTGATTGTTGTTCATGGAGGTCGGCCCTAAGGTTAGTGGAGAAGCTGTGATCGAAA
AGCTCAAGGATGACGGCGACTTCGACAAACTCCGTCTCAAAATCATTCGCAAGCTGAAAGACAATGAAGAATTGCGCAATAATATTGTTGCAATAGTGAAGCAATCGGCA
GCACTTAATCGGGCAGGAACTGAAAATGTGAAGCCTCGGCAAATTTCTGATGCGATATATGATGAGGTTGGGGAAGAAATAATGAGCAAGGTTTCTGATAACTTATGGGA
GATCATCAGATCAGCTGATGGCATGAAAAATGAAATCACAGAAACGGTGCAATCTGTCTACAATAAGTTAGCGAACCCAAAAGCGGAGGAAAATGCCGAAGCATCTACCC
ACCATGCGATACCAGCTCGGAAGGAAGGTGATAATAATGGTTCTATGAAGGCCTCCACCAGTCAATTAGAACATTCCGAGGCTGATCCGGTAGAACCTCCCGGTTTTTCT
TTTGCTGGTAATCAAACAAACAATGGAAGGCAGCACATAGAGGATCTGCAATTTCCGAAACATCATGAAGGAAGACACAATAACGATAGTCGAAACGTAGAGGGACATAA
TCCAAACAATGTGTCTGATGCAGATAATGTTGATCTGCCGCCAGGCTTTGTTTCAAACAGGAAGCACAACCAAATGTTTAAAGATGCTGGTAGTGATGATGATGAAGACC
CGGATGTTCCTCCTGGTTTCGGTTGAAGAACTAGATTTTTACTTTGGTTTTCTTCTTATAATGCCATCACCAGTTTTGAGCATATAGCAACAAGTTTTTGACCAAATCTC
ATGCTGCTACGCGATGAACAAAGTCGTGATTGATCGACTGATCATCTGTCTCTCATGGTTTGTAATGGATGCAGGTGATGGTTATAATGTATAAAGTCTGAGTTCCATGT
CGATTGAATTGTAAAACACTACATTGAAGAATAGAATGTAGTTCATGTATAAGGGGTATAAGATGTTTGATAGAAATATCAAACCATATGAGATGTTTTTGCTATCCTAA
ATTACTG
Protein sequenceShow/hide protein sequence
MEVGPKVSGEAVIEKLKDDGDFDKLRLKIIRKLKDNEELRNNIVAIVKQSAALNRAGTENVKPRQISDAIYDEVGEEIMSKVSDNLWEIIRSADGMKNEITETVQSVYNK
LANPKAEENAEASTHHAIPARKEGDNNGSMKASTSQLEHSEADPVEPPGFSFAGNQTNNGRQHIEDLQFPKHHEGRHNNDSRNVEGHNPNNVSDADNVDLPPGFVSNRKH
NQMFKDAGSDDDEDPDVPPGFG