; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G05220 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G05220
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUnknown protein
Genome locationChr2:3714674..3717374
RNA-Seq ExpressionCSPI02G05220
SyntenyCSPI02G05220
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]6.9e-13699.22Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKK TLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GF+PVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]2.0e-12793.05Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKGVAA+SMACALFE+SMVGIKHQSLLQDY+ELHNETEA+KKKLLIAKRKKATLLDEVRFLRHRYELLK QPANIQPKVGFK  RNLEL+PP VKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQAHD+NQRGGIYNG+EASSRKSQSFFDLNQKSNTCSKKEVI+N+SFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GFEP+R LEDE KNIFPRSEHDAKNS+LVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFEPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]7.7e-8768.46Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARK  A +  A ALFE+ M+G KH  LLQDYE+L N TE MK++LLIAKRKK+TLL EVRFLRHRYE LK Q  N QPK G + P+N E++PP  KK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPL--AQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL
        EKSS+KREASLK L  AQA D+NQRGGIY+G+EA+SRKS+  F +NQK   CS  EV +++S P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEEL
Subjt:  EKSSRKREASLKPL--AQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL

Query:  QAGFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        QAGFEP R+E+ PKN F RSE+D KNS+L++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]2.0e-11199.07Show/hide
Query:  MKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDL
        MKKKLLIAKRKK TLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDL
Subjt:  MKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDL

Query:  NQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRA
        NQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGF+PVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRA
Subjt:  NQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRA

Query:  GKRKISWQDQVALRA
        GKRKISWQDQVALRA
Subjt:  GKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]3.0e-10779.84Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKG+A +S ACALFE+SM+G+KHQSLLQDYEEL NETEAMK+KLLIAKRKK TLL EVRFLRHRYELLK +PAN QPKV F+ P +LE+ PP  KK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
         KSSRK EASLKPLA+AHD+NQRGGIYNG+EA SRKSQSFF++NQKS  CSKKEV +  S P FDQKERVYR HE   +RNMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GFEP+R+EDE KN+F RSEHDAKNS+LVLSSMCRND NGSN AGKRKISWQDQVALRA
Subjt:  GFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein3.3e-13699.22Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKK TLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GF+PVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936839.7e-12893.05Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKGVAA+SMACALFE+SMVGIKHQSLLQDY+ELHNETEA+KKKLLIAKRKKATLLDEVRFLRHRYELLK QPANIQPKVGFK  RNLEL+PP VKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQAHD+NQRGGIYNG+EASSRKSQSFFDLNQKSNTCSKKEVI+N+SFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GFEP+R LEDE KNIFPRSEHDAKNS+LVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFEPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein9.7e-12893.05Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKGVAA+SMACALFE+SMVGIKHQSLLQDY+ELHNETEA+KKKLLIAKRKKATLLDEVRFLRHRYELLK QPANIQPKVGFK  RNLEL+PP VKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQAHD+NQRGGIYNG+EASSRKSQSFFDLNQKSNTCSKKEVI+N+SFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GFEP+R LEDE KNIFPRSEHDAKNS+LVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFEPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246963.7e-8768.46Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARK  A +  A ALFE+ M+G KH  LLQDYE+L N TE MK++LLIAKRKK+TLL EVRFLRHRYE LK Q  N QPK G + P+N E++PP  KK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPL--AQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL
        EKSS+KREASLK L  AQA D+NQRGGIY+G+EA+SRKS+  F +NQK   CS  EV +++S P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEEL
Subjt:  EKSSRKREASLKPL--AQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL

Query:  QAGFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        QAGFEP R+E+ PKN F RSE+D KNS+L++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A6J1KKM1 uncharacterized protein LOC1114940121.1e-7567.08Show/hide
Query:  MVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVN
        M+ I H  LLQDY EL NETEAMK+KLLI K+KK+TLL EVRFLRH+YELLK  P   QPKVGFK P+NL+++PP  KKE  SRKRE        A ++N
Subjt:  MVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVN

Query:  QRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEP---KNIFPRS
        QRGGI +G+EA++RK++S  ++NQKS  CSKKE+ +   FP   QKERVYRAHE A N NMTPVFDLNQISREEEELQ GFEP+R EDE    KNI  RS
Subjt:  QRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEP---KNIFPRS

Query:  EHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        E DAKNS+L++SSMCRN  NGSNRAGKRKISWQD+VALRA
Subjt:  EHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein1.3e-1231.98Show/hide
Query:  ELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKK-QPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVNQRGGIYNGVEASS
        EL  E E  +K+L + K+K+ TL  EVRFLR RYE LK+ Q     P++  +   +  L+ P  +K    RK+++ ++      D+  +  I N  EA +
Subjt:  ELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKK-QPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVNQRGGIYNGVEASS

Query:  RKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCR
            S  DL++K       +V+      TF             +  +  P FDLNQISREEEE +   E + + +  KN    +     + E  L  +C 
Subjt:  RKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPRSEHDAKNSELVLSSMCR

Query:  NDDNGSNRAGKRKISWQDQVAL
        + +   NRA KRK++WQD VAL
Subjt:  NDDNGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein4.0e-1734Show/hide
Query:  FESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQA
        FE   V  +H SL+QDY ELH ETEAM+K+L   + +KATL+ EVRFLR RY  L++                   +P  +KK + S             
Subjt:  FESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQA

Query:  HDVNQRGGIYNGVEAS-SRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFP
              GG    VE S S KS++             K V    S P  +  E+ +   + +  R + P+FDLNQIS EEE+     E   +++  +N   
Subjt:  HDVNQRGGIYNGVEAS-SRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFP

Query:  RSEHDAKNSELVLSSM-----------CRNDDNGSNRAGKRKISWQDQVA
        R E       L++SS+           CRN  NGSN   KRKISWQD VA
Subjt:  RSEHDAKNSELVLSSM-----------CRNDDNGSNRAGKRKISWQDQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAGCTCGAAAAGGGGTGGCTGCGAATTCAATGGCGTGTGCTCTGTTTGAGAGTTCGATGGTTGGGATCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGTT
GCATAACGAAACAGAAGCCATGAAGAAAAAACTACTGATTGCGAAGCGGAAAAAGGCAACCCTTTTGGATGAAGTTCGATTCCTGAGGCATAGATATGAATTGTTGAAGA
AGCAGCCTGCAAACATCCAGCCAAAGGTTGGTTTCAAGCGGCCACGGAACCTTGAACTCAAACCTCCCACCGTTAAGAAAGAAAAGAGTTCACGGAAAAGAGAAGCTTCT
TTGAAGCCGCTTGCTCAAGCTCATGACGTAAACCAAAGGGGAGGAATCTACAATGGGGTTGAAGCCTCTTCTCGAAAATCTCAGTCGTTTTTCGACCTAAACCAGAAGTC
AAATACGTGCAGCAAGAAGGAAGTCATTGTGAACAGTTCTTTTCCTACTTTTGACCAGAAAGAGAGAGTATACAGAGCACATGAAGCTGCTGCCAACAGGAACATGACCC
CGGTTTTCGACCTTAACCAGATTTCGAGGGAGGAAGAAGAACTGCAAGCTGGTTTCGAACCAGTGAGACTGGAGGACGAGCCGAAGAATATCTTCCCAAGAAGCGAACAC
GATGCAAAGAACAGCGAGTTGGTGTTATCATCAATGTGTAGGAATGATGATAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCTTTAAG
AGCATGA
mRNA sequenceShow/hide mRNA sequence
TTAAAACAGTTCAACACAAACAGAAATTCATTCCCTTTTCTCTTTCCAAAAAAACAAACAATTAATAATAAAGAAAAAAAGTTATACACATCACTCACACAATTCACTGA
ATTTTTGTTCTCAAGATTTTTCACCATCCACACATCAATGGAATCCTCTTTCTATATCTCATCAATGGCCCAGTTGGACCAAACCAAAAATTATTCTCTCCAACAAATTC
CTAATTCAATCAGAGAAAAACTGCCTAAATTTCTCTCAACTTCCCTGAATCACCCTCCGATTTGGGTTTTTTCTCCCTTCCCTTCTTAGTTCTCTTCTAGGTTAAGATTT
AACCAAAACCCTTCATTCTTTTCGCCCCCCCCTTTTTTTTTCCTTTTCCGCCATTGTTTCTCCTTCTTCTCCTCATTCTCTTTTCTGGGTCATCTCTTTTTTCGACCATC
GTTTCGATTTTTCTCTTTTAATCGAATGAAGAAAGCTCGAAAAGGGGTGGCTGCGAATTCAATGGCGTGTGCTCTGTTTGAGAGTTCGATGGTTGGGATCAAACATCAAA
GTCTCTTGCAGGATTACGAGGAGTTGCATAACGAAACAGAAGCCATGAAGAAAAAACTACTGATTGCGAAGCGGAAAAAGGCAACCCTTTTGGATGAAGTTCGATTCCTG
AGGCATAGATATGAATTGTTGAAGAAGCAGCCTGCAAACATCCAGCCAAAGGTTGGTTTCAAGCGGCCACGGAACCTTGAACTCAAACCTCCCACCGTTAAGAAAGAAAA
GAGTTCACGGAAAAGAGAAGCTTCTTTGAAGCCGCTTGCTCAAGCTCATGACGTAAACCAAAGGGGAGGAATCTACAATGGGGTTGAAGCCTCTTCTCGAAAATCTCAGT
CGTTTTTCGACCTAAACCAGAAGTCAAATACGTGCAGCAAGAAGGAAGTCATTGTGAACAGTTCTTTTCCTACTTTTGACCAGAAAGAGAGAGTATACAGAGCACATGAA
GCTGCTGCCAACAGGAACATGACCCCGGTTTTCGACCTTAACCAGATTTCGAGGGAGGAAGAAGAACTGCAAGCTGGTTTCGAACCAGTGAGACTGGAGGACGAGCCGAA
GAATATCTTCCCAAGAAGCGAACACGATGCAAAGAACAGCGAGTTGGTGTTATCATCAATGTGTAGGAATGATGATAATGGATCAAACAGAGCAGGAAAAAGGAAGATCT
CATGGCAAGATCAAGTGGCTTTAAGAGCATGAAGTCCTCTCTAATTGAAAAAATCATTGAGGATCCTTCATTTGCATAGTCTGCATAGGTGCAAAAACTCTGATTCCTTG
AAATCCCATTATTTTTATTTTTTCATAATTCAAGCGAGAAAACGGTGACCATTCATGTTGAATGACTTTGTAAGTGCTTAAAAATGTTAGACAAGAAATTTTTAAGTACT
TAGAAAGTCATTACAAATAGTTCATACCTAAAGTCGTAGTTAGTGGTAGGTAATTATTGTGATTGGAATTAATGAAAGAATGTTTGAAATTTCGATTTCTTCATCTCAGG
ACTATGGTTGCGAATTGTTTAAAAAGCGG
Protein sequenceShow/hide protein sequence
MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREAS
LKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPRSEH
DAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA