; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G19633 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G19633
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionUnknown protein
Genome locationctg4:3681308..3684039
RNA-Seq ExpressionCucsat.G19633
SyntenyCucsat.G19633
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]8.91e-179100Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]9.75e-16592.28Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKGVAA+SMACALFE+SMVGIKHQSLLQDY+ELHNETEA+KKKLLIAKRKK TLLDEVRFLRHRYELLK QPANIQPKVGFK  RNLEL+PP VKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQAHD+NQRGGIYNG+EASSRKSQSFFDLNQKSNTCSKKEVI+N+SFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFKPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GF+P+R LEDE KNIFPRSEHDAKNS+LVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFKPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]4.84e-11268.08Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARK  A +  A ALFE+ M+G KH  LLQDYE+L N TE MK++LLIAKRKK TLL EVRFLRHRYE LK Q  N QPK G + P+N E++PP  KK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLA--QAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL
        EKSS+KREASLK LA  QA D+NQRGGIY+G+EA+SRKS+  F +NQK   CS  EV +++S P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEEL
Subjt:  EKSSRKREASLKPLA--QAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL

Query:  QAGFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        QAGF+P R+E+ PKN F RSE+D KNS+L++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]2.71e-147100Show/hide
Query:  MKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDL
        MKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDL
Subjt:  MKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDL

Query:  NQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRA
        NQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRA
Subjt:  NQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRA

Query:  GKRKISWQDQVALRA
        GKRKISWQDQVALRA
Subjt:  GKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]1.06e-13879.46Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKG+A +S ACALFE+SM+G+KHQSLLQDYEEL NETEAMK+KLLIAKRKK TLL EVRFLRHRYELLK +PAN QPKV F+ P +LE+ PP  KK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
         KSSRK EASLKPLA+AHD+NQRGGIYNG+EA SRKSQSFF++NQKS  CSKKEV +  S P FDQKERVYR HE   +RNMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GF+P+R+EDE KN+F RSEHDAKNS+LVLSSMCRND NGSN AGKRKISWQDQVALRA
Subjt:  GFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein4.31e-179100Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936834.72e-16592.28Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKGVAA+SMACALFE+SMVGIKHQSLLQDY+ELHNETEA+KKKLLIAKRKK TLLDEVRFLRHRYELLK QPANIQPKVGFK  RNLEL+PP VKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQAHD+NQRGGIYNG+EASSRKSQSFFDLNQKSNTCSKKEVI+N+SFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFKPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GF+P+R LEDE KNIFPRSEHDAKNS+LVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFKPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein4.72e-16592.28Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARKGVAA+SMACALFE+SMVGIKHQSLLQDY+ELHNETEA+KKKLLIAKRKK TLLDEVRFLRHRYELLK QPANIQPKVGFK  RNLEL+PP VKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQAHD+NQRGGIYNG+EASSRKSQSFFDLNQKSNTCSKKEVI+N+SFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFKPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GF+P+R LEDE KNIFPRSEHDAKNS+LVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFKPVR-LEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246962.34e-11268.08Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK
        MKKARK  A +  A ALFE+ M+G KH  LLQDYE+L N TE MK++LLIAKRKK TLL EVRFLRHRYE LK Q  N QPK G + P+N E++PP  KK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKK

Query:  EKSSRKREASLKPLA--QAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL
        EKSS+KREASLK LA  QA D+NQRGGIY+G+EA+SRKS+  F +NQK   CS  EV +++S P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEEL
Subjt:  EKSSRKREASLKPLA--QAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL

Query:  QAGFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        QAGF+P R+E+ PKN F RSE+D KNS+L++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A6J1KKM1 uncharacterized protein LOC1114940121.37e-9766.67Show/hide
Query:  MVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVN
        M+ I H  LLQDY EL NETEAMK+KLLI K+KK TLL EVRFLRH+YELLK  P   QPKVGFK P+NL+++PP  KKE  SRKREA         ++N
Subjt:  MVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVN

Query:  QRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFKPVRLEDEP---KNIFPRS
        QRGGI +G+EA++RK++S  ++NQKS  CSKKE+ +   FP   QKERVYRAHE A N NMTPVFDLNQISREEEELQ GF+P+R EDE    KNI  RS
Subjt:  QRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFKPVRLEDEP---KNIFPRS

Query:  EHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        E DAKNS+L++SSMCRN  NGSNRAGKRKISWQD+VALRA
Subjt:  EHDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein1.1e-1131.53Show/hide
Query:  ELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKK-QPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVNQRGGIYNGVEASS
        EL  E E  +K+L + K+K+ TL  EVRFLR RYE LK+ Q     P++  +   +  L+ P  +K    RK+++ ++      D+  +  I N  EA +
Subjt:  ELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKK-QPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQAHDVNQRGGIYNGVEASS

Query:  RKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCR
            S  DL++K       +V+      TF             +  +  P FDLNQISREEEE +   + + + +  KN    +     + E  L  +C 
Subjt:  RKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFKPVRLEDEPKNIFPRSEHDAKNSELVLSSMCR

Query:  NDDNGSNRAGKRKISWQDQVAL
        + +   NRA KRK++WQD VAL
Subjt:  NDDNGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein7.6e-1633.2Show/hide
Query:  FESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQA
        FE   V  +H SL+QDY ELH ETEAM+K+L   + +K TL+ EVRFLR RY  L++                   +P  +KK + S             
Subjt:  FESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREASLKPLAQA

Query:  HDVNQRGGIYNGVEAS-SRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFKPVRLEDEPKNIFP
              GG    VE S S KS++             K V    S P  +  E+ +   + +  R + P+FDLNQIS EEE+     +   +++  +N   
Subjt:  HDVNQRGGIYNGVEAS-SRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFKPVRLEDEPKNIFP

Query:  RSEHDAKNSELVLSSM-----------CRNDDNGSNRAGKRKISWQDQVA
        R E       L++SS+           CRN  NGSN   KRKISWQD VA
Subjt:  RSEHDAKNSELVLSSM-----------CRNDDNGSNRAGKRKISWQDQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAGCTCGAAAAGGGGTGGCTGCGAATTCAATGGCGTGTGCTCTGTTTGAGAGTTCGATGGTTGGGATCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGTT
GCATAACGAAACAGAAGCCATGAAGAAAAAACTACTGATTGCGAAGCGGAAAAAGGAAACCCTTTTGGATGAAGTTCGATTCCTGAGGCATAGATATGAATTGTTGAAGA
AGCAGCCTGCAAACATCCAGCCAAAGGTTGGTTTCAAGCGGCCACGGAACCTTGAACTCAAACCTCCCACCGTTAAGAAAGAAAAGAGTTCACGGAAAAGAGAAGCTTCT
TTGAAGCCGCTTGCTCAAGCTCATGACGTAAACCAAAGGGGAGGAATCTACAATGGGGTTGAAGCCTCTTCTCGAAAATCTCAGTCGTTTTTCGACCTAAACCAGAAGTC
AAATACGTGCAGCAAGAAGGAAGTCATTGTGAACAGTTCTTTTCCTACTTTTGACCAGAAAGAGAGAGTATACAGAGCACATGAAGCTGCTGCCAACAGGAACATGACCC
CGGTTTTCGACCTTAACCAGATTTCGAGGGAGGAAGAAGAACTGCAAGCTGGTTTCAAACCAGTGAGACTGGAGGACGAGCCGAAGAATATCTTCCCAAGAAGCGAACAC
GATGCAAAGAACAGCGAGTTGGTGTTATCATCAATGTGTAGGAATGATGATAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCTTTAAG
AGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAAGCTCGAAAAGGGGTGGCTGCGAATTCAATGGCGTGTGCTCTGTTTGAGAGTTCGATGGTTGGGATCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGTT
GCATAACGAAACAGAAGCCATGAAGAAAAAACTACTGATTGCGAAGCGGAAAAAGGAAACCCTTTTGGATGAAGTTCGATTCCTGAGGCATAGATATGAATTGTTGAAGA
AGCAGCCTGCAAACATCCAGCCAAAGGTTGGTTTCAAGCGGCCACGGAACCTTGAACTCAAACCTCCCACCGTTAAGAAAGAAAAGAGTTCACGGAAAAGAGAAGCTTCT
TTGAAGCCGCTTGCTCAAGCTCATGACGTAAACCAAAGGGGAGGAATCTACAATGGGGTTGAAGCCTCTTCTCGAAAATCTCAGTCGTTTTTCGACCTAAACCAGAAGTC
AAATACGTGCAGCAAGAAGGAAGTCATTGTGAACAGTTCTTTTCCTACTTTTGACCAGAAAGAGAGAGTATACAGAGCACATGAAGCTGCTGCCAACAGGAACATGACCC
CGGTTTTCGACCTTAACCAGATTTCGAGGGAGGAAGAAGAACTGCAAGCTGGTTTCAAACCAGTGAGACTGGAGGACGAGCCGAAGAATATCTTCCCAAGAAGCGAACAC
GATGCAAAGAACAGCGAGTTGGTGTTATCATCAATGTGTAGGAATGATGATAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCTTTAAG
AGCATGA
Protein sequenceShow/hide protein sequence
MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKETLLDEVRFLRHRYELLKKQPANIQPKVGFKRPRNLELKPPTVKKEKSSRKREAS
LKPLAQAHDVNQRGGIYNGVEASSRKSQSFFDLNQKSNTCSKKEVIVNSSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFKPVRLEDEPKNIFPRSEH
DAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA