; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006297 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006297
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr07:17068789..17070803
RNA-Seq ExpressionHG10006297
SyntenyHG10006297
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]6.0e-11684.11Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK
        MKKARK +A +S ACALF+SSM+GIKHQSLLQDYEELHNETEAMK+KLLIAKRKK TLL EVRFLRHRYE LK QPANIQPKVGF+ PRNLE++PP +KK
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK

Query:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQ HD+NQRGGIYNG+EA SRK QSFFD NQKS  CSKKEV +++SFP FDQKERVYRAHE AAN NMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA

Query:  GSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        G KP R+EDEPKNI  RSEHDAKNS+LV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  GSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]2.3e-11584.56Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK
        MKKARK +A DS ACALF++SM+GIKHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLL EVRFLRHRYE LKNQPANIQPKVGF+L RNLE+RPPI+KK
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK

Query:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQ HDLNQRGGIYNG+EA SRK QSFFD NQKS  CSKKEV ++NSFP FDQKERVYRAHE AAN NMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA

Query:  GSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        G +P R +EDE KNI  RSEHDAKNSDLV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  GSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]2.2e-9473.46Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK
        MKKARK  A+D +A ALF++ MIG KH  LLQDYE+L N TE MKE+LLIAKRKK+TLLAEVRFLRHRYEFLKNQ  N QPK G E P+N EIRPP  KK
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK

Query:  EKSSRKREASLKPL--AQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEEL
        EKSS+KREASLK L  AQ  DLNQRGGIY+GMEA SRK +  F  NQK RMCS  EV++HNS PIF+ KE +YR HE AA+ NMTPVFDLNQISREEEEL
Subjt:  EKSSRKREASLKPL--AQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEEL

Query:  QAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        QAG +PPRME+ PKN   RSE+D KNSDL+IS MCRN G+GSNRAGKRKISWQDQVALRA
Subjt:  QAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]4.5e-9584.19Show/hide
Query:  MKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDT
        MK+KLLIAKRKK TLL EVRFLRHRYE LK QPANIQPKVGF+ PRNLE++PP +KKEKSSRKREASLKPLAQ HD+NQRGGIYNG+EA SRK QSFFD 
Subjt:  MKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDT

Query:  NQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRA
        NQKS  CSKKEV +++SFP FDQKERVYRAHE AAN NMTPVFDLNQISREEEELQAG KP R+EDEPKNI  RSEHDAKNS+LV+SSMCRND NGSNRA
Subjt:  NQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRA

Query:  GKRKISWQDQVALRA
        GKRKISWQDQVALRA
Subjt:  GKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]4.3e-11484.88Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK
        MKKARK +AVDSEACALF++SMIG+KHQSLLQDYEEL NETEAMKEKLLIAKRKK TLLAEVRFLRHRYE LKN+PAN QPKV FELP +LEI PPI KK
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK

Query:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA
         KSSRK EASLKPLA+ HDLNQRGGIYNGMEAPSRK QSFF+ NQKSRMCSKKEVTI +S PIFDQKERVYR HE   + NMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA

Query:  GSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        G +P RMEDE KN+ SRSEHDAKNSDLV+SSMCRNDGNGSN AGKRKISWQDQVALRA
Subjt:  GSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein2.9e-11684.11Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK
        MKKARK +A +S ACALF+SSM+GIKHQSLLQDYEELHNETEAMK+KLLIAKRKK TLL EVRFLRHRYE LK QPANIQPKVGF+ PRNLE++PP +KK
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK

Query:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQ HD+NQRGGIYNG+EA SRK QSFFD NQKS  CSKKEV +++SFP FDQKERVYRAHE AAN NMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA

Query:  GSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        G KP R+EDEPKNI  RSEHDAKNS+LV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  GSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936831.1e-11584.56Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK
        MKKARK +A DS ACALF++SM+GIKHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLL EVRFLRHRYE LKNQPANIQPKVGF+L RNLE+RPPI+KK
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK

Query:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQ HDLNQRGGIYNG+EA SRK QSFFD NQKS  CSKKEV ++NSFP FDQKERVYRAHE AAN NMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA

Query:  GSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        G +P R +EDE KNI  RSEHDAKNSDLV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  GSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein1.1e-11584.56Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK
        MKKARK +A DS ACALF++SM+GIKHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLL EVRFLRHRYE LKNQPANIQPKVGF+L RNLE+RPPI+KK
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK

Query:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQ HDLNQRGGIYNG+EA SRK QSFFD NQKS  CSKKEV ++NSFP FDQKERVYRAHE AAN NMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQA

Query:  GSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        G +P R +EDE KNI  RSEHDAKNSDLV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  GSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246961.1e-9473.46Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK
        MKKARK  A+D +A ALF++ MIG KH  LLQDYE+L N TE MKE+LLIAKRKK+TLLAEVRFLRHRYEFLKNQ  N QPK G E P+N EIRPP  KK
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK

Query:  EKSSRKREASLKPL--AQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEEL
        EKSS+KREASLK L  AQ  DLNQRGGIY+GMEA SRK +  F  NQK RMCS  EV++HNS PIF+ KE +YR HE AA+ NMTPVFDLNQISREEEEL
Subjt:  EKSSRKREASLKPL--AQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEEL

Query:  QAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        QAG +PPRME+ PKN   RSE+D KNSDL+IS MCRN G+GSNRAGKRKISWQDQVALRA
Subjt:  QAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A6J1KKM1 uncharacterized protein LOC1114940121.3e-8473.33Show/hide
Query:  MIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLN
        MI I H  LLQDY EL NETEAMKEKLLI K+KK+TLLAEVRFLRH+YE LKN P   QPKVGF+LP+NL+IRPP+ KKE  SRKREA         +LN
Subjt:  MIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLN

Query:  QRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDEP---KNICSRS
        QRGGI +GMEA +RK +S  + NQKSRMCSKKE++I + FPI  QKERVYRAHEVA NTNMTPVFDLNQISREEEELQ G +P R EDE    KNICSRS
Subjt:  QRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDEP---KNICSRS

Query:  EHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        E DAKNSDL++SSMCRN GNGSNRAGKRKISWQD+VALRA
Subjt:  EHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein8.3e-1531.03Show/hide
Query:  ELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLK-NQPANIQPKV-------GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIY
        EL  E E  +++L + K+K+ TL +EVRFLR RYE LK +Q     P++       G E+PR          K    RK+++ ++      DL  +  I 
Subjt:  ELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLK-NQPANIQPKV-------GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIY

Query:  NGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDL
        N  EA +    S     ++ R      +T   S P  + +          + T+  P FDLNQISREEEE +        E         +  D + SDL
Subjt:  NGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDL

Query:  VIS---SMCRNDGNGSNRAGKRKISWQDQVAL
         +     +C +     NRA KRK++WQD VAL
Subjt:  VIS---SMCRNDGNGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein3.3e-1934.84Show/hide
Query:  FQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSSRKREASLKPLAQV
        F+   +  +H SL+QDY ELH ETEAM+++L   + +KATL+AEVRFLR RY  L+                    +P  IKK + S             
Subjt:  FQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSSRKREASLKPLAQV

Query:  HDLNQRGGIYNGME-APSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEE-ELQA-GSKPPRMEDEPKNI
              GG    +E +PS K ++                T H S P  +  E+ +   + +    + P+FDLNQIS EEE E +A  +       E  N 
Subjt:  HDLNQRGGIYNGME-APSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEE-ELQA-GSKPPRMEDEPKNI

Query:  CSR---SEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVA
        C R   S  + +  D+  SS CRN GNGSN   KRKISWQD VA
Subjt:  CSR---SEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAGCTCGAAAACGGATGGCTGTGGATTCAGAGGCATGTGCTCTGTTCCAGAGCTCGATGATTGGGATCAAACATCAAAGTCTCTTGCAGGATTACGAG
GAGTTGCATAACGAAACAGAAGCCATGAAGGAGAAACTGCTGATCGCAAAGCGGAAAAAGGCGACCCTTTTGGCTGAAGTTCGATTTTTGAGGCATAGATATGAA
TTCTTGAAGAACCAGCCTGCAAACATCCAACCGAAGGTTGGTTTCGAGCTGCCACGGAACCTTGAAATCAGACCTCCCATCATTAAGAAAGAAAAGAGTTCTCGA
AAAAGAGAAGCTTCTTTGAAACCCCTTGCTCAGGTTCATGACTTAAACCAAAGGGGAGGAATCTACAATGGGATGGAAGCCCCTTCTCGAAAACCTCAGTCGTTT
TTCGACACAAACCAGAAGTCAAGGATGTGCAGCAAGAAGGAAGTCACTATACACAATTCTTTTCCTATTTTTGACCAGAAAGAGAGAGTATACAGAGCACATGAA
GTTGCTGCCAACACGAACATGACCCCAGTTTTCGACCTTAACCAGATCTCGAGAGAGGAGGAAGAACTGCAGGCTGGTTCCAAACCACCGAGAATGGAGGACGAG
CCAAAGAATATCTGTTCAAGAAGCGAACACGATGCGAAGAACAGTGACTTGGTGATATCATCAATGTGTAGGAATGATGGTAATGGATCAAACAGAGCAGGAAAA
AGGAAGATCTCATGGCAAGATCAAGTGGCCTTAAGAGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAAGCTCGAAAACGGATGGCTGTGGATTCAGAGGCATGTGCTCTGTTCCAGAGCTCGATGATTGGGATCAAACATCAAAGTCTCTTGCAGGATTACGAG
GAGTTGCATAACGAAACAGAAGCCATGAAGGAGAAACTGCTGATCGCAAAGCGGAAAAAGGCGACCCTTTTGGCTGAAGTTCGATTTTTGAGGCATAGATATGAA
TTCTTGAAGAACCAGCCTGCAAACATCCAACCGAAGGTTGGTTTCGAGCTGCCACGGAACCTTGAAATCAGACCTCCCATCATTAAGAAAGAAAAGAGTTCTCGA
AAAAGAGAAGCTTCTTTGAAACCCCTTGCTCAGGTTCATGACTTAAACCAAAGGGGAGGAATCTACAATGGGATGGAAGCCCCTTCTCGAAAACCTCAGTCGTTT
TTCGACACAAACCAGAAGTCAAGGATGTGCAGCAAGAAGGAAGTCACTATACACAATTCTTTTCCTATTTTTGACCAGAAAGAGAGAGTATACAGAGCACATGAA
GTTGCTGCCAACACGAACATGACCCCAGTTTTCGACCTTAACCAGATCTCGAGAGAGGAGGAAGAACTGCAGGCTGGTTCCAAACCACCGAGAATGGAGGACGAG
CCAAAGAATATCTGTTCAAGAAGCGAACACGATGCGAAGAACAGTGACTTGGTGATATCATCAATGTGTAGGAATGATGGTAATGGATCAAACAGAGCAGGAAAA
AGGAAGATCTCATGGCAAGATCAAGTGGCCTTAAGAGCATGA
Protein sequenceShow/hide protein sequence
MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSSR
KREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDE
PKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA