; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004777 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004777
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationChr08:20334565..20335860
RNA-Seq ExpressionHG10004777
SyntenyHG10004777
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444812.1 PREDICTED: uncharacterized protein LOC103488048 [Cucumis melo]2.7e-8866.55Show/hide
Query:  SAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        S H+CS S +SD+FTP EH V+ IL +  LL Q+S+FSLGL PSWP+RRKRSAV SPPD +SV+ QPPPPPSSSE  KESSPTTPLS N   L RSESDE
Subjt:  SAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSKIPEIGTSSSVAMEMAKLTVKSSASNL
        N  N K+SK+KA ++KKFQYL+TIDKLTH+N+AL  DV AMK+H+  LKTINS+LKAKKQE  MILGGS NQS+IPEIGTSSS          KSS SN+
Subjt:  NNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSKIPEIGTSSSVAMEMAKLTVKSSASNL

Query:  ENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAA
        EN+LH+ +PS+KNQ AP+AEQSN NQN QIPTG IPL D    PMGIPDLNL+ E+  Q   ++YMAA+ARQNRIRI KNK  NN+NG A
Subjt:  ENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAA

XP_008444813.1 PREDICTED: uncharacterized protein LOC103488049 [Cucumis melo]1.6e-9971.43Show/hide
Query:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL
        MAST+S HQCS S DSDDF+P+E  V+QIL    LL Q+S FSLGLSPSWPIRRKRSAV SPPD+ S+I QPP PP     SSE  KESSPTTPLSLNSL
Subjt:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV
        PLSRSESDEN T  K+SKKKA ++KK QYL+TIDKLTH+ +ALEGD+ AMKRH+  LKTINS+LKAKKQE IL G  N S  PEIGTSSSVAME+AKLTV
Subjt:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV

Query:  KSSASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAAKLR
        KSS SN+EN+  + EPS+KNQ  P AEQ NSN+N+QIP G IPLYDPSLGPMGIPDLNLS E+I  +  ++Y+AARARQNRI+I KNKN NNNNGA KL+
Subjt:  KSSASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAAKLR

Query:  S
        S
Subjt:  S

XP_011649663.1 uncharacterized protein LOC105434650 [Cucumis sativus]1.8e-7158.33Show/hide
Query:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSR
        MAST      S S +SDDFTP++H V+ IL +  LL Q+S+FSLGL PSWPIRRKRSAV SP   ++V+ QPPPPPSSSE  KE+SPTTPLSL+SL LSR
Subjt:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSR

Query:  SESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSKIPEIGTSSSVAMEMAKLTVKS
        SESDEN  N K+SK+KA + KKF+  +++DKLTH+N+AL  +  A K+ +   KTINS+LKAKKQE  MILGGS N+S+IPE GTS+S          KS
Subjt:  SESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSKIPEIGTSSSVAMEMAKLTVKS

Query:  SASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNK-NKNNNNGAAKLRS
        S  N+EN+LH+ EPS KNQ APMAEQSN NQN QIP   IPL D     MGIPDLNL+ E+  +    + +AA+ARQNR RICKNK NK N       R+
Subjt:  SASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNK-NKNNNNGAAKLRS

XP_011649664.1 myocardin-related transcription factor A [Cucumis sativus]5.3e-10070.53Show/hide
Query:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL
        MAST+S HQCS S DSDDF+P+EH V+QIL    LL QQS FSLGLSPSWPIRRKRSAV SPPD++S+I QPP PP     SSE  KESSPTTPLSL+SL
Subjt:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV
        PLSRSESDEN T  K+SKKKA ++KK QYL+TI+KLTH+ +ALEGD+ AMKRH+  LKTINS+LKAKKQE ILGG +N S  P+ GTS+SVAME+AKLTV
Subjt:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV

Query:  KSSASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNK-NKNNNNGAAKL
        KSS SN+EN+  + EPS+KNQ  P+AEQSNS QN+QIP G IPLYDPSLGPMGIPDLNLS E+I  +  ++Y+AA+ARQNRI+I KNK N NNNNGA KL
Subjt:  KSSASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNK-NKNNNNGAAKL

Query:  RS
        +S
Subjt:  RS

XP_022996985.1 zinc finger homeobox protein 4-like isoform X1 [Cucurbita maxima]1.6e-7258.5Show/hide
Query:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVI----PQPPPPPSSSENVKESSPTTPLSLNSL
        MAST   HQC+ + D D  TPDE L  QIL +F LL QQ +FSLGL P+WP+R KRSAV SPPDS S++    P PPPPP SS   KESSPTTP SL+SL
Subjt:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVI----PQPPPPPSSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV
        PLSR ESDE N    L  KK  ++KK QYL+T+ +LT +N+AL G V  +KRHYN+LKT NS+LKAK+Q+MI   S  +S  PEI  SSS A++  K TV
Subjt:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV

Query:  KSSASNLENHLHQREPSIKNQKAPMA-----EQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRI--CKNKNKNNN
        K      ++H HQ +P IKNQ A  A     EQSNS QN +IP GAI +YDPS GP GIPDLNLSF+EI+Q   +R MAA+ARQNRI+I   KN N NNN
Subjt:  KSSASNLENHLHQREPSIKNQKAPMA-----EQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRI--CKNKNKNNN

Query:  NGAAKL
        NGAA+L
Subjt:  NGAAKL

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein2.6e-10070.53Show/hide
Query:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL
        MAST+S HQCS S DSDDF+P+EH V+QIL    LL QQS FSLGLSPSWPIRRKRSAV SPPD++S+I QPP PP     SSE  KESSPTTPLSL+SL
Subjt:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV
        PLSRSESDEN T  K+SKKKA ++KK QYL+TI+KLTH+ +ALEGD+ AMKRH+  LKTINS+LKAKKQE ILGG +N S  P+ GTS+SVAME+AKLTV
Subjt:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV

Query:  KSSASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNK-NKNNNNGAAKL
        KSS SN+EN+  + EPS+KNQ  P+AEQSNS QN+QIP G IPLYDPSLGPMGIPDLNLS E+I  +  ++Y+AA+ARQNRI+I KNK N NNNNGA KL
Subjt:  KSSASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNK-NKNNNNGAAKL

Query:  RS
        +S
Subjt:  RS

A0A1S3BAR4 uncharacterized protein LOC1034880497.5e-10071.43Show/hide
Query:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL
        MAST+S HQCS S DSDDF+P+E  V+QIL    LL Q+S FSLGLSPSWPIRRKRSAV SPPD+ S+I QPP PP     SSE  KESSPTTPLSLNSL
Subjt:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV
        PLSRSESDEN T  K+SKKKA ++KK QYL+TIDKLTH+ +ALEGD+ AMKRH+  LKTINS+LKAKKQE IL G  N S  PEIGTSSSVAME+AKLTV
Subjt:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV

Query:  KSSASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAAKLR
        KSS SN+EN+  + EPS+KNQ  P AEQ NSN+N+QIP G IPLYDPSLGPMGIPDLNLS E+I  +  ++Y+AARARQNRI+I KNKN NNNNGA KL+
Subjt:  KSSASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAAKLR

Query:  S
        S
Subjt:  S

A0A1S3BC34 uncharacterized protein LOC1034880481.3e-8866.55Show/hide
Query:  SAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        S H+CS S +SD+FTP EH V+ IL +  LL Q+S+FSLGL PSWP+RRKRSAV SPPD +SV+ QPPPPPSSSE  KESSPTTPLS N   L RSESDE
Subjt:  SAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSKIPEIGTSSSVAMEMAKLTVKSSASNL
        N  N K+SK+KA ++KKFQYL+TIDKLTH+N+AL  DV AMK+H+  LKTINS+LKAKKQE  MILGGS NQS+IPEIGTSSS          KSS SN+
Subjt:  NNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSKIPEIGTSSSVAMEMAKLTVKSSASNL

Query:  ENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAA
        EN+LH+ +PS+KNQ AP+AEQSN NQN QIPTG IPL D    PMGIPDLNL+ E+  Q   ++YMAA+ARQNRIRI KNK  NN+NG A
Subjt:  ENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAA

A0A5A7VA15 Uncharacterized protein1.3e-8866.55Show/hide
Query:  SAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        S H+CS S +SD+FTP EH V+ IL +  LL Q+S+FSLGL PSWP+RRKRSAV SPPD +SV+ QPPPPPSSSE  KESSPTTPLS N   L RSESDE
Subjt:  SAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSKIPEIGTSSSVAMEMAKLTVKSSASNL
        N  N K+SK+KA ++KKFQYL+TIDKLTH+N+AL  DV AMK+H+  LKTINS+LKAKKQE  MILGGS NQS+IPEIGTSSS          KSS SN+
Subjt:  NNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSKIPEIGTSSSVAMEMAKLTVKSSASNL

Query:  ENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAA
        EN+LH+ +PS+KNQ AP+AEQSN NQN QIPTG IPL D    PMGIPDLNL+ E+  Q   ++YMAA+ARQNRIRI KNK  NN+NG A
Subjt:  ENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAA

A0A5A7VHE1 Uncharacterized protein7.5e-10071.43Show/hide
Query:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL
        MAST+S HQCS S DSDDF+P+E  V+QIL    LL Q+S FSLGLSPSWPIRRKRSAV SPPD+ S+I QPP PP     SSE  KESSPTTPLSLNSL
Subjt:  MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV
        PLSRSESDEN T  K+SKKKA ++KK QYL+TIDKLTH+ +ALEGD+ AMKRH+  LKTINS+LKAKKQE IL G  N S  PEIGTSSSVAME+AKLTV
Subjt:  PLSRSESDENNTNPKLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTV

Query:  KSSASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAAKLR
        KSS SN+EN+  + EPS+KNQ  P AEQ NSN+N+QIP G IPLYDPSLGPMGIPDLNLS E+I  +  ++Y+AARARQNRI+I KNKN NNNNGA KL+
Subjt:  KSSASNLENHLHQREPSIKNQKAPMAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAAKLR

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTACTTCCGCTCATCAATGCTCCACCTCCTCCGACTCCGACGACTTCACCCCTGATGAGCACCTTGTCTCTCAAATCCTCGTCGATTTTGCTCTTCTCGC
TCAACAATCCCAGTTTTCTCTTGGCTTATCCCCTTCCTGGCCTATCCGACGCAAGAGATCTGCCGTCGCTTCCCCGCCCGACTCCGCCTCCGTCATCCCCCAACCGCCGC
CTCCTCCATCGTCGTCCGAGAACGTCAAGGAGTCTAGCCCTACTACTCCACTTTCACTTAACTCTTTACCTTTGTCGCGGAGTGAATCTGACGAGAATAACACCAACCCG
AAGCTCTCCAAGAAGAAAGCCTCTATCAATAAGAAATTTCAGTATTTGGATACCATTGACAAATTGACCCACCGGAATCGAGCTCTGGAAGGGGACGTTGGTGCTATGAA
GCGACATTATAATCAACTGAAAACCATTAATTCGAAGTTGAAGGCCAAGAAGCAAGAGATGATTCTGGGTGGTTCCAATAATCAATCAAAAATTCCAGAAATTGGGACCT
CAAGTTCGGTCGCCATGGAAATGGCTAAGTTAACTGTGAAATCCTCAGCCTCAAATCTGGAGAATCATCTTCATCAACGTGAACCGTCGATCAAGAATCAGAAGGCTCCG
ATGGCAGAACAGAGCAACAGTAATCAGAATTTCCAAATTCCAACTGGGGCAATTCCTTTATATGATCCTTCATTAGGTCCAATGGGGATTCCTGATTTGAACCTCTCTTT
TGAAGAAATTAATCAGGAGTATTGTTCAAGATACATGGCTGCTAGAGCAAGACAGAACAGGATTCGGATCTGTAAGAACAAGAACAAGAACAACAACAATGGAGCTGCCA
AATTGCGATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACTACTTCCGCTCATCAATGCTCCACCTCCTCCGACTCCGACGACTTCACCCCTGATGAGCACCTTGTCTCTCAAATCCTCGTCGATTTTGCTCTTCTCGC
TCAACAATCCCAGTTTTCTCTTGGCTTATCCCCTTCCTGGCCTATCCGACGCAAGAGATCTGCCGTCGCTTCCCCGCCCGACTCCGCCTCCGTCATCCCCCAACCGCCGC
CTCCTCCATCGTCGTCCGAGAACGTCAAGGAGTCTAGCCCTACTACTCCACTTTCACTTAACTCTTTACCTTTGTCGCGGAGTGAATCTGACGAGAATAACACCAACCCG
AAGCTCTCCAAGAAGAAAGCCTCTATCAATAAGAAATTTCAGTATTTGGATACCATTGACAAATTGACCCACCGGAATCGAGCTCTGGAAGGGGACGTTGGTGCTATGAA
GCGACATTATAATCAACTGAAAACCATTAATTCGAAGTTGAAGGCCAAGAAGCAAGAGATGATTCTGGGTGGTTCCAATAATCAATCAAAAATTCCAGAAATTGGGACCT
CAAGTTCGGTCGCCATGGAAATGGCTAAGTTAACTGTGAAATCCTCAGCCTCAAATCTGGAGAATCATCTTCATCAACGTGAACCGTCGATCAAGAATCAGAAGGCTCCG
ATGGCAGAACAGAGCAACAGTAATCAGAATTTCCAAATTCCAACTGGGGCAATTCCTTTATATGATCCTTCATTAGGTCCAATGGGGATTCCTGATTTGAACCTCTCTTT
TGAAGAAATTAATCAGGAGTATTGTTCAAGATACATGGCTGCTAGAGCAAGACAGAACAGGATTCGGATCTGTAAGAACAAGAACAAGAACAACAACAATGGAGCTGCCA
AATTGCGATCCTAA
Protein sequenceShow/hide protein sequence
MASTTSAHQCSTSSDSDDFTPDEHLVSQILVDFALLAQQSQFSLGLSPSWPIRRKRSAVASPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTNP
KLSKKKASINKKFQYLDTIDKLTHRNRALEGDVGAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSKIPEIGTSSSVAMEMAKLTVKSSASNLENHLHQREPSIKNQKAP
MAEQSNSNQNFQIPTGAIPLYDPSLGPMGIPDLNLSFEEINQEYCSRYMAARARQNRIRICKNKNKNNNNGAAKLRS