; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004776 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004776
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationChr08:20329687..20330954
RNA-Seq ExpressionHG10004776
SyntenyHG10004776
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444812.1 PREDICTED: uncharacterized protein LOC103488048 [Cucumis melo]1.9e-8966.55Show/hide
Query:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        STH+CS SF+SD+FTP +HFVA IL +  LL+Q+S+FSLGL PSW +RRKRSAV SPPD +SV+ QPPPPPSSSE  KESSPTTPLS N   L RSESDE
Subjt:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL
        N  N K+SK+K  ++KKFQYLETIDKLTH+N+AL  DV+AMK+H+  LKTINS+LKAKKQE  MILGGS NQSEIPEIGTSSS          KSS SN+
Subjt:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL

Query:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA
        EN+LH+ +PS+KNQTAP+ EQSN  QN QIP G IPL D    PMGIPDLNL+ E+  Q   +++MAA+ARQNRIRIWKNK  +N+NG A
Subjt:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA

XP_008444813.1 PREDICTED: uncharacterized protein LOC103488049 [Cucumis melo]1.4e-10372.43Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL
        MAST+STHQCS SFDSDDF+P++ FVAQIL   PLL+Q+S FSLGLSPSW IRRKRSAVDSPPD+ S+I QPP PP     SSE  KESSPTTPLSLNSL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSRSESDEN T  K+SKKK  ++KK QYLETIDKLTH+ +ALEGD++AMKRH+  LKTINS+LKAKKQE IL G  N S  PEIGTSSSVAME+AKLT 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ
        KSS SN+EN+  + EPS+KNQT P  EQ NS +N+QIPIG IPLYDPSLGPMGIPDLNLS E+I     ++++AARARQNRI+IWKNKN +NNNGA KLQ
Subjt:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ

Query:  S
        S
Subjt:  S

XP_011649663.1 uncharacterized protein LOC105434650 [Cucumis sativus]2.1e-7260.14Show/hide
Query:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        ST   S S +SDDFTP+DH VA IL + PLL+Q+S+FSLGL PSW IRRKRSAV SP   ++V+ QPPPPPSSSE  KE+SPTTPLSL+SL LSRSESDE
Subjt:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL
        N  N K+SK+K  + KKF+  E++DKLTH+N+AL  + +A K+ +   KTINS+LKAKKQE  MILGGS N+SEIPE GTS+S          KSS  N+
Subjt:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL

Query:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNN
        EN+LH+ EPS KNQTAPM EQSN  QN QIPI  IPL D     MGIPDLNL+ E+  +    + +AA+ARQNR RI KNK    N
Subjt:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNN

XP_011649664.1 myocardin-related transcription factor A [Cucumis sativus]9.4e-10571.85Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL
        MAST+STHQCS SFDSDDF+P++HFVAQIL   PLL+QQS FSLGLSPSW IRRKRSAVDSPPD++S+I QPP PP     SSE  KESSPTTPLSL+SL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSRSESDEN T  K+SKKK  ++KK QYLETI+KLTH+ +ALEGD++AMKRH+  LKTINS+LKAKKQE ILGG +N S  P+ GTS+SVAME+AKLT 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNK-NKHNNNGAAKL
        KSS SN+EN+  + EPS+KNQT P+ EQSNS QN+QIPIG IPLYDPSLGPMGIPDLNLS E+I     ++++AA+ARQNRI+IWKNK N +NNNGA KL
Subjt:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNK-NKHNNNGAAKL

Query:  QS
        QS
Subjt:  QS

XP_022996985.1 zinc finger homeobox protein 4-like isoform X1 [Cucurbita maxima]2.1e-7258.17Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVI----PQPPPPPSSSENVKESSPTTPLSLNSL
        MAST   HQC+   D D  TPD+    QIL +FPLLVQQ +FSLGL P+W +R KRSAV SPPDS S++    P PPPPP SS   KESSPTTP SL+SL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVI----PQPPPPPSSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSR ESDE N    L  KK  ++KK QYLET+ +LT +N+AL G V  +KRHYN+LKT NS+LKAK+Q+MI   S  +S  PEI  SSS A++  K T 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTA-----PMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIW--KNKNKHNN
        K      ++H HQ +P IKNQTA        EQSNS QN +IP GAI +YDPS GP GIPDLNLSF+EI+Q   +R MAA+ARQNRI+IW  KN N +NN
Subjt:  KSSASNLENHLHQREPSIKNQTA-----PMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIW--KNKNKHNN

Query:  NGAAKL
        NGAA+L
Subjt:  NGAAKL

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein4.5e-10571.85Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL
        MAST+STHQCS SFDSDDF+P++HFVAQIL   PLL+QQS FSLGLSPSW IRRKRSAVDSPPD++S+I QPP PP     SSE  KESSPTTPLSL+SL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPP----SSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSRSESDEN T  K+SKKK  ++KK QYLETI+KLTH+ +ALEGD++AMKRH+  LKTINS+LKAKKQE ILGG +N S  P+ GTS+SVAME+AKLT 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNK-NKHNNNGAAKL
        KSS SN+EN+  + EPS+KNQT P+ EQSNS QN+QIPIG IPLYDPSLGPMGIPDLNLS E+I     ++++AA+ARQNRI+IWKNK N +NNNGA KL
Subjt:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNK-NKHNNNGAAKL

Query:  QS
        QS
Subjt:  QS

A0A1S3BAR4 uncharacterized protein LOC1034880496.6e-10472.43Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL
        MAST+STHQCS SFDSDDF+P++ FVAQIL   PLL+Q+S FSLGLSPSW IRRKRSAVDSPPD+ S+I QPP PP     SSE  KESSPTTPLSLNSL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSRSESDEN T  K+SKKK  ++KK QYLETIDKLTH+ +ALEGD++AMKRH+  LKTINS+LKAKKQE IL G  N S  PEIGTSSSVAME+AKLT 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ
        KSS SN+EN+  + EPS+KNQT P  EQ NS +N+QIPIG IPLYDPSLGPMGIPDLNLS E+I     ++++AARARQNRI+IWKNKN +NNNGA KLQ
Subjt:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ

Query:  S
        S
Subjt:  S

A0A1S3BC34 uncharacterized protein LOC1034880489.2e-9066.55Show/hide
Query:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        STH+CS SF+SD+FTP +HFVA IL +  LL+Q+S+FSLGL PSW +RRKRSAV SPPD +SV+ QPPPPPSSSE  KESSPTTPLS N   L RSESDE
Subjt:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL
        N  N K+SK+K  ++KKFQYLETIDKLTH+N+AL  DV+AMK+H+  LKTINS+LKAKKQE  MILGGS NQSEIPEIGTSSS          KSS SN+
Subjt:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL

Query:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA
        EN+LH+ +PS+KNQTAP+ EQSN  QN QIP G IPL D    PMGIPDLNL+ E+  Q   +++MAA+ARQNRIRIWKNK  +N+NG A
Subjt:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA

A0A5A7VA15 Uncharacterized protein9.2e-9066.55Show/hide
Query:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE
        STH+CS SF+SD+FTP +HFVA IL +  LL+Q+S+FSLGL PSW +RRKRSAV SPPD +SV+ QPPPPPSSSE  KESSPTTPLS N   L RSESDE
Subjt:  STHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDE

Query:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL
        N  N K+SK+K  ++KKFQYLETIDKLTH+N+AL  DV+AMK+H+  LKTINS+LKAKKQE  MILGGS NQSEIPEIGTSSS          KSS SN+
Subjt:  NNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQE--MILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNL

Query:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA
        EN+LH+ +PS+KNQTAP+ EQSN  QN QIP G IPL D    PMGIPDLNL+ E+  Q   +++MAA+ARQNRIRIWKNK  +N+NG A
Subjt:  ENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAA

A0A5A7VHE1 Uncharacterized protein6.6e-10472.43Show/hide
Query:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL
        MAST+STHQCS SFDSDDF+P++ FVAQIL   PLL+Q+S FSLGLSPSW IRRKRSAVDSPPD+ S+I QPP PP     SSE  KESSPTTPLSLNSL
Subjt:  MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA
        PLSRSESDEN T  K+SKKK  ++KK QYLETIDKLTH+ +ALEGD++AMKRH+  LKTINS+LKAKKQE IL G  N S  PEIGTSSSVAME+AKLT 
Subjt:  PLSRSESDENNTNTKLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTA

Query:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ
        KSS SN+EN+  + EPS+KNQT P  EQ NS +N+QIPIG IPLYDPSLGPMGIPDLNLS E+I     ++++AARARQNRI+IWKNKN +NNNGA KLQ
Subjt:  KSSASNLENHLHQREPSIKNQTAPMEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQ

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTACTTCCACTCATCAATGCTCCACCTCCTTCGATTCCGACGACTTCACCCCTGATGACCACTTCGTCGCTCAAATCCTCATCGATTTTCCTCTTCTCGT
TCAACAATCACAGTTTTCTCTTGGCTTATCCCCTTCCTGGTCTATCCGACGCAAGAGATCCGCCGTCGATTCCCCGCCGGACTCCGCCTCCGTCATCCCCCAACCGCCGC
CTCCTCCATCGTCGTCCGAGAACGTCAAGGAGTCTAGCCCTACTACTCCACTTTCACTCAACTCTTTACCTTTGTCGCGGAGTGAATCTGACGAGAATAACACCAACACG
AAGCTCTCCAAGAAGAAAACCTCTATCAATAAGAAATTTCAGTATTTGGAAACCATTGACAAATTGACCCACCGGAATCGAGCTCTGGAAGGGGACGTTGATGCTATGAA
GCGACATTATAATCAACTGAAAACTATTAATTCGAAGTTGAAAGCCAAGAAGCAAGAGATGATTCTGGGTGGTTCCAATAATCAATCAGAAATTCCAGAAATTGGGACCT
CAAGTTCGGTCGCCATGGAAATGGCTAAGTTAACTGCGAAATCCTCAGCCTCAAATCTGGAGAATCATCTTCATCAACGTGAACCGTCGATCAAGAATCAGACGGCTCCG
ATGGAAGAACAGAGCAACAGTAAACAGAATTTCCAAATTCCAATTGGGGCAATTCCTTTATATGATCCTTCATTAGGTCCAATGGGGATTCCTGATTTGAACCTCTCTTT
TGAAGAAATTAATCAGGGATATTGTTCAAGACACATGGCTGCTAGAGCAAGACAGAACAGGATTCGGATCTGGAAGAACAAGAACAAGCACAACAACAATGGAGCTGCCA
AATTGCAATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACTACTTCCACTCATCAATGCTCCACCTCCTTCGATTCCGACGACTTCACCCCTGATGACCACTTCGTCGCTCAAATCCTCATCGATTTTCCTCTTCTCGT
TCAACAATCACAGTTTTCTCTTGGCTTATCCCCTTCCTGGTCTATCCGACGCAAGAGATCCGCCGTCGATTCCCCGCCGGACTCCGCCTCCGTCATCCCCCAACCGCCGC
CTCCTCCATCGTCGTCCGAGAACGTCAAGGAGTCTAGCCCTACTACTCCACTTTCACTCAACTCTTTACCTTTGTCGCGGAGTGAATCTGACGAGAATAACACCAACACG
AAGCTCTCCAAGAAGAAAACCTCTATCAATAAGAAATTTCAGTATTTGGAAACCATTGACAAATTGACCCACCGGAATCGAGCTCTGGAAGGGGACGTTGATGCTATGAA
GCGACATTATAATCAACTGAAAACTATTAATTCGAAGTTGAAAGCCAAGAAGCAAGAGATGATTCTGGGTGGTTCCAATAATCAATCAGAAATTCCAGAAATTGGGACCT
CAAGTTCGGTCGCCATGGAAATGGCTAAGTTAACTGCGAAATCCTCAGCCTCAAATCTGGAGAATCATCTTCATCAACGTGAACCGTCGATCAAGAATCAGACGGCTCCG
ATGGAAGAACAGAGCAACAGTAAACAGAATTTCCAAATTCCAATTGGGGCAATTCCTTTATATGATCCTTCATTAGGTCCAATGGGGATTCCTGATTTGAACCTCTCTTT
TGAAGAAATTAATCAGGGATATTGTTCAAGACACATGGCTGCTAGAGCAAGACAGAACAGGATTCGGATCTGGAAGAACAAGAACAAGCACAACAACAATGGAGCTGCCA
AATTGCAATCCTAA
Protein sequenceShow/hide protein sequence
MASTTSTHQCSTSFDSDDFTPDDHFVAQILIDFPLLVQQSQFSLGLSPSWSIRRKRSAVDSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTNT
KLSKKKTSINKKFQYLETIDKLTHRNRALEGDVDAMKRHYNQLKTINSKLKAKKQEMILGGSNNQSEIPEIGTSSSVAMEMAKLTAKSSASNLENHLHQREPSIKNQTAP
MEEQSNSKQNFQIPIGAIPLYDPSLGPMGIPDLNLSFEEINQGYCSRHMAARARQNRIRIWKNKNKHNNNGAAKLQS