; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G003580 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G003580
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationCmo_Chr12:2224813..2226358
RNA-Seq ExpressionCmoCh12G003580
SyntenyCmoCh12G003580
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585540.1 hypothetical protein SDJN03_18273, partial [Cucurbita argyrosperma subsp. sororia]9.2e-13997.8Show/hide
Query:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
        MAATPHC     AFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
Subjt:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK

Query:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN
        KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQP PMAEQSNN
Subjt:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN

Query:  GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
        GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
Subjt:  GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP

KAG7020453.1 hypothetical protein SDJN02_17137, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-13897.44Show/hide
Query:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
        MAATPHC     AFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
Subjt:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK

Query:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN
        KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQP PMAEQSNN
Subjt:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN

Query:  GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
        GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNG+TKLQTPPNNP
Subjt:  GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP

XP_022951578.1 uncharacterized protein LOC111454352 [Cucurbita moschata]2.5e-144100Show/hide
Query:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
        MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
Subjt:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK

Query:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN
        KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN
Subjt:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN

Query:  GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
        GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
Subjt:  GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP

XP_023002465.1 uncharacterized protein LOC111496295 [Cucurbita maxima]3.2e-13997.81Show/hide
Query:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
        MAATP CASLDFAF+PKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
Subjt:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK

Query:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTH-QPDPMAEQSN
        KK SLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESS H QP PMAEQSN
Subjt:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTH-QPDPMAEQSN

Query:  NGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
        NGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
Subjt:  NGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP

XP_023537124.1 uncharacterized protein LOC111798295 [Cucurbita pepo subsp. pepo]6.8e-14298.53Show/hide
Query:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
        MAATPHCASLDFAF+PKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKH+K
Subjt:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK

Query:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN
        KKPS DKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQP PMAEQSNN
Subjt:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN

Query:  GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
        GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
Subjt:  GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein9.2e-6053.8Show/hide
Query:  ATPHCASLDF---AFSPKEHEVAQILLEFS---KKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPP---------SKKVKESSPTSPLVLNSLPLSR
        ++ H  S+ F    FSP+EH VAQIL +     ++S   LG  P W +RRKRSA+ SPP++S+ +  PP         S++ KESSPT+PL L+SLPLSR
Subjt:  ATPHCASLDF---AFSPKEHEVAQILLEFS---KKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPP---------SKKVKESSPTSPLVLNSLPLSR

Query:  SESDEHTN-AKHSKKKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSS-AMEVVKLLTVE
        SESDE+T  AK SKKK  +DKKSQ++E I++LT Q Q L+G+ EAMK+H+ +LK INSELKAKKQE ILG  S  N S  P+ GTS+S AME+ K LTV+
Subjt:  SESDEHTN-AKHSKKKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSS-AMEVVKLLTVE

Query:  SS---------------THQPDPMAEQSNNGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNK-----NNGVTK
        SS                +Q  P+AEQSN+  QN+QIPIGGIP YDP SL PMGIPDLN+SLE+I  +NY++++AA+AR+NRIQI KNK     NNG  K
Subjt:  SS---------------THQPDPMAEQSNNGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNK-----NNGVTK

Query:  LQT
        LQ+
Subjt:  LQT

A0A1S3BAR4 uncharacterized protein LOC1034880499.2e-6054.15Show/hide
Query:  ATPHCASLDF---AFSPKEHEVAQILLEFS---KKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPP---------SKKVKESSPTSPLVLNSLPLSR
        ++ H  S+ F    FSP+E  VAQIL +     +KS   LG  P W +RRKRSA+ SPP++ + +  PP         S++ KESSPT+PL LNSLPLSR
Subjt:  ATPHCASLDF---AFSPKEHEVAQILLEFS---KKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPP---------SKKVKESSPTSPLVLNSLPLSR

Query:  SESDEH-TNAKHSKKKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSS-AMEVVKLLTVE
        SESDE+ T AK SKKK  +DKKSQ++E ID+LT Q Q L+G+ EAMK+H+ +LK INSELKAKKQE++ G     N S  PEIGTSSS AMEV K LTV+
Subjt:  SESDEH-TNAKHSKKKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSS-AMEVVKLLTVE

Query:  SS---------------THQPDPMAEQSNNGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNK---NNGVTKLQ
        SS                +Q  P AEQ N+ ++N+QIPIGGIP YDP SL PMGIPDLN+SLE+I  ++Y++++AARAR+NRIQI KNK   NNG  KLQ
Subjt:  SS---------------THQPDPMAEQSNNGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNK---NNGVTKLQ

Query:  T
        +
Subjt:  T

A0A5A7VHE1 Uncharacterized protein9.2e-6054.15Show/hide
Query:  ATPHCASLDF---AFSPKEHEVAQILLEFS---KKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPP---------SKKVKESSPTSPLVLNSLPLSR
        ++ H  S+ F    FSP+E  VAQIL +     +KS   LG  P W +RRKRSA+ SPP++ + +  PP         S++ KESSPT+PL LNSLPLSR
Subjt:  ATPHCASLDF---AFSPKEHEVAQILLEFS---KKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPP---------SKKVKESSPTSPLVLNSLPLSR

Query:  SESDEH-TNAKHSKKKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSS-AMEVVKLLTVE
        SESDE+ T AK SKKK  +DKKSQ++E ID+LT Q Q L+G+ EAMK+H+ +LK INSELKAKKQE++ G     N S  PEIGTSSS AMEV K LTV+
Subjt:  SESDEH-TNAKHSKKKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSS-AMEVVKLLTVE

Query:  SS---------------THQPDPMAEQSNNGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNK---NNGVTKLQ
        SS                +Q  P AEQ N+ ++N+QIPIGGIP YDP SL PMGIPDLN+SLE+I  ++Y++++AARAR+NRIQI KNK   NNG  KLQ
Subjt:  SS---------------THQPDPMAEQSNNGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNK---NNGVTKLQ

Query:  T
        +
Subjt:  T

A0A6J1GI34 uncharacterized protein LOC1114543521.2e-144100Show/hide
Query:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
        MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
Subjt:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK

Query:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN
        KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN
Subjt:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNN

Query:  GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
        GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
Subjt:  GSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP

A0A6J1KP15 uncharacterized protein LOC1114962951.5e-13997.81Show/hide
Query:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
        MAATP CASLDFAF+PKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK
Subjt:  MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSK

Query:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTH-QPDPMAEQSN
        KK SLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESS H QP PMAEQSN
Subjt:  KKPSLDKKSQFVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTH-QPDPMAEQSN

Query:  NGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
        NGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP
Subjt:  NGSQNFQIPIGGIPFYDPSSLSPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGACTCCTCATTGCGCCTCTCTCGATTTCGCCTTCTCTCCTAAAGAACACGAAGTCGCTCAAATCCTCCTCGAATTCTCGAAGAAATCCGTCGTTTATCTCGG
ATTTATCCCCGTCTGGACACTCCGACGCAAGAGATCCGCCTTAGTTTCCCCGCCGGAATCCTCCGCCTCCGTCCCTTCGCCGCCGTCCAAGAAGGTCAAGGAGTCCAGCC
CTACCTCTCCTCTCGTCCTCAACTCCTTGCCGTTGTCGCGAAGTGAATCCGATGAACATACCAACGCTAAACACTCCAAGAAGAAGCCCTCTCTCGATAAGAAATCTCAG
TTTGTGGAAGCCATTGACGAATTGACCAAGCAGAATCAAGGTTTGAAAGGGGAATTTGAAGCTATGAAGCAACATTATAATCATCTGAAAGCTATCAATTCGGAATTGAA
GGCCAAAAAGCAAGAGATGATTCTCGGTTCTAACAGCTCTAAGAACGAATCAGCAATTCCAGAAATAGGAACCTCAAGTTCGGCCATGGAAGTCGTTAAGCTGCTCACAG
TCGAATCCTCAACTCATCAGCCAGATCCCATGGCGGAACAGAGTAATAATGGCAGTCAGAATTTTCAAATCCCGATTGGGGGAATTCCTTTCTATGATCCTTCTTCATTG
AGCCCAATGGGGATTCCTGATTTGAACATCTCTCTTGAAGAAATCAACCAGAGGAATTACTCCAGATTCATGGCGGCTCGAGCAAGAAAGAACAGGATTCAGATCTGCAA
GAATAAGAACAACGGCGTTACCAAATTGCAGACTCCTCCAAACAATCCCTATTTTTGA
mRNA sequenceShow/hide mRNA sequence
GTCTTCTTCAACCAAACACTCACACGCTCTCCTTTCGTCATGTTCTTCCTTCTATAAAATCCTCATACTCTCCTTCTTCTTCCTTTCTTCTTCAACCTTCCATGGAATTT
TCCTTGCCCTAAGCCTCCCAACTTCGCTCCCATGGCGGCGACTCCTCATTGCGCCTCTCTCGATTTCGCCTTCTCTCCTAAAGAACACGAAGTCGCTCAAATCCTCCTCG
AATTCTCGAAGAAATCCGTCGTTTATCTCGGATTTATCCCCGTCTGGACACTCCGACGCAAGAGATCCGCCTTAGTTTCCCCGCCGGAATCCTCCGCCTCCGTCCCTTCG
CCGCCGTCCAAGAAGGTCAAGGAGTCCAGCCCTACCTCTCCTCTCGTCCTCAACTCCTTGCCGTTGTCGCGAAGTGAATCCGATGAACATACCAACGCTAAACACTCCAA
GAAGAAGCCCTCTCTCGATAAGAAATCTCAGTTTGTGGAAGCCATTGACGAATTGACCAAGCAGAATCAAGGTTTGAAAGGGGAATTTGAAGCTATGAAGCAACATTATA
ATCATCTGAAAGCTATCAATTCGGAATTGAAGGCCAAAAAGCAAGAGATGATTCTCGGTTCTAACAGCTCTAAGAACGAATCAGCAATTCCAGAAATAGGAACCTCAAGT
TCGGCCATGGAAGTCGTTAAGCTGCTCACAGTCGAATCCTCAACTCATCAGCCAGATCCCATGGCGGAACAGAGTAATAATGGCAGTCAGAATTTTCAAATCCCGATTGG
GGGAATTCCTTTCTATGATCCTTCTTCATTGAGCCCAATGGGGATTCCTGATTTGAACATCTCTCTTGAAGAAATCAACCAGAGGAATTACTCCAGATTCATGGCGGCTC
GAGCAAGAAAGAACAGGATTCAGATCTGCAAGAATAAGAACAACGGCGTTACCAAATTGCAGACTCCTCCAAACAATCCCTATTTTTGA
Protein sequenceShow/hide protein sequence
MAATPHCASLDFAFSPKEHEVAQILLEFSKKSVVYLGFIPVWTLRRKRSALVSPPESSASVPSPPSKKVKESSPTSPLVLNSLPLSRSESDEHTNAKHSKKKPSLDKKSQ
FVEAIDELTKQNQGLKGEFEAMKQHYNHLKAINSELKAKKQEMILGSNSSKNESAIPEIGTSSSAMEVVKLLTVESSTHQPDPMAEQSNNGSQNFQIPIGGIPFYDPSSL
SPMGIPDLNISLEEINQRNYSRFMAARARKNRIQICKNKNNGVTKLQTPPNNPYF