; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0003682 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0003682
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationchr03:25114036..25115467
RNA-Seq ExpressionIVF0003682
SyntenyIVF0003682
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444812.1 PREDICTED: uncharacterized protein LOC103488048 [Cucumis melo]2.78e-192100Show/hide
Query:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
        MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
Subjt:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN

Query:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
        IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
Subjt:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP

Query:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
        SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
Subjt:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN

XP_008444813.1 PREDICTED: uncharacterized protein LOC103488049 [Cucumis melo]2.71e-11667.35Show/hide
Query:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPS----SSERAKESSPTTPLSFNLL---R
        +STH+CSISF+SD+F+P+E FVA IL++L LLIQKS FSLGL PSWP+RRKRSAV SPPD  S++ QPP PP     SSER KESSPTTPLS N L   R
Subjt:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPS----SSERAKESSPTTPLSFNLL---R

Query:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS
        SESDENI   KVSK+KAP+DKK QYLETIDKLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   IL G  N S  PEIGTSSS          KS
Subjt:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS

Query:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDP----MGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGP
        S+SNVENN  EC+PSMKNQT P AEQ N N+N+QIP G IPL DP    MGIPDLNL++E     +YTKY+AA+ARQNRI+IWKNK NNN+  P
Subjt:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDP----MGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGP

XP_011649663.1 uncharacterized protein LOC105434650 [Cucumis sativus]1.48e-13574.66Show/hide
Query:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLL---RSES
        MAST   SIS NSD+FTP++H VADIL+EL LLIQKSEFSLGLPPSWP+RRKRSAVVSP  CS+VVAQPPPPPSSSE  KE+SPTTPLS + L   RSES
Subjt:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLL---RSES

Query:  DENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHE
        DENIANVKVSKRKAPL KKF+  E++DKLTHQNQAL ++ EA KQ F H KTINSELKAKKQE  MILGGSTN+SEIPE GTS+SGTKSS  N+ENNLHE
Subjt:  DENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHE

Query:  CQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKN-----------NNSNGPAN
        C+PS KNQTAP+AEQSN NQNHQIP  EIPLLD MGIPDLNLT+EQN ++NY K +AAKARQNR RI KNK+N           NN NGPAN
Subjt:  CQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKN-----------NNSNGPAN

XP_011649664.1 myocardin-related transcription factor A [Cucumis sativus]1.76e-11767.46Show/hide
Query:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPS----SSERAKESSPTTPLSFNLL---R
        +STH+CSISF+SD+F+P+EHFVA IL++L LLIQ+S FSLGL PSWP+RRKRSAV SPPD SS++ QPP PP     SSER KESSPTTPLS + L   R
Subjt:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPS----SSERAKESSPTTPLSFNLL---R

Query:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS
        SESDEN    KVSK+KAP+DKK QYLETI+KLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   ILGG +N S  P+ GTS+S          KS
Subjt:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS

Query:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDP----MGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPA
        S SNVENN  EC+PSMKNQT PVAEQSN  QN+QIP G IPL DP    MGIPDLNL++E  +  NYTKY+AAKARQNRI+IWKNK NNN+N  A
Subjt:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDP----MGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPA

XP_022996985.1 zinc finger homeobox protein 4-like isoform X1 [Cucurbita maxima]1.21e-7052Show/hide
Query:  MAST---HKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQP----PPPPSSSERAKESSPTTPLSFNLL
        MAST   H+C+   + D  TP E     IL E  LL+Q+ EFSLGLPP+WPVR KRSAVVSPPD  S+V  P    PPPP SS + KESSPTTP S + L
Subjt:  MAST---HKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQP----PPPPSSSERAKESSPTTPLSFNLL

Query:  ---RSESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSN
           R ESDE I    +  +K  LDKK QYLET+ +LT QNQALV  V+ +K+H+  LKT NSELKAK+Q++   +  S  +S  PEI  SSS  K+    
Subjt:  ---RSESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSN

Query:  VE-------NNLHECQPSMKNQTA-PVA----EQSNHNQNHQIPTGEIPLLDPM----GIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSN
        V+       ++ H+CQP +KNQTA P A    EQSN +QN +IP G I + DP     GIPDLNL+ ++  Q NYT+ MAA+ARQNRI+IWK+K NNN+N
Subjt:  VE-------NNLHECQPSMKNQTA-PVA----EQSNHNQNHQIPTGEIPLLDPM----GIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSN

TrEMBL top hitse value%identityAlignment
A0A0A0LL74 Uncharacterized protein1.0e-10674.66Show/hide
Query:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFN---LLRSES
        MAST   SIS NSD+FTP++H VADIL+EL LLIQKSEFSLGLPPSWP+RRKRSAVVSP  CS+VVAQPPPPPSSSE  KE+SPTTPLS +   L RSES
Subjt:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFN---LLRSES

Query:  DENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHE
        DENIANVKVSKRKAPL KKF+  E++DKLTHQNQAL ++ EA KQ F H KTINSELKAKKQE  MILGGSTN+SEIPE GTS+SGTKSS  N+ENNLHE
Subjt:  DENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHE

Query:  CQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKN-----------NNSNGPAN
        C+PS KNQTAP+AEQSN NQNHQIP  EIPLLD MGIPDLNLT+EQN ++NY K +AAKARQNR RI KNK+N           NN NGPAN
Subjt:  CQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKN-----------NNSNGPAN

A0A0A0LRP1 Uncharacterized protein4.9e-9367.46Show/hide
Query:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPP----SSSERAKESSPTTPLSFN---LLR
        +STH+CSISF+SD+F+P+EHFVA IL++L LLIQ+S FSLGL PSWP+RRKRSAV SPPD SS++ QPP PP     SSER KESSPTTPLS +   L R
Subjt:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPP----SSSERAKESSPTTPLSFN---LLR

Query:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS
        SESDEN    KVSK+KAP+DKK QYLETI+KLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   ILGG +N S  P+ GTS+S          KS
Subjt:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS

Query:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPA
        S SNVENN  EC+PSMKNQT PVAEQSN  QN+QIP G IPL D    PMGIPDLNL++E  +  NYTKY+AAKARQNRI+IWKNK NNN+N  A
Subjt:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPA

A0A1S3BC34 uncharacterized protein LOC1034880489.7e-150100Show/hide
Query:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
        MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
Subjt:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN

Query:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
        IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
Subjt:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP

Query:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
        SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
Subjt:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN

A0A5A7VA15 Uncharacterized protein9.7e-150100Show/hide
Query:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
        MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
Subjt:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN

Query:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
        IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
Subjt:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP

Query:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
        SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
Subjt:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN

A0A5A7VHE1 Uncharacterized protein3.2e-9267.35Show/hide
Query:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPS----SSERAKESSPTTPLSFN---LLR
        +STH+CSISF+SD+F+P+E FVA IL++L LLIQKS FSLGL PSWP+RRKRSAV SPPD  S++ QPP PP     SSER KESSPTTPLS N   L R
Subjt:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPS----SSERAKESSPTTPLSFN---LLR

Query:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS
        SESDENI   KVSK+KAP+DKK QYLETIDKLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   IL G  N S  PEIGTSSS          KS
Subjt:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS

Query:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGP
        S+SNVENN  EC+PSMKNQT P AEQ N N+N+QIP G IPL D    PMGIPDLNL++E     +YTKY+AA+ARQNRI+IWKNK NNN+  P
Subjt:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTCATAAATGCTCCATCTCCTTCAATTCCGACGAGTTCACCCCTAAAGAACACTTTGTCGCTGACATCCTCGAAGAATTAACTCTTCTCATTCAAAAATC
CGAATTCTCTCTTGGATTACCTCCTTCCTGGCCTGTCCGACGCAAGAGATCCGCTGTCGTTTCCCCGCCGGACTGTTCCTCGGTCGTCGCTCAACCGCCGCCTCCTCCAT
CGTCATCCGAGAGAGCAAAGGAGTCTAGCCCTACTACTCCACTTTCATTCAACTTGTTAAGGAGTGAGTCTGATGAGAATATTGCCAACGTCAAGGTTTCCAAGAGGAAA
GCCCCACTCGATAAGAAATTTCAATATTTGGAAACCATTGATAAATTGACCCACCAGAATCAAGCTCTGGTAAAGGACGTTGAAGCTATGAAGCAACATTTTGTTCATCT
GAAAACTATCAATTCGGAGTTGAAGGCCAAGAAGCAAGAGGTCCCCATGATTCTTGGTGGTTCCACTAACCAATCAGAAATTCCCGAAATTGGGACCTCAAGTTCGGGTA
CGAAATCCTCACACTCAAATGTGGAGAATAATCTCCATGAATGTCAACCGTCGATGAAGAATCAGACGGCTCCGGTGGCAGAACAGAGTAACCATAATCAGAATCACCAA
ATCCCAACTGGGGAAATTCCTTTACTGGACCCAATGGGGATTCCTGATTTGAACCTAACTATCGAACAAAATGTTCAAATGAATTACACAAAATATATGGCTGCCAAAGC
AAGACAGAACAGGATTCGAATCTGGAAGAACAAGAAGAACAACAACAGCAATGGACCTGCAAATTAA
mRNA sequenceShow/hide mRNA sequence
GTGTCAACAGCCACATCCCTTTTCCAAAATCTTTTCCTATTTGGTTTTGTGAAACCCAGTCCCCAAGCAACTCCCCGCTCTTCATCTTCCTCCGTTGTTATGAATCCTCA
TTCTTTTCCAGCTTCCATGGATTTTCTCTTTCTCTGAACTTTCGCATCGATTCTTTCAATCAAAATCAACAGATTGATATCTCAATGGCTTCCACTCATAAATGCTCCAT
CTCCTTCAATTCCGACGAGTTCACCCCTAAAGAACACTTTGTCGCTGACATCCTCGAAGAATTAACTCTTCTCATTCAAAAATCCGAATTCTCTCTTGGATTACCTCCTT
CCTGGCCTGTCCGACGCAAGAGATCCGCTGTCGTTTCCCCGCCGGACTGTTCCTCGGTCGTCGCTCAACCGCCGCCTCCTCCATCGTCATCCGAGAGAGCAAAGGAGTCT
AGCCCTACTACTCCACTTTCATTCAACTTGTTAAGGAGTGAGTCTGATGAGAATATTGCCAACGTCAAGGTTTCCAAGAGGAAAGCCCCACTCGATAAGAAATTTCAATA
TTTGGAAACCATTGATAAATTGACCCACCAGAATCAAGCTCTGGTAAAGGACGTTGAAGCTATGAAGCAACATTTTGTTCATCTGAAAACTATCAATTCGGAGTTGAAGG
CCAAGAAGCAAGAGGTCCCCATGATTCTTGGTGGTTCCACTAACCAATCAGAAATTCCCGAAATTGGGACCTCAAGTTCGGGTACGAAATCCTCACACTCAAATGTGGAG
AATAATCTCCATGAATGTCAACCGTCGATGAAGAATCAGACGGCTCCGGTGGCAGAACAGAGTAACCATAATCAGAATCACCAAATCCCAACTGGGGAAATTCCTTTACT
GGACCCAATGGGGATTCCTGATTTGAACCTAACTATCGAACAAAATGTTCAAATGAATTACACAAAATATATGGCTGCCAAAGCAAGACAGAACAGGATTCGAATCTGGA
AGAACAAGAAGAACAACAACAGCAATGGACCTGCAAATTAACAATCCTAATTCCACAAATTTTTCCTTCTTTTTATTAAACCTAAGGATTCAATCAACCAATTTTATGGG
AGAACTGTTTTGAATCTTGGGGTAGTTTTTTCCCCTTTATTAATT
Protein sequenceShow/hide protein sequence
MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDENIANVKVSKRK
APLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQ
IPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN