; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0018549 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0018549
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationchr03:27900356..27901842
RNA-Seq ExpressionPay0018549
SyntenyPay0018549
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444812.1 PREDICTED: uncharacterized protein LOC103488048 [Cucumis melo]2.0e-149100Show/hide
Query:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
        MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
Subjt:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN

Query:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
        IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
Subjt:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP

Query:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
        SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
Subjt:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN

XP_008444813.1 PREDICTED: uncharacterized protein LOC103488049 [Cucumis melo]6.5e-9267.35Show/hide
Query:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPS----SSERAKESSPTTPLSFN---LLR
        +STH+CSISF+SD+F+P+E FVA IL++L LLIQKS FSLGL PSWP+RRKRSAV SPPD  S++ QPP PP     SSER KESSPTTPLS N   L R
Subjt:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPS----SSERAKESSPTTPLSFN---LLR

Query:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS
        SESDENI   KVSK+KAP+DKK QYLETIDKLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   IL G  N S  PEIGTSSS          KS
Subjt:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS

Query:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGP
        S+SNVENN  EC+PSMKNQT P AEQ N N+N+QIP G IPL D    PMGIPDLNL++E     +YTKY+AA+ARQNRI+IWKNK NNN+  P
Subjt:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGP

XP_011649663.1 uncharacterized protein LOC105434650 [Cucumis sativus]2.1e-10674.66Show/hide
Query:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFN---LLRSES
        MAST   SIS NSD+FTP++H VADIL+EL LLIQKSEFSLGLPPSWP+RRKRSAVVSP  CS+VVAQPPPPPSSSE  KE+SPTTPLS +   L RSES
Subjt:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFN---LLRSES

Query:  DENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHE
        DENIANVKVSKRKAPL KKF+  E++DKLTHQNQAL ++ EA KQ F H KTINSELKAKKQE  MILGGSTN+SEIPE GTS+SGTKSS  N+ENNLHE
Subjt:  DENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHE

Query:  CQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKN-----------NNSNGPAN
        C+PS KNQTAP+AEQSN NQNHQIP  EIPLLD MGIPDLNLT+EQN ++NY K +AAKARQNR RI KNK+N           NN NGPAN
Subjt:  CQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKN-----------NNSNGPAN

XP_011649664.1 myocardin-related transcription factor A [Cucumis sativus]1.0e-9267.46Show/hide
Query:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPP----SSSERAKESSPTTPLSFN---LLR
        +STH+CSISF+SD+F+P+EHFVA IL++L LLIQ+S FSLGL PSWP+RRKRSAV SPPD SS++ QPP PP     SSER KESSPTTPLS +   L R
Subjt:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPP----SSSERAKESSPTTPLSFN---LLR

Query:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS
        SESDEN    KVSK+KAP+DKK QYLETI+KLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   ILGG +N S  P+ GTS+S          KS
Subjt:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS

Query:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPA
        S SNVENN  EC+PSMKNQT PVAEQSN  QN+QIP G IPL D    PMGIPDLNL++E  +  NYTKY+AAKARQNRI+IWKNK NNN+N  A
Subjt:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPA

XP_022996985.1 zinc finger homeobox protein 4-like isoform X1 [Cucurbita maxima]5.2e-5752Show/hide
Query:  MAST---HKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVV----AQPPPPPSSSERAKESSPTTPLSFN--
        MAST   H+C+   + D  TP E     IL E  LL+Q+ EFSLGLPP+WPVR KRSAVVSPPD  S+V      PPPPP SS + KESSPTTP S +  
Subjt:  MAST---HKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVV----AQPPPPPSSSERAKESSPTTPLSFN--

Query:  -LLRSESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSN
         L R ESDE I    +  +K  LDKK QYLET+ +LT QNQALV  V+ +K+H+  LKT NSELKAK+Q+   ++  S  +S  PEI  SSS  K+    
Subjt:  -LLRSESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSN

Query:  V-------ENNLHECQPSMKNQTA-PVA----EQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSN
        V       +++ H+CQP +KNQTA P A    EQSN +QN +IP G I + D    P GIPDLNL+ ++  Q NYT+ MAA+ARQNRI+IWK+K NNN+N
Subjt:  V-------ENNLHECQPSMKNQTA-PVA----EQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSN

TrEMBL top hitse value%identityAlignment
A0A0A0LL74 Uncharacterized protein1.0e-10674.66Show/hide
Query:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFN---LLRSES
        MAST   SIS NSD+FTP++H VADIL+EL LLIQKSEFSLGLPPSWP+RRKRSAVVSP  CS+VVAQPPPPPSSSE  KE+SPTTPLS +   L RSES
Subjt:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFN---LLRSES

Query:  DENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHE
        DENIANVKVSKRKAPL KKF+  E++DKLTHQNQAL ++ EA KQ F H KTINSELKAKKQE  MILGGSTN+SEIPE GTS+SGTKSS  N+ENNLHE
Subjt:  DENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHE

Query:  CQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKN-----------NNSNGPAN
        C+PS KNQTAP+AEQSN NQNHQIP  EIPLLD MGIPDLNLT+EQN ++NY K +AAKARQNR RI KNK+N           NN NGPAN
Subjt:  CQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKN-----------NNSNGPAN

A0A0A0LRP1 Uncharacterized protein4.9e-9367.46Show/hide
Query:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPP----SSSERAKESSPTTPLSFN---LLR
        +STH+CSISF+SD+F+P+EHFVA IL++L LLIQ+S FSLGL PSWP+RRKRSAV SPPD SS++ QPP PP     SSER KESSPTTPLS +   L R
Subjt:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPP----SSSERAKESSPTTPLSFN---LLR

Query:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS
        SESDEN    KVSK+KAP+DKK QYLETI+KLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   ILGG +N S  P+ GTS+S          KS
Subjt:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS

Query:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPA
        S SNVENN  EC+PSMKNQT PVAEQSN  QN+QIP G IPL D    PMGIPDLNL++E  +  NYTKY+AAKARQNRI+IWKNK NNN+N  A
Subjt:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPA

A0A1S3BC34 uncharacterized protein LOC1034880489.7e-150100Show/hide
Query:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
        MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
Subjt:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN

Query:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
        IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
Subjt:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP

Query:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
        SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
Subjt:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN

A0A5A7VA15 Uncharacterized protein9.7e-150100Show/hide
Query:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
        MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN
Subjt:  MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDEN

Query:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
        IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP
Subjt:  IANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQP

Query:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
        SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN
Subjt:  SMKNQTAPVAEQSNHNQNHQIPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN

A0A5A7VHE1 Uncharacterized protein3.2e-9267.35Show/hide
Query:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPS----SSERAKESSPTTPLSFN---LLR
        +STH+CSISF+SD+F+P+E FVA IL++L LLIQKS FSLGL PSWP+RRKRSAV SPPD  S++ QPP PP     SSER KESSPTTPLS N   L R
Subjt:  ASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPS----SSERAKESSPTTPLSFN---LLR

Query:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS
        SESDENI   KVSK+KAP+DKK QYLETIDKLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   IL G  N S  PEIGTSSS          KS
Subjt:  SESDENIANVKVSKRKAPLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSS--------GTKS

Query:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGP
        S+SNVENN  EC+PSMKNQT P AEQ N N+N+QIP G IPL D    PMGIPDLNL++E     +YTKY+AA+ARQNRI+IWKNK NNN+  P
Subjt:  SHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQIPTGEIPLLD----PMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTCATAAATGCTCCATCTCCTTCAATTCCGACGAGTTCACCCCTAAAGAACACTTTGTCGCTGACATCCTCGAAGAATTAACTCTTCTCATTCAAAAATC
CGAATTCTCTCTTGGATTACCTCCTTCCTGGCCTGTCCGACGCAAGAGATCCGCTGTCGTTTCCCCGCCGGACTGTTCCTCGGTCGTCGCTCAACCGCCGCCTCCTCCAT
CGTCATCCGAGAGAGCAAAGGAGTCTAGCCCTACTACTCCACTTTCATTCAACTTGTTAAGGAGCGAGTCTGATGAGAATATTGCCAACGTCAAGGTTTCCAAGAGGAAA
GCCCCACTCGATAAGAAATTTCAATATTTGGAAACCATTGATAAATTGACCCACCAGAATCAAGCTCTGGTAAAGGACGTTGAAGCTATGAAGCAACATTTTGTTCATCT
GAAAACTATCAATTCGGAGTTGAAGGCCAAGAAGCAAGAGGTCCCCATGATTCTTGGTGGTTCCACTAACCAATCAGAAATTCCCGAAATTGGGACCTCAAGTTCGGGTA
CGAAATCCTCACACTCAAATGTGGAGAATAATCTCCATGAATGTCAACCGTCGATGAAGAATCAGACGGCTCCGGTGGCAGAACAGAGTAACCATAATCAGAATCACCAA
ATCCCAACTGGGGAAATTCCTTTACTGGACCCAATGGGGATTCCTGATTTGAACCTAACTATCGAACAAAATGTTCAAATGAATTACACAAAATATATGGCTGCCAAAGC
AAGACAGAACAGGATTCGAATCTGGAAGAACAAGAAGAACAACAACAGCAATGGACCTGCAAATTAA
mRNA sequenceShow/hide mRNA sequence
TGAATATCCATCTAACGGACACGTGTCAACAGCCACATCCCTTTTCCAAAATCTTTTCCTATTTGGTTTTGTGAAACCCAGTCCCCAAGCAACTCCCCGCTCTTCATCTT
CCTCAGTTGTTATGAATCCTCATTCTTTTCCAGCTTCCATGGATTTTCTCCTTCTCTGAACTTTCGCATCGATTCTTTCCATCAAAATCAACAGATTGATATCTCAATGG
CTTCCACTCATAAATGCTCCATCTCCTTCAATTCCGACGAGTTCACCCCTAAAGAACACTTTGTCGCTGACATCCTCGAAGAATTAACTCTTCTCATTCAAAAATCCGAA
TTCTCTCTTGGATTACCTCCTTCCTGGCCTGTCCGACGCAAGAGATCCGCTGTCGTTTCCCCGCCGGACTGTTCCTCGGTCGTCGCTCAACCGCCGCCTCCTCCATCGTC
ATCCGAGAGAGCAAAGGAGTCTAGCCCTACTACTCCACTTTCATTCAACTTGTTAAGGAGCGAGTCTGATGAGAATATTGCCAACGTCAAGGTTTCCAAGAGGAAAGCCC
CACTCGATAAGAAATTTCAATATTTGGAAACCATTGATAAATTGACCCACCAGAATCAAGCTCTGGTAAAGGACGTTGAAGCTATGAAGCAACATTTTGTTCATCTGAAA
ACTATCAATTCGGAGTTGAAGGCCAAGAAGCAAGAGGTCCCCATGATTCTTGGTGGTTCCACTAACCAATCAGAAATTCCCGAAATTGGGACCTCAAGTTCGGGTACGAA
ATCCTCACACTCAAATGTGGAGAATAATCTCCATGAATGTCAACCGTCGATGAAGAATCAGACGGCTCCGGTGGCAGAACAGAGTAACCATAATCAGAATCACCAAATCC
CAACTGGGGAAATTCCTTTACTGGACCCAATGGGGATTCCTGATTTGAACCTAACTATCGAACAAAATGTTCAAATGAATTACACAAAATATATGGCTGCCAAAGCAAGA
CAGAACAGGATTCGAATCTGGAAGAACAAGAAGAACAACAACAGCAATGGACCTGCAAATTAACAATCCTAATTCCACAAATTTTTCCTTCTTTTTATTAAACCTAAGGA
TTCAATCAATCAATTTTATGGGAGAACTGTTTTGAATCTTGGGGTAGTTTTTTTCCCCTTTATTAATTGGGTTGAATTGTGAAGGTAGATAGAACCCCAC
Protein sequenceShow/hide protein sequence
MASTHKCSISFNSDEFTPKEHFVADILEELTLLIQKSEFSLGLPPSWPVRRKRSAVVSPPDCSSVVAQPPPPPSSSERAKESSPTTPLSFNLLRSESDENIANVKVSKRK
APLDKKFQYLETIDKLTHQNQALVKDVEAMKQHFVHLKTINSELKAKKQEVPMILGGSTNQSEIPEIGTSSSGTKSSHSNVENNLHECQPSMKNQTAPVAEQSNHNQNHQ
IPTGEIPLLDPMGIPDLNLTIEQNVQMNYTKYMAAKARQNRIRIWKNKKNNNSNGPAN