; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020535 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020535
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Descriptionprotein DOUBLE-STRAND BREAK FORMATION
Genome locationchr12:15404518..15409304
RNA-Seq ExpressionPay0020535
SyntenyPay0020535
Gene Ontology termsGO:0042138 - meiotic DNA double-strand break formation (biological process)
InterPro domainsIPR044969 - Protein DOUBLE-STRAND BREAK FORMATION


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139510.2 protein DOUBLE-STRAND BREAK FORMATION isoform X1 [Cucumis sativus]2.8e-11593.25Show/hide
Query:  MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVL
        MFCSYSLFLSRLRSRRFDDSTLRILE FPASKDATSL DV SSF E+LRFESLSIIRET+EKTDD KLLVIEFLVRAFALVGDIESCLALRYEALNFRVL
Subjt:  MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVL

Query:  KSFNQPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKI
        KSFNQPWLQVSH EWLNFAEHSLHAGFFSI+IKAYEQALSSLQQSDTANYTSHGSFK TEV+EKINRLKDHALNL+GSHSVQALTSDYLKKKVTER+RKI
Subjt:  KSFNQPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKI

Query:  SSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQ
        SSSCTRKFTASTLF NGIRNYNARKLHEYRS  GVNQ
Subjt:  SSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQ

XP_008464290.1 PREDICTED: uncharacterized protein LOC103502216 isoform X1 [Cucumis melo]3.8e-12899.59Show/hide
Query:  MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVL
        MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVL
Subjt:  MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVL

Query:  KSFNQPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKI
        KSFNQPWLQVSH EWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKI
Subjt:  KSFNQPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKI

Query:  SSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQKVAQNPVL
        SSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQKVAQNPVL
Subjt:  SSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQKVAQNPVL

XP_022970619.1 uncharacterized protein LOC111469552 isoform X1 [Cucurbita maxima]2.0e-10085.04Show/hide
Query:  YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFN
        YSLF SRLRSRRFDDSTLRILE F ASKD     DVKS   ELLRFESLSIIRET EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFR LKSFN
Subjt:  YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFN

Query:  QPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSC
        QP LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALSSLQQSDTANYTSHGS K  EVIEKI RLKDHAL  +GSHSVQALTS+YLKKKVTER+RKISSSC
Subjt:  QPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSC

Query:  TRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQK
        TRKFTASTLFRNGIRN+NA+KLHEY+++ G+  +
Subjt:  TRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQK

XP_022970621.1 uncharacterized protein LOC111469552 isoform X2 [Cucurbita maxima]1.2e-10083.33Show/hide
Query:  YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFN
        YSLF SRLRSRRFDDSTLRILE F ASKD     DVKS   ELLRFESLSIIRET EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFR LKSFN
Subjt:  YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFN

Query:  QPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSC
        QP LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALSSLQQSDTANYTSHGS K  EVIEKI RLKDHAL  +GSHSVQALTS+YLKKKVTER+RKISSSC
Subjt:  QPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSC

Query:  TRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQKVAQNPV
        TRKFTASTLFRNGIRN+NA+KLHEY+++ G+       P+
Subjt:  TRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQKVAQNPV

XP_038895344.1 protein DOUBLE-STRAND BREAK FORMATION isoform X1 [Benincasa hispida]6.8e-10183.06Show/hide
Query:  MFCS----YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALN
        M CS    YSLF SRLRSRRFDDSTLRILE FPASKDA SL DVKS   E LRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALN
Subjt:  MFCS----YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALN

Query:  FRVLKSFNQPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTER
        FR+LKSFNQPWLQVSH EWLNFAEHSL AGFFSIAIKAYEQALSSLQQ+DT NYTSHGS K  EVIEKI RLKDHAL  +GSHSVQALTS+YL KKVTER
Subjt:  FRVLKSFNQPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTER

Query:  SRKISSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQK
        + KISSSCTRK TASTLFRNG RN+NA+KLHEY+ + G+  +
Subjt:  SRKISSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQK

TrEMBL top hitse value%identityAlignment
A0A0A0LSV1 Uncharacterized protein1.4e-11593.25Show/hide
Query:  MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVL
        MFCSYSLFLSRLRSRRFDDSTLRILE FPASKDATSL DV SSF E+LRFESLSIIRET+EKTDD KLLVIEFLVRAFALVGDIESCLALRYEALNFRVL
Subjt:  MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVL

Query:  KSFNQPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKI
        KSFNQPWLQVSH EWLNFAEHSLHAGFFSI+IKAYEQALSSLQQSDTANYTSHGSFK TEV+EKINRLKDHALNL+GSHSVQALTSDYLKKKVTER+RKI
Subjt:  KSFNQPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKI

Query:  SSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQ
        SSSCTRKFTASTLF NGIRNYNARKLHEYRS  GVNQ
Subjt:  SSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQ

A0A1S3CL48 uncharacterized protein LOC103502216 isoform X11.8e-12899.59Show/hide
Query:  MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVL
        MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVL
Subjt:  MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVL

Query:  KSFNQPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKI
        KSFNQPWLQVSH EWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKI
Subjt:  KSFNQPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKI

Query:  SSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQKVAQNPVL
        SSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQKVAQNPVL
Subjt:  SSSCTRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQKVAQNPVL

A0A6J1HPP0 uncharacterized protein LOC111464906 isoform X11.8e-9984.19Show/hide
Query:  YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFN
        YSLF SRLRSRR DDSTLRILE F ASKD  SL DVKS   ELL FESLSIIRET EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFR LKSFN
Subjt:  YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFN

Query:  QPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSC
        QP LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALSSLQQSDTANYTSHGS K  EVIEKI RLKDHAL  +GSHSVQALTS+YLKK+VTER+RKISSSC
Subjt:  QPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSC

Query:  TRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQK
        TRKFTASTLFRNGIRN+NA++LHEY+++ G+  +
Subjt:  TRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQK

A0A6J1I136 uncharacterized protein LOC111469552 isoform X25.6e-10183.33Show/hide
Query:  YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFN
        YSLF SRLRSRRFDDSTLRILE F ASKD     DVKS   ELLRFESLSIIRET EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFR LKSFN
Subjt:  YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFN

Query:  QPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSC
        QP LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALSSLQQSDTANYTSHGS K  EVIEKI RLKDHAL  +GSHSVQALTS+YLKKKVTER+RKISSSC
Subjt:  QPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSC

Query:  TRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQKVAQNPV
        TRKFTASTLFRNGIRN+NA+KLHEY+++ G+       P+
Subjt:  TRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQKVAQNPV

A0A6J1I645 uncharacterized protein LOC111469552 isoform X19.6e-10185.04Show/hide
Query:  YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFN
        YSLF SRLRSRRFDDSTLRILE F ASKD     DVKS   ELLRFESLSIIRET EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFR LKSFN
Subjt:  YSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFN

Query:  QPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSC
        QP LQVSH EWLNFAEHSL+AGFFSIAIKAYEQALSSLQQSDTANYTSHGS K  EVIEKI RLKDHAL  +GSHSVQALTS+YLKKKVTER+RKISSSC
Subjt:  QPWLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSC

Query:  TRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQK
        TRKFTASTLFRNGIRN+NA+KLHEY+++ G+  +
Subjt:  TRKFTASTLFRNGIRNYNARKLHEYRSIGGVNQK

SwissProt top hitse value%identityAlignment
Q8RX33 Protein DOUBLE-STRAND BREAK FORMATION2.4e-3246.55Show/hide
Query:  LFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQP
        LF++R++ RRFD+ +LRILEL   + +  S  +V+S   + +R ES+ I  E   ++   KL V+EF  RAFAL+GD+ESCLA+RYEALN R LKS +  
Subjt:  LFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQP

Query:  WLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHS
        WL VSH EW  FA  S+  GF SIA KA E AL SL++       S  +    +  EK+ RL+D A +L+ SHS
Subjt:  WLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHS

Arabidopsis top hitse value%identityAlignment
AT1G07060.1 unknown protein1.7e-3346.55Show/hide
Query:  LFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQP
        LF++R++ RRFD+ +LRILEL   + +  S  +V+S   + +R ES+ I  E   ++   KL V+EF  RAFAL+GD+ESCLA+RYEALN R LKS +  
Subjt:  LFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQP

Query:  WLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHS
        WL VSH EW  FA  S+  GF SIA KA E AL SL++       S  +    +  EK+ RL+D A +L+ SHS
Subjt:  WLQVSHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTGTTCATACTCTCTCTTTCTTTCGCGCCTCAGAAGCCGAAGATTTGATGATTCTACTCTGCGAATTCTGGAATTATTTCCCGCTTCCAAAGACGCGACCTCGTT
GGCGGATGTTAAATCCAGCTTTTCTGAACTTCTCAGATTTGAATCTCTATCTATCATTCGTGAAACCGCTGAGAAAACTGATGATCAAAAGCTTCTAGTCATCGAATTTC
TTGTTCGAGCTTTTGCCCTCGTTGGAGACATTGAGAGTTGCTTAGCTTTGAGATACGAGGCCTTGAATTTTCGGGTACTGAAGTCTTTCAATCAACCATGGCTTCAAGTA
TCACACGAAGAATGGTTAAACTTCGCTGAGCATTCATTGCATGCTGGCTTTTTCTCAATTGCCATAAAGGCATATGAGCAAGCACTGTCAAGCCTTCAGCAGAGTGATAC
TGCAAACTACACATCACATGGTTCCTTTAAACACACAGAAGTCATCGAGAAGATAAATAGACTCAAAGATCATGCTCTGAATTTATCTGGTTCCCATTCTGTTCAAGCTC
TCACATCTGATTATTTGAAAAAGAAAGTAACTGAAAGGAGCAGAAAGATTTCTTCATCCTGCACAAGAAAGTTTACAGCGAGCACTCTATTCAGAAATGGTATCAGAAAC
TACAATGCAAGAAAGCTGCATGAATATCGAAGTATTGGTGGGGTTAACCAGAAAGTCGCACAAAATCCAGTTTTGTGA
mRNA sequenceShow/hide mRNA sequence
CTTATCTATATTATCTACTATTATAGTATTTTCAATAAGGTTACTCCCTACAATTGTATTCTATTGTGATGAACTCATTGTCATTTCAATACCAATTTCGGATTTAGAAG
AACAAAGCTTTCCTTTCCCACCGATTATTAATTGATGTTCTGTTCATACTCTCTCTTTCTTTCGCGCCTCAGAAGCCGAAGATTTGATGATTCTACTCTGCGAATTCTGG
AATTATTTCCCGCTTCCAAAGACGCGACCTCGTTGGCGGATGTTAAATCCAGCTTTTCTGAACTTCTCAGATTTGAATCTCTATCTATCATTCGTGAAACCGCTGAGAAA
ACTGATGATCAAAAGCTTCTAGTCATCGAATTTCTTGTTCGAGCTTTTGCCCTCGTTGGAGACATTGAGAGTTGCTTAGCTTTGAGATACGAGGCCTTGAATTTTCGGGT
ACTGAAGTCTTTCAATCAACCATGGCTTCAAGTATCACACGAAGAATGGTTAAACTTCGCTGAGCATTCATTGCATGCTGGCTTTTTCTCAATTGCCATAAAGGCATATG
AGCAAGCACTGTCAAGCCTTCAGCAGAGTGATACTGCAAACTACACATCACATGGTTCCTTTAAACACACAGAAGTCATCGAGAAGATAAATAGACTCAAAGATCATGCT
CTGAATTTATCTGGTTCCCATTCTGTTCAAGCTCTCACATCTGATTATTTGAAAAAGAAAGTAACTGAAAGGAGCAGAAAGATTTCTTCATCCTGCACAAGAAAGTTTAC
AGCGAGCACTCTATTCAGAAATGGTATCAGAAACTACAATGCAAGAAAGCTGCATGAATATCGAAGTATTGGTGGGGTTAACCAGAAAGTCGCACAAAATCCAGTTTTGT
GATCAGACCTACATGTAGTCTCTCTTCATACCCATCTGACTAGATAAATGTTCAGGGAGTAGATCGGGGAGACTTAGGGGCATAAGAAAAAGTGTTTCTACAAAATTCCA
GTTGATCTCAGTTCTGTGCAAGTCTAAGTTGCCTCTCCCCGTCTGACAGGTCTGTATTATCAATACTCCACCCCACCCCAAATTCTTCATTTCTTCCTTTTTCTGAGTTT
AATTTTTCAATACCATAAATCTTTCATGTGGTAAAATCCACTTGGAGTTCACATGGAAGCCTGGAAATTCTTCTTCATTCCCTCTTTGAGACAAGAGAATGACGTACTTA
TTTCATTACCTTTTCATGCCATAAATTGTTTGGTAACTGTTAGACCTCCAATTGTATTGCAATATAGTAGGAAAGGGACGAAAGGGAGTGCACGTGTGATAAAGAGTGCG
AGCACTAGGAATGGGACCACAGTTAGTTAGAGGAAAGAGTGGTTAAACAGAAGGGAGGCTGGAAAGGAGAGGGGAATCGTTTTGTGATTAGTTTTCTGCTTTCATCCTTG
AGAGATAGGAGAGGCAGGGGGAAATAGTTCTTTTGTTGTCCTTACGGTTCTTCTGCATTTCTATTTCCGAACTGATACTCTGTCAACTCGTATTTGTAATACCTGAGTTT
TATCAATCAAATAAAAGCA
Protein sequenceShow/hide protein sequence
MFCSYSLFLSRLRSRRFDDSTLRILELFPASKDATSLADVKSSFSELLRFESLSIIRETAEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQV
SHEEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDTANYTSHGSFKHTEVIEKINRLKDHALNLSGSHSVQALTSDYLKKKVTERSRKISSSCTRKFTASTLFRNGIRN
YNARKLHEYRSIGGVNQKVAQNPVL