; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0020276 (gene) of Chayote v1 genome

Gene IDSed0020276
OrganismSechium edule (Chayote v1)
Descriptionprotein DOUBLE-STRAND BREAK FORMATION
Genome locationLG13:20235989..20241088
RNA-Seq ExpressionSed0020276
SyntenySed0020276
Gene Ontology termsGO:0042138 - meiotic DNA double-strand break formation (biological process)
InterPro domainsIPR044969 - Protein DOUBLE-STRAND BREAK FORMATION


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022964954.1 uncharacterized protein LOC111464906 isoform X1 [Cucurbita moschata]1.2e-9480.17Show/hide
Query:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN
        M  SVAEQ+SLF SRL++RR DDSTLR+L F S SKD MSLMDVKS +K+LL  ESLSIIR++ EKT+DQKL+V+EFLVRAFAL+GDIESCLALRYEALN
Subjt:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN

Query:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER
        FR+  SFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQALS LQQS +ANYT+H S K  EVI+KIKRLKD+ALKSAGSHSVQALTSEYLK++VTER
Subjt:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER

Query:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL
        NRK SSSCT  +FTASTLFRNGIRNHNAK+LHEYQ L
Subjt:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL

XP_022970619.1 uncharacterized protein LOC111469552 isoform X1 [Cucurbita maxima]1.7e-9681.43Show/hide
Query:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN
        M  SVAEQ+SLF SRL++RRFDDSTLR+L F S SKD M  MDVKS +K+LLR ESLSIIR++ EKT+DQKL+V+EFLVRAFAL+GDIESCLALRYEALN
Subjt:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN

Query:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER
        FR+  SFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQALS LQQS +ANYT+H S KR EVI+KIKRLKD+ALKSAGSHSVQALTSEYLK+KVTER
Subjt:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER

Query:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL
        NRK SSSCT  +FTASTLFRNGIRNHNAKKLHEYQ L
Subjt:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL

XP_022970621.1 uncharacterized protein LOC111469552 isoform X2 [Cucurbita maxima]1.7e-9681.43Show/hide
Query:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN
        M  SVAEQ+SLF SRL++RRFDDSTLR+L F S SKD M  MDVKS +K+LLR ESLSIIR++ EKT+DQKL+V+EFLVRAFAL+GDIESCLALRYEALN
Subjt:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN

Query:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER
        FR+  SFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQALS LQQS +ANYT+H S KR EVI+KIKRLKD+ALKSAGSHSVQALTSEYLK+KVTER
Subjt:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER

Query:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL
        NRK SSSCT  +FTASTLFRNGIRNHNAKKLHEYQ L
Subjt:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL

XP_023520164.1 uncharacterized protein LOC111783465 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-9580.59Show/hide
Query:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN
        M  SVAEQ+SLF SRL++RRFDDSTLR+L F S SKD MSLMDVKS +K+LLR ESLSIIR++ +KT+DQKL+V+EFLVRAFAL+GDIESCLALRYEALN
Subjt:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN

Query:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER
        FR+  SFNQP LQVSHAEWLNFAEHSL+AGFFSIA+KAYEQALS LQQS +ANYT+H S K  EVI+KIKRLKD++LKSAGSHSVQALTSEYLK+KVTER
Subjt:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER

Query:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL
        NRK SSSCT  +FTASTLFRNGIRNHNAKKLHEYQ L
Subjt:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL

XP_023520165.1 uncharacterized protein LOC111783465 isoform X2 [Cucurbita pepo subsp. pepo]1.1e-9580.59Show/hide
Query:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN
        M  SVAEQ+SLF SRL++RRFDDSTLR+L F S SKD MSLMDVKS +K+LLR ESLSIIR++ +KT+DQKL+V+EFLVRAFAL+GDIESCLALRYEALN
Subjt:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN

Query:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER
        FR+  SFNQP LQVSHAEWLNFAEHSL+AGFFSIA+KAYEQALS LQQS +ANYT+H S K  EVI+KIKRLKD++LKSAGSHSVQALTSEYLK+KVTER
Subjt:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER

Query:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL
        NRK SSSCT  +FTASTLFRNGIRNHNAKKLHEYQ L
Subjt:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL

TrEMBL top hitse value%identityAlignment
A0A1S3CL48 uncharacterized protein LOC103502216 isoform X14.4e-9077.73Show/hide
Query:  FSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALNFRDQNSFN
        +SLF+SRL++RRFDDSTLR+L     SKD  SL DVKSS  +LLR ESLSIIR++AEKT+DQKL+V+EFLVRAFAL+GDIESCLALRYEALNFR   SFN
Subjt:  FSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALNFRDQNSFN

Query:  QPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTERNRKTSSSC
        QPWLQVSHAEWLNFAEHSL AGFFSIAIKAYEQALS LQQS +ANYT+H SFK TEVI+KI RLKD+AL  +GSHSVQALTS+YLK+KVTER+RK SSSC
Subjt:  QPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTERNRKTSSSC

Query:  TSTQFTASTLFRNGIRNHNAKKLHEYQDL
        T  +FTASTLFRNGIRN+NA+KLHEY+ +
Subjt:  TSTQFTASTLFRNGIRNHNAKKLHEYQDL

A0A6J1HMC3 uncharacterized protein LOC111464906 isoform X26.0e-9580.17Show/hide
Query:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN
        M  SVAEQ+SLF SRL++RR DDSTLR+L F S SKD MSLMDVKS +K+LL  ESLSIIR++ EKT+DQKL+V+EFLVRAFAL+GDIESCLALRYEALN
Subjt:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN

Query:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER
        FR+  SFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQALS LQQS +ANYT+H S K  EVI+KIKRLKD+ALKSAGSHSVQALTSEYLK++VTER
Subjt:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER

Query:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL
        NRK SSSCT  +FTASTLFRNGIRNHNAK+LHEYQ L
Subjt:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL

A0A6J1HPP0 uncharacterized protein LOC111464906 isoform X16.0e-9580.17Show/hide
Query:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN
        M  SVAEQ+SLF SRL++RR DDSTLR+L F S SKD MSLMDVKS +K+LL  ESLSIIR++ EKT+DQKL+V+EFLVRAFAL+GDIESCLALRYEALN
Subjt:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN

Query:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER
        FR+  SFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQALS LQQS +ANYT+H S K  EVI+KIKRLKD+ALKSAGSHSVQALTSEYLK++VTER
Subjt:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER

Query:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL
        NRK SSSCT  +FTASTLFRNGIRNHNAK+LHEYQ L
Subjt:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL

A0A6J1I136 uncharacterized protein LOC111469552 isoform X28.3e-9781.43Show/hide
Query:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN
        M  SVAEQ+SLF SRL++RRFDDSTLR+L F S SKD M  MDVKS +K+LLR ESLSIIR++ EKT+DQKL+V+EFLVRAFAL+GDIESCLALRYEALN
Subjt:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN

Query:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER
        FR+  SFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQALS LQQS +ANYT+H S KR EVI+KIKRLKD+ALKSAGSHSVQALTSEYLK+KVTER
Subjt:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER

Query:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL
        NRK SSSCT  +FTASTLFRNGIRNHNAKKLHEYQ L
Subjt:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL

A0A6J1I645 uncharacterized protein LOC111469552 isoform X18.3e-9781.43Show/hide
Query:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN
        M  SVAEQ+SLF SRL++RRFDDSTLR+L F S SKD M  MDVKS +K+LLR ESLSIIR++ EKT+DQKL+V+EFLVRAFAL+GDIESCLALRYEALN
Subjt:  MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALN

Query:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER
        FR+  SFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQALS LQQS +ANYT+H S KR EVI+KIKRLKD+ALKSAGSHSVQALTSEYLK+KVTER
Subjt:  FRDQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTER

Query:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL
        NRK SSSCT  +FTASTLFRNGIRNHNAKKLHEYQ L
Subjt:  NRKTSSSCTSTQFTASTLFRNGIRNHNAKKLHEYQDL

SwissProt top hitse value%identityAlignment
Q8RX33 Protein DOUBLE-STRAND BREAK FORMATION1.3e-3344.51Show/hide
Query:  SSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALNFR
        S +A+Q  LF++R+++RRFD+ +LR+L    V+ +V S ++V+S L+D +RSES+ I  +   ++   KL VLEF  RAFAL+GD+ESCLA+RYEALN R
Subjt:  SSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALNFR

Query:  DQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHS
           S +  WL VSH+EW  FA  S++ GF SIA KA E AL  L++       + D+    +  +K++RL+D A     SHS
Subjt:  DQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHS

Arabidopsis top hitse value%identityAlignment
AT1G07060.1 unknown protein8.9e-3544.51Show/hide
Query:  SSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALNFR
        S +A+Q  LF++R+++RRFD+ +LR+L    V+ +V S ++V+S L+D +RSES+ I  +   ++   KL VLEF  RAFAL+GD+ESCLA+RYEALN R
Subjt:  SSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALNFR

Query:  DQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHS
           S +  WL VSH+EW  FA  S++ GF SIA KA E AL  L++       + D+    +  +K++RL+D A     SHS
Subjt:  DQNSFNQPWLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTTCAGTTGCAGAGCAATTTTCTCTCTTCGTCTCACGGCTCAAGAACCGAAGATTCGATGATTCTACATTGCGACTTCTGGTATTTCTTTCCGTTTCGAAAGA
CGTGATGTCGTTGATGGATGTCAAATCCAGCTTGAAAGATCTGCTCAGATCTGAATCTCTATCTATTATTCGTGATTCCGCTGAGAAAACTGAAGATCAGAAGCTTATAG
TTCTTGAATTTCTTGTTCGAGCTTTCGCCTTAATCGGAGACATCGAGAGTTGCTTAGCTTTGAGATATGAGGCCTTGAATTTTCGGGACCAGAACTCTTTTAATCAGCCA
TGGCTTCAAGTTTCCCACGCAGAATGGTTAAACTTCGCCGAGCATTCATTGGACGCTGGCTTTTTTTCAATTGCAATTAAGGCATATGAGCAGGCACTTTCACGACTTCA
GCAGAGTGGTTCTGCAAACTACACAGCACATGATTCCTTTAAACGCACGGAAGTTATTGATAAGATAAAGAGACTCAAAGATTATGCTCTGAAATCCGCTGGTTCCCATT
CTGTTCAGGCTCTCACATCTGAGTATTTGAAAAGGAAAGTAACTGAAAGGAATAGAAAGACTTCTTCATCCTGCACAAGTACTCAGTTTACAGCAAGTACTCTATTCAGA
AATGGTATCAGAAACCATAATGCGAAAAAGCTGCATGAATACCAGGATCTGTCTGCTTCCATCAGGCAATAG
mRNA sequenceShow/hide mRNA sequence
GCTGGCTGTGCCCGTTTCTAAAAAAAAGAAAAGAAAGAAGCTAATTCGAACGAAATTAAGAATTATTTAAAAACACTTGGAAACGATTTCATATAAAAGGCTAAAACTAG
TTTCAGATTGCGAAGAACCATTCTTTCCCGCCAATTAATTGATGTCTTCTTCAGTTGCAGAGCAATTTTCTCTCTTCGTCTCACGGCTCAAGAACCGAAGATTCGATGAT
TCTACATTGCGACTTCTGGTATTTCTTTCCGTTTCGAAAGACGTGATGTCGTTGATGGATGTCAAATCCAGCTTGAAAGATCTGCTCAGATCTGAATCTCTATCTATTAT
TCGTGATTCCGCTGAGAAAACTGAAGATCAGAAGCTTATAGTTCTTGAATTTCTTGTTCGAGCTTTCGCCTTAATCGGAGACATCGAGAGTTGCTTAGCTTTGAGATATG
AGGCCTTGAATTTTCGGGACCAGAACTCTTTTAATCAGCCATGGCTTCAAGTTTCCCACGCAGAATGGTTAAACTTCGCCGAGCATTCATTGGACGCTGGCTTTTTTTCA
ATTGCAATTAAGGCATATGAGCAGGCACTTTCACGACTTCAGCAGAGTGGTTCTGCAAACTACACAGCACATGATTCCTTTAAACGCACGGAAGTTATTGATAAGATAAA
GAGACTCAAAGATTATGCTCTGAAATCCGCTGGTTCCCATTCTGTTCAGGCTCTCACATCTGAGTATTTGAAAAGGAAAGTAACTGAAAGGAATAGAAAGACTTCTTCAT
CCTGCACAAGTACTCAGTTTACAGCAAGTACTCTATTCAGAAATGGTATCAGAAACCATAATGCGAAAAAGCTGCATGAATACCAGGATCTGTCTGCTTCCATCAGGCAA
TAGGTTACTGGACATTTTATGCTGAACCTCAAGTTAGCTGCTGCAAACTTATACTTTTATTGCTGGGTGGTGATCGTAGAAGCTTGCTGGCAACATGGAAAATGCTTTCA
ATGAAATTCCAGTTTCACTCTATTGGTATGCAAGTCTTAATCTATTGGTATGCAAGTCTTAAGTCGTCCACTCCAGTATTTTAAAAAACTCAAGGCGCACTAAGGTGCAT
CGGCCCTTTAGAGCCTGAGGCACAAGATGCACCAAAAGGCCCGAGCTTTTTACTAAAGGCGCATACATGTATACATACACAAAACATATATATTAATCTCTTTCAATATA
GAGATTTTACATAGTTATCTCA
Protein sequenceShow/hide protein sequence
MSSSVAEQFSLFVSRLKNRRFDDSTLRLLVFLSVSKDVMSLMDVKSSLKDLLRSESLSIIRDSAEKTEDQKLIVLEFLVRAFALIGDIESCLALRYEALNFRDQNSFNQP
WLQVSHAEWLNFAEHSLDAGFFSIAIKAYEQALSRLQQSGSANYTAHDSFKRTEVIDKIKRLKDYALKSAGSHSVQALTSEYLKRKVTERNRKTSSSCTSTQFTASTLFR
NGIRNHNAKKLHEYQDLSASIRQ