; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006457 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006457
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein DOUBLE-STRAND BREAK FORMATION
Genome locationscaffold761:31698..34934
RNA-Seq ExpressionMS006457
SyntenyMS006457
Gene Ontology termsGO:0042138 - meiotic DNA double-strand break formation (biological process)
InterPro domainsIPR044969 - Protein DOUBLE-STRAND BREAK FORMATION


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583341.1 Protein DOUBLE-STRAND BREAK FORMATION, partial [Cucurbita argyrosperma subsp. sororia]1.5e-8772.94Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLH--------NYL---CLALRYEAL
        RRLDDSTL+ILEF S SKD  SL++ KS +KELLRFESLSIIRETVEKTDDQKLLV+EFLVRAFALVGD EA+ ++         N+L   CLALRYEAL
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLH--------NYL---CLALRYEAL

Query:  SFREMKSSNQKWLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTE
        +FRE+KS NQ  LQVSH EWLNFAEHS+++GF SIAIKAYE ALS LQQSDT N TSH SSKC EVIEKI RLKDHALKSA SHSVQALTSEYLKKKVTE
Subjt:  SFREMKSSNQKWLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTE

Query:  RNRKDSSFCTRTPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKY
        RNRK SS CTR  FTASTLFR+GIRNHNA++L EYQ L G  S +    + D  Y
Subjt:  RNRKDSSFCTRTPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKY

KAG7019109.1 hypothetical protein SDJN02_18067, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-8673.41Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RRLDDSTL+ILEF S SKD  SL++ KS +KELLRFESLSIIRETVEKTDDQKLLV+EFLVRAFALVGD EA+          LRYEAL+FRE+KS NQ 
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
         LQVSH EWLNFAEHS+++GF SIAIKAYE ALS LQQSDT N TSH SSKC EVIEKI RLKDHALKSA SHSVQALTSEYLKKKVTERNRK SS CTR
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR

Query:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKYSLPSHPSD
          FTASTLFR+GIRNHNA++L EYQ L GF        +  P YSL ++P D
Subjt:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKYSLPSHPSD

XP_022154860.1 uncharacterized protein LOC111022017 isoform X1 [Momordica charantia]6.7e-11294.44Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTE+        CLALRYEALSFREMKSSNQK
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
        WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHS SKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR

Query:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRT
        TPFTASTLFRSGIRNHNARKLQEYQGLP FPS +
Subjt:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRT

XP_022964954.1 uncharacterized protein LOC111464906 isoform X1 [Cucurbita moschata]2.2e-8674.18Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RRLDDSTL+ILEF S SKD  SL++ KS +KELL FESLSIIRETVEKTDDQKLLV+EFLVRAFALVGD E+        CLALRYEAL+FRE+KS NQ 
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
         LQVSH EWLNFAEHS+++GF SIAIKAYE ALS LQQSDT N TSH SSKC EVIEKI RLKDHALKSA SHSVQALTSEYLKK+VTERNRK SS CTR
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR

Query:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKY
          FTASTLFR+GIRNHNA++L EYQ L G  S +    + D  Y
Subjt:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKY

XP_023520165.1 uncharacterized protein LOC111783465 isoform X2 [Cucurbita pepo subsp. pepo]2.8e-8673.77Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RR DDSTL+ILEF S SKD  SL++ KS +KELLRFESLSIIRETV+KTDDQKLLV+EFLVRAFALVGD E+        CLALRYEAL+FRE+KS NQ 
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
         LQVSH EWLNFAEHS+++GF SIA+KAYE ALS LQQSDT N TSH SSKC EVIEKI RLKDH+LKSA SHSVQALTSEYLKKKVTERNRK SS CTR
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR

Query:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKY
          FTASTLFR+GIRNHNA+KL EYQ L G  S +    + D  Y
Subjt:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKY

TrEMBL top hitse value%identityAlignment
A0A6J1DKU7 uncharacterized protein LOC111022017 isoform X13.3e-11294.44Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTE+        CLALRYEALSFREMKSSNQK
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
        WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHS SKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR

Query:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRT
        TPFTASTLFRSGIRNHNARKLQEYQGLP FPS +
Subjt:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRT

A0A6J1HMC3 uncharacterized protein LOC111464906 isoform X26.8e-8675.63Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RRLDDSTL+ILEF S SKD  SL++ KS +KELL FESLSIIRETVEKTDDQKLLV+EFLVRAFALVGD E+        CLALRYEAL+FRE+KS NQ 
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
         LQVSH EWLNFAEHS+++GF SIAIKAYE ALS LQQSDT N TSH SSKC EVIEKI RLKDHALKSA SHSVQALTSEYLKK+VTERNRK SS CTR
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR

Query:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSL
          FTASTLFR+GIRNHNA++L EYQ L G  S  +  L
Subjt:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSL

A0A6J1HPP0 uncharacterized protein LOC111464906 isoform X11.1e-8674.18Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RRLDDSTL+ILEF S SKD  SL++ KS +KELL FESLSIIRETVEKTDDQKLLV+EFLVRAFALVGD E+        CLALRYEAL+FRE+KS NQ 
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
         LQVSH EWLNFAEHS+++GF SIAIKAYE ALS LQQSDT N TSH SSKC EVIEKI RLKDHALKSA SHSVQALTSEYLKK+VTERNRK SS CTR
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR

Query:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKY
          FTASTLFR+GIRNHNA++L EYQ L G  S +    + D  Y
Subjt:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKY

A0A6J1I136 uncharacterized protein LOC111469552 isoform X24.9e-8475.74Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RR DDSTL+ILEF S SKD    ++ KS +KELLRFESLSIIRETVEKTDDQKLLV+EFLVRAFALVGD E+        CLALRYEAL+FRE+KS NQ 
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
         LQVSH EWLNFAEHS+++GF SIAIKAYE ALS LQQSDT N TSH SSK  EVIEKI RLKDHALKSA SHSVQALTSEYLKKKVTERNRK SS CTR
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR

Query:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTI
          FTASTLFR+GIRNHNA+KL EYQ L G  S  +
Subjt:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTI

A0A6J1I645 uncharacterized protein LOC111469552 isoform X14.4e-8573.77Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RR DDSTL+ILEF S SKD    ++ KS +KELLRFESLSIIRETVEKTDDQKLLV+EFLVRAFALVGD E+        CLALRYEAL+FRE+KS NQ 
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR
         LQVSH EWLNFAEHS+++GF SIAIKAYE ALS LQQSDT N TSH SSK  EVIEKI RLKDHALKSA SHSVQALTSEYLKKKVTERNRK SS CTR
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTR

Query:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKY
          FTASTLFR+GIRNHNA+KL EYQ L G  S +    + D  Y
Subjt:  TPFTASTLFRSGIRNHNARKLQEYQGLPGFPSRTISSLVTDPKY

SwissProt top hitse value%identityAlignment
Q8RX33 Protein DOUBLE-STRAND BREAK FORMATION5.7e-2944.83Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RR D+ +L+ILE   V+ +VKS +E +SRL++ +R ES+ I  E   ++   KL VLEF  RAFAL+GD E+        CLA+RYEAL+ R++KS +  
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHS
        WL VSH EW  FA  SM +GF SIA KA E AL  L++       S  +S  ++  EK+ RL+D A    +SHS
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHS

Arabidopsis top hitse value%identityAlignment
AT1G07060.1 unknown protein4.0e-3044.83Show/hide
Query:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK
        RR D+ +L+ILE   V+ +VKS +E +SRL++ +R ES+ I  E   ++   KL VLEF  RAFAL+GD E+        CLA+RYEAL+ R++KS +  
Subjt:  RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQK

Query:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHS
        WL VSH EW  FA  SM +GF SIA KA E AL  L++       S  +S  ++  EK+ RL+D A    +SHS
Subjt:  WLQVSHVEWLNFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CGCAGACTCGATGATTCTACTTTGCAAATTCTGGAATTTGTTTCCGTTTCCAAAGACGTGAAGTCGTTGATCGAAGCCAAATCCAGATTAAAAGAACTACTCAGATTTGA
ATCTCTATCTATCATTCGCGAAACCGTCGAAAAAACTGACGATCAAAAGCTTCTAGTCCTCGAATTTCTGGTTCGAGCTTTCGCTCTTGTTGGAGACACTGAGGCGAGTA
TTCTTTTGCATAATTATCTTTGCCTAGCTTTGAGATATGAGGCCTTGAGTTTTCGGGAAATGAAGTCTTCTAATCAGAAATGGCTTCAAGTTTCACACGTGGAATGGTTA
AACTTCGCTGAGCATTCAATGCATTCTGGCTTTATTTCTATTGCCATTAAGGCATATGAGCTTGCGCTGTCACGCCTTCAGCAGAGTGATACTGAAAACTGCACATCACA
CAGTTCGTCTAAATGCGTGGAAGTTATCGAAAAGATAAATAGACTCAAAGATCATGCTCTGAAATCAGCTGCTTCCCATTCTGTTCAGGCTCTCACATCTGAGTATTTGA
AAAAGAAAGTAACTGAGAGGAACAGAAAGGATTCTTCATTCTGCACAAGAACTCCGTTTACAGCAAGCACTCTATTCAGAAGTGGCATCAGAAACCATAATGCAAGAAAG
CTGCAAGAATATCAGGGTTTGCCGGGGTTTCCTAGTCGTACAATCTCCAGTTTGGTGACCGATCCTAAATATAGTCTCCCTTCACACCCATCAGAC
mRNA sequenceShow/hide mRNA sequence
CGCAGACTCGATGATTCTACTTTGCAAATTCTGGAATTTGTTTCCGTTTCCAAAGACGTGAAGTCGTTGATCGAAGCCAAATCCAGATTAAAAGAACTACTCAGATTTGA
ATCTCTATCTATCATTCGCGAAACCGTCGAAAAAACTGACGATCAAAAGCTTCTAGTCCTCGAATTTCTGGTTCGAGCTTTCGCTCTTGTTGGAGACACTGAGGCGAGTA
TTCTTTTGCATAATTATCTTTGCCTAGCTTTGAGATATGAGGCCTTGAGTTTTCGGGAAATGAAGTCTTCTAATCAGAAATGGCTTCAAGTTTCACACGTGGAATGGTTA
AACTTCGCTGAGCATTCAATGCATTCTGGCTTTATTTCTATTGCCATTAAGGCATATGAGCTTGCGCTGTCACGCCTTCAGCAGAGTGATACTGAAAACTGCACATCACA
CAGTTCGTCTAAATGCGTGGAAGTTATCGAAAAGATAAATAGACTCAAAGATCATGCTCTGAAATCAGCTGCTTCCCATTCTGTTCAGGCTCTCACATCTGAGTATTTGA
AAAAGAAAGTAACTGAGAGGAACAGAAAGGATTCTTCATTCTGCACAAGAACTCCGTTTACAGCAAGCACTCTATTCAGAAGTGGCATCAGAAACCATAATGCAAGAAAG
CTGCAAGAATATCAGGGTTTGCCGGGGTTTCCTAGTCGTACAATCTCCAGTTTGGTGACCGATCCTAAATATAGTCTCCCTTCACACCCATCAGAC
Protein sequenceShow/hide protein sequence
RRLDDSTLQILEFVSVSKDVKSLIEAKSRLKELLRFESLSIIRETVEKTDDQKLLVLEFLVRAFALVGDTEASILLHNYLCLALRYEALSFREMKSSNQKWLQVSHVEWL
NFAEHSMHSGFISIAIKAYELALSRLQQSDTENCTSHSSSKCVEVIEKINRLKDHALKSAASHSVQALTSEYLKKKVTERNRKDSSFCTRTPFTASTLFRSGIRNHNARK
LQEYQGLPGFPSRTISSLVTDPKYSLPSHPSD