; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg07022 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg07022
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionStructure-specific endonuclease subunit SLX1 homolog
Genome locationCarg_Chr01:3871128..3872518
RNA-Seq ExpressionCarg07022
SyntenyCarg07022
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0006310 - DNA recombination (biological process)
GO:0031323 - regulation of cellular metabolic process (biological process)
GO:0051171 - regulation of nitrogen compound metabolic process (biological process)
GO:0060255 - regulation of macromolecule metabolic process (biological process)
GO:0080090 - regulation of primary metabolic process (biological process)
GO:0031981 - nuclear lumen (cellular component)
GO:0140513 - nuclear protein-containing complex (cellular component)
GO:0004520 - endodeoxyribonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060004.1 Structure-specific endonuclease subunit SLX1 [Cucumis melo var. makuwa]1.0e-6359.2Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVAN--------------------------KYMKNAASCPSLPEHFKVQVSLINEPPCY
        MVLCIYGFPT VSALQFEWAWQHPNESLAVRS AATFKSLSGVAN                          K+MKNAA CPSLPEH KVQVS I+E PCY
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVAN--------------------------KYMKNAASCPSLPEHFKVQVSLINEPPCY

Query:  SEGDQGEHEN-------KELEEMCHFQVYGSMKV---EIPRKSMDYQTGTDGIPNE-LHGCEKELEGNKRVSPRSCTP-YNDADMSYDLHGCGK-----E
        SEGDQG  EN       +E EE+C FQVYGSMK    E+P+K MDYQTGTDG P   L GC+KELE N++V P SCTP Y D  MSYDL  C +     E
Subjt:  SEGDQGEHEN-------KELEEMCHFQVYGSMKV---EIPRKSMDYQTGTDGIPNE-LHGCEKELEGNKRVSPRSCTP-YNDADMSYDLHGCGK-----E

Query:  LESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTS
         E+  C  S  VAG S  E+II+DGEE++ EG+GM LQ+Q    R+NLTS
Subjt:  LESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTS

KAG7037034.1 hypothetical protein SDJN02_00655, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-124100Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQGEHENKELEEMCHFQVYGSMK
        MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQGEHENKELEEMCHFQVYGSMK
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQGEHENKELEEMCHFQVYGSMK

Query:  VEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPD
        VEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPD
Subjt:  VEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPD

Query:  RRRNLTSSTEMTNSPTFIQL
        RRRNLTSSTEMTNSPTFIQL
Subjt:  RRRNLTSSTEMTNSPTFIQL

XP_022997730.1 uncharacterized protein LOC111492603 isoform X1 [Cucurbita maxima]2.2e-9077.09Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFK--------------------SLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQG
        MVLCIYGFPT VSALQF WAWQHP+ESLAVRS AA                       +++ ++ KYMKN ASC SLPEH KVQVSLINE PCYSEGDQG
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFK--------------------SLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQG

Query:  EHENKELEEMCHFQVYGSMKVEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIE
        E ENKELEEM HFQVYGSMKVEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRV PRSCTPYND DMSYDLHGCGKELESPPCSPSYIVA MSGAEM I 
Subjt:  EHENKELEEMCHFQVYGSMKVEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIE

Query:  DGEEDELEGNGMKLQQQQPDRRRNLTS
        + EED LEG GM LQQQQP R+RNLTS
Subjt:  DGEEDELEGNGMKLQQQQPDRRRNLTS

XP_022997731.1 structure-specific endonuclease subunit slx1 isoform X2 [Cucurbita maxima]9.2e-6563.44Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFK--------------------SLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQG
        MVLCIYGFPT VSALQF WAWQHP+ESLAVRS AA                       +++ ++ KYMKN ASC SLPEH KVQVSLINE PCYSEGDQG
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFK--------------------SLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQG

Query:  EHENKELEEMCHFQVYGSMKVEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIE
        E ENKELEEM HFQVYGSMKVEIPRKSMDYQT                                 DMSYDLHGCGKELESPPCSPSYIVA MSGAEM I 
Subjt:  EHENKELEEMCHFQVYGSMKVEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIE

Query:  DGEEDELEGNGMKLQQQQPDRRRNLTS
        + EED LEG GM LQQQQP R+RNLTS
Subjt:  DGEEDELEGNGMKLQQQQPDRRRNLTS

XP_038894793.1 structure-specific endonuclease subunit SLX1 [Benincasa hispida]9.8e-6759.44Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVAN--------------------------KYMKNAASCPSLPEHFKVQVSLINEPPCY
        MVLCIYGFPT VSALQFEWAWQHPNESLAVRS AATFKSLSGVAN                          KYMKNAA CPSLP+H KVQVS INE PCY
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVAN--------------------------KYMKNAASCPSLPEHFKVQVSLINEPPCY

Query:  SEGDQG--EHE-----NKELEEMCHFQVYGSMKV---EIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTP-YNDADMSYDLHGCGKELES---
        SEGDQ   E+E     N+E EE+C F+VYGSMK    E+P+K MD+Q  TDG P ELHGC+KE E N++V P SC P Y DA MSYDLHGC +ELE    
Subjt:  SEGDQG--EHE-----NKELEEMCHFQVYGSMKV---EIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTP-YNDADMSYDLHGCGKELES---

Query:  --PPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTS
            C+PS IV  +S  E+II+DG+ED++EG GM L+QQ    R+NL+S
Subjt:  --PPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTS

TrEMBL top hitse value%identityAlignment
A0A0A0M0Y6 Structure-specific endonuclease subunit SLX1 homolog1.2e-6258.8Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVAN--------------------------KYMKNAASCPSLPEHFKVQVSLINEPPCY
        MVLCIYGFPT VSALQFEWAWQHPNESLAVRS AATFKSLSGVAN                          K+MKNAA CPSLPEH KVQVS INE PCY
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVAN--------------------------KYMKNAASCPSLPEHFKVQVSLINEPPCY

Query:  SEGDQ------GEHE-NKELEEMCHFQVYGSMKV---EIPRKSMDYQTGTDG-IPNELHGCEKELEGNKRVSPRSCTP-YNDADMSYDLHGCGK-----E
        SEGDQ      G+ E N+E EE+C F+VYGSMK    E+P+K MDYQTGTDG  P+ L GC+KELE N++V P SCTP Y D  MSYDL  C +     E
Subjt:  SEGDQ------GEHE-NKELEEMCHFQVYGSMKV---EIPRKSMDYQTGTDG-IPNELHGCEKELEGNKRVSPRSCTP-YNDADMSYDLHGCGK-----E

Query:  LESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTS
         E+  C  S IVAG S  E++I+D EE++LEG+ M LQ+Q    R NLTS
Subjt:  LESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTS

A0A1S3C574 Structure-specific endonuclease subunit SLX1 homolog4.9e-6459.2Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVAN--------------------------KYMKNAASCPSLPEHFKVQVSLINEPPCY
        MVLCIYGFPT VSALQFEWAWQHPNESLAVRS AATFKSLSGVAN                          K+MKNAA CPSLPEH KVQVS I+E PCY
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVAN--------------------------KYMKNAASCPSLPEHFKVQVSLINEPPCY

Query:  SEGDQGEHEN-------KELEEMCHFQVYGSMKV---EIPRKSMDYQTGTDGIPNE-LHGCEKELEGNKRVSPRSCTP-YNDADMSYDLHGCGK-----E
        SEGDQG  EN       +E EE+C FQVYGSMK    E+P+K MDYQTGTDG P   L GC+KELE N++V P SCTP Y D  MSYDL  C +     E
Subjt:  SEGDQGEHEN-------KELEEMCHFQVYGSMKV---EIPRKSMDYQTGTDGIPNE-LHGCEKELEGNKRVSPRSCTP-YNDADMSYDLHGCGK-----E

Query:  LESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTS
         E+  C  S  VAG S  E+II+DGEE++ EG+GM LQ+Q    R+NLTS
Subjt:  LESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTS

A0A5D3BBR0 Structure-specific endonuclease subunit SLX1 homolog4.9e-6459.2Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVAN--------------------------KYMKNAASCPSLPEHFKVQVSLINEPPCY
        MVLCIYGFPT VSALQFEWAWQHPNESLAVRS AATFKSLSGVAN                          K+MKNAA CPSLPEH KVQVS I+E PCY
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVAN--------------------------KYMKNAASCPSLPEHFKVQVSLINEPPCY

Query:  SEGDQGEHEN-------KELEEMCHFQVYGSMKV---EIPRKSMDYQTGTDGIPNE-LHGCEKELEGNKRVSPRSCTP-YNDADMSYDLHGCGK-----E
        SEGDQG  EN       +E EE+C FQVYGSMK    E+P+K MDYQTGTDG P   L GC+KELE N++V P SCTP Y D  MSYDL  C +     E
Subjt:  SEGDQGEHEN-------KELEEMCHFQVYGSMKV---EIPRKSMDYQTGTDGIPNE-LHGCEKELEGNKRVSPRSCTP-YNDADMSYDLHGCGK-----E

Query:  LESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTS
         E+  C  S  VAG S  E+II+DGEE++ EG+GM LQ+Q    R+NLTS
Subjt:  LESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTS

A0A6J1K8A7 structure-specific endonuclease subunit slx1 isoform X24.4e-6563.44Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFK--------------------SLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQG
        MVLCIYGFPT VSALQF WAWQHP+ESLAVRS AA                       +++ ++ KYMKN ASC SLPEH KVQVSLINE PCYSEGDQG
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFK--------------------SLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQG

Query:  EHENKELEEMCHFQVYGSMKVEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIE
        E ENKELEEM HFQVYGSMKVEIPRKSMDYQT                                 DMSYDLHGCGKELESPPCSPSYIVA MSGAEM I 
Subjt:  EHENKELEEMCHFQVYGSMKVEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIE

Query:  DGEEDELEGNGMKLQQQQPDRRRNLTS
        + EED LEG GM LQQQQP R+RNLTS
Subjt:  DGEEDELEGNGMKLQQQQPDRRRNLTS

A0A6J1KAR6 uncharacterized protein LOC111492603 isoform X11.1e-9077.09Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFK--------------------SLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQG
        MVLCIYGFPT VSALQF WAWQHP+ESLAVRS AA                       +++ ++ KYMKN ASC SLPEH KVQVSLINE PCYSEGDQG
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFK--------------------SLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQG

Query:  EHENKELEEMCHFQVYGSMKVEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIE
        E ENKELEEM HFQVYGSMKVEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRV PRSCTPYND DMSYDLHGCGKELESPPCSPSYIVA MSGAEM I 
Subjt:  EHENKELEEMCHFQVYGSMKVEIPRKSMDYQTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIE

Query:  DGEEDELEGNGMKLQQQQPDRRRNLTS
        + EED LEG GM LQQQQP R+RNLTS
Subjt:  DGEEDELEGNGMKLQQQQPDRRRNLTS

SwissProt top hitse value%identityAlignment
Q0IH86 Structure-specific endonuclease subunit slx17.2e-0431.4Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSG-----VANKYMKNAASCPSLP-------EHFKVQVSLINEPPCY
        MVL ++GFP  ++AL+FEWAWQHP+ S  +  V    K  S      +   +M   A    LP       + ++ ++ L+ +PP +
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSG-----VANKYMKNAASCPSLP-------EHFKVQVSLINEPPCY

Q32PI0 Structure-specific endonuclease subunit SLX12.5e-0436.11Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVANKYMKNAASCPSLPEHFKVQVSLINEPP
        MVL ++GFP+ V+AL+FEWAWQHP  S   R +    + L G A           +   H +V   ++  PP
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVANKYMKNAASCPSLPEHFKVQVSLINEPP

Q5PQP5 Structure-specific endonuclease subunit SLX11.5e-0451.28Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKS
        MVL ++GFP+ V+AL+FEWAWQHP  S  +  V    +S
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKS

Q8BX32 Structure-specific endonuclease subunit SLX16.5e-0550Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVA
        MVL I+GFP+ V+AL+FEWAWQHP  S  +  V    +S +  A
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVA

Q9BQ83 Structure-specific endonuclease subunit SLX12.5e-0445.45Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVA
        MVL ++GFP+ V+AL+FEWAWQHP+ S  +  V    +  +  A
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVA

Arabidopsis top hitse value%identityAlignment
AT2G30350.1 Excinuclease ABC, C subunit, N-terminal1.8e-1037.86Show/hide
Query:  LQFEWAWQHPNESLAVRSVAATFKSLSGVA--------------------------NKYMKNAASCPSLPEHFKVQVSLINEPPCYSE-GDQGEHENKEL
        LQFEWAWQHP ES+AVR  AA FKS SGVA                          +KY  +    PSLP H KVQV  + +   +++  D  + E++E 
Subjt:  LQFEWAWQHPNESLAVRSVAATFKSLSGVA--------------------------NKYMKNAASCPSLPEHFKVQVSLINEPPCYSE-GDQGEHENKEL

Query:  EEM
         E+
Subjt:  EEM

AT2G30350.2 Excinuclease ABC, C subunit, N-terminal5.3e-1844.44Show/hide
Query:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVA--------------------------NKYMKNAASCPSLPEHFKVQVSLINEPPCY
        MVLCIYGFPT VSALQFEWAWQHP ES+AVR  AA FKS SGVA                          +KY  +    PSLP H KVQV  + +   +
Subjt:  MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVA--------------------------NKYMKNAASCPSLPEHFKVQVSLINEPPCY

Query:  SE-GDQGEHENKELEEM
        ++  D  + E++E  E+
Subjt:  SE-GDQGEHENKELEEM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTGTGCATCTATGGCTTTCCCACTCTAGTCTCTGCTCTTCAGTTTGAATGGGCCTGGCAGCATCCGAATGAGTCGTTGGCTGTAAGAAGTGTTGCTGCAACTTT
CAAATCTCTCTCTGGCGTTGCCAACAAATACATGAAGAACGCGGCCAGTTGCCCGAGTTTGCCCGAGCATTTTAAGGTCCAAGTTTCTCTCATCAATGAGCCTCCATGCT
ATTCTGAAGGAGATCAAGGTGAGCATGAAAACAAAGAACTTGAAGAAATGTGTCATTTCCAAGTATATGGATCGATGAAAGTCGAAATTCCTCGAAAGTCGATGGATTAC
CAAACAGGTACAGATGGAATACCTAATGAACTGCATGGATGCGAAAAAGAACTCGAGGGCAACAAACGAGTGTCGCCTCGTTCATGTACTCCTTATAATGATGCAGATAT
GTCTTATGACCTACATGGATGTGGTAAAGAACTCGAGTCTCCTCCGTGTTCCCCATCTTATATCGTCGCAGGTATGTCCGGGGCCGAAATGATCATTGAAGACGGAGAGG
AAGACGAACTAGAAGGGAATGGTATGAAGTTGCAGCAACAACAACCTGATAGAAGGAGGAATTTAACATCATCTACTGAAATGACCAATTCTCCTACGTTTATCCAATTG
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTGTGCATCTATGGCTTTCCCACTCTAGTCTCTGCTCTTCAGTTTGAATGGGCCTGGCAGCATCCGAATGAGTCGTTGGCTGTAAGAAGTGTTGCTGCAACTTT
CAAATCTCTCTCTGGCGTTGCCAACAAATACATGAAGAACGCGGCCAGTTGCCCGAGTTTGCCCGAGCATTTTAAGGTCCAAGTTTCTCTCATCAATGAGCCTCCATGCT
ATTCTGAAGGAGATCAAGGTGAGCATGAAAACAAAGAACTTGAAGAAATGTGTCATTTCCAAGTATATGGATCGATGAAAGTCGAAATTCCTCGAAAGTCGATGGATTAC
CAAACAGGTACAGATGGAATACCTAATGAACTGCATGGATGCGAAAAAGAACTCGAGGGCAACAAACGAGTGTCGCCTCGTTCATGTACTCCTTATAATGATGCAGATAT
GTCTTATGACCTACATGGATGTGGTAAAGAACTCGAGTCTCCTCCGTGTTCCCCATCTTATATCGTCGCAGGTATGTCCGGGGCCGAAATGATCATTGAAGACGGAGAGG
AAGACGAACTAGAAGGGAATGGTATGAAGTTGCAGCAACAACAACCTGATAGAAGGAGGAATTTAACATCATCTACTGAAATGACCAATTCTCCTACGTTTATCCAATTG
TAG
Protein sequenceShow/hide protein sequence
MVLCIYGFPTLVSALQFEWAWQHPNESLAVRSVAATFKSLSGVANKYMKNAASCPSLPEHFKVQVSLINEPPCYSEGDQGEHENKELEEMCHFQVYGSMKVEIPRKSMDY
QTGTDGIPNELHGCEKELEGNKRVSPRSCTPYNDADMSYDLHGCGKELESPPCSPSYIVAGMSGAEMIIEDGEEDELEGNGMKLQQQQPDRRRNLTSSTEMTNSPTFIQL