; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033581 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033581
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein MICRORCHIDIA 7
Genome locationchr3:387721..393890
RNA-Seq ExpressionLag0033581
SyntenyLag0033581
Gene Ontology termsGO:0002833 - positive regulation of response to biotic stimulus (biological process)
GO:0031349 - positive regulation of defense response (biological process)
GO:0032103 - positive regulation of response to external stimulus (biological process)
GO:0050778 - positive regulation of immune response (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR041006 - Morc, S5 domain 2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587399.1 Protein MICRORCHIDIA 4, partial [Cucurbita argyrosperma subsp. sororia]2.3e-13682.52Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        +Q F+     + + PFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWA+YCHKIGYAPRR NK TNQSPDRESSPDDY
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND
        S Q SPQSRK    SGK PDKVYSGK+SEK QKTKDCRYVNG SS D NSSM  E+SR RPSS+P SPS  EV VD+LHGGQAN  GN TFHEKE HGND
Subjt:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND

Query:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ
        VS+TM+A+S+GGVS+AQEVG G RG QLKGGDVN +ERSLSSS+FHMLQQ+KEEN ELKERLQRKEAD NELQH RDRCKSLEAQLKAAELKIEELNKEQ
Subjt:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ

Query:  ESLIDIFSEERDRRETEEKNLRKKLQ
        ESLIDIFSEERDRRETEE+NLRKKLQ
Subjt:  ESLIDIFSEERDRRETEEKNLRKKLQ

KAG7021383.1 Protein MICRORCHIDIA 4 [Cucurbita argyrosperma subsp. argyrosperma]2.3e-13682.52Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        +Q F+     + + PFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWA+YCHKIGYAPRR NK TNQSPDRESSPDDY
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND
        S Q SPQSRK    SGK PDKVYSGK+SEK QKTKDCRYVNG SS D NSSM  E+SR RPSS+P SPS  EV VD+LHGGQAN  GN TFHEKE HGND
Subjt:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND

Query:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ
        VS+TM+A+S+GGVS+AQEVG G RG QLKGGDVN +ERSLSSS+FHMLQQ+KEEN ELKERLQRKEAD NELQH RDRCKSLEAQLKAAELKIEELNKEQ
Subjt:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ

Query:  ESLIDIFSEERDRRETEEKNLRKKLQ
        ESLIDIFSEERDRRETEE+NLRKKLQ
Subjt:  ESLIDIFSEERDRRETEEKNLRKKLQ

XP_023004868.1 protein MICRORCHIDIA 7-like isoform X1 [Cucurbita maxima]4.3e-13581.9Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        +Q F+     + + PFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWA+YCHKIGYAPRR NK TNQSPDRESSPDDY
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND
        S Q SPQSRK    SGK PDKVYSGK+SEK QKTK+CRYVNG SS D NSSM  E+SR RPSS+P SPS  EV VD+LHGGQAN  GN TFHEKE HGND
Subjt:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND

Query:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ
        VS TM+A+S+GGVS+AQEVG G RG QLKGGDVN +ERSLSSS+FHMLQ++KEEN ELKERLQRKEAD NELQH RDRCKSLEAQLKAAELKIEELNKEQ
Subjt:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ

Query:  ESLIDIFSEERDRRETEEKNLRKKLQ
        ESLIDIFSEERDRRETEE+NLRKKLQ
Subjt:  ESLIDIFSEERDRRETEEKNLRKKLQ

XP_023531889.1 protein MICRORCHIDIA 7-like [Cucurbita pepo subsp. pepo]2.5e-13581.9Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        +Q F+     + + PFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWA+YCHKIGYAPRR NK TNQSPDRESSPDDY
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND
        S Q SPQSRK    SGK PDKVYSGK+SEK+QKTKDCRYVNG SS D NSSM  E+SR RPSS+P SPS  EV VD+LHGGQAN  GN TFHEKE HGND
Subjt:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND

Query:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ
        VS+TM+ +S+GGVS+AQEVG G RG QLKGGDVN +ERSLSSS+FHMLQQ+K EN ELKERLQRKEAD NELQH RDRCKSLEAQLKAAELKIEELNKEQ
Subjt:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ

Query:  ESLIDIFSEERDRRETEEKNLRKKLQ
        ESLIDIFSEERDRRETEE+NLRKKLQ
Subjt:  ESLIDIFSEERDRRETEEKNLRKKLQ

XP_038879188.1 protein MICRORCHIDIA 7 [Benincasa hispida]1.1e-13583.03Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        +Q F+     + + PFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYW +YCHKIGYAPRR NK TN+SPDRESSPDDY
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQS-RKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMT--SEKSRIRP-SSEPASPS-LEVRVDNLHGGQANGTGNETFHEKES
        SSQPSPQS +K TTLSGKKPDKVYSGKE+EK QKTKD RY + HSS D+NSSMT   EKSR+RP SSEP SPS LEVRVDNLHGGQANGT NETF     
Subjt:  SSQPSPQS-RKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMT--SEKSRIRP-SSEPASPS-LEVRVDNLHGGQANGTGNETFHEKES

Query:  HGNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEEL
        HGNDVSMTMKA+S+GGVS+A++ GLGRR  QLKGGDVN SERSLSSSDF MLQQLKEENEELKERLQRKEADH +L+HER+RCKSLEAQLKAAELKIEEL
Subjt:  HGNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEEL

Query:  NKEQESLIDIFSEERDRRETEEKNLRKKLQ
        NKEQESLIDIFSEERDRRETEE+NLRKKLQ
Subjt:  NKEQESLIDIFSEERDRRETEEKNLRKKLQ

TrEMBL top hitse value%identityAlignment
A0A6J1C072 protein MICRORCHIDIA 72.1e-13581.71Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        +Q F+     + + PFWRLWNA+GSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYW +YCHKIGYAPRR NK TN SPDRESSPDDY
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSR-KYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIR-PSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHG
        SSQPSPQSR K +T SGKKPDKVYSGKESEK QKTKD RYVNGHSS DKNSSM  +KS +R  SSE  SPS LEV+VDNLH  QANGTG++TF++KESHG
Subjt:  SSQPSPQSR-KYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIR-PSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHG

Query:  NDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNK
        NDVSMTMKA+S+GGV ++QE+G+GRRGSQLKGGDVN SE S S+S+FHMLQQLKEENE+LKERLQRK  D NELQ ERDRCKSLEAQLKAAELKIEELNK
Subjt:  NDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNK

Query:  EQESLIDIFSEERDRRETEEKNLRKKLQ
        EQESLIDIFSEERDRRETEE+NLRKKLQ
Subjt:  EQESLIDIFSEERDRRETEEKNLRKKLQ

A0A6J1EWC6 protein MICRORCHIDIA 7-like isoform X27.9e-13581.6Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        +Q F+     + + PFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWA+YCHKIGYAPRR NK TNQSPDRESSPDDY
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND
        S Q SPQSRK    S K PDKVYSGK+SEK QKTKDCRYVNG SS D NSSM  E+SR RPSS+P SPS  EV VD+LHGGQAN  GN TFHEKE HGND
Subjt:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND

Query:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ
        VS+TM+A+S+GGVS+AQEVG G RG QLKGGDVN +ERSLSSS+FHMLQ++KEEN ELKERLQRKEAD NELQH RD CKSLEAQLKAAELKIEELNKEQ
Subjt:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ

Query:  ESLIDIFSEERDRRETEEKNLRKKLQ
        ESLIDIFSEERDRRETEE+NLRKKLQ
Subjt:  ESLIDIFSEERDRRETEEKNLRKKLQ

A0A6J1F262 protein MICRORCHIDIA 7-like isoform X17.9e-13581.6Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        +Q F+     + + PFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWA+YCHKIGYAPRR NK TNQSPDRESSPDDY
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND
        S Q SPQSRK    S K PDKVYSGK+SEK QKTKDCRYVNG SS D NSSM  E+SR RPSS+P SPS  EV VD+LHGGQAN  GN TFHEKE HGND
Subjt:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND

Query:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ
        VS+TM+A+S+GGVS+AQEVG G RG QLKGGDVN +ERSLSSS+FHMLQ++KEEN ELKERLQRKEAD NELQH RD CKSLEAQLKAAELKIEELNKEQ
Subjt:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ

Query:  ESLIDIFSEERDRRETEEKNLRKKLQ
        ESLIDIFSEERDRRETEE+NLRKKLQ
Subjt:  ESLIDIFSEERDRRETEEKNLRKKLQ

A0A6J1L0Q9 protein MICRORCHIDIA 7-like isoform X12.1e-13581.9Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        +Q F+     + + PFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWA+YCHKIGYAPRR NK TNQSPDRESSPDDY
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND
        S Q SPQSRK    SGK PDKVYSGK+SEK QKTK+CRYVNG SS D NSSM  E+SR RPSS+P SPS  EV VD+LHGGQAN  GN TFHEKE HGND
Subjt:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND

Query:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ
        VS TM+A+S+GGVS+AQEVG G RG QLKGGDVN +ERSLSSS+FHMLQ++KEEN ELKERLQRKEAD NELQH RDRCKSLEAQLKAAELKIEELNKEQ
Subjt:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ

Query:  ESLIDIFSEERDRRETEEKNLRKKLQ
        ESLIDIFSEERDRRETEE+NLRKKLQ
Subjt:  ESLIDIFSEERDRRETEEKNLRKKLQ

A0A6J1L0R5 protein MICRORCHIDIA 7-like isoform X22.1e-13581.9Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        +Q F+     + + PFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWA+YCHKIGYAPRR NK TNQSPDRESSPDDY
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND
        S Q SPQSRK    SGK PDKVYSGK+SEK QKTK+CRYVNG SS D NSSM  E+SR RPSS+P SPS  EV VD+LHGGQAN  GN TFHEKE HGND
Subjt:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPS-LEVRVDNLHGGQANGTGNETFHEKESHGND

Query:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ
        VS TM+A+S+GGVS+AQEVG G RG QLKGGDVN +ERSLSSS+FHMLQ++KEEN ELKERLQRKEAD NELQH RDRCKSLEAQLKAAELKIEELNKEQ
Subjt:  VSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQ

Query:  ESLIDIFSEERDRRETEEKNLRKKLQ
        ESLIDIFSEERDRRETEE+NLRKKLQ
Subjt:  ESLIDIFSEERDRRETEEKNLRKKLQ

SwissProt top hitse value%identityAlignment
F4JPP0 Protein MICRORCHIDIA 39.3e-1642.86Show/hide
Query:  IRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGY
        I+   I+      +  F+     + + PFW++       G GV+GVLEANF+EPAHDKQ FER+++  RLEARL ++   YW T+CH  GY
Subjt:  IRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGY

F4JRS4 Protein MICRORCHIDIA 71.6e-3636.26Show/hide
Query:  STSIIRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRF
        S  +I   V   K    +Q F+     + + PFWR+WNA+GSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLE+RL+QMQKTYW+T CHKIGYAPRR 
Subjt:  STSIIRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRF

Query:  NKVTNQSPDRESSPDDYSSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQAN
         K      +R+SSP+  + +  P S K  T +    DK YS                                     SS P                  
Subjt:  NKVTNQSPDRESSPDDYSSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQAN

Query:  GTGNETFHEKESHGNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEA
                   +H  D           GVS       G+ G++L+                                         EL+ E++R K+LE 
Subjt:  GTGNETFHEKESHGNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEA

Query:  QLKAAELKIEELNKEQESLIDIFSEERDRRETEEKNLRKKLQ
        +++ +  KIEE+ KEQE+LI+IFSEERDRR+ EE+ LR KL+
Subjt:  QLKAAELKIEELNKEQESLIDIFSEERDRRETEEKNLRKKLQ

F4K2G3 Protein MICRORCHIDIA 53.3e-2930.77Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        IQ F+     + + PFWR+WNA+GSDGRGVIG+LEANF++PAH+KQGFERT VLA+LE+RL+  QK YW++ CH+IGYAPRR  K             +Y
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQANGTGNETFHEKESHGNDV
         S  +   R +  ++  K                         SS+     +   +  + PS     P +E R  +       G  N +++         
Subjt:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQANGTGNETFHEKESHGNDV

Query:  SMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQE
                  G+S  +E G     ++L         + +      ++ +L+ + + L+ +LQ  +A    L+  +   + LE QLK ++ +I+ L   QE
Subjt:  SMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQE

Query:  SLIDIFSEERDRRETEEKNLRKKLQ
         +  IF +ER RR+  E  LRKKL+
Subjt:  SLIDIFSEERDRRETEEKNLRKKLQ

F4KAF2 Protein MICRORCHIDIA 45.8e-4238Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNK----VTNQSPDRESS
        +Q F+     + + PFWR+WNA+GSDGRGVIGVLEANFVEPAHDKQGFERTTVL+RLEARL+ MQK YW + CHKIGYA R+  K        + DRESS
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNK----VTNQSPDRESS

Query:  PDDYSSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQANGTGNETFHEKESH
        P ++  + S  SRK T  S  K         +      K          N +++     K  ++ S +    S E       GG+   + +++    +  
Subjt:  PDDYSSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQANGTGNETFHEKESH

Query:  GNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEAD----HNELQHERDRCKSLEAQLKAAELKI
        G        +        + E    R  ++L G     SE     S    L QL++EN EL+ERL +KE        +L+ ER+  K+LEA+++  + K+
Subjt:  GNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEAD----HNELQHERDRCKSLEAQLKAAELKI

Query:  EELNKEQESLIDIFSEERDRRETEEKNLRKKLQVGDWEVVSKPVDPEGLG
        +E++KEQ SLID+F+E+RDRR+ EE+NLR KL+      + K +D +  G
Subjt:  EELNKEQESLIDIFSEERDRRETEEKNLRKKLQVGDWEVVSKPVDPEGLG

Q84WV6 Protein MICRORCHIDIA 11.6e-1536Show/hide
Query:  IRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVT
        I+   I+      I  F+     + + PFW++     + G GV+GVLEANF+EPAHDKQ FER+++  RLEARL ++   YW  +CH  GY   +     
Subjt:  IRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVT

Query:  NQS---PDRESSPDDYSSQPSPQSR
        ++    PD+  + + Y+  P P  R
Subjt:  NQS---PDRESSPDDYSSQPSPQSR

Arabidopsis top hitse value%identityAlignment
AT4G24970.1 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family protein1.2e-3736.26Show/hide
Query:  STSIIRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRF
        S  +I   V   K    +Q F+     + + PFWR+WNA+GSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLE+RL+QMQKTYW+T CHKIGYAPRR 
Subjt:  STSIIRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRF

Query:  NKVTNQSPDRESSPDDYSSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQAN
         K      +R+SSP+  + +  P S K  T +    DK YS                                     SS P                  
Subjt:  NKVTNQSPDRESSPDDYSSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQAN

Query:  GTGNETFHEKESHGNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEA
                   +H  D           GVS       G+ G++L+                                         EL+ E++R K+LE 
Subjt:  GTGNETFHEKESHGNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEA

Query:  QLKAAELKIEELNKEQESLIDIFSEERDRRETEEKNLRKKLQ
        +++ +  KIEE+ KEQE+LI+IFSEERDRR+ EE+ LR KL+
Subjt:  QLKAAELKIEELNKEQESLIDIFSEERDRRETEEKNLRKKLQ

AT4G36270.1 ATP binding6.6e-1742.86Show/hide
Query:  IRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGY
        I+   I+      +  F+     + + PFW++       G GV+GVLEANF+EPAHDKQ FER+++  RLEARL ++   YW T+CH  GY
Subjt:  IRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGY

AT5G13130.1 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family protein2.3e-3030.77Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY
        IQ F+     + + PFWR+WNA+GSDGRGVIG+LEANF++PAH+KQGFERT VLA+LE+RL+  QK YW++ CH+IGYAPRR  K             +Y
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDY

Query:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQANGTGNETFHEKESHGNDV
         S  +   R +  ++  K                         SS+     +   +  + PS     P +E R  +       G  N +++         
Subjt:  SSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQANGTGNETFHEKESHGNDV

Query:  SMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQE
                  G+S  +E G     ++L         + +      ++ +L+ + + L+ +LQ  +A    L+  +   + LE QLK ++ +I+ L   QE
Subjt:  SMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQLKAAELKIEELNKEQE

Query:  SLIDIFSEERDRRETEEKNLRKKLQ
         +  IF +ER RR+  E  LRKKL+
Subjt:  SLIDIFSEERDRRETEEKNLRKKLQ

AT5G13130.2 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family protein2.0e-2964.71Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNK
        IQ F+     + + PFWR+WNA+GSDGRGVIG+LEANF++PAH+KQGFERT VLA+LE+RL+  QK YW++ CH+IGYAPRR  K
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNK

AT5G50780.1 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family protein4.1e-4338Show/hide
Query:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNK----VTNQSPDRESS
        +Q F+     + + PFWR+WNA+GSDGRGVIGVLEANFVEPAHDKQGFERTTVL+RLEARL+ MQK YW + CHKIGYA R+  K        + DRESS
Subjt:  IQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWATYCHKIGYAPRRFNK----VTNQSPDRESS

Query:  PDDYSSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQANGTGNETFHEKESH
        P ++  + S  SRK T  S  K         +      K          N +++     K  ++ S +    S E       GG+   + +++    +  
Subjt:  PDDYSSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRVDNLHGGQANGTGNETFHEKESH

Query:  GNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEAD----HNELQHERDRCKSLEAQLKAAELKI
        G        +        + E    R  ++L G     SE     S    L QL++EN EL+ERL +KE        +L+ ER+  K+LEA+++  + K+
Subjt:  GNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEAD----HNELQHERDRCKSLEAQLKAAELKI

Query:  EELNKEQESLIDIFSEERDRRETEEKNLRKKLQVGDWEVVSKPVDPEGLG
        +E++KEQ SLID+F+E+RDRR+ EE+NLR KL+      + K +D +  G
Subjt:  EELNKEQESLIDIFSEERDRRETEEKNLRKKLQVGDWEVVSKPVDPEGLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCAGGAAAAGAAAGGATACCTTACGTTCTCAGGTTTTGATAGCAAAGAACACCTCAGTTTCAATTTCAACTGGGAGACAAGTAGCACCTCTATCATCCGTGCTAC
AGTCATTCAACACAAGAGACCTACAGTCATTCAAAGCTTCTCTAGAAGGCAATCAAAAAAAGATCTGCTGCCATTTTGGAGGCTTTGGAATGCTTCGGGAAGTGATGGGC
GTGGAGTTATAGGTGTTCTAGAAGCAAATTTTGTTGAACCTGCTCATGATAAGCAGGGGTTTGAGCGAACAACTGTTCTTGCAAGACTTGAAGCACGGTTGATACAGATG
CAGAAAACTTACTGGGCTACCTACTGTCATAAGATTGGCTATGCTCCACGACGATTTAATAAAGTTACCAATCAGTCTCCGGATAGAGAAAGTTCCCCCGATGATTATTC
TTCACAGCCTTCACCTCAATCAAGAAAGTACACTACATTAAGTGGGAAGAAACCTGATAAAGTCTATTCAGGAAAAGAATCAGAAAAGTTACAGAAAACAAAAGACTGTA
GATATGTGAATGGGCATTCAAGTAACGATAAAAACAGCAGCATGACCTCTGAGAAATCCAGGATAAGACCTTCTTCTGAGCCAGCTTCTCCTTCGCTTGAAGTTAGGGTT
GATAACCTGCATGGAGGACAAGCAAATGGTACTGGCAATGAGACGTTTCATGAGAAGGAATCTCATGGAAATGATGTTTCAATGACAATGAAAGCGGCTTCAAGTGGAGG
AGTCAGTGAAGCTCAGGAAGTTGGATTGGGTAGAAGAGGATCCCAACTGAAGGGAGGAGATGTAAACAAAAGTGAGCGTTCCCTTTCAAGTTCCGATTTCCACATGCTAC
AGCAGTTGAAGGAAGAAAATGAAGAATTGAAGGAGAGATTACAGAGAAAGGAAGCTGATCACAACGAATTGCAGCATGAAAGAGATAGGTGTAAATCACTTGAGGCTCAG
CTTAAAGCAGCAGAACTTAAAATTGAGGAATTGAATAAAGAACAAGAAAGCCTAATAGATATTTTCTCAGAGGAGAGAGATCGAAGAGAAACTGAGGAGAAAAATCTGCG
AAAGAAGCTGCAGGTGGGAGACTGGGAGGTGGTTTCTAAGCCGGTTGACCCTGAGGGCTTGGGCTTTGGGAATTTGAGACTTTGCAATGAGGCCCTCGGGATTGTTGGTC
CAGTGGTACCATTTATCTTCTTTGTTCCCTCATTTGTATTGCCTCACTTCTTTGAAGGTCATTCTATAGCATCTGTCTTGACTTCTCCAGGCATTCCCCCTTCTTTTAAT
TTTGGCTTCCCTCTCTCTTTGTCCAATAGGGAGACAACGGATGTTTCTTCGCTTCTTCCTTTGCTAGATCTCGTCCTTGTCTCCTTGGGGAGGACTGAGGAGAGTAGGGT
TATTCATATATGGTCTCCTTGCCCTTCTAAAGGGTTTTCCTGCAAATCTTTGTTCTGTCTTTTGTATGACCCAATCACCTTCAATCCCCTAATAATCTTTATTCTCCTCA
TTGTCGAAGATCAAATCTCCCAGAAAGAGGGTGTTGGAGGGTTTGGATCACATTCTTTGGAGATGTTGGTTTGCTATTGCAATTCGGAACCAGTTTTTGGAGTCTTTGAG
CATCCTTCTAGCTTTGAACTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGCAGGAAAAGAAAGGATACCTTACGTTCTCAGGTTTTGATAGCAAAGAACACCTCAGTTTCAATTTCAACTGGGAGACAAGTAGCACCTCTATCATCCGTGCTAC
AGTCATTCAACACAAGAGACCTACAGTCATTCAAAGCTTCTCTAGAAGGCAATCAAAAAAAGATCTGCTGCCATTTTGGAGGCTTTGGAATGCTTCGGGAAGTGATGGGC
GTGGAGTTATAGGTGTTCTAGAAGCAAATTTTGTTGAACCTGCTCATGATAAGCAGGGGTTTGAGCGAACAACTGTTCTTGCAAGACTTGAAGCACGGTTGATACAGATG
CAGAAAACTTACTGGGCTACCTACTGTCATAAGATTGGCTATGCTCCACGACGATTTAATAAAGTTACCAATCAGTCTCCGGATAGAGAAAGTTCCCCCGATGATTATTC
TTCACAGCCTTCACCTCAATCAAGAAAGTACACTACATTAAGTGGGAAGAAACCTGATAAAGTCTATTCAGGAAAAGAATCAGAAAAGTTACAGAAAACAAAAGACTGTA
GATATGTGAATGGGCATTCAAGTAACGATAAAAACAGCAGCATGACCTCTGAGAAATCCAGGATAAGACCTTCTTCTGAGCCAGCTTCTCCTTCGCTTGAAGTTAGGGTT
GATAACCTGCATGGAGGACAAGCAAATGGTACTGGCAATGAGACGTTTCATGAGAAGGAATCTCATGGAAATGATGTTTCAATGACAATGAAAGCGGCTTCAAGTGGAGG
AGTCAGTGAAGCTCAGGAAGTTGGATTGGGTAGAAGAGGATCCCAACTGAAGGGAGGAGATGTAAACAAAAGTGAGCGTTCCCTTTCAAGTTCCGATTTCCACATGCTAC
AGCAGTTGAAGGAAGAAAATGAAGAATTGAAGGAGAGATTACAGAGAAAGGAAGCTGATCACAACGAATTGCAGCATGAAAGAGATAGGTGTAAATCACTTGAGGCTCAG
CTTAAAGCAGCAGAACTTAAAATTGAGGAATTGAATAAAGAACAAGAAAGCCTAATAGATATTTTCTCAGAGGAGAGAGATCGAAGAGAAACTGAGGAGAAAAATCTGCG
AAAGAAGCTGCAGGTGGGAGACTGGGAGGTGGTTTCTAAGCCGGTTGACCCTGAGGGCTTGGGCTTTGGGAATTTGAGACTTTGCAATGAGGCCCTCGGGATTGTTGGTC
CAGTGGTACCATTTATCTTCTTTGTTCCCTCATTTGTATTGCCTCACTTCTTTGAAGGTCATTCTATAGCATCTGTCTTGACTTCTCCAGGCATTCCCCCTTCTTTTAAT
TTTGGCTTCCCTCTCTCTTTGTCCAATAGGGAGACAACGGATGTTTCTTCGCTTCTTCCTTTGCTAGATCTCGTCCTTGTCTCCTTGGGGAGGACTGAGGAGAGTAGGGT
TATTCATATATGGTCTCCTTGCCCTTCTAAAGGGTTTTCCTGCAAATCTTTGTTCTGTCTTTTGTATGACCCAATCACCTTCAATCCCCTAATAATCTTTATTCTCCTCA
TTGTCGAAGATCAAATCTCCCAGAAAGAGGGTGTTGGAGGGTTTGGATCACATTCTTTGGAGATGTTGGTTTGCTATTGCAATTCGGAACCAGTTTTTGGAGTCTTTGAG
CATCCTTCTAGCTTTGAACTGTGA
Protein sequenceShow/hide protein sequence
MLQEKKGYLTFSGFDSKEHLSFNFNWETSSTSIIRATVIQHKRPTVIQSFSRRQSKKDLLPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQM
QKTYWATYCHKIGYAPRRFNKVTNQSPDRESSPDDYSSQPSPQSRKYTTLSGKKPDKVYSGKESEKLQKTKDCRYVNGHSSNDKNSSMTSEKSRIRPSSEPASPSLEVRV
DNLHGGQANGTGNETFHEKESHGNDVSMTMKAASSGGVSEAQEVGLGRRGSQLKGGDVNKSERSLSSSDFHMLQQLKEENEELKERLQRKEADHNELQHERDRCKSLEAQ
LKAAELKIEELNKEQESLIDIFSEERDRRETEEKNLRKKLQVGDWEVVSKPVDPEGLGFGNLRLCNEALGIVGPVVPFIFFVPSFVLPHFFEGHSIASVLTSPGIPPSFN
FGFPLSLSNRETTDVSSLLPLLDLVLVSLGRTEESRVIHIWSPCPSKGFSCKSLFCLLYDPITFNPLIIFILLIVEDQISQKEGVGGFGSHSLEMLVCYCNSEPVFGVFE
HPSSFEL