; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015870 (gene) of Chayote v1 genome

Gene IDSed0015870
OrganismSechium edule (Chayote v1)
DescriptionTranscriptional coactivator Hfi1/Transcriptional adapter 1
Genome locationLG07:9550087..9553770
RNA-Seq ExpressionSed0015870
SyntenySed0015870
Gene Ontology termsGO:0070461 - SAGA-type complex (cellular component)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044349.1 Transcriptional coactivator Hfi1/Transcriptional adapter 1 [Cucumis melo var. makuwa]3.4e-17669.76Show/hide
Query:  EKKEVPGWSVNDWAAEEEDLRDGFELKLKSFEPFETEPAMDSQACSTIRAGRFMLSELSSKVAVLILVSNLQITKSNQSNLHKQLKETAKKLPIFHSEEA
        EKK+  GW VND +  +EDLRDGF+ KLKSFE FE E   DSQ C  + +GR                S+  + +  Q           +++ +F +   
Subjt:  EKKEVPGWSVNDWAAEEEDLRDGFELKLKSFEPFETEPAMDSQACSTIRAGRFMLSELSSKVAVLILVSNLQITKSNQSNLHKQLKETAKKLPIFHSEEA

Query:  AKIIPITHFHQFLILVYLLVGDSEGFVSNS--------AMISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVI
           +P       ++ ++      EGFVSNS         M  RKD SRIDT ELKAMIYRKLGHQRS+KYFDQLK LLSL+ NK EFDKFCIQIIGRE+I
Subjt:  AKIIPITHFHQFLILVYLLVGDSEGFVSNS--------AMISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVI

Query:  PLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQ
        PLHNRLI+AIL+NACVAKTPP L+STRKVGGNLSVKVVNGYQRSCLQSLHGDA   SPRKGRSPV+RDRKIRDRPSPLGPCGKPQN+ALEE A KAQEQQ
Subjt:  PLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQ

Query:  SATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSV-GKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDIS
        SATELHSLGSRPPVEM SVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNF+GS KTLSNV V G+NYH TTCQ+GGELPDTRLLRTHL++K ETEQ+DIS
Subjt:  SATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSV-GKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDIS

Query:  VDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
        VDGVNLLNNALDVYLKRLIEPCLN SRS+ ER K TGNQPITGSR  FQEQ  HRAQQ+NN SLLDFRVAM+LNP+VLG++  +QLEKISLRA EE
Subjt:  VDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

KAG6576923.1 hypothetical protein SDJN03_24497, partial [Cucurbita argyrosperma subsp. sororia]8.0e-17085Show/hide
Query:  YLLVGDSEGFVSNSA--------MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACV
        Y  VGDSEGFVSNSA        M+SRKD SRIDT ELKAMIYRKLGHQRSEKYFDQLK LLSL+INK EFDKFCIQIIGRE+IPLHNR IKAIL+NACV
Subjt:  YLLVGDSEGFVSNSA--------MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACV

Query:  AKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEM
        AKTPP L STRKV  NLSVKVVNGYQRSCLQSLHGDA   SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPV+M
Subjt:  AKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEM

Query:  PSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSVG-KNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLK
         SVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSN+SVG +N H TTCQNGGELPDTRLLRTHLKQK E EQ+DISVDGVNLLNNALDVYLK
Subjt:  PSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSVG-KNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLK

Query:  RLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
        RLIEPCL+ SRS+ ER + T NQPITGSR   +EQ  HRAQ+L NASLLDFRVAM+LNPEVLG+D M QLEKISLRA EE
Subjt:  RLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

KAG7014950.1 hypothetical protein SDJN02_22581, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-16884.21Show/hide
Query:  YLLVGDSEGFVSNSA--------MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACV
        Y  VGDSEGFVSNSA        M+SRKD SRIDT ELKAMIYRKLGHQRSEKYFDQLK LLSL+INK EFDKFCIQIIGRE+IPLHNR IKAIL+NACV
Subjt:  YLLVGDSEGFVSNSA--------MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACV

Query:  AKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEM
        AKTPP L STRKV  NLSVKVVNGYQRSCLQSLHGDA   SPRKGRSPV+RD KIRDRPSPLGPCGKPQNIA+EELAFKAQEQQSATELHSLGSRPPV+M
Subjt:  AKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEM

Query:  PSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSVG-KNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLK
         SVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSN+SVG +N H TTCQNGGELPDTRLLRTHLKQK E EQ+DISVDGVNLLNNALDVYLK
Subjt:  PSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSVG-KNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLK

Query:  RLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
        RLIEPCL+ SRS+ ER + T NQPITGSR   +EQ  HRAQ+L NASLLDFRVAM+LNPEVLG+D M QLEKISLRA EE
Subjt:  RLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

KAG7030952.1 hypothetical protein SDJN02_04989 [Cucurbita argyrosperma subsp. argyrosperma]5.7e-16875.06Show/hide
Query:  VAVLILVSNLQITKSNQSNLHKQLKETAKKLPI---FHSEEAAKIIPITHFHQFLILVYLLVGDSEGFVSNSA--------MISRKDRSRIDTFELKAMI
        +AVLI VS LQITKS++  + KQ KE AK          +     IP+  F  F            GF+S+SA        MI RKD SRIDT ELKAMI
Subjt:  VAVLILVSNLQITKSNQSNLHKQLKETAKKLPI---FHSEEAAKIIPITHFHQFLILVYLLVGDSEGFVSNSA--------MISRKDRSRIDTFELKAMI

Query:  YRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SP
        Y+KLGHQRSEKYFDQLK LLSL+INK EFDKFCIQIIGREVIPLHNRLI+AILRNACVAK PP L+S+RK G NLSVKVV+GYQ+SCLQSL+GDA   SP
Subjt:  YRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SP

Query:  RKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLS
        RKGRSP++RDRKIRDRPSPLGPCGK QNI LEELAFKAQEQQSATELHSLGSRPPVEM SVEDGEEVEQ+A SPGVQSRSPVTAP GISMNF+GSGKTLS
Subjt:  RKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLS

Query:  NVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQL
        NVSVG+NYH TTCQNGGELPDTRLLRTHLKQK E EQ+DISVDGVNLLNNALDVYLKRLIEPCLN SRS+ ER    GNQPIT SR    E+ WHR+Q+L
Subjt:  NVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQL

Query:  NNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
        NNASLLDF  AM+LNPEVLGKDR IQLEKISLRA EE
Subjt:  NNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

XP_038903082.1 uncharacterized protein LOC120089765 [Benincasa hispida]1.1e-16686.52Show/hide
Query:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN
        MI RKD SRIDT ELKAMIYRKLGHQ+SEKYFD LK LLSL+INK EFDKFCIQIIGRE+IPLHNRLI+AIL+NAC AKTPP LNSTRKVGGNLSVKVVN
Subjt:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN

Query:  GYQRSCLQSLHGD---ASPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSP
        GYQRSCLQSLHGD   +SPRK RSPV+RDRKIRDRPSPLGPCGKPQN+ALEEL+FKAQEQQSATE+HSLGSRPPVEM SVEDGEEVEQVAGSPGVQSRSP
Subjt:  GYQRSCLQSLHGD---ASPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSP

Query:  VTAPLGISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQP
        VTAPLGISMNFVGS KTLSNV V +NY+ TTCQ+GGELPDTRLLRTHL+QK ETEQ+DISVDGVNLLNNALDVYLKRLIEPCLN SRS+ ER K TGNQP
Subjt:  VTAPLGISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQP

Query:  ITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
        ITGSR  FQEQ  HRAQQLNNASLLDFRVAM+LNPEVLG+D  +QLEKISLRA EE
Subjt:  ITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0KWF9 Uncharacterized protein7.6e-17477.03Show/hide
Query:  LILVSNLQITKSNQSNLHKQLKETAKKLPIFHSEEAAKIIPITHFHQFLILVYLLVGDSEGFVSNSA--------MISRKDRSRIDTFELKAMIYRKLGH
        L+LVSNL+IT+  QS   KQ  +      I HS++ A         QF   ++      EGFVSNSA        M+ RKD SRIDT ELKAMIYRKLGH
Subjt:  LILVSNLQITKSNQSNLHKQLKETAKKLPIFHSEEAAKIIPITHFHQFLILVYLLVGDSEGFVSNSA--------MISRKDRSRIDTFELKAMIYRKLGH

Query:  QRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SPRKGRSP
        QRS+KYFDQLK LLSL+ NK EFDKFCIQIIGRE+IPLHNRLI+AIL+NACVAKTPP L+STRKVGGNLSVKVVNGYQRSCLQSLHGDA   SPRKGRSP
Subjt:  QRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SPRKGRSP

Query:  VNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSVGK
        V+RDRKIRDRPSPLGPCGKPQN+ALEE A KAQEQQSATELHSLGSRPPVEM SVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNF+GSGKTLSNV VG 
Subjt:  VNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSVGK

Query:  NYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLL
        NYH TTCQ+ GELPDTRLLRTHL++K ETEQ+DISVDGVNLLNNALDVYLKRLIEPCLN SRS+ ER K TGNQPITGSR  FQEQ  HRAQQLNN SLL
Subjt:  NYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLL

Query:  DFRVAMELNPEVLGKDRMIQLEKISLRAFEE
        DFRVAM+LNP+VLG++  +QLEKISLRA EE
Subjt:  DFRVAMELNPEVLGKDRMIQLEKISLRAFEE

A0A1S3BYK6 uncharacterized protein LOC1034947992.2e-16585.43Show/hide
Query:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN
        M  RKD SRIDT ELKAMIYRKLGHQRS+KYFDQLK LLSL+ NK EFDKFCIQIIGRE+IPLHNRLI+AIL+NACVAKTPP L+STRKVGGNLSVKVVN
Subjt:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN

Query:  GYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSP
        GYQRSCLQSLHGDA   SPRKGRSPV+RDRKIRDRPSPLGPCGKPQN+ALEE A KAQEQQSATELHSLGSRPPVEM SVEDGEEVEQVAGSPGVQSRSP
Subjt:  GYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSP

Query:  VTAPLGISMNFVGSGKTLSNVSV-GKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQ
        VTAPLGISMNF+GS KTLSNV V G+NYH TTCQ+GGELPDTRLLRTHL++K ETEQ+DISVDGVNLLNNALDVYLKRLIEPCLN SRS+ ER K TGNQ
Subjt:  VTAPLGISMNFVGSGKTLSNVSV-GKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQ

Query:  PITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
        PITGSR  FQEQ  HRAQQ+NN SLLDFRVAM+LNP+VLG++  +QLEKISLRA EE
Subjt:  PITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

A0A5A7TT05 Transcriptional coactivator Hfi1/Transcriptional adapter 11.6e-17669.76Show/hide
Query:  EKKEVPGWSVNDWAAEEEDLRDGFELKLKSFEPFETEPAMDSQACSTIRAGRFMLSELSSKVAVLILVSNLQITKSNQSNLHKQLKETAKKLPIFHSEEA
        EKK+  GW VND +  +EDLRDGF+ KLKSFE FE E   DSQ C  + +GR                S+  + +  Q           +++ +F +   
Subjt:  EKKEVPGWSVNDWAAEEEDLRDGFELKLKSFEPFETEPAMDSQACSTIRAGRFMLSELSSKVAVLILVSNLQITKSNQSNLHKQLKETAKKLPIFHSEEA

Query:  AKIIPITHFHQFLILVYLLVGDSEGFVSNS--------AMISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVI
           +P       ++ ++      EGFVSNS         M  RKD SRIDT ELKAMIYRKLGHQRS+KYFDQLK LLSL+ NK EFDKFCIQIIGRE+I
Subjt:  AKIIPITHFHQFLILVYLLVGDSEGFVSNS--------AMISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVI

Query:  PLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQ
        PLHNRLI+AIL+NACVAKTPP L+STRKVGGNLSVKVVNGYQRSCLQSLHGDA   SPRKGRSPV+RDRKIRDRPSPLGPCGKPQN+ALEE A KAQEQQ
Subjt:  PLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNGYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQ

Query:  SATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSV-GKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDIS
        SATELHSLGSRPPVEM SVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNF+GS KTLSNV V G+NYH TTCQ+GGELPDTRLLRTHL++K ETEQ+DIS
Subjt:  SATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSV-GKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDIS

Query:  VDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
        VDGVNLLNNALDVYLKRLIEPCLN SRS+ ER K TGNQPITGSR  FQEQ  HRAQQ+NN SLLDFRVAM+LNP+VLG++  +QLEKISLRA EE
Subjt:  VDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

A0A6J1FW12 uncharacterized protein LOC1114478295.4e-16485.39Show/hide
Query:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN
        MI RKD SRIDT ELKAMIY+KLGHQRSEKYFDQLKNLLSL+INK EFDKFCIQIIGREVIPLHNRLI+AILRNACVAK PP L+S+RKVG NLSVKVV+
Subjt:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN

Query:  GYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSP
        GYQ+SCLQSL+GDA   SPRKGRSPV+RDRKIRDRPSPLGPCGKPQNI LEELAFKAQEQQSATELHSLGSRPPVEM SVEDGEEVEQ+A SPGVQSRSP
Subjt:  GYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSP

Query:  VTAPLGISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQP
        VTAP GISMNF+GSGKTLSNV+VG+NY  TTCQNGGELPDTRLLRTHLKQK ETEQ+DISVDGVNLLNNALDVYLKRLIEPCLN SRS+ ER    GNQP
Subjt:  VTAPLGISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQP

Query:  ITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
        IT SR    EQ WHR+Q+L+NASLLDF VAM+LNPE+LGKDR IQLEKISLRA EE
Subjt:  ITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

A0A6J1J7C3 uncharacterized protein LOC1114832434.6e-16385.71Show/hide
Query:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN
        M+ RKD SRIDT ELKAMIYRKLGHQRSEKYFDQLK LLSL+INK EFDKFCIQIIGRE+IPLHNR IKAIL+NACVAKTPP L+STRKV  NLSVKVVN
Subjt:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN

Query:  GYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSP
        GYQRSCLQSLHGDA   SPRKGRSPV+RDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPV+M SVEDGEEVEQVAGSPGVQSRSP
Subjt:  GYQRSCLQSLHGDA---SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSP

Query:  VTAPLGISMNFVGSGKTLSNVSVG-KNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQ
        VTAPLGISMNFVGSGKTLSN+SVG +N H TTCQNGGELPDTRLLRTHLKQK E EQ+DISVDGVNLLNNALD+YLKRLIEPCL+ S+S+ ER + T NQ
Subjt:  VTAPLGISMNFVGSGKTLSNVSVG-KNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQ

Query:  PITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
        PITGSR   QEQ  HRAQ+L NASLLDFRVAM+LNPEVLG+D   QLEKISLRA EE
Subjt:  PITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein3.2e-6845.43Show/hide
Query:  SRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNGYQRSCL
        SR+++ E+KA+IY+K+GHQR++ YFDQL   L+ RI+KSEFDK C + +GRE I LHNRL+++IL+NA VAK+PP                     R   
Subjt:  SRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNGYQRSCL

Query:  QSLHGD----ASPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSPVTAPLG
        +SL+GD     SPRK RS     RK RDRPSPLGP GKPQ++             +  E  S   R P+E+ SVEDGEEVEQ+ GSP VQSRSP+TAPLG
Subjt:  QSLHGD----ASPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRSPVTAPLG

Query:  ISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRN
        +S +     K+ +  S     +  TCQ+ GELPD   LR  L++K E E + +S+D  NLLN  L+ Y++RLIEPCL+L+                    
Subjt:  ISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRN

Query:  GFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
             Q  RA  ++N S+LDF  AME+NP VLG++  IQLEKI  RA EE
Subjt:  GFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

AT2G24530.1 unknown protein8.3e-4032.84Show/hide
Query:  ISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPP---------ALNSTRKVGG
        + R    RI   ELK  I +K G +RS +YF  L   LS ++ KSEFDK C++++GRE + LHN+LI++ILRNA VAK+PP           N+ +  G 
Subjt:  ISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPP---------ALNSTRKVGG

Query:  NL---SVKVVNGYQRSCLQSLHG--DASPRKGRSPVNRDRKIRDRPSPLGPCGKPQ--------------NIALEELAFKAQEQQSATELHSLGSRPPVE
         L      + N  Q   + S +G    SPRK RS + ++RK RDRPSPLG  GK +              ++ +E   ++   +  A E      R PVE
Subjt:  NL---SVKVVNGYQRSCLQSLHG--DASPRKGRSPVNRDRKIRDRPSPLGPCGKPQ--------------NIALEELAFKAQEQQSATELHSLGSRPPVE

Query:  MPSVEDGEEVEQVAGSPGVQ---------SRSPVTAPLGISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMD-ISVDGVNL
         P + + E++  V+               S SP+ APLGI       G +   + V  N    +C + G LPD  +LR  ++     + ++ +S++    
Subjt:  MPSVEDGEEVEQVAGSPGVQ---------SRSPVTAPLGISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMD-ISVDGVNL

Query:  LNNALDVYLKRLIEPCLNL---------------SRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISL
        LNN LDVYLK+LI  C +L                + + + + + G  P    +           Q  ++ S+LDFR AMELNP  LG+D     E+ISL
Subjt:  LNNALDVYLKRLIEPCLNL---------------SRSKLERRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISL

Query:  RAFEE
        R+FEE
Subjt:  RAFEE

AT4G31440.1 unknown protein7.5e-4134.11Show/hide
Query:  ISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNG
        + R    RID  ELK  I +K+G +RS +YF  L   LS ++ KSEFDK C +++GRE + LHN+LI++ILRNA +AK+PP+++ +   G +L +   +G
Subjt:  ISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVNG

Query:  YQRSCLQSLHGD----------ASPRKGRSPVNRDRKIRDRPSPLGPCGKPQN-IALEELAFKAQEQQSA----TELHSLGSRPPVEMPSVEDGEEVEQV
         + S  +SL+ D              K R     DR IRD+P PLG  GK     A         E+ SA     E  ++  +  V  P   D E   ++
Subjt:  YQRSCLQSLHGD----------ASPRKGRSPVNRDRKIRDRPSPLGPCGKPQN-IALEELAFKAQEQQSA----TELHSLGSRPPVEMPSVEDGEEVEQV

Query:  AGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQM-DISVDGVNLLNNALDVYLKRLIEPCLNLS--
          +P      PV APLGI       G     V V  +  A +C + G L DT +LR  ++    T+ +  +S +   +LNN LD+YLK+L++ C++L+  
Subjt:  AGSPGVQSRSPVTAPLGISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQM-DISVDGVNLLNNALDVYLKRLIEPCLNLS--

Query:  --------RSKLERRKLTGNQPITGSR--NGFQEQQWHR----AQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
                +  LE+++ + ++ + G R  N F  Q  ++     ++ ++ SLLDFRVAMELNP  LG+D  +  E+IS+  FEE
Subjt:  --------RSKLERRKLTGNQPITGSR--NGFQEQQWHR----AQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

AT4G33890.1 unknown protein2.0e-8149.59Show/hide
Query:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN
        M S +  SR+DT E+KA+IYR++G+QR+E YF+QL    +L+I KSEFDK CI+ IGR+ I LHNRLI++I++NAC+AK+PP +    K GG+  V+  N
Subjt:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN

Query:  G--YQRSCLQSLHGDA----SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIAL--EELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVA-GSPG
        G   + S +Q LHGD+    S RK RS     RK+RDRPSPLGP GKP ++    EE   KA   QSATEL SLGSRPPVE+ SVE+GEEVEQ+A GSP 
Subjt:  G--YQRSCLQSLHGDA----SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIAL--EELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVA-GSPG

Query:  VQSRSPVTAPLGISMNFVGSG--KTLSNVSV-GKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLE
        VQSR P+TAPLG+SM+       K++SNVS+  ++++  TCQN GELPDTR LR+ L+++ E E + I++D V+LLN+ LDV+++RLIEPCL+L+ ++  
Subjt:  VQSRSPVTAPLGISMNFVGSG--KTLSNVSV-GKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLE

Query:  RRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
                  T        Q   ++++L+  S+ DFR  MELN E+LG+D  + +EKI  RA ++
Subjt:  RRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE

AT4G33890.2 unknown protein2.0e-8149.59Show/hide
Query:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN
        M S +  SR+DT E+KA+IYR++G+QR+E YF+QL    +L+I KSEFDK CI+ IGR+ I LHNRLI++I++NAC+AK+PP +    K GG+  V+  N
Subjt:  MISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNSTRKVGGNLSVKVVN

Query:  G--YQRSCLQSLHGDA----SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIAL--EELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVA-GSPG
        G   + S +Q LHGD+    S RK RS     RK+RDRPSPLGP GKP ++    EE   KA   QSATEL SLGSRPPVE+ SVE+GEEVEQ+A GSP 
Subjt:  G--YQRSCLQSLHGDA----SPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIAL--EELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVA-GSPG

Query:  VQSRSPVTAPLGISMNFVGSG--KTLSNVSV-GKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLE
        VQSR P+TAPLG+SM+       K++SNVS+  ++++  TCQN GELPDTR LR+ L+++ E E + I++D V+LLN+ LDV+++RLIEPCL+L+ ++  
Subjt:  VQSRSPVTAPLGISMNFVGSG--KTLSNVSV-GKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLE

Query:  RRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE
                  T        Q   ++++L+  S+ DFR  MELN E+LG+D  + +EKI  RA ++
Subjt:  RRKLTGNQPITGSRNGFQEQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGGAGAAGAAAGAGGTTCCTGGTTGGAGCGTAAATGATTGGGCGGCTGAAGAAGAAGATTTGAGAGATGGGTTTGAATTGAAGTTGAAATCTTTTGAGCCGTT
TGAAACAGAGCCTGCAATGGACTCTCAGGCATGTTCGACAATCAGAGCTGGAAGGTTCATGTTATCAGAACTCTCATCGAAGGTCGCTGTTTTGATTTTGGTCTCCAATC
TTCAAATCACCAAAAGCAATCAATCCAATCTCCATAAACAATTGAAAGAAACTGCAAAAAAGCTTCCAATTTTCCATTCTGAAGAAGCCGCCAAAATCATCCCTATTACT
CATTTCCATCAATTTCTTATTTTAGTCTACTTGTTGGTGGGGGATTCTGAAGGCTTTGTCTCTAATTCAGCCATGATTTCGAGGAAAGACCGTTCTCGTATCGACACTTT
CGAGCTGAAGGCGATGATTTATCGAAAGCTTGGACATCAGAGGTCGGAGAAGTACTTCGATCAGCTTAAGAATTTGTTGAGTTTAAGGATTAACAAAAGCGAATTCGACA
AGTTTTGTATTCAGATCATTGGGAGGGAGGTTATACCTCTTCATAATCGGCTTATCAAAGCAATTCTTCGAAATGCCTGTGTTGCTAAAACTCCTCCTGCTCTTAACAGT
ACTAGGAAAGTAGGGGGCAATCTCAGTGTGAAGGTTGTGAATGGGTATCAGAGGAGCTGTCTTCAATCCCTTCATGGAGATGCATCCCCTCGAAAGGGCAGGTCTCCGGT
TAATAGAGACCGTAAGATTCGTGATCGGCCGAGTCCTCTGGGACCATGTGGGAAGCCCCAGAATATTGCACTTGAAGAACTTGCTTTCAAGGCACAAGAACAACAGAGTG
CAACTGAGTTGCATTCTCTTGGCAGCCGTCCGCCTGTCGAAATGCCGTCTGTAGAAGACGGAGAAGAGGTTGAGCAGGTGGCTGGGAGTCCAGGAGTTCAGAGCAGAAGC
CCGGTTACTGCTCCGCTTGGTATATCTATGAACTTTGTTGGATCTGGTAAAACACTGTCTAATGTATCTGTAGGAAAAAATTACCATGCAACAACATGTCAAAATGGCGG
TGAGTTACCCGACACAAGGCTGCTAAGAACTCATTTGAAGCAGAAGTTTGAAACAGAGCAGATGGATATATCTGTCGATGGTGTAAATCTTCTCAACAACGCGCTGGATG
TTTATTTAAAGAGGTTAATTGAGCCATGTTTGAATTTATCTCGGTCAAAGCTCGAGCGACGGAAACTTACAGGGAATCAACCGATAACTGGCTCAAGAAATGGATTCCAG
GAACAACAATGGCATAGAGCTCAACAATTGAACAATGCATCCTTGTTGGACTTCCGTGTCGCAATGGAACTGAACCCTGAAGTACTTGGGAAAGACCGGATGATACAGCT
CGAGAAAATCAGTTTACGAGCTTTTGAAGAGTAA
mRNA sequenceShow/hide mRNA sequence
TGGGCGACCAAAATCTGAACATCGCAATTGAATAGCAGCCGGCAAGAGAAGGGAAGAAATGAAAATGAATACCTTTTGGAGAATATCATTAGATTGCAAGTGAGTAGCGT
GGGTGTACGATGAAGATGGAGAAGAAAGAGGTTCCTGGTTGGAGCGTAAATGATTGGGCGGCTGAAGAAGAAGATTTGAGAGATGGGTTTGAATTGAAGTTGAAATCTTT
TGAGCCGTTTGAAACAGAGCCTGCAATGGACTCTCAGGCATGTTCGACAATCAGAGCTGGAAGGTTCATGTTATCAGAACTCTCATCGAAGGTCGCTGTTTTGATTTTGG
TCTCCAATCTTCAAATCACCAAAAGCAATCAATCCAATCTCCATAAACAATTGAAAGAAACTGCAAAAAAGCTTCCAATTTTCCATTCTGAAGAAGCCGCCAAAATCATC
CCTATTACTCATTTCCATCAATTTCTTATTTTAGTCTACTTGTTGGTGGGGGATTCTGAAGGCTTTGTCTCTAATTCAGCCATGATTTCGAGGAAAGACCGTTCTCGTAT
CGACACTTTCGAGCTGAAGGCGATGATTTATCGAAAGCTTGGACATCAGAGGTCGGAGAAGTACTTCGATCAGCTTAAGAATTTGTTGAGTTTAAGGATTAACAAAAGCG
AATTCGACAAGTTTTGTATTCAGATCATTGGGAGGGAGGTTATACCTCTTCATAATCGGCTTATCAAAGCAATTCTTCGAAATGCCTGTGTTGCTAAAACTCCTCCTGCT
CTTAACAGTACTAGGAAAGTAGGGGGCAATCTCAGTGTGAAGGTTGTGAATGGGTATCAGAGGAGCTGTCTTCAATCCCTTCATGGAGATGCATCCCCTCGAAAGGGCAG
GTCTCCGGTTAATAGAGACCGTAAGATTCGTGATCGGCCGAGTCCTCTGGGACCATGTGGGAAGCCCCAGAATATTGCACTTGAAGAACTTGCTTTCAAGGCACAAGAAC
AACAGAGTGCAACTGAGTTGCATTCTCTTGGCAGCCGTCCGCCTGTCGAAATGCCGTCTGTAGAAGACGGAGAAGAGGTTGAGCAGGTGGCTGGGAGTCCAGGAGTTCAG
AGCAGAAGCCCGGTTACTGCTCCGCTTGGTATATCTATGAACTTTGTTGGATCTGGTAAAACACTGTCTAATGTATCTGTAGGAAAAAATTACCATGCAACAACATGTCA
AAATGGCGGTGAGTTACCCGACACAAGGCTGCTAAGAACTCATTTGAAGCAGAAGTTTGAAACAGAGCAGATGGATATATCTGTCGATGGTGTAAATCTTCTCAACAACG
CGCTGGATGTTTATTTAAAGAGGTTAATTGAGCCATGTTTGAATTTATCTCGGTCAAAGCTCGAGCGACGGAAACTTACAGGGAATCAACCGATAACTGGCTCAAGAAAT
GGATTCCAGGAACAACAATGGCATAGAGCTCAACAATTGAACAATGCATCCTTGTTGGACTTCCGTGTCGCAATGGAACTGAACCCTGAAGTACTTGGGAAAGACCGGAT
GATACAGCTCGAGAAAATCAGTTTACGAGCTTTTGAAGAGTAA
Protein sequenceShow/hide protein sequence
MKMEKKEVPGWSVNDWAAEEEDLRDGFELKLKSFEPFETEPAMDSQACSTIRAGRFMLSELSSKVAVLILVSNLQITKSNQSNLHKQLKETAKKLPIFHSEEAAKIIPIT
HFHQFLILVYLLVGDSEGFVSNSAMISRKDRSRIDTFELKAMIYRKLGHQRSEKYFDQLKNLLSLRINKSEFDKFCIQIIGREVIPLHNRLIKAILRNACVAKTPPALNS
TRKVGGNLSVKVVNGYQRSCLQSLHGDASPRKGRSPVNRDRKIRDRPSPLGPCGKPQNIALEELAFKAQEQQSATELHSLGSRPPVEMPSVEDGEEVEQVAGSPGVQSRS
PVTAPLGISMNFVGSGKTLSNVSVGKNYHATTCQNGGELPDTRLLRTHLKQKFETEQMDISVDGVNLLNNALDVYLKRLIEPCLNLSRSKLERRKLTGNQPITGSRNGFQ
EQQWHRAQQLNNASLLDFRVAMELNPEVLGKDRMIQLEKISLRAFEE