; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002447 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002447
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTranscriptional coactivator Hfi1/Transcriptional adapter 1
Genome locationscaffold6:4536507..4541411
RNA-Seq ExpressionSpg002447
SyntenySpg002447
Gene Ontology termsGO:0070461 - SAGA-type complex (cellular component)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044349.1 Transcriptional coactivator Hfi1/Transcriptional adapter 1 [Cucumis melo var. makuwa]4.0e-20680.64Show/hide
Query:  MTKMEKKEVSDWGVNGGAAGKEDLRDGFEWKLKSFEQIEGELARDSQTCSESERESTADVAR-----------QIGFWISDERFLAL-SAYSLVGDSEGF
        M   EKK+   WGVN G+ GKEDLRDGF+WKLKSFEQ E E+  DSQTC ES RES+ADV R             GF  SD +   +   +      EGF
Subjt:  MTKMEKKEVSDWGVNGGAAGKEDLRDGFEWKLKSFEQIEGELARDSQTCSESERESTADVAR-----------QIGFWISDERFLAL-SAYSLVGDSEGF

Query:  ISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSS
        +SNS I E+  +KM PRKD+SRIDTSELKAMIYRKLGHQRS+KYFDQLK LLSLK NK EFDKFCIQI+GRE+IPLHNRLIRAIL+NACVAKTPPVLSS+
Subjt:  ISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSS

Query:  RKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVE
        RKVGG+LSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGK QN+ALEEFA KAQEQQSATELHSLGSRPPVEMASVEDGEEVE
Subjt:  RKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVE

Query:  QVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSV-GRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFS
        QVAGSPGVQSRSPVTAPLGISMNFIGS KTLSNV V GRNYHVTTCQ+GGELPDTRLLRTHL++KLE EQIDISVDGVNLLNNALDVYLKRLIEPCLNFS
Subjt:  QVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSV-GRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFS

Query:  RSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
        RSRCERLKFTGNQPITGSR  FQEQ+ HRAQ++NN SLLDFRVAM+LNP+VLG++WT+QLEKISLRASEE
Subjt:  RSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

KAG6576923.1 hypothetical protein SDJN03_24497, partial [Cucurbita argyrosperma subsp. sororia]2.3e-18588.68Show/hide
Query:  YSLVGDSEGFISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACV
        Y  VGDSEGF+SNSAI E+  +KML RKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLK LLSLKINK EFDKFCIQI+GRE+IPLHNR I+AIL+NACV
Subjt:  YSLVGDSEGFISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACV

Query:  AKTPPVLSSSRKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEM
        AKTPPVL S+RKV  +LSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPV+RDRKIRDRPSPLGPCGK QNIALEE AFKAQEQQSATELHSLGSRPPV+M
Subjt:  AKTPPVLSSSRKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEM

Query:  ASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSVG-RNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLK
        ASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNF+GSGKTLSN+SVG RN HVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLK
Subjt:  ASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSVG-RNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLK

Query:  RLIEPCLNFSRSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
        RLIEPCL+FSRSRCERL+FT NQPITGSR A +EQ+ HRAQRL N SLLDFRVAM+LNPEVLG+DW  QLEKISLRASEE
Subjt:  RLIEPCLNFSRSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

KAG7014950.1 hypothetical protein SDJN02_22581, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-18488.42Show/hide
Query:  YSLVGDSEGFISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACV
        Y  VGDSEGF+SNSAI E+  +KML RKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLK LLSLKINK EFDKFCIQI+GRE+IPLHNR I+AIL+NACV
Subjt:  YSLVGDSEGFISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACV

Query:  AKTPPVLSSSRKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEM
        AKTPPVL S+RKV  +LSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRD KIRDRPSPLGPCGK QNIA+EE AFKAQEQQSATELHSLGSRPPV+M
Subjt:  AKTPPVLSSSRKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEM

Query:  ASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSVG-RNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLK
        ASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNF+GSGKTLSN+SVG RN HVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLK
Subjt:  ASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSVG-RNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLK

Query:  RLIEPCLNFSRSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
        RLIEPCL+FSRSRCERL+FT NQPITGSR A +EQ+ HRAQRL N SLLDFRVAM+LNPEVLG+DW  QLEKISLRASEE
Subjt:  RLIEPCLNFSRSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

KAG7030952.1 hypothetical protein SDJN02_04989 [Cucurbita argyrosperma subsp. argyrosperma]1.3e-18088.95Show/hide
Query:  GFISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLS
        GFIS+SAI E+T +KM+PRKDSSRIDTSELKAMIY+KLGHQRSEKYFDQLK LLSLKINK EFDKFCIQI+GREVIPLHNRLIRAILRNACVAK PPVLS
Subjt:  GFISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLS

Query:  SSRKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEE
        SSRK G +LSVKVV+GYQ+SCLQSL+GDAFLSSPRKGRSP+SRDRKIRDRPSPLGPCGK+QNI LEE AFKAQEQQSATELHSLGSRPPVEMASVEDGEE
Subjt:  SSRKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEE

Query:  VEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNF
        VEQ+A SPGVQSRSPVTAP GISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNF
Subjt:  VEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNF

Query:  SRSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
        SRSRCER    GNQPIT SR    E+HWHR+QRLNN SLLDF  AM+LNPEVLGKD TIQLEKISLRA EE
Subjt:  SRSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

XP_004152270.1 uncharacterized protein LOC101211126 [Cucumis sativus]2.0e-18191.57Show/hide
Query:  MLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVN
        MLPRKD+SRIDTSELKAMIYRKLGHQRS+KYFDQLK LLSLK NK EFDKFCIQI+GRE+IPLHNRLIRAIL+NACVAKTPPVLSS+RKVGG+LSVKVVN
Subjt:  MLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVN

Query:  GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSP
        GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGK QN+ALEEFA KAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSP
Subjt:  GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSP

Query:  VTAPLGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQP
        VTAPLGISMNFIGSGKTLSNV VG NYHVTTCQ+ GELPDTRLLRTHL++KLE EQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQP
Subjt:  VTAPLGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQP

Query:  ITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
        ITGSR  FQEQH HRAQ+LNN SLLDFRVAM+LNP+VLG++WT+QLEKISLRASEE
Subjt:  ITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

TrEMBL top hitse value%identityAlignment
A0A0A0KWF9 Uncharacterized protein2.0e-18790.05Show/hide
Query:  EGFISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVL
        EGF+SNSAI  +  +KMLPRKD+SRIDTSELKAMIYRKLGHQRS+KYFDQLK LLSLK NK EFDKFCIQI+GRE+IPLHNRLIRAIL+NACVAKTPPVL
Subjt:  EGFISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVL

Query:  SSSRKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGE
        SS+RKVGG+LSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGK QN+ALEEFA KAQEQQSATELHSLGSRPPVEMASVEDGE
Subjt:  SSSRKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGE

Query:  EVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLN
        EVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNV VG NYHVTTCQ+ GELPDTRLLRTHL++KLE EQIDISVDGVNLLNNALDVYLKRLIEPCLN
Subjt:  EVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLN

Query:  FSRSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
        FSRSRCERLKFTGNQPITGSR  FQEQH HRAQ+LNN SLLDFRVAM+LNP+VLG++WT+QLEKISLRASEE
Subjt:  FSRSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

A0A1S3BYK6 uncharacterized protein LOC1034947997.0e-18090.76Show/hide
Query:  MLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVN
        M PRKD+SRIDTSELKAMIYRKLGHQRS+KYFDQLK LLSLK NK EFDKFCIQI+GRE+IPLHNRLIRAIL+NACVAKTPPVLSS+RKVGG+LSVKVVN
Subjt:  MLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVN

Query:  GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSP
        GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGK QN+ALEEFA KAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSP
Subjt:  GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSP

Query:  VTAPLGISMNFIGSGKTLSNVSV-GRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ
        VTAPLGISMNFIGS KTLSNV V GRNYHVTTCQ+GGELPDTRLLRTHL++KLE EQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ
Subjt:  VTAPLGISMNFIGSGKTLSNVSV-GRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ

Query:  PITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
        PITGSR  FQEQ+ HRAQ++NN SLLDFRVAM+LNP+VLG++WT+QLEKISLRASEE
Subjt:  PITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

A0A5A7TT05 Transcriptional coactivator Hfi1/Transcriptional adapter 11.9e-20680.64Show/hide
Query:  MTKMEKKEVSDWGVNGGAAGKEDLRDGFEWKLKSFEQIEGELARDSQTCSESERESTADVAR-----------QIGFWISDERFLAL-SAYSLVGDSEGF
        M   EKK+   WGVN G+ GKEDLRDGF+WKLKSFEQ E E+  DSQTC ES RES+ADV R             GF  SD +   +   +      EGF
Subjt:  MTKMEKKEVSDWGVNGGAAGKEDLRDGFEWKLKSFEQIEGELARDSQTCSESERESTADVAR-----------QIGFWISDERFLAL-SAYSLVGDSEGF

Query:  ISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSS
        +SNS I E+  +KM PRKD+SRIDTSELKAMIYRKLGHQRS+KYFDQLK LLSLK NK EFDKFCIQI+GRE+IPLHNRLIRAIL+NACVAKTPPVLSS+
Subjt:  ISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSS

Query:  RKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVE
        RKVGG+LSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGK QN+ALEEFA KAQEQQSATELHSLGSRPPVEMASVEDGEEVE
Subjt:  RKVGGSLSVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVE

Query:  QVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSV-GRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFS
        QVAGSPGVQSRSPVTAPLGISMNFIGS KTLSNV V GRNYHVTTCQ+GGELPDTRLLRTHL++KLE EQIDISVDGVNLLNNALDVYLKRLIEPCLNFS
Subjt:  QVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSV-GRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFS

Query:  RSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
        RSRCERLKFTGNQPITGSR  FQEQ+ HRAQ++NN SLLDFRVAM+LNP+VLG++WT+QLEKISLRASEE
Subjt:  RSRCERLKFTGNQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

A0A6J1E3E4 uncharacterized protein LOC1114304591.8e-17590.2Show/hide
Query:  MLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVN
        ML RKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLK LLSLKINK EFDKFCIQI+GRE+IPLHNR I+AIL+NACVAKTPPVL S+RKV  +LSVKVVN
Subjt:  MLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVN

Query:  GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSP
        GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGK QNIALEE AFKAQEQQSATELHSLGSRPPV+MASVEDGEEVEQVAGSPGVQSRSP
Subjt:  GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSP

Query:  VTAPLGISMNFIGSGKTLSNVSVG-RNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ
        VTAPLGISMNF+GSGKTLSN+SVG RN  VTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCL+FSRSRCER +FT NQ
Subjt:  VTAPLGISMNFIGSGKTLSNVSVG-RNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ

Query:  PITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
        PITGSR A +EQ+ HRAQRL N SLLDFRVAM+LNPEVLG+DWT QLEKISLRASEE
Subjt:  PITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

A0A6J1J7C3 uncharacterized protein LOC1114832436.5e-17890.76Show/hide
Query:  MLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVN
        MLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLK LLSLKINK EFDKFCIQI+GRE+IPLHNR I+AIL+NACVAKTPPVLSS+RKV  +LSVKVVN
Subjt:  MLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVN

Query:  GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSP
        GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGK QNIALEE AFKAQEQQSATELHSLGSRPPV+MASVEDGEEVEQVAGSPGVQSRSP
Subjt:  GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSP

Query:  VTAPLGISMNFIGSGKTLSNVSVG-RNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ
        VTAPLGISMNF+GSGKTLSN+SVG RN HVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALD+YLKRLIEPCL+FS+SRCER +FT NQ
Subjt:  VTAPLGISMNFIGSGKTLSNVSVG-RNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ

Query:  PITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
        PITGSR A QEQ+ HRAQRL N SLLDFRVAM+LNPEVLG+DWT QLEKISLRASEE
Subjt:  PITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein1.1e-7146Show/hide
Query:  SRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVNGYQRSCL
        SR+++ E+KA+IY+K+GHQR++ YFDQL   L+ +I+KSEFDK C + VGRE I LHNRL+R+IL+NA VAK+PP                     R   
Subjt:  SRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVNGYQRSCL

Query:  QSLHGD-AFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSPVTAPLG
        +SL+GD  F  SPRK RS     RK RDRPSPLGP GK Q++             +  E  S   R P+E+ SVEDGEEVEQ+ GSP VQSRSP+TAPLG
Subjt:  QSLHGD-AFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSPVTAPLG

Query:  ISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRN
        +S +     K+ +  S     +  TCQ+ GELPD   LR  L++KLEME I +S+D  NLLN  L+ Y++RLIEPCL+ +                    
Subjt:  ISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRN

Query:  AFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
                + + ++N S+LDF  AME+NP VLG++W IQLEKI  RASEE
Subjt:  AFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

AT2G24530.1 unknown protein4.4e-4133.58Show/hide
Query:  LPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVV--
        + R    RI   ELK  I +K G +RS +YF  L   LS K+ KSEFDK C++++GRE + LHN+LIR+ILRNA VAK+PP      + G S        
Subjt:  LPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVV--

Query:  --NGYQRSCL----QSLHGDAFLS-----SPRKGRSPVSRDRKIRDRPSPLGPCGKSQ--------------NIALEEFAFKAQEQQSATELHSLGSRPP
          +G ++S       S H   + +     SPRK RS + ++RK RDRPSPLG  GK +              ++ +E   ++   +  A E      R P
Subjt:  --NGYQRSCL----QSLHGDAFLS-----SPRKGRSPVSRDRKIRDRPSPLGPCGKSQ--------------NIALEEFAFKAQEQQSATELHSLGSRPP

Query:  VEMASVEDGEEVEQVAGSPGVQ---------SRSPVTAPLGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQID-ISVDGV
        VE   + + E++  V+               S SP+ APLGI       G +   + V  N  + +C + G LPD  +LR  ++     + ++ +S++  
Subjt:  VEMASVEDGEEVEQVAGSPGVQ---------SRSPVTAPLGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQID-ISVDGV

Query:  NLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPI--TGSRNAFQEQHWHR--------------AQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEK
          LNN LDVYLK+LI  C +   +R       G Q I    S+N      W                 Q  ++ S+LDFR AMELNP  LG+DW    E+
Subjt:  NLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPI--TGSRNAFQEQHWHR--------------AQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEK

Query:  ISLRASEE
        ISLR+ EE
Subjt:  ISLRASEE

AT4G31440.1 unknown protein3.9e-4234.29Show/hide
Query:  LPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVNG
        + R    RID +ELK  I +K+G +RS +YF  L   LS K+ KSEFDK C +++GRE + LHN+LIR+ILRNA +AK+PP +  S   G SL +   +G
Subjt:  LPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVNG

Query:  YQRSCL---QSLHGDAFLSSP--RKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFK-----AQEQQSA----TELHSLGSRPPVEMASVEDGEEVE
         + S       +  D  LS+    K R     DR IRD+P PLG  GK     L  FA+        E+ SA     E  ++  +  V      D E   
Subjt:  YQRSCL---QSLHGDAFLSSP--RKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFK-----AQEQQSA----TELHSLGSRPPVEMASVEDGEEVE

Query:  QVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQI-DISVDGVNLLNNALDVYLKRLIEPCLNFS
        ++  +P      PV APLGI       G     V V  +    +C + G L DT +LR  ++     + +  +S +   +LNN LD+YLK+L++ C++ +
Subjt:  QVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQI-DISVDGVNLLNNALDVYLKRLIEPCLNFS

Query:  RSRCE---------RLKFTGNQPITGSR--NAFQEQHWHR----AQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
         +R             + + ++ + G R  N+F  Q  ++     +  ++ SLLDFRVAMELNP  LG+DW +  E+IS+   EE
Subjt:  RSRCE---------RLKFTGNQPITGSR--NAFQEQHWHR----AQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

AT4G33890.1 unknown protein4.3e-8952.92Show/hide
Query:  SSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVNG--YQR
        SSR+DT E+KA+IYR++G+QR+E YF+QL    +LKI KSEFDK CI+ +GR+ I LHNRLIR+I++NAC+AK+PP +    K GGS  V+  NG   + 
Subjt:  SSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVNG--YQR

Query:  SCLQSLHGD-AFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIAL--EEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVA-GSPGVQSRSP
        S +Q LHGD AF  S RK RS     RK+RDRPSPLGP GK  ++    EE   KA   QSATEL SLGSRPPVE+ SVE+GEEVEQ+A GSP VQSR P
Subjt:  SCLQSLHGD-AFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIAL--EEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVA-GSPGVQSRSP

Query:  VTAPLGISMNFIGSG--KTLSNVSV-GRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTG
        +TAPLG+SM+       K++SNVS+  R+++  TCQN GELPDTR LR+ L+++LEME + I++D V+LLN+ LDV+++RLIEPCL+ + +RC       
Subjt:  VTAPLGISMNFIGSG--KTLSNVSV-GRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTG

Query:  NQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
            T        Q+  +++RL+  S+ DFR  MELN E+LG+DW + +EKI  RAS++
Subjt:  NQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE

AT4G33890.2 unknown protein4.3e-8952.92Show/hide
Query:  SSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVNG--YQR
        SSR+DT E+KA+IYR++G+QR+E YF+QL    +LKI KSEFDK CI+ +GR+ I LHNRLIR+I++NAC+AK+PP +    K GGS  V+  NG   + 
Subjt:  SSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSLSVKVVNG--YQR

Query:  SCLQSLHGD-AFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIAL--EEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVA-GSPGVQSRSP
        S +Q LHGD AF  S RK RS     RK+RDRPSPLGP GK  ++    EE   KA   QSATEL SLGSRPPVE+ SVE+GEEVEQ+A GSP VQSR P
Subjt:  SCLQSLHGD-AFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIAL--EEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVA-GSPGVQSRSP

Query:  VTAPLGISMNFIGSG--KTLSNVSV-GRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTG
        +TAPLG+SM+       K++SNVS+  R+++  TCQN GELPDTR LR+ L+++LEME + I++D V+LLN+ LDV+++RLIEPCL+ + +RC       
Subjt:  VTAPLGISMNFIGSG--KTLSNVSV-GRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTG

Query:  NQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE
            T        Q+  +++RL+  S+ DFR  MELN E+LG+DW + +EKI  RAS++
Subjt:  NQPITGSRNAFQEQHWHRAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAATACGAAATTAACAAGGTAACATTTTTGAGAAAACCTGGGCGGCAAGGATGTGAAGAAAAGTCCAGGATGACGAAGATGGAGAAGAAAGAGGTTTCTGATTG
GGGCGTGAATGGTGGGGCGGCTGGGAAAGAAGATTTGAGAGATGGGTTTGAATGGAAGTTGAAATCTTTTGAGCAGATTGAAGGAGAGCTTGCAAGGGATTCTCAGACAT
GTTCGGAATCAGAGCGGGAAAGCACTGCAGATGTTGCTAGACAGATAGGTTTCTGGATTTCTGATGAAAGATTTCTTGCCCTTTCAGCCTACTCGTTGGTGGGGGATTCT
GAGGGCTTTATCTCAAATTCAGCCATTGGTGAAGTTACGGTCGTGAAAATGCTTCCGAGGAAAGACAGTTCTCGTATAGACACTTCAGAGCTGAAAGCGATGATCTATCG
GAAGCTTGGTCATCAGAGGTCGGAGAAATACTTTGATCAGCTCAAAAACTTGTTGAGTTTAAAGATTAACAAGAGTGAATTCGACAAGTTTTGCATTCAGATCGTTGGGA
GAGAGGTTATACCTCTTCATAATCGGCTTATCAGAGCAATTCTTCGAAATGCCTGTGTGGCTAAAACTCCCCCTGTTCTTAGCAGTTCTAGGAAAGTAGGAGGCAGTCTC
AGTGTAAAGGTTGTAAATGGGTATCAGAGGAGTTGTCTTCAATCACTTCATGGAGATGCATTCCTTTCTTCCCCTCGAAAGGGCAGGTCTCCGGTCAGTAGAGACCGCAA
GATTCGAGATCGCCCAAGTCCTCTGGGACCATGTGGGAAGTCCCAGAATATTGCACTTGAAGAATTTGCTTTCAAGGCACAAGAACAACAGAGTGCAACTGAGTTGCATT
CTCTTGGCAGTCGTCCGCCTGTCGAAATGGCGTCTGTAGAAGATGGGGAAGAGGTTGAGCAGGTGGCTGGGAGTCCTGGAGTTCAGAGCAGAAGCCCGGTTACTGCTCCA
CTTGGTATATCTATGAACTTCATTGGATCTGGTAAAACTCTGTCTAACGTATCTGTAGGAAGAAATTACCATGTAACAACATGTCAAAACGGTGGCGAGCTACCCGACAC
GAGGTTGCTAAGAACACATTTGAAGCAGAAGTTGGAAATGGAGCAGATTGATATATCTGTTGATGGTGTAAACCTTCTTAACAATGCACTGGATGTTTATTTAAAGAGGT
TAATCGAGCCATGTTTGAATTTCTCTCGGTCAAGGTGCGAGCGACTGAAATTTACAGGTAATCAACCAATAACTGGCTCAAGAAACGCATTCCAGGAACAACATTGGCAT
AGAGCTCAACGATTGAACAACACATCCTTGTTGGACTTTCGTGTTGCAATGGAACTGAATCCTGAAGTACTTGGGAAAGACTGGACGATACAGCTCGAGAAAATCAGTTT
ACGGGCTTCTGAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGAATACGAAATTAACAAGGTAACATTTTTGAGAAAACCTGGGCGGCAAGGATGTGAAGAAAAGTCCAGGATGACGAAGATGGAGAAGAAAGAGGTTTCTGATTG
GGGCGTGAATGGTGGGGCGGCTGGGAAAGAAGATTTGAGAGATGGGTTTGAATGGAAGTTGAAATCTTTTGAGCAGATTGAAGGAGAGCTTGCAAGGGATTCTCAGACAT
GTTCGGAATCAGAGCGGGAAAGCACTGCAGATGTTGCTAGACAGATAGGTTTCTGGATTTCTGATGAAAGATTTCTTGCCCTTTCAGCCTACTCGTTGGTGGGGGATTCT
GAGGGCTTTATCTCAAATTCAGCCATTGGTGAAGTTACGGTCGTGAAAATGCTTCCGAGGAAAGACAGTTCTCGTATAGACACTTCAGAGCTGAAAGCGATGATCTATCG
GAAGCTTGGTCATCAGAGGTCGGAGAAATACTTTGATCAGCTCAAAAACTTGTTGAGTTTAAAGATTAACAAGAGTGAATTCGACAAGTTTTGCATTCAGATCGTTGGGA
GAGAGGTTATACCTCTTCATAATCGGCTTATCAGAGCAATTCTTCGAAATGCCTGTGTGGCTAAAACTCCCCCTGTTCTTAGCAGTTCTAGGAAAGTAGGAGGCAGTCTC
AGTGTAAAGGTTGTAAATGGGTATCAGAGGAGTTGTCTTCAATCACTTCATGGAGATGCATTCCTTTCTTCCCCTCGAAAGGGCAGGTCTCCGGTCAGTAGAGACCGCAA
GATTCGAGATCGCCCAAGTCCTCTGGGACCATGTGGGAAGTCCCAGAATATTGCACTTGAAGAATTTGCTTTCAAGGCACAAGAACAACAGAGTGCAACTGAGTTGCATT
CTCTTGGCAGTCGTCCGCCTGTCGAAATGGCGTCTGTAGAAGATGGGGAAGAGGTTGAGCAGGTGGCTGGGAGTCCTGGAGTTCAGAGCAGAAGCCCGGTTACTGCTCCA
CTTGGTATATCTATGAACTTCATTGGATCTGGTAAAACTCTGTCTAACGTATCTGTAGGAAGAAATTACCATGTAACAACATGTCAAAACGGTGGCGAGCTACCCGACAC
GAGGTTGCTAAGAACACATTTGAAGCAGAAGTTGGAAATGGAGCAGATTGATATATCTGTTGATGGTGTAAACCTTCTTAACAATGCACTGGATGTTTATTTAAAGAGGT
TAATCGAGCCATGTTTGAATTTCTCTCGGTCAAGGTGCGAGCGACTGAAATTTACAGGTAATCAACCAATAACTGGCTCAAGAAACGCATTCCAGGAACAACATTGGCAT
AGAGCTCAACGATTGAACAACACATCCTTGTTGGACTTTCGTGTTGCAATGGAACTGAATCCTGAAGTACTTGGGAAAGACTGGACGATACAGCTCGAGAAAATCAGTTT
ACGGGCTTCTGAAGAGTGA
Protein sequenceShow/hide protein sequence
MMEYEINKVTFLRKPGRQGCEEKSRMTKMEKKEVSDWGVNGGAAGKEDLRDGFEWKLKSFEQIEGELARDSQTCSESERESTADVARQIGFWISDERFLALSAYSLVGDS
EGFISNSAIGEVTVVKMLPRKDSSRIDTSELKAMIYRKLGHQRSEKYFDQLKNLLSLKINKSEFDKFCIQIVGREVIPLHNRLIRAILRNACVAKTPPVLSSSRKVGGSL
SVKVVNGYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKSQNIALEEFAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSPVTAP
LGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQPITGSRNAFQEQHWH
RAQRLNNTSLLDFRVAMELNPEVLGKDWTIQLEKISLRASEE