; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029269 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029269
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionC2H2-type domain-containing protein
Genome locationchr8:37064246..37065644
RNA-Seq ExpressionLag0029269
SyntenyLag0029269
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602074.1 hypothetical protein SDJN03_07307, partial [Cucurbita argyrosperma subsp. sororia]2.7e-21793.29Show/hide
Query:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL RQ+HENQKPK QKAEKPPSWAVVRGIFSCKYLQPQ     QQQQ+QLPRKEKQ EQATEESSKNCKKMRCSGSLCSNTKVT+RLE
Subjt:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         AASP+ HKKRALA SMGSKNNES SSSRSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATI P
Subjt:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKA+ LELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_022141510.1 uncharacterized protein LOC111011869 [Momordica charantia]9.8e-22093.98Show/hide
Query:  MASAMAK-AKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRL
        MASAMAK AKTQRKL RQ  ENQKPK QKAE+PPSWAVVRGIFSCKYLQPQQQPEK QQQHQLPRKEKQQEQATEES+KNCKKMRCSGSLCSNTKVTHRL
Subjt:  MASAMAK-AKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRL

Query:  EAAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
        EAA SPEVHKKRAL S+GS+NN+SSSS RSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
Subjt:  EAAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLC+SIP CNVC+IIKNGFKVA EA    A KGILTTATSGKAHDSAGISSD NDKRAMLVCRVIAGRVKKS EGSMEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_022990074.1 uncharacterized protein LOC111487079 [Cucurbita maxima]9.1e-21893.29Show/hide
Query:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL RQ+HENQKPK QKAEKPPSWAVVRGIFSCKYLQPQ     QQQQ+QLPRKEKQ EQATEESSKNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRAL-ASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         AASP+ HKKRAL +SMGSKNNES S SRSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLG+TRDPSLRATI P
Subjt:  AAASPEVHKKRAL-ASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_023526533.1 uncharacterized protein LOC111789980 [Cucurbita pepo subsp. pepo]5.4e-21893.52Show/hide
Query:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL RQ+HENQKPK QKAEKPPSWAVVRGIFSCKYLQPQQ    QQQQ+QLPRKEKQ EQATEESSKNCKKMRCSGSLCSNTKVT+RLE
Subjt:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         AASP+ HKKRALA SMGSKNNES SSSRSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATI P
Subjt:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKA+ LELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_038874395.1 uncharacterized protein LOC120067077 [Benincasa hispida]1.2e-22093.53Show/hide
Query:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK K QRKL RQ+HENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQP+K   QHQLPRKEKQ+EQATEE SKNCKKM+CSGSLCSNTKVTHRLE
Subjt:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
        AAASPEVHKKRAL  SMGS+NNESSSSSRS KAPPLNEQ GVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPV+GMTRDPSLRATICP
Subjt:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMK ETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKK+PRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDG-NDKRAMLVCRVIAGRVKKSSEGSMEDYDS
        RFHCTTL CSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGD GKGILTTATSGKAHDSAG+SSDG NDKRAMLVCRVIAGRVKKSSEG+MEDYDS
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDG-NDKRAMLVCRVIAGRVKKSSEGSMEDYDS

Query:  LAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        LAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  LAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

TrEMBL top hitse value%identityAlignment
A0A0A0KN18 C2H2-type domain-containing protein5.2e-21191.01Show/hide
Query:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        M SAMAK KTQRKL R  HENQKP P KAEKPPSWAVVRGI SCKYLQP      QQQQHQLPRKEK QEQATEE+ KNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC
        AAASPEVHKKRAL SMGS+NNESSSS+RSTKA  LNEQ  GVLSAT SSLSA SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPV+GMTRDPSLRATIC
Subjt:  AAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC

Query:  PCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL
        PCPQCGEIFMK ETLELHQ+VRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKK+PRCIADGNEL
Subjt:  PCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL

Query:  LRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSD-GNDKRAMLVCRVIAGRVKKSSEGSMEDYD
        LRFHCTTL CSLG NGSSNLCSSIP CNVC+IIKNGFK+AAEATGGDAGKGILTTATSGKAHDS G+SSD GN+KRAMLVCRVIAGRVKKSSEGSMEDYD
Subjt:  LRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSD-GNDKRAMLVCRVIAGRVKKSSEGSMEDYD

Query:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A5A7TWE1 C2H2-like zinc finger protein6.8e-21190.78Show/hide
Query:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL R  HENQKP PQKAEKPPSWAVVRGI SCKYLQP      QQQQHQLPRKEK QEQATEE+ KNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC
        AAASPEVHKKR L SMGS+NNESSSS+RSTK   LNEQ  GVLSAT SSLSA SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPV+GMTRDPSLRATIC
Subjt:  AAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC

Query:  PCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL
        PCPQCGEIFMK ETLELHQ+VRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKK+PRCIADGNEL
Subjt:  PCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL

Query:  LRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSD-GNDKRAMLVCRVIAGRVKKSSEGSMEDYD
        LRFHCTTL CSLG NGSSNLCSSIP CNVC+IIKNGFKVAAEATGGD+GKGILTTATSGKAHDS G+SSD G+DKRAMLVCRVIAGRVKKS EGSMEDYD
Subjt:  LRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSD-GNDKRAMLVCRVIAGRVKKSSEGSMEDYD

Query:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A6J1CIA1 uncharacterized protein LOC1110118694.7e-22093.98Show/hide
Query:  MASAMAK-AKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRL
        MASAMAK AKTQRKL RQ  ENQKPK QKAE+PPSWAVVRGIFSCKYLQPQQQPEK QQQHQLPRKEKQQEQATEES+KNCKKMRCSGSLCSNTKVTHRL
Subjt:  MASAMAK-AKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRL

Query:  EAAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
        EAA SPEVHKKRAL S+GS+NN+SSSS RSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
Subjt:  EAAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLC+SIP CNVC+IIKNGFKVA EA    A KGILTTATSGKAHDSAGISSD NDKRAMLVCRVIAGRVKKS EGSMEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A6J1HNA2 uncharacterized protein LOC1114644802.9e-21793.06Show/hide
Query:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL RQ+HENQKPK QKAEKPPSWAVVRGIFSCKYLQPQ     QQQQ+QLPRKEKQ EQATEESSKNCKKMRCSGSLCSNTKVT+RLE
Subjt:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         AASP+ HKKRALA SMGSKNNES SSSRSTKAPPLNEQNGVLSAT SSLSASSSSNSSNG SFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATI P
Subjt:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKA+ LELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A6J1JR26 uncharacterized protein LOC1114870794.4e-21893.29Show/hide
Query:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL RQ+HENQKPK QKAEKPPSWAVVRGIFSCKYLQPQ     QQQQ+QLPRKEKQ EQATEESSKNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRAL-ASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         AASP+ HKKRAL +SMGSKNNES S SRSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLG+TRDPSLRATI P
Subjt:  AAASPEVHKKRAL-ASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11490.1 zinc finger (C2H2 type) family protein2.1e-3941.8Show/hide
Query:  ICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPI--CKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIAD
        +  C +C E     +  E H    H+V  L   D S+  VE+I  + +  K   +    I  I K+QN  + ++ FE+YR+ +K +A KL KKH RC+AD
Subjt:  ICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPI--CKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIAD

Query:  GNELLRFHCTTLACSLGL-NGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKR----AMLVCRVIAGRVKK--
        GNE L FH TTL+C+LG  N SSNLC S   C VC+I+++GF   +  T  D  KG+LT +TS  A +S  I +D    R    A+++CRVIAGRV K  
Subjt:  GNELLRFHCTTLACSLGL-NGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKR----AMLVCRVIAGRVKK--

Query:  -SSEGSM--EDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY
         + E S+   ++DSLA   G  S ++ELY+ S KA+LPCFV+I+
Subjt:  -SSEGSM--EDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY

AT1G75710.1 C2H2-like zinc finger protein6.8e-8644.74Show/hide
Query:  QNHENQKPKPQKAEK-PPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCS-------NTKVTHRLEAAASPEVH
        Q H+ QKPK     K P SW  ++ + +CK ++  +  +        P K  Q   +   +    K      S+CS       NT+V HR  A  SP+V 
Subjt:  QNHENQKPKPQKAEK-PPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCS-------NTKVTHRLEAAASPEVH

Query:  KKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSL--SASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPCPQCGEI
                G+    +S +   T+ P  +  +   S T  S   +AS S  SS+  SFR M FR+  GCYEC M++DP    +R P +   +C C QCGE+
Subjt:  KKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSL--SASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPCPQCGEI

Query:  FMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTL
        F K E+LELHQ+VRHAVSELGPED+ +NIVEIIF+SSWLKK +PIC+IERILKV NT +TI +FE+ RD++KA+A +  +K  RC ADGNELLRFHCTTL
Subjt:  FMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTL

Query:  ACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGD-AGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKK------------SSEGSM
         CSLG  GSS+LCS++P C VC +I++GF+  +   G + A  G+ TTA+SG+A D    S D   +R MLVCRVIAGRVK+              + ++
Subjt:  ACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGD-AGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKK------------SSEGSM

Query:  ED------------YDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY
        ED            +DS+A  AG+YSNL+EL V++P+AILPCFVVIY
Subjt:  ED------------YDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY

AT2G29660.1 zinc finger (C2H2 type) family protein1.2e-4544.06Show/hide
Query:  ICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKK---QTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPK-----KH
        I PC  CGEIF K   LE H +++HAVSEL   ++S NIV+IIF+S W ++   ++P+  I RILK+ N+ K +++FEEYR+ +KAKA +          
Subjt:  ICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKK---QTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPK-----KH

Query:  PRCIADGNELLRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDG-----NDKRAMLVCRVIAG
         RC+ADGNELLRF+C+T  C LG NG SNLC     C++C II +GF    +        GI T AT  + H +     +      N KRAMLVCRV+AG
Subjt:  PRCIADGNELLRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDG-----NDKRAMLVCRVIAG

Query:  R----------VKKSSEGSMEDYDSLAGAAGMYSNL------DELYVFSPKAILPCFVVIY
        R          V KS  G    YDSL G +G  S        DEL VF+P+A+LPCFV++Y
Subjt:  R----------VKKSSEGSMEDYDSLAGAAGMYSNL------DELYVFSPKAILPCFVVIY

AT4G27240.1 zinc finger (C2H2 type) family protein1.2e-5536.57Show/hide
Query:  KAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQ------QEQATEESSKNCKKMRCSGSLCSNTKVTH-RL
        K K ++ + R+N + QK    K + P  W  ++    CK                +PR +K+      +   T        +  CS S+ +   V H   
Subjt:  KAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQ------QEQATEESSKNCKKMRCSGSLCSNTKVTH-RL

Query:  EAAASPEVHKKRALASMGSKN--------NESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDP
             P     R++ S    N        + S+   + T A        +   T  + S+S  S +S  AS      R   G ++ +   D    +  D 
Subjt:  EAAASPEVHKKRALASMGSKN--------NESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDP

Query:  SLRATICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRC
        S       C +CGE F K E  E H   +HAV+EL   D+S+ IVEII ++SWLK +    +I+RILKV N  KT+++FEEYRD++K +A+KL KKHPRC
Subjt:  SLRATICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRC

Query:  IADGNELLRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGI-SSDGNDKRAMLVCRVIAGRVKKSSE
        IADGNELLRFH TT+AC+LG+NGS++LCSS   C VC II+NGF    E      G G+ T +TS +A +S  I    G D++A++VCRVIAGRV +  E
Subjt:  IADGNELLRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGI-SSDGNDKRAMLVCRVIAGRVKKSSE

Query:  ------GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI
              G +  +DSLAG  G+Y+N++ELY+ + +A+LPCFV+I
Subjt:  ------GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI

AT5G54630.1 zinc finger protein-related2.9e-6049.38Show/hide
Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        C +CGE F K E  E H   +HAV+EL   D+S+ IVEII ++SWLK +    +I+R+LKV N  KT+++FEEYR+++K +A+KL KKHPRC+ADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISS-------DGNDKRAMLVCRVIAGRVKKSSE--
        RFH TT+AC LG+NGS+++C++   C VC II+NGF    E      G G+ T +TSG+A +S  ++        D   ++ ++VCRVIAGRV +  E  
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISS-------DGNDKRAMLVCRVIAGRVKKSSE--

Query:  ----GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI
            G M  +DSLAG  G+Y+N++ELY+ +PKA+LPCFVVI
Subjt:  ----GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTGCCATGGCCAAAGCCAAAACGCAGAGAAAGCTCATCAGACAGAACCATGAAAACCAGAAGCCGAAGCCGCAGAAGGCAGAGAAGCCGCCGTCTTGGGCAGT
TGTTAGAGGCATATTCAGTTGCAAGTACCTGCAGCCTCAACAGCAACCAGAAAAACAACAACAGCAACATCAATTGCCACGCAAAGAGAAACAACAGGAGCAAGCAACAG
AAGAGAGTAGCAAGAACTGTAAGAAAATGAGGTGTTCGGGTTCACTCTGTAGCAACACTAAGGTAACACACAGGCTTGAAGCAGCAGCATCGCCGGAAGTTCACAAGAAA
AGGGCATTGGCTTCAATGGGTTCCAAAAACAATGAGTCTTCGAGTTCAAGTAGATCCACGAAAGCTCCTCCTTTGAATGAACAAAATGGTGTTCTATCAGCCACATGTTC
ATCATTATCTGCATCATCGTCTTCTAATTCCTCCAATGGTGCCTCTTTCAGGGGAATGCCTTTCAGGAGGTTCTATGGCTGTTATGAATGTAAAATGGTGATTGACCCTG
TTCTTGGGATGACTAGAGATCCTTCACTTAGAGCAACCATTTGCCCTTGTCCTCAATGTGGTGAGATTTTCATGAAAGCTGAAACTTTGGAGCTTCACCAGAGTGTTAGA
CATGCAGTGTCTGAACTTGGTCCTGAAGACACGAGCAAGAACATAGTGGAAATCATATTCCAGTCGAGCTGGCTGAAAAAGCAAACTCCAATTTGCAAAATTGAAAGGAT
TCTCAAAGTCCAGAACACTCCAAAGACCATCTCGAAATTCGAGGAATACAGGGACTCCATTAAAGCCAAAGCCACAAAGCTTCCAAAGAAGCACCCACGTTGCATAGCAG
ACGGCAACGAACTGCTAAGGTTCCACTGCACCACCTTGGCCTGTTCGCTGGGCCTAAATGGCTCCTCTAATCTCTGCAGTTCAATCCCACCATGCAATGTCTGTAACATA
ATCAAGAATGGGTTCAAGGTGGCCGCAGAGGCCACCGGAGGGGACGCCGGAAAGGGCATTCTAACAACGGCAACAAGTGGAAAGGCTCATGACAGTGCCGGAATATCATC
GGACGGGAATGACAAACGAGCAATGCTGGTTTGCCGGGTAATAGCTGGCAGGGTGAAGAAGAGTTCAGAAGGCAGCATGGAAGATTACGACTCATTGGCCGGAGCAGCAG
GCATGTATTCCAATTTGGATGAGTTATACGTATTCAGTCCCAAGGCAATATTGCCCTGTTTTGTTGTGATTTATGGAGGGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTGCCATGGCCAAAGCCAAAACGCAGAGAAAGCTCATCAGACAGAACCATGAAAACCAGAAGCCGAAGCCGCAGAAGGCAGAGAAGCCGCCGTCTTGGGCAGT
TGTTAGAGGCATATTCAGTTGCAAGTACCTGCAGCCTCAACAGCAACCAGAAAAACAACAACAGCAACATCAATTGCCACGCAAAGAGAAACAACAGGAGCAAGCAACAG
AAGAGAGTAGCAAGAACTGTAAGAAAATGAGGTGTTCGGGTTCACTCTGTAGCAACACTAAGGTAACACACAGGCTTGAAGCAGCAGCATCGCCGGAAGTTCACAAGAAA
AGGGCATTGGCTTCAATGGGTTCCAAAAACAATGAGTCTTCGAGTTCAAGTAGATCCACGAAAGCTCCTCCTTTGAATGAACAAAATGGTGTTCTATCAGCCACATGTTC
ATCATTATCTGCATCATCGTCTTCTAATTCCTCCAATGGTGCCTCTTTCAGGGGAATGCCTTTCAGGAGGTTCTATGGCTGTTATGAATGTAAAATGGTGATTGACCCTG
TTCTTGGGATGACTAGAGATCCTTCACTTAGAGCAACCATTTGCCCTTGTCCTCAATGTGGTGAGATTTTCATGAAAGCTGAAACTTTGGAGCTTCACCAGAGTGTTAGA
CATGCAGTGTCTGAACTTGGTCCTGAAGACACGAGCAAGAACATAGTGGAAATCATATTCCAGTCGAGCTGGCTGAAAAAGCAAACTCCAATTTGCAAAATTGAAAGGAT
TCTCAAAGTCCAGAACACTCCAAAGACCATCTCGAAATTCGAGGAATACAGGGACTCCATTAAAGCCAAAGCCACAAAGCTTCCAAAGAAGCACCCACGTTGCATAGCAG
ACGGCAACGAACTGCTAAGGTTCCACTGCACCACCTTGGCCTGTTCGCTGGGCCTAAATGGCTCCTCTAATCTCTGCAGTTCAATCCCACCATGCAATGTCTGTAACATA
ATCAAGAATGGGTTCAAGGTGGCCGCAGAGGCCACCGGAGGGGACGCCGGAAAGGGCATTCTAACAACGGCAACAAGTGGAAAGGCTCATGACAGTGCCGGAATATCATC
GGACGGGAATGACAAACGAGCAATGCTGGTTTGCCGGGTAATAGCTGGCAGGGTGAAGAAGAGTTCAGAAGGCAGCATGGAAGATTACGACTCATTGGCCGGAGCAGCAG
GCATGTATTCCAATTTGGATGAGTTATACGTATTCAGTCCCAAGGCAATATTGCCCTGTTTTGTTGTGATTTATGGAGGGTTTTAA
Protein sequenceShow/hide protein sequence
MASAMAKAKTQRKLIRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLEAAASPEVHKK
RALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPCPQCGEIFMKAETLELHQSVR
HAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTLACSLGLNGSSNLCSSIPPCNVCNI
IKNGFKVAAEATGGDAGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF