; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008799 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008799
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionC2H2-type domain-containing protein
Genome locationscaffold10:34602364..34604784
RNA-Seq ExpressionSpg008799
SyntenySpg008799
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602074.1 hypothetical protein SDJN03_07307, partial [Cucurbita argyrosperma subsp. sororia]7.7e-21793.06Show/hide
Query:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL RQ+HENQKPK QKAEKPPSWAVVRGIFSCKYLQPQ     QQQQ+QLPRKEKQ EQATEESSKNCKKMRCSGSLCSNTKVT+RLE
Subjt:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         AASP+ HKKRALA SMGSKNNES SSSRSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATI P
Subjt:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKA+ LELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGD GKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_022141510.1 uncharacterized protein LOC111011869 [Momordica charantia]2.8e-21993.75Show/hide
Query:  MASAMAK-AKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRL
        MASAMAK AKTQRKL RQ  ENQKPK QKAE+PPSWAVVRGIFSCKYLQPQQQPEK QQQHQLPRKEKQQEQATEES+KNCKKMRCSGSLCSNTKVTHRL
Subjt:  MASAMAK-AKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRL

Query:  EAAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
        EAA SPEVHKKRAL S+GS+NN+SSSS RSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
Subjt:  EAAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLC+SIP CNVC+IIKNGFKVA EA      KGILTTATSGKAHDSAGISSD NDKRAMLVCRVIAGRVKKS EGSMEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_022990074.1 uncharacterized protein LOC111487079 [Cucurbita maxima]2.7e-21793.06Show/hide
Query:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL RQ+HENQKPK QKAEKPPSWAVVRGIFSCKYLQPQ     QQQQ+QLPRKEKQ EQATEESSKNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRAL-ASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         AASP+ HKKRAL +SMGSKNNES S SRSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLG+TRDPSLRATI P
Subjt:  AAASPEVHKKRAL-ASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGD GKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_023526533.1 uncharacterized protein LOC111789980 [Cucurbita pepo subsp. pepo]2.0e-21793.29Show/hide
Query:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL RQ+HENQKPK QKAEKPPSWAVVRGIFSCKYLQPQQ    QQQQ+QLPRKEKQ EQATEESSKNCKKMRCSGSLCSNTKVT+RLE
Subjt:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         AASP+ HKKRALA SMGSKNNES SSSRSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATI P
Subjt:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKA+ LELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGD GKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_038874395.1 uncharacterized protein LOC120067077 [Benincasa hispida]3.0e-22193.76Show/hide
Query:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK K QRKL RQ+HENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQP+K   QHQLPRKEKQ+EQATEE SKNCKKM+CSGSLCSNTKVTHRLE
Subjt:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
        AAASPEVHKKRAL  SMGS+NNESSSSSRS KAPPLNEQ GVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPV+GMTRDPSLRATICP
Subjt:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMK ETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKK+PRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDG-NDKRAMLVCRVIAGRVKKSSEGSMEDYDS
        RFHCTTL CSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAG+SSDG NDKRAMLVCRVIAGRVKKSSEG+MEDYDS
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDG-NDKRAMLVCRVIAGRVKKSSEGSMEDYDS

Query:  LAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        LAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  LAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

TrEMBL top hitse value%identityAlignment
A0A1S3CQ33 uncharacterized protein LOC1035033998.9e-21190.78Show/hide
Query:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL R  HENQKP PQKAEKPPSWAVVRGI SCKYLQP      QQQQHQLPRKEK QEQATEE+ KNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC
        AAASPEVHKKR L SMGS+NNESSSS+RSTK   LNEQ  GVLSAT SSLSA SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPV+GMTRDPSLRATIC
Subjt:  AAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC

Query:  PCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL
        PCPQCGEIFMK ETLELHQ+VRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKK+PRCIADGNEL
Subjt:  PCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL

Query:  LRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSD-GNDKRAMLVCRVIAGRVKKSSEGSMEDYD
        LRFHCTTL CSLG NGSSNLCSSIP CNVC+IIKNGFKVAAEATGGD+GKGILTTATSGKAHDS G+SSD G+DKRAMLVCRVIAGRVKKS EGSMEDYD
Subjt:  LRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSD-GNDKRAMLVCRVIAGRVKKSSEGSMEDYD

Query:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A5A7TWE1 C2H2-like zinc finger protein8.9e-21190.78Show/hide
Query:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL R  HENQKP PQKAEKPPSWAVVRGI SCKYLQP      QQQQHQLPRKEK QEQATEE+ KNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC
        AAASPEVHKKR L SMGS+NNESSSS+RSTK   LNEQ  GVLSAT SSLSA SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPV+GMTRDPSLRATIC
Subjt:  AAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC

Query:  PCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL
        PCPQCGEIFMK ETLELHQ+VRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKK+PRCIADGNEL
Subjt:  PCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL

Query:  LRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSD-GNDKRAMLVCRVIAGRVKKSSEGSMEDYD
        LRFHCTTL CSLG NGSSNLCSSIP CNVC+IIKNGFKVAAEATGGD+GKGILTTATSGKAHDS G+SSD G+DKRAMLVCRVIAGRVKKS EGSMEDYD
Subjt:  LRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSD-GNDKRAMLVCRVIAGRVKKSSEGSMEDYD

Query:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A6J1CIA1 uncharacterized protein LOC1110118691.4e-21993.75Show/hide
Query:  MASAMAK-AKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRL
        MASAMAK AKTQRKL RQ  ENQKPK QKAE+PPSWAVVRGIFSCKYLQPQQQPEK QQQHQLPRKEKQQEQATEES+KNCKKMRCSGSLCSNTKVTHRL
Subjt:  MASAMAK-AKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRL

Query:  EAAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
        EAA SPEVHKKRAL S+GS+NN+SSSS RSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
Subjt:  EAAASPEVHKKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLC+SIP CNVC+IIKNGFKVA EA      KGILTTATSGKAHDSAGISSD NDKRAMLVCRVIAGRVKKS EGSMEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A6J1HNA2 uncharacterized protein LOC1114644808.3e-21792.82Show/hide
Query:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL RQ+HENQKPK QKAEKPPSWAVVRGIFSCKYLQPQ     QQQQ+QLPRKEKQ EQATEESSKNCKKMRCSGSLCSNTKVT+RLE
Subjt:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         AASP+ HKKRALA SMGSKNNES SSSRSTKAPPLNEQNGVLSAT SSLSASSSSNSSNG SFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATI P
Subjt:  AAASPEVHKKRALA-SMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKA+ LELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGD GKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A6J1JR26 uncharacterized protein LOC1114870791.3e-21793.06Show/hide
Query:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK KTQRKL RQ+HENQKPK QKAEKPPSWAVVRGIFSCKYLQPQ     QQQQ+QLPRKEKQ EQATEESSKNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAASPEVHKKRAL-ASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         AASP+ HKKRAL +SMGSKNNES S SRSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLG+TRDPSLRATI P
Subjt:  AAASPEVHKKRAL-ASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CP+CGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLCSSIP CNVC+IIKNGFKVAAEATGGD GKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11490.1 zinc finger (C2H2 type) family protein3.6e-3941.8Show/hide
Query:  ICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPI--CKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIAD
        +  C +C E     +  E H    H+V  L   D S+  VE+I  + +  K   +    I  I K+QN  + ++ FE+YR+ +K +A KL KKH RC+AD
Subjt:  ICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPI--CKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIAD

Query:  GNELLRFHCTTLACSLGL-NGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKR----AMLVCRVIAGRVKK--
        GNE L FH TTL+C+LG  N SSNLC S   C VC+I+++GF   +  T  D  KG+LT +TS  A +S  I +D    R    A+++CRVIAGRV K  
Subjt:  GNELLRFHCTTLACSLGL-NGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKR----AMLVCRVIAGRVKK--

Query:  -SSEGSM--EDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY
         + E S+   ++DSLA   G  S ++ELY+ S KA+LPCFV+I+
Subjt:  -SSEGSM--EDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY

AT1G75710.1 C2H2-like zinc finger protein1.5e-8544.52Show/hide
Query:  QNHENQKPKPQKAEK-PPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCS-------NTKVTHRLEAAASPEVH
        Q H+ QKPK     K P SW  ++ + +CK ++  +  +        P K  Q   +   +    K      S+CS       NT+V HR  A  SP+V 
Subjt:  QNHENQKPKPQKAEK-PPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCS-------NTKVTHRLEAAASPEVH

Query:  KKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSL--SASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPCPQCGEI
                G+    +S +   T+ P  +  +   S T  S   +AS S  SS+  SFR M FR+  GCYEC M++DP    +R P +   +C C QCGE+
Subjt:  KKRALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSL--SASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPCPQCGEI

Query:  FMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTL
        F K E+LELHQ+VRHAVSELGPED+ +NIVEIIF+SSWLKK +PIC+IERILKV NT +TI +FE+ RD++KA+A +  +K  RC ADGNELLRFHCTTL
Subjt:  FMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTL

Query:  ACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGK-GILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKK------------SSEGSM
         CSLG  GSS+LCS++P C VC +I++GF+  +   G +    G+ TTA+SG+A D    S D   +R MLVCRVIAGRVK+              + ++
Subjt:  ACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGK-GILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKK------------SSEGSM

Query:  ED------------YDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY
        ED            +DS+A  AG+YSNL+EL V++P+AILPCFVVIY
Subjt:  ED------------YDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY

AT2G29660.1 zinc finger (C2H2 type) family protein1.2e-4544.06Show/hide
Query:  ICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKK---QTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPK-----KH
        I PC  CGEIF K   LE H +++HAVSEL   ++S NIV+IIF+S W ++   ++P+  I RILK+ N+ K +++FEEYR+ +KAKA +          
Subjt:  ICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKK---QTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPK-----KH

Query:  PRCIADGNELLRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDG-----NDKRAMLVCRVIAG
         RC+ADGNELLRF+C+T  C LG NG SNLC     C++C II +GF    +        GI T AT  + H +     +      N KRAMLVCRV+AG
Subjt:  PRCIADGNELLRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDG-----NDKRAMLVCRVIAG

Query:  R----------VKKSSEGSMEDYDSLAGAAGMYSNL------DELYVFSPKAILPCFVVIY
        R          V KS  G    YDSL G +G  S        DEL VF+P+A+LPCFV++Y
Subjt:  R----------VKKSSEGSMEDYDSLAGAAGMYSNL------DELYVFSPKAILPCFVVIY

AT4G27240.1 zinc finger (C2H2 type) family protein5.6e-5636.57Show/hide
Query:  KAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQ------QEQATEESSKNCKKMRCSGSLCSNTKVTH-RL
        K K ++ + R+N + QK    K + P  W  ++    CK                +PR +K+      +   T        +  CS S+ +   V H   
Subjt:  KAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQ------QEQATEESSKNCKKMRCSGSLCSNTKVTH-RL

Query:  EAAASPEVHKKRALASMGSKN--------NESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDP
             P     R++ S    N        + S+   + T A        +   T  + S+S  S +S  AS      R   G ++ +   D    +  D 
Subjt:  EAAASPEVHKKRALASMGSKN--------NESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDP

Query:  SLRATICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRC
        S       C +CGE F K E  E H   +HAV+EL   D+S+ IVEII ++SWLK +    +I+RILKV N  KT+++FEEYRD++K +A+KL KKHPRC
Subjt:  SLRATICPCPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRC

Query:  IADGNELLRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGI-SSDGNDKRAMLVCRVIAGRVKKSSE
        IADGNELLRFH TT+AC+LG+NGS++LCSS   C VC II+NGF    E    + G G+ T +TS +A +S  I    G D++A++VCRVIAGRV +  E
Subjt:  IADGNELLRFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGI-SSDGNDKRAMLVCRVIAGRVKKSSE

Query:  ------GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI
              G +  +DSLAG  G+Y+N++ELY+ + +A+LPCFV+I
Subjt:  ------GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI

AT5G54630.1 zinc finger protein-related1.7e-6049.38Show/hide
Query:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        C +CGE F K E  E H   +HAV+EL   D+S+ IVEII ++SWLK +    +I+R+LKV N  KT+++FEEYR+++K +A+KL KKHPRC+ADGNELL
Subjt:  CPQCGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISS-------DGNDKRAMLVCRVIAGRVKKSSE--
        RFH TT+AC LG+NGS+++C++   C VC II+NGF    E    + G G+ T +TSG+A +S  ++        D   ++ ++VCRVIAGRV +  E  
Subjt:  RFHCTTLACSLGLNGSSNLCSSIPPCNVCNIIKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISS-------DGNDKRAMLVCRVIAGRVKKSSE--

Query:  ----GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI
            G M  +DSLAG  G+Y+N++ELY+ +PKA+LPCFVVI
Subjt:  ----GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTGCCATGGCCAAAGCCAAAACGCAGAGAAAGCTCCTTCGACAGAACCATGAAAACCAGAAACCGAAGCCGCAGAAGGCAGAGAAGCCGCCGTCTTGGGCAGT
TGTTAGAGGCATATTCAGTTGCAAGTACCTGCAGCCTCAACAGCAACCAGAAAAACAACAACAGCAACATCAATTGCCACGCAAAGAGAAACAACAGGAGCAAGCAACAG
AAGAGAGTAGCAAGAACTGTAAGAAAATGAGGTGTTCGGGTTCACTCTGTAGCAACACTAAGGTAACACACAGGCTTGAAGCAGCAGCATCGCCGGAAGTTCACAAGAAA
AGGGCATTGGCTTCAATGGGTTCCAAAAACAATGAGTCTTCGAGTTCAAGTAGATCCACGAAAGCTCCTCCTTTGAATGAACAAAATGGTGTTCTATCAGCCACATGTTC
ATCATTATCTGCATCATCTTCTTCTAATTCCTCGAATGGTGCCTCTTTCAGGGGAATGCCTTTCAGGAGGTTCTATGGCTGTTATGAATGTAAAATGGTGATTGACCCTG
TCCTTGGGATGACTAGAGATCCTTCTCTCAGAGCAACCATTTGCCCTTGTCCTCAATGTGGTGAGATTTTCATGAAAGCTGAAACTTTGGAGCTTCACCAGAGTGTTAGA
CATGCAGTGTCTGAACTTGGTCCTGAAGACACGAGCAAGAACATAGTGGAAATCATATTCCAGTCGAGCTGGCTGAAAAAGCAAACACCAATTTGCAAAATTGAAAGGAT
TCTCAAAGTCCAGAACACTCCAAAGACCATCTCGAAATTCGAGGAATACAGGGACTCCATTAAAGCCAAAGCCACAAAGCTTCCAAAGAAGCACCCACGTTGCATAGCAG
ACGGCAACGAACTGCTAAGGTTCCACTGCACCACCTTGGCCTGTTCGCTGGGCCTAAATGGCTCCTCTAATCTCTGCAGTTCAATCCCACCATGCAATGTCTGTAACATA
ATCAAGAATGGGTTCAAGGTGGCCGCAGAGGCCACCGGAGGGGACACCGGAAAGGGCATTCTAACAACGGCAACAAGTGGAAAGGCTCATGACAGTGCCGGAATATCATC
GGACGGGAATGACAAACGAGCAATGCTGGTTTGCCGGGTAATAGCTGGCAGGGTGAAGAAGAGTTCAGAAGGCAGCATGGAAGATTACGACTCATTGGCCGGAGCAGCAG
GCATGTATTCCAATTTGGATGAGTTATACGTATTCAGTCCCAAGGCAATATTGCCTTGTTTTGTTGTGATTTATGGAGGGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTGCCATGGCCAAAGCCAAAACGCAGAGAAAGCTCCTTCGACAGAACCATGAAAACCAGAAACCGAAGCCGCAGAAGGCAGAGAAGCCGCCGTCTTGGGCAGT
TGTTAGAGGCATATTCAGTTGCAAGTACCTGCAGCCTCAACAGCAACCAGAAAAACAACAACAGCAACATCAATTGCCACGCAAAGAGAAACAACAGGAGCAAGCAACAG
AAGAGAGTAGCAAGAACTGTAAGAAAATGAGGTGTTCGGGTTCACTCTGTAGCAACACTAAGGTAACACACAGGCTTGAAGCAGCAGCATCGCCGGAAGTTCACAAGAAA
AGGGCATTGGCTTCAATGGGTTCCAAAAACAATGAGTCTTCGAGTTCAAGTAGATCCACGAAAGCTCCTCCTTTGAATGAACAAAATGGTGTTCTATCAGCCACATGTTC
ATCATTATCTGCATCATCTTCTTCTAATTCCTCGAATGGTGCCTCTTTCAGGGGAATGCCTTTCAGGAGGTTCTATGGCTGTTATGAATGTAAAATGGTGATTGACCCTG
TCCTTGGGATGACTAGAGATCCTTCTCTCAGAGCAACCATTTGCCCTTGTCCTCAATGTGGTGAGATTTTCATGAAAGCTGAAACTTTGGAGCTTCACCAGAGTGTTAGA
CATGCAGTGTCTGAACTTGGTCCTGAAGACACGAGCAAGAACATAGTGGAAATCATATTCCAGTCGAGCTGGCTGAAAAAGCAAACACCAATTTGCAAAATTGAAAGGAT
TCTCAAAGTCCAGAACACTCCAAAGACCATCTCGAAATTCGAGGAATACAGGGACTCCATTAAAGCCAAAGCCACAAAGCTTCCAAAGAAGCACCCACGTTGCATAGCAG
ACGGCAACGAACTGCTAAGGTTCCACTGCACCACCTTGGCCTGTTCGCTGGGCCTAAATGGCTCCTCTAATCTCTGCAGTTCAATCCCACCATGCAATGTCTGTAACATA
ATCAAGAATGGGTTCAAGGTGGCCGCAGAGGCCACCGGAGGGGACACCGGAAAGGGCATTCTAACAACGGCAACAAGTGGAAAGGCTCATGACAGTGCCGGAATATCATC
GGACGGGAATGACAAACGAGCAATGCTGGTTTGCCGGGTAATAGCTGGCAGGGTGAAGAAGAGTTCAGAAGGCAGCATGGAAGATTACGACTCATTGGCCGGAGCAGCAG
GCATGTATTCCAATTTGGATGAGTTATACGTATTCAGTCCCAAGGCAATATTGCCTTGTTTTGTTGTGATTTATGGAGGGTTTTAA
Protein sequenceShow/hide protein sequence
MASAMAKAKTQRKLLRQNHENQKPKPQKAEKPPSWAVVRGIFSCKYLQPQQQPEKQQQQHQLPRKEKQQEQATEESSKNCKKMRCSGSLCSNTKVTHRLEAAASPEVHKK
RALASMGSKNNESSSSSRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPCPQCGEIFMKAETLELHQSVR
HAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTLACSLGLNGSSNLCSSIPPCNVCNI
IKNGFKVAAEATGGDTGKGILTTATSGKAHDSAGISSDGNDKRAMLVCRVIAGRVKKSSEGSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF