; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015658 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015658
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionC2H2-type domain-containing protein
Genome locationscaffold983:93985..95357
RNA-Seq ExpressionMS015658
SyntenyMS015658
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141510.1 uncharacterized protein LOC111011869 [Momordica charantia]5.1e-237100Show/hide
Query:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAPSPEVHKKRALTSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPC
        AAPSPEVHKKRALTSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPC
Subjt:  AAPSPEVHKKRALTSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPC

Query:  PECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLR
        PECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLR
Subjt:  PECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLR

Query:  FHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSLAGAAG
        FHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSLAGAAG
Subjt:  FHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSLAGAAG

Query:  MYSNLDELYVFSPKAILPCFVVIYGGF
        MYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  MYSNLDELYVFSPKAILPCFVVIYGGF

XP_022964469.1 uncharacterized protein LOC111464480 [Cucurbita moschata]1.2e-20689.81Show/hide
Query:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK  KTQRKLTRQ+ ENQKPK QKAE+PPSWAVVRGIFSCKYLQPQQ    QQQ+QLPRKEKQ EQATEES+KNCKKMRCSGSLCSNTKVT+RLE
Subjt:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAPSPEVHKKRAL-TSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         A SP+ HKKRAL +S+GS+NN+S SS RSTKAPPLNEQNGVLSAT SSLSASSSSNSSNG SFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATI P
Subjt:  AAPSPEVHKKRAL-TSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CPECGEIFMKA+ LELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLC+SIPQCNVCSIIKNGFKVA EA    A KGILTTATSGKAHDSAGISSD NDKRAMLVCRVIAGRVKKS EG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_022990074.1 uncharacterized protein LOC111487079 [Cucurbita maxima]8.5e-20890.28Show/hide
Query:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK  KTQRKLTRQ+ ENQKPK QKAE+PPSWAVVRGIFSCKYLQPQQ    QQQ+QLPRKEKQ EQATEES+KNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAPSPEVHKKRAL-TSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         A SP+ HKKRAL +S+GS+NN+S S  RSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLG+TRDPSLRATI P
Subjt:  AAPSPEVHKKRAL-TSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLC+SIPQCNVCSIIKNGFKVA EA    A KGILTTATSGKAHDSAGISSD NDKRAMLVCRVIAGRVKKS EG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_023526533.1 uncharacterized protein LOC111789980 [Cucurbita pepo subsp. pepo]1.1e-20790.28Show/hide
Query:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK  KTQRKLTRQ+ ENQKPK QKAE+PPSWAVVRGIFSCKYLQPQQQ   QQQ+QLPRKEKQ EQATEES+KNCKKMRCSGSLCSNTKVT+RLE
Subjt:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAPSPEVHKKRAL-TSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         A SP+ HKKRAL +S+GS+NN+S SS RSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATI P
Subjt:  AAPSPEVHKKRAL-TSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CPECGEIFMKA+ LELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLC+SIPQCNVCSIIKNGFKVA EA    A KGILTTATSGKAHDSAGISSD NDKRAMLVCRVIAGRVKKS EG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

XP_038874395.1 uncharacterized protein LOC120067077 [Benincasa hispida]3.7e-21190.99Show/hide
Query:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK  K QRKL+RQ+ ENQKPK QKAE+PPSWAVVRGIFSCKYLQPQQQP+K  QHQLPRKEKQ+EQATEE +KNCKKM+CSGSLCSNTKVTHRLE
Subjt:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAPSPEVHKKRALT-SVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
        AA SPEVHKKRALT S+GSRNN+SSSS RS KAPPLNEQ GVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPV+GMTRDPSLRATICP
Subjt:  AAPSPEVHKKRALT-SVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CPECGEIFMK ETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKK+PRCIADGNELL
Subjt:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSD-SNDKRAMLVCRVIAGRVKKSLEGSMEDYDS
        RFHCTTL CSLGLNGSSNLC+SIPQCNVCSIIKNGFKVA EA      KGILTTATSGKAHDSAG+SSD SNDKRAMLVCRVIAGRVKKS EG+MEDYDS
Subjt:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSD-SNDKRAMLVCRVIAGRVKKSLEGSMEDYDS

Query:  LAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        LAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  LAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

TrEMBL top hitse value%identityAlignment
A0A0A0KN18 C2H2-type domain-containing protein1.7e-20188.48Show/hide
Query:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
        M SAMAK  KTQRKL+R T ENQKP   KAE+PPSWAVVRGI SCKYLQPQ     QQQHQLPRKEK QEQATEE+ KNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAPSPEVHKKRALTSVGSRNNDSSSSCRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC
        AA SPEVHKKRALTS+GSRNN+SSSS RSTKA  LNEQ  GVLSAT SSLSA SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPV+GMTRDPSLRATIC
Subjt:  AAPSPEVHKKRALTSVGSRNNDSSSSCRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC

Query:  PCPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL
        PCP+CGEIFMK ETLELHQ+VRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKK+PRCIADGNEL
Subjt:  PCPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL

Query:  LRFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSD-SNDKRAMLVCRVIAGRVKKSLEGSMEDYD
        LRFHCTTL CSLG NGSSNLC+SIPQCNVCSIIKNGFK+A EA    A KGILTTATSGKAHDS G+SSD  N+KRAMLVCRVIAGRVKKS EGSMEDYD
Subjt:  LRFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSD-SNDKRAMLVCRVIAGRVKKSLEGSMEDYD

Query:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A5A7TWE1 C2H2-like zinc finger protein2.2e-20188.48Show/hide
Query:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK  KTQRKL+R T ENQKP  QKAE+PPSWAVVRGI SCKYLQPQ     QQQHQLPRKEK QEQATEE+ KNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAPSPEVHKKRALTSVGSRNNDSSSSCRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC
        AA SPEVHKKR LTS+GSRNN+SSSS RSTK   LNEQ  GVLSAT SSLSA SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPV+GMTRDPSLRATIC
Subjt:  AAPSPEVHKKRALTSVGSRNNDSSSSCRSTKAPPLNEQ-NGVLSATCSSLSA-SSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATIC

Query:  PCPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL
        PCP+CGEIFMK ETLELHQ+VRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKK+PRCIADGNEL
Subjt:  PCPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNEL

Query:  LRFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSD-SNDKRAMLVCRVIAGRVKKSLEGSMEDYD
        LRFHCTTL CSLG NGSSNLC+SIPQCNVCSIIKNGFKVA EA    + KGILTTATSGKAHDS G+SSD  +DKRAMLVCRVIAGRVKKS EGSMEDYD
Subjt:  LRFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSD-SNDKRAMLVCRVIAGRVKKSLEGSMEDYD

Query:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  SLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A6J1CIA1 uncharacterized protein LOC1110118692.5e-237100Show/hide
Query:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAPSPEVHKKRALTSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPC
        AAPSPEVHKKRALTSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPC
Subjt:  AAPSPEVHKKRALTSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPC

Query:  PECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLR
        PECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLR
Subjt:  PECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLR

Query:  FHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSLAGAAG
        FHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSLAGAAG
Subjt:  FHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSLAGAAG

Query:  MYSNLDELYVFSPKAILPCFVVIYGGF
        MYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  MYSNLDELYVFSPKAILPCFVVIYGGF

A0A6J1HNA2 uncharacterized protein LOC1114644805.9e-20789.81Show/hide
Query:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK  KTQRKLTRQ+ ENQKPK QKAE+PPSWAVVRGIFSCKYLQPQQ    QQQ+QLPRKEKQ EQATEES+KNCKKMRCSGSLCSNTKVT+RLE
Subjt:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAPSPEVHKKRAL-TSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         A SP+ HKKRAL +S+GS+NN+S SS RSTKAPPLNEQNGVLSAT SSLSASSSSNSSNG SFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATI P
Subjt:  AAPSPEVHKKRAL-TSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CPECGEIFMKA+ LELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLC+SIPQCNVCSIIKNGFKVA EA    A KGILTTATSGKAHDSAGISSD NDKRAMLVCRVIAGRVKKS EG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

A0A6J1JR26 uncharacterized protein LOC1114870794.1e-20890.28Show/hide
Query:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE
        MASAMAK  KTQRKLTRQ+ ENQKPK QKAE+PPSWAVVRGIFSCKYLQPQQ    QQQ+QLPRKEKQ EQATEES+KNCKKMRCSGSLCSNTKVTHRLE
Subjt:  MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLE

Query:  AAPSPEVHKKRAL-TSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP
         A SP+ HKKRAL +S+GS+NN+S S  RSTKAPPLNEQNGVLSAT SSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLG+TRDPSLRATI P
Subjt:  AAPSPEVHKKRAL-TSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICP

Query:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQ PICKIERILKVQNT KTISKFEEYRDSIKAKA KL KKHPRCIADGNELL
Subjt:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSL
        RFHCTTLACSLGLNGSSNLC+SIPQCNVCSIIKNGFKVA EA    A KGILTTATSGKAHDSAGISSD NDKRAMLVCRVIAGRVKKS EG+MEDYDSL
Subjt:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEA----AEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSL

Query:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
        AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF
Subjt:  AGAAGMYSNLDELYVFSPKAILPCFVVIYGGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11490.1 zinc finger (C2H2 type) family protein3.6e-3940.66Show/hide
Query:  ICPCPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPI--CKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIAD
        +  C +C E     +  E H    H+V  L   D S+  VE+I  + +  K   +    I  I K+QN  + ++ FE+YR+ +K +A KL KKH RC+AD
Subjt:  ICPCPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPI--CKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIAD

Query:  GNELLRFHCTTLACSLGL-NGSSNLCNSIPQCNVCSIIKNGFKVAGEA-AEKGILTTATSGKAHDSAGISSDSNDKR----AMLVCRVIAGRVKKSLEG-
        GNE L FH TTL+C+LG  N SSNLC S   C VC I+++GF         KG+LT +TS  A +S  I +D    R    A+++CRVIAGRV K ++  
Subjt:  GNELLRFHCTTLACSLGL-NGSSNLCNSIPQCNVCSIIKNGFKVAGEA-AEKGILTTATSGKAHDSAGISSDSNDKR----AMLVCRVIAGRVKKSLEG-

Query:  ----SMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY
               ++DSLA   G  S ++ELY+ S KA+LPCFV+I+
Subjt:  ----SMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY

AT1G75710.1 C2H2-like zinc finger protein3.3e-8543.97Show/hide
Query:  QTQENQKPKQQKA---ERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCS-------NTKVTHRLEAAPSPEV
        QTQ+++  K +KA   ++P SW  ++ + +CK      Q E  + H  P K  Q   +   +    K      S+CS       NT+V HR + +P    
Subjt:  QTQENQKPKQQKA---ERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCS-------NTKVTHRLEAAPSPEV

Query:  HKKRALTSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSL--SASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPCPECGE
                VG+    +S +   T+ P  +  +   S T  S   +AS S  SS+  SFR M FR+  GCYEC M++DP    +R P +   +C C +CGE
Subjt:  HKKRALTSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSL--SASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPCPECGE

Query:  IFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTT
        +F K E+LELHQ+VRHAVSELGPED+ +NIVEIIF+SSWLKK +PIC+IERILKV NT +TI +FE+ RD++KA+A +  +K  RC ADGNELLRFHCTT
Subjt:  IFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTT

Query:  LACSLGLNGSSNLCNSIPQCNVCSIIKNGFK-----VAGEAAEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKK------------SLEGS
        L CSLG  GSS+LC+++P C VC++I++GF+          A  G+ TTA+SG+A D    S D+  +R MLVCRVIAGRVK+              + +
Subjt:  LACSLGLNGSSNLCNSIPQCNVCSIIKNGFK-----VAGEAAEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKK------------SLEGS

Query:  MED------------YDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY
        +ED            +DS+A  AG+YSNL+EL V++P+AILPCFVVIY
Subjt:  MED------------YDSLAGAAGMYSNLDELYVFSPKAILPCFVVIY

AT2G29660.1 zinc finger (C2H2 type) family protein3.0e-4644.49Show/hide
Query:  ICPCPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKK---QTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPK-----KH
        I PC  CGEIF K   LE H +++HAVSEL   ++S NIV+IIF+S W ++   ++P+  I RILK+ N+ K +++FEEYR+ +KAKA +          
Subjt:  ICPCPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKK---QTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPK-----KH

Query:  PRCIADGNELLRFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAEKGILTTATSGKAHDSAGISSDS-----NDKRAMLVCRVIAGRVKK
         RC+ADGNELLRF+C+T  C LG NG SNLC     C++C II +GF    +    GI T AT  + H +     +      N KRAMLVCRV+AGRV  
Subjt:  PRCIADGNELLRFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAEKGILTTATSGKAHDSAGISSDS-----NDKRAMLVCRVIAGRVKK

Query:  SL-------EGSMEDYDSLAGAAGMYSNL------DELYVFSPKAILPCFVVIY
         L       +     YDSL G +G  S        DEL VF+P+A+LPCFV++Y
Subjt:  SL-------EGSMEDYDSLAGAAGMYSNL------DELYVFSPKAILPCFVVIY

AT4G27240.1 zinc finger (C2H2 type) family protein1.6e-5535.75Show/hide
Query:  AKAAKTQRKLTRQTQENQKPKQQKAERPPS-WAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCK------KMRCSGSLCSNTKVTH-
        A + ++++K  ++T + +    QK ++ PS W  ++    CK               +PR +K+    + + T          +  CS S+ +   V H 
Subjt:  AKAAKTQRKLTRQTQENQKPKQQKAERPPS-WAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCK------KMRCSGSLCSNTKVTH-

Query:  RLEAAPSPEVHKKRALTSVGSRN--------NDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTR
               P     R++ S    N        ++S+   + T A        +   T  + S+S  S +S  AS      R   G ++ +   D    +  
Subjt:  RLEAAPSPEVHKKRALTSVGSRN--------NDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTR

Query:  DPSLRATICPCPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHP
        D S       C +CGE F K E  E H   +HAV+EL   D+S+ IVEII ++SWLK +    +I+RILKV N  KT+++FEEYRD++K +A+KL KKHP
Subjt:  DPSLRATICPCPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHP

Query:  RCIADGNELLRFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAEK-GILTTATSGKAHDSAGI-SSDSNDKRAMLVCRVIAGRVKKSLE-
        RCIADGNELLRFH TT+AC+LG+NGS++LC+S  +C VC II+NGF    E     G+ T +TS +A +S  I      D++A++VCRVIAGRV + +E 
Subjt:  RCIADGNELLRFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAEK-GILTTATSGKAHDSAGI-SSDSNDKRAMLVCRVIAGRVKKSLE-

Query:  -----GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI
             G +  +DSLAG  G+Y+N++ELY+ + +A+LPCFV+I
Subjt:  -----GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI

AT5G54630.1 zinc finger protein-related2.8e-6049.58Show/hide
Query:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL
        C +CGE F K E  E H   +HAV+EL   D+S+ IVEII ++SWLK +    +I+R+LKV N  KT+++FEEYR+++K +A+KL KKHPRC+ADGNELL
Subjt:  CPECGEIFMKAETLELHQSVRHAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELL

Query:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAE-KGILTTATSGKAHDSAGISS-------DSNDKRAMLVCRVIAGRVKKSLE-----
        RFH TT+AC LG+NGS+++C +  +C VC II+NGF    E     G+ T +TSG+A +S  ++        D   ++ ++VCRVIAGRV + +E     
Subjt:  RFHCTTLACSLGLNGSSNLCNSIPQCNVCSIIKNGFKVAGEAAE-KGILTTATSGKAHDSAGISS-------DSNDKRAMLVCRVIAGRVKKSLE-----

Query:  -GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI
         G M  +DSLAG  G+Y+N++ELY+ +PKA+LPCFVVI
Subjt:  -GSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTGCCATGGCCAAAGCAGCGAAAACGCAGAGAAAGCTTACCAGACAGACCCAAGAAAACCAGAAACCGAAGCAGCAGAAGGCAGAGAGGCCGCCGTCTTGGGC
AGTTGTTAGAGGCATATTCAGCTGCAAGTACCTGCAGCCCCAACAGCAACCAGAAAAACAACAACAACATCAATTGCCACGAAAAGAGAAGCAACAGGAGCAAGCGACAG
AAGAGAGTACCAAAAACTGTAAGAAAATGAGGTGTTCGGGTTCGCTCTGTAGCAACACCAAGGTAACTCACAGGCTTGAAGCAGCACCATCGCCGGAAGTTCACAAGAAA
AGGGCATTAACTTCAGTGGGTTCCAGAAACAACGATTCCTCAAGTTCATGTAGATCCACAAAAGCTCCTCCTTTGAACGAACAAAATGGTGTCTTATCAGCCACTTGTTC
ATCGTTATCCGCATCATCTTCTTCCAATTCCTCCAATGGTGCTTCTTTCAGGGGAATGCCTTTCAGGAGGTTCTATGGCTGTTATGAATGTAAAATGGTGATTGACCCTG
TTCTTGGGATGACCAGAGATCCTTCTCTTAGAGCAACCATTTGCCCCTGTCCCGAATGTGGTGAGATTTTCATGAAAGCTGAAACTTTGGAGCTTCACCAGAGTGTTAGA
CATGCAGTATCTGAACTTGGTCCTGAAGACACGAGCAAGAACATAGTGGAAATCATATTCCAGTCAAGCTGGCTGAAAAAGCAAACCCCAATTTGCAAAATTGAGAGGAT
TCTCAAAGTCCAGAACACTCCAAAGACCATCTCGAAATTCGAGGAATACAGAGACTCCATTAAAGCCAAAGCCACCAAGCTTCCAAAGAAGCACCCACGTTGCATAGCAG
ATGGTAATGAACTCCTCAGGTTCCACTGCACCACCTTGGCCTGCTCACTTGGCCTAAATGGCTCCTCCAACCTCTGCAATTCAATTCCACAATGCAATGTCTGTAGTATA
ATCAAGAATGGGTTCAAGGTGGCCGGAGAGGCCGCCGAAAAGGGCATTCTAACGACGGCAACAAGCGGAAAGGCTCACGACAGTGCCGGAATATCATCGGACAGCAATGA
CAAACGGGCGATGCTGGTTTGCCGGGTAATAGCTGGCAGGGTGAAGAAGAGTTTGGAAGGTAGCATGGAGGATTATGACTCATTGGCTGGAGCAGCAGGGATGTATTCCA
ATTTGGATGAGTTATATGTGTTCAGTCCCAAGGCAATATTGCCTTGTTTTGTTGTCATTTACGGAGGGTTT
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTGCCATGGCCAAAGCAGCGAAAACGCAGAGAAAGCTTACCAGACAGACCCAAGAAAACCAGAAACCGAAGCAGCAGAAGGCAGAGAGGCCGCCGTCTTGGGC
AGTTGTTAGAGGCATATTCAGCTGCAAGTACCTGCAGCCCCAACAGCAACCAGAAAAACAACAACAACATCAATTGCCACGAAAAGAGAAGCAACAGGAGCAAGCGACAG
AAGAGAGTACCAAAAACTGTAAGAAAATGAGGTGTTCGGGTTCGCTCTGTAGCAACACCAAGGTAACTCACAGGCTTGAAGCAGCACCATCGCCGGAAGTTCACAAGAAA
AGGGCATTAACTTCAGTGGGTTCCAGAAACAACGATTCCTCAAGTTCATGTAGATCCACAAAAGCTCCTCCTTTGAACGAACAAAATGGTGTCTTATCAGCCACTTGTTC
ATCGTTATCCGCATCATCTTCTTCCAATTCCTCCAATGGTGCTTCTTTCAGGGGAATGCCTTTCAGGAGGTTCTATGGCTGTTATGAATGTAAAATGGTGATTGACCCTG
TTCTTGGGATGACCAGAGATCCTTCTCTTAGAGCAACCATTTGCCCCTGTCCCGAATGTGGTGAGATTTTCATGAAAGCTGAAACTTTGGAGCTTCACCAGAGTGTTAGA
CATGCAGTATCTGAACTTGGTCCTGAAGACACGAGCAAGAACATAGTGGAAATCATATTCCAGTCAAGCTGGCTGAAAAAGCAAACCCCAATTTGCAAAATTGAGAGGAT
TCTCAAAGTCCAGAACACTCCAAAGACCATCTCGAAATTCGAGGAATACAGAGACTCCATTAAAGCCAAAGCCACCAAGCTTCCAAAGAAGCACCCACGTTGCATAGCAG
ATGGTAATGAACTCCTCAGGTTCCACTGCACCACCTTGGCCTGCTCACTTGGCCTAAATGGCTCCTCCAACCTCTGCAATTCAATTCCACAATGCAATGTCTGTAGTATA
ATCAAGAATGGGTTCAAGGTGGCCGGAGAGGCCGCCGAAAAGGGCATTCTAACGACGGCAACAAGCGGAAAGGCTCACGACAGTGCCGGAATATCATCGGACAGCAATGA
CAAACGGGCGATGCTGGTTTGCCGGGTAATAGCTGGCAGGGTGAAGAAGAGTTTGGAAGGTAGCATGGAGGATTATGACTCATTGGCTGGAGCAGCAGGGATGTATTCCA
ATTTGGATGAGTTATATGTGTTCAGTCCCAAGGCAATATTGCCTTGTTTTGTTGTCATTTACGGAGGGTTT
Protein sequenceShow/hide protein sequence
MASAMAKAAKTQRKLTRQTQENQKPKQQKAERPPSWAVVRGIFSCKYLQPQQQPEKQQQHQLPRKEKQQEQATEESTKNCKKMRCSGSLCSNTKVTHRLEAAPSPEVHKK
RALTSVGSRNNDSSSSCRSTKAPPLNEQNGVLSATCSSLSASSSSNSSNGASFRGMPFRRFYGCYECKMVIDPVLGMTRDPSLRATICPCPECGEIFMKAETLELHQSVR
HAVSELGPEDTSKNIVEIIFQSSWLKKQTPICKIERILKVQNTPKTISKFEEYRDSIKAKATKLPKKHPRCIADGNELLRFHCTTLACSLGLNGSSNLCNSIPQCNVCSI
IKNGFKVAGEAAEKGILTTATSGKAHDSAGISSDSNDKRAMLVCRVIAGRVKKSLEGSMEDYDSLAGAAGMYSNLDELYVFSPKAILPCFVVIYGGF