; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G016560 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G016560
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationCG_Chr07:32995053..33009659
RNA-Seq ExpressionClCG07G016560
SyntenyClCG07G016560
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649122.1 hypothetical protein Csa_014515 [Cucumis sativus]4.6e-19992.31Show/hide
Query:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA
        MAIAVASSL+FEGA CS SYAFTSSWNR SFDV GRN+KFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCK  RNTELTA
Subjt:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYV DDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRR VGDYS  DSGYGDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY

Query:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK
        D+IEVVEADL +IPVDWG PDVSS+VHPVYFAKCL KVINMEYDR MKHPSNGVSILGCLRPAYADEESY+RRLFYFEESEGYNT+WKGLEGET +LESK
Subjt:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE
        IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHST EI+ERF+EKGI+CNIALKALCKKRGLHVE
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE

XP_004152092.1 uncharacterized protein At3g49140 isoform X2 [Cucumis sativus]4.6e-19992.31Show/hide
Query:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA
        MAIAVASSL+FEGA CS SYAFTSSWNR SFDV GRN+KFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCK  RNTELTA
Subjt:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYV DDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRR VGDYS  DSGYGDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY

Query:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK
        D+IEVVEADL +IPVDWG PDVSS+VHPVYFAKCL KVINMEYDR MKHPSNGVSILGCLRPAYADEESY+RRLFYFEESEGYNT+WKGLEGET +LESK
Subjt:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE
        IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHST EI+ERF+EKGI+CNIALKALCKKRGLHVE
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE

XP_022137916.1 uncharacterized protein At3g49140 isoform X2 [Momordica charantia]3.1e-19589.95Show/hide
Query:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA
        MAIAVASSL+FEGACCSTSYAFTSSWNR S DVRGRN  FGSTE HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCKRAR+TELTA
Subjt:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRR VGDY+  DSG GDVVPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY

Query:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK
        D+IEVVE DL DIPVDWG PDVSSLVHPVYFAKCLNKVINMEYD+KMKHPSNGVSILGCLRPA+ADEESY+RRLFYFE SEGY T+WKGL+GE LS ESK
Subjt:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEL
         D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQDFQ+AEPDIL+HST EI+E FSEKGIRCNIALKALCKKRGLHVE+
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEL

XP_038898170.1 uncharacterized protein At3g49140 isoform X1 [Benincasa hispida]1.8e-20393.49Show/hide
Query:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA
        M IAVAS+L+FEGACCSTSYAFTSSWNR SFDVRGRN++FGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTNKGYHPLEDLKVCKRARNTELTA
Subjt:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSW+EFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYS ADSGYGDVVPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY

Query:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWK-------GLEGE
        D+IEVVE DL DIPVDWGAPD SSLVHPVYFAKCLNKVINMEYDRKM HPSNGVSILGCLRPAYADEESYVRRLF+FEESEGYNT+WK       GLEGE
Subjt:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWK-------GLEGE

Query:  TLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE
        TLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQ AEPDILLHST EIIERFSEKGIRCNIALKALCKKRGLHVE
Subjt:  TLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE

XP_038898179.1 uncharacterized protein At3g49140 isoform X2 [Benincasa hispida]1.5e-20595.23Show/hide
Query:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA
        M IAVAS+L+FEGACCSTSYAFTSSWNR SFDVRGRN++FGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTNKGYHPLEDLKVCKRARNTELTA
Subjt:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSW+EFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYS ADSGYGDVVPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY

Query:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK
        D+IEVVE DL DIPVDWGAPD SSLVHPVYFAKCLNKVINMEYDRKM HPSNGVSILGCLRPAYADEESYVRRLF+FEESEGYNT+WKGLEGETLSLESK
Subjt:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE
        IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQ AEPDILLHST EIIERFSEKGIRCNIALKALCKKRGLHVE
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE

TrEMBL top hitse value%identityAlignment
A0A0A0KW72 Uncharacterized protein2.2e-19992.31Show/hide
Query:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA
        MAIAVASSL+FEGA CS SYAFTSSWNR SFDV GRN+KFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCK  RNTELTA
Subjt:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYV DDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRR VGDYS  DSGYGDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY

Query:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK
        D+IEVVEADL +IPVDWG PDVSS+VHPVYFAKCL KVINMEYDR MKHPSNGVSILGCLRPAYADEESY+RRLFYFEESEGYNT+WKGLEGET +LESK
Subjt:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE
        IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHST EI+ERF+EKGI+CNIALKALCKKRGLHVE
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE

A0A1S3BY92 uncharacterized protein At3g49140 isoform X11.7e-19489.92Show/hide
Query:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA
        MA+AVASSL+FEGA CSTSYAFTS WNR SFDV GRN+KFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCKRARNTELTA
Subjt:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE QYV DDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRR +GDYS  DSGYGDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY

Query:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK
        D+IE VEADL +IPVDWG PDVSSLVHPVYFAKCLNKV+N+EYDR MKHPSNGV+ILG LRP YADEESYVRRLF FEESEGYNT+WKGLEGET +LE K
Subjt:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE
        IDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHST +I+ERF+EKGI+CNIALKALCKKRGLHVE
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE

A0A1S3BYP0 uncharacterized protein At3g49140 isoform X23.2e-19090.16Show/hide
Query:  EGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTAAEVARTAVEVN
        EGA CSTSYAFTS WNR SFDV GRN+KFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCKRARNTELTAAEVARTAVEVN
Subjt:  EGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTAAEVARTAVEVN

Query:  SNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDYDFIEVVEADLV
        SNALLLFPGTVHSEPHEQVSWDE QYV DDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRR +GDYS  DSGYGDV PFDYD+IE VEADL 
Subjt:  SNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDYDFIEVVEADLV

Query:  DIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESKIDRSSQRSTLY
        +IPVDWG PDVSSLVHPVYFAKCLNKV+N+EYDR MKHPSNGV+ILG LRP YADEESYVRRLF FEESEGYNT+WKGLEGET +LE KIDRSSQRSTLY
Subjt:  DIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESKIDRSSQRSTLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE
        RLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHST +I+ERF+EKGI+CNIALKALCKKRGLHVE
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE

A0A6J1C800 uncharacterized protein At3g49140 isoform X12.0e-19590.19Show/hide
Query:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA
        MAIAVASSL+FEGACCSTSYAFTSSWNR S DVRGRN  FGSTE HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCKRAR+TELTA
Subjt:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRR VGDY+  DSG GDVVPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY

Query:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK
        D+IEVVE DL DIPVDWG PDVSSLVHPVYFAKCLNKVINMEYD+KMKHPSNGVSILGCLRPA+ADEESY+RRLFYFE SEGY T+WKGL+GE LS ESK
Subjt:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE
         D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQDFQ+AEPDIL+HST EI+E FSEKGIRCNIALKALCKKRGLHVE
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE

A0A6J1C853 uncharacterized protein At3g49140 isoform X21.5e-19589.95Show/hide
Query:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA
        MAIAVASSL+FEGACCSTSYAFTSSWNR S DVRGRN  FGSTE HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCKRAR+TELTA
Subjt:  MAIAVASSLSFEGACCSTSYAFTSSWNR-SFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRR VGDY+  DSG GDVVPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDY

Query:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK
        D+IEVVE DL DIPVDWG PDVSSLVHPVYFAKCLNKVINMEYD+KMKHPSNGVSILGCLRPA+ADEESY+RRLFYFE SEGY T+WKGL+GE LS ESK
Subjt:  DFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEL
         D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQDFQ+AEPDIL+HST EI+E FSEKGIRCNIALKALCKKRGLHVE+
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-0532.47Show/hide
Query:  EFRALTHGNCEGIWIRRLLEEPKFSQTLPIHTYNCDNKTTISIAYNSTLHDWSKHVEVDKYFIKEKINAGIICILYL
        E+ AL     E +W++ LL         PI  Y  DN+  ISIA N + H  +KH+++  +F +E++   +IC+ Y+
Subjt:  EFRALTHGNCEGIWIRRLLEEPKFSQTLPIHTYNCDNKTTISIAYNSTLHDWSKHVEVDKYFIKEKINAGIICILYL

Q0WMN5 Uncharacterized protein At3g491409.5e-3026.26Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KRARNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVN
        ++    A+Y DS  D         YHP E+++    +   ++ L+ AE  RT +EVN+   L+  G++    HE + W +  Y+ D  G+LYF++ +  +
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KRARNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVN

Query:  MLED-RGAHNPVNALIGMD-MQMYESRRTVG----DYSTADSGYGDVVPFD-------YDFIEVVE------------------ADLVDIPVDWGAPDVS
        +++     +N V  ++G D M+M +    +G    D+ T D   GD    D        +++ ++E                  +D  +   DW   +  
Subjt:  MLED-RGAHNPVNALIGMD-MQMYESRRTVG----DYSTADSGYGDVVPFD-------YDFIEVVE------------------ADLVDIPVDWGAPDVS

Query:  SLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYV-RRLFYFEESEGYNTDWKGL------EGETLSLESKIDRSSQRS-----TLY
           HP++FAK + +V + +    M  PS G++I G L     ++ S + ++L     +   N D + L        +    ES+ID S           Y
Subjt:  SLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYV-RRLFYFEESEGYNTDWKGL------EGETLSLESKIDRSSQRS-----TLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC
        +LE++RI+L +  G Q+EV ++D + A+PD + H++ EII R  E G +   ALK+LC
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein6.7e-3126.26Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KRARNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVN
        ++    A+Y DS  D         YHP E+++    +   ++ L+ AE  RT +EVN+   L+  G++    HE + W +  Y+ D  G+LYF++ +  +
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KRARNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVN

Query:  MLED-RGAHNPVNALIGMD-MQMYESRRTVG----DYSTADSGYGDVVPFD-------YDFIEVVE------------------ADLVDIPVDWGAPDVS
        +++     +N V  ++G D M+M +    +G    D+ T D   GD    D        +++ ++E                  +D  +   DW   +  
Subjt:  MLED-RGAHNPVNALIGMD-MQMYESRRTVG----DYSTADSGYGDVVPFD-------YDFIEVVE------------------ADLVDIPVDWGAPDVS

Query:  SLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYV-RRLFYFEESEGYNTDWKGL------EGETLSLESKIDRSSQRS-----TLY
           HP++FAK + +V + +    M  PS G++I G L     ++ S + ++L     +   N D + L        +    ES+ID S           Y
Subjt:  SLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYV-RRLFYFEESEGYNTDWKGL------EGETLSLESKIDRSSQRS-----TLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC
        +LE++RI+L +  G Q+EV ++D + A+PD + H++ EII R  E G +   ALK+LC
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein3.5e-11254.66Show/hide
Query:  MAIAVASSLSFEGACCSTSYA--FTSS---------WNRSFDVRGRNQ----------KFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKG
        M IA ASS S   + C  SY   F+SS          NR FD  G              F  + FH  S G DL L+KVSVAADY DSVPDSS Y    G
Subjt:  MAIAVASSLSFEGACCSTSYA--FTSS---------WNRSFDVRGRNQ----------KFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKG

Query:  YHPLEDLKVCKRARNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRR
        YHPLEDLK  KR + T+L+A+EVART VE NS+A+L+FPG +H EPH+  SW EF+YVIDDYGD++FEI D  N+LED GA NPV A  GMD+  YE+ R
Subjt:  YHPLEDLKVCKRARNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRR

Query:  TVGDYSTADSGYGDVVPFDYDFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEES
           +Y+ +D G  D + FD  + E+++++  DIP+DWG PD S+ VHP+YFAK L+K I+M+YDRKM +PSNGVSILGCLRPA+ DEESY+RRLF  E+ 
Subjt:  TVGDYSTADSGYGDVVPFDYDFIEVVEADLVDIPVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEES

Query:  EGYNTDWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE
        + Y+ + +G +    S  S+ D +   S+LYRLEI+ IEL S+YG +S +SLQDFQDAEPDIL+HST+ IIERF+ +GI  +IALKALCKK+GLH E
Subjt:  EGYNTDWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVE

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-0527.34Show/hide
Query:  YGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKAL---CKKRGLHVELYLSQEFRALTHGNCEGIWIRRLLEEPKFSQTLPIHTYNCDNK
        Y  Q+E+ LQ F DA       S  +   R S  G    +    +    KK+ +  +     E+RAL+    E +W+ +   E +   + P   + CDN 
Subjt:  YGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKAL---CKKRGLHVELYLSQEFRALTHGNCEGIWIRRLLEEPKFSQTLPIHTYNCDNK

Query:  TTISIAYNSTLHDWSKHVEVDKYFIKEK
          I IA N+  H+ +KH+E D + ++E+
Subjt:  TTISIAYNSTLHDWSKHVEVDKYFIKEK

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-3025.4Show/hide
Query:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KRARNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDL
        S G+ L  ++    A+Y  S  D         YHP ED++     K   ++ L+  E ART +EVN    L+  G +    HE + W +  YV D +G++
Subjt:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KRARNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDL

Query:  YFEI-----------------------FDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDYDFIEVVE---------ADLVDI
        YF++                       FD++ M++D    +P     G++ ++ +    V D +  D   G+    D +++ V+E         +D  + 
Subjt:  YFEI-----------------------FDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDYDFIEVVE---------ADLVDI

Query:  PVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLF---------YFEESEGYNTDWKGLEGETLSLESKIDRS
          DW   +     HP+YFA+ + +V + +    M  PS G++I G L P   ++ S +++             +E E     ++G+ GE  S    ++ S
Subjt:  PVDWGAPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLF---------YFEESEGYNTDWKGLEGETLSLESKIDRS

Query:  SQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVE
              Y+LEI+RI+L +  G Q+EV ++D + A+PD++  ++  I+ R  E G +   AL++LC +  G+  E
Subjt:  SQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVE

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein1.7e-2925.14Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KRARNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI----
        ++    A+Y  S  D         YHP ED++     K   ++ L+  E ART +EVN    L+  G +    HE + W +  YV D +G++YF++    
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KRARNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI----

Query:  -------------------FDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDYDFIEVVE---------ADLVDIPVDWGAPD
                           FD++ M++D    +P     G++ ++ +    V D +  D   G+    D +++ V+E         +D  +   DW   +
Subjt:  -------------------FDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDYDFIEVVE---------ADLVDIPVDWGAPD

Query:  VSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLF---------YFEESEGYNTDWKGLEGETLSLESKIDRSSQRSTLYR
             HP+YFA+ + +V + +    M  PS G++I G L P   ++ S +++             +E E     ++G+ GE  S    ++ S      Y+
Subjt:  VSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLF---------YFEESEGYNTDWKGLEGETLSLESKIDRSSQRSTLYR

Query:  LEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVE
        LEI+RI+L +  G Q+EV ++D + A+PD++  ++  I+ R  E G +   AL++LC +  G+  E
Subjt:  LEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTGCTGTAGCTTCTTCACTTTCCTTTGAAGGGGCTTGTTGCTCGACATCATATGCATTCACAAGCAGTTGGAATAGATCTTTTGATGTTCGTGGCAGAAATCA
AAAATTTGGATCAACAGAATTTCATTGGTTGTCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTGGCTGCTGATTACCCAGATTCAGTTCCAGATTCATCAAGTT
ATTTAACTAACAAAGGTTATCATCCTCTTGAAGATCTAAAAGTTTGCAAAAGAGCACGGAATACTGAACTCACTGCTGCAGAAGTAGCAAGGACCGCTGTGGAGGTCAAT
AGCAATGCTTTGCTGTTATTTCCTGGAACTGTGCACAGTGAACCACATGAACAAGTATCGTGGGATGAGTTTCAATATGTTATTGACGATTATGGAGATTTGTATTTTGA
AATTTTTGATAGTGTGAACATGTTAGAAGATCGTGGAGCACACAATCCTGTGAATGCTTTGATTGGAATGGACATGCAAATGTATGAGAGTAGAAGGACAGTTGGAGATT
ATAGCACGGCAGATAGTGGCTACGGTGATGTTGTTCCTTTTGATTATGATTTTATTGAGGTAGTGGAAGCTGATTTAGTTGATATTCCAGTTGACTGGGGAGCTCCAGAT
GTTTCTAGCCTGGTTCATCCTGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATATGACAGAAAGATGAAGCATCCTTCAAATGGGGTTTCCATTTTGGG
ATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATGTAAGAAGATTATTTTACTTTGAAGAAAGTGAAGGCTACAACACAGATTGGAAAGGTTTAGAAGGCGAAACCT
TGAGCTTGGAGTCCAAAATTGACAGAAGCAGCCAAAGATCCACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCTGTGTATGGAGTTCAGTCTGAAGTTAGT
TTGCAAGATTTTCAAGATGCTGAACCTGATATTCTTCTGCACTCTACTACGGAAATTATAGAGCGTTTTAGTGAGAAGGGTATTAGGTGCAATATTGCCCTTAAAGCTCT
TTGCAAAAAGAGGGGCCTTCATGTTGAGTTATATCTCTCACAAGAATTTAGGGCGCTAACTCATGGTAATTGTGAGGGTATATGGATAAGAAGGCTATTGGAAGAACCAA
AATTCTCCCAGACATTGCCCATACACACTTATAATTGTGACAACAAGACAACAATCTCCATTGCCTACAATTCAACCCTTCATGATTGGTCAAAACATGTTGAAGTTGAT
AAATACTTCATAAAAGAGAAGATTAATGCTGGAATAATCTGCATTCTCTACCTTGCGATAATAAAGCAAAGGCTTTCATAG
mRNA sequenceShow/hide mRNA sequence
TATGTCTTGTACTTCACAATTACTTTATATTTGTTTTTCGTCTCAACATTTTGTAGGTATATATAAACAGAATTTATGGACCCATTCGTTATTTTCTACATTTATTAAAC
TGAGATTTCTGAACACGTTGGGACTTGGGACATCTTCCCTTTTAGCTGACGTTTGAATCTTTATGGCCAAATCGTACTGCGACCTTTTCGTAGAAGCCCATACCCATCAC
AGCTTAGTTCGACTTTGATCTCTCTCTCTCATGGCAATTGCTGTAGCTTCTTCACTTTCCTTTGAAGGGGCTTGTTGCTCGACATCATATGCATTCACAAGCAGTTGGAA
TAGATCTTTTGATGTTCGTGGCAGAAATCAAAAATTTGGATCAACAGAATTTCATTGGTTGTCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTGGCTGCTGATT
ACCCAGATTCAGTTCCAGATTCATCAAGTTATTTAACTAACAAAGGTTATCATCCTCTTGAAGATCTAAAAGTTTGCAAAAGAGCACGGAATACTGAACTCACTGCTGCA
GAAGTAGCAAGGACCGCTGTGGAGGTCAATAGCAATGCTTTGCTGTTATTTCCTGGAACTGTGCACAGTGAACCACATGAACAAGTATCGTGGGATGAGTTTCAATATGT
TATTGACGATTATGGAGATTTGTATTTTGAAATTTTTGATAGTGTGAACATGTTAGAAGATCGTGGAGCACACAATCCTGTGAATGCTTTGATTGGAATGGACATGCAAA
TGTATGAGAGTAGAAGGACAGTTGGAGATTATAGCACGGCAGATAGTGGCTACGGTGATGTTGTTCCTTTTGATTATGATTTTATTGAGGTAGTGGAAGCTGATTTAGTT
GATATTCCAGTTGACTGGGGAGCTCCAGATGTTTCTAGCCTGGTTCATCCTGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATATGACAGAAAGATGAA
GCATCCTTCAAATGGGGTTTCCATTTTGGGATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATGTAAGAAGATTATTTTACTTTGAAGAAAGTGAAGGCTACAACA
CAGATTGGAAAGGTTTAGAAGGCGAAACCTTGAGCTTGGAGTCCAAAATTGACAGAAGCAGCCAAAGATCCACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTC
TCTGTGTATGGAGTTCAGTCTGAAGTTAGTTTGCAAGATTTTCAAGATGCTGAACCTGATATTCTTCTGCACTCTACTACGGAAATTATAGAGCGTTTTAGTGAGAAGGG
TATTAGGTGCAATATTGCCCTTAAAGCTCTTTGCAAAAAGAGGGGCCTTCATGTTGAGTTATATCTCTCACAAGAATTTAGGGCGCTAACTCATGGTAATTGTGAGGGTA
TATGGATAAGAAGGCTATTGGAAGAACCAAAATTCTCCCAGACATTGCCCATACACACTTATAATTGTGACAACAAGACAACAATCTCCATTGCCTACAATTCAACCCTT
CATGATTGGTCAAAACATGTTGAAGTTGATAAATACTTCATAAAAGAGAAGATTAATGCTGGAATAATCTGCATTCTCTACCTTGCGATAATAAAGCAAAGGCTTTCATA
GAGAGAATTCCTCGACTCTCTCGAATGTGCTTAGTTTACTTGTTCTTCTTTTGCATTTTTTCTAGCATTGTTTCATATTGTATTAGTTTACTTACTATTGTTAACATCTA
CAATTTTCTTTTGAGACAATATTCTTGCTTATTGACTTGTCAAACTTTAGAAGATCTAAAAAAATTCCATAACACATCCCTCTCCTTGTAGAAAAAACAAACAAGTGAGT
CTCATACCTGAAGCAATACAGGAACTAAAGAGTAGAAGAGAAACATAGCCACTGAAAATCCAAGAAATGGAACTGTCTGTAACCATT
Protein sequenceShow/hide protein sequence
MAIAVASSLSFEGACCSTSYAFTSSWNRSFDVRGRNQKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRARNTELTAAEVARTAVEVN
SNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSVNMLEDRGAHNPVNALIGMDMQMYESRRTVGDYSTADSGYGDVVPFDYDFIEVVEADLVDIPVDWGAPD
VSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTDWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVS
LQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVELYLSQEFRALTHGNCEGIWIRRLLEEPKFSQTLPIHTYNCDNKTTISIAYNSTLHDWSKHVEVD
KYFIKEKINAGIICILYLAIIKQRLS