; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G034660 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G034660
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchrH02:8638262..8641527
RNA-Seq ExpressionChy2G034660
SyntenyChy2G034660
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637469.1 hypothetical protein CSA_004502 [Cucumis sativus]0.090.24Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFT+ F+KLIQAR DPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILGFMNLSS EDSDKNDKSLDFSLITPFMDLI+IWTQPNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSF--------------------------ETLVRV
        PLVCSEVREEFSSGECDVRRLAGVVIAE FLMKLCLDFN GRSRQDLEKDL TWAVGSITQIRNFYSF                          ETLVRV
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSF--------------------------ETLVRV

Query:  LLEATLPVTSLLSINEEALLRKVLYDALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQM
        LLEATLPVTSLLS + EALLRKVLYDALILVDYSFLKPEIAINLP EHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRW+KSQM
Subjt:  LLEATLPVTSLLSINEEALLRKVLYDALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQM

Query:  PSNENLNRPNGSSPKVFLEWLLKAEDQGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTT
        PSNENLN PNG SPKVFLEWLLKAEDQGVRVFDNTISNRR+KLVLDTSKSVS+EGDKVDDDLLFYIDKQGGN NGSEEDTTMDESVNAAL SAAPTMSTT
Subjt:  PSNENLNRPNGSSPKVFLEWLLKAEDQGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTT

Query:  ENSSVKKLNRKAKKKNKKLKLLSQLKSAVEDDLLFCIDKQGKNENGNEEDTTMDESVNGALVSAAPTMSTTENS
        ENSSVKKL+RKAKK+NKKLKLLSQLKSAVE DLLFCI+KQG+NENGNEEDTTM+E VN ALVSAAPTMSTTENS
Subjt:  ENSSVKKLNRKAKKKNKKLKLLSQLKSAVEDDLLFCIDKQGKNENGNEEDTTMDESVNGALVSAAPTMSTTENS

TYK10112.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.087.25Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+N FLGENYEFTLAQSIQNVLAEIRKGNVVFS+FTEGF+KLIQAR DPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILG  NLSSCEDS+KNDKSLDF+ ITPF+DLI+IWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAETFL+KLCLDFNCG SRQ LE+DLR W VGSIT+IRNFY FETLVR+LLEATLPVTSLLS ++EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLP EH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLN  NGSSPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKL-LSQL
        QGVRVFDNTISN RAK+VLDTSKSV +EGDKVDDDLLFYIDKQG NENG EED TMD+SVNAALVS A TMSTTENSSVKK +RKAKK+NKK     SQL
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKL-LSQL

Query:  KSAVEDDLLFCIDKQGKNENGNEEDTTMDESVNGALVSAAPTMSTTENS
        KSAVE+           N+   +EDTTMDESVN ALVS APTMSTTENS
Subjt:  KSAVEDDLLFCIDKQGKNENGNEEDTTMDESVNGALVSAAPTMSTTENS

XP_004135698.1 uncharacterized protein LOC101207150 [Cucumis sativus]0.094.53Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFT+ F+KLIQAR DPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILGFMNLSS EDSDKNDKSLDFSLITPFMDLI+IWTQPNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAE FLMKLCLDFN GRSRQDLEKDL TWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLS + EALLRKVLYD
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED
        ALILVDYSFLKPEIAINLP EHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRW+KSQMPSNENLN PNG SPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLLSQLK
        QGVRVFDNTISNRR+KLVLDTSKSVS+EGDKVDDDLLFYIDKQGGN NGSEEDTTMDESVNAAL SAAPTMSTTENSSVKKL+RKAKK+NKKLKLLSQLK
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLLSQLK

Query:  SAVEDDLLFCIDKQGKNENGNEEDTTMDESVNGALVSAAPTMSTTENS
        SAVE DLLFCI+KQG+NENGNEEDTTM+E VN ALVSAAPTMSTTENS
Subjt:  SAVEDDLLFCIDKQGKNENGNEEDTTMDESVNGALVSAAPTMSTTENS

XP_008450830.1 PREDICTED: uncharacterized protein LOC103492302 [Cucumis melo]6.08e-30989.72Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+NPFLGENYEFTL QSIQNVLAEIRKGNVVFS+FTEGF+KLIQAR DPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILG  NLSSCEDS+KNDKSLDF+ ITPF+DLI+IWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAETFL+KLCLDFNCG SRQ LE+DLR W VGSIT+IRNFY FETLVR+LLEATLPVTSLLS ++EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLP EH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLN  NGSSPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLL
        QGVRVFDNTISN RAK+VLDTSKSV +EGDKVDDDLLFYIDKQG NENG EED TMD+SVNAALVS A TMSTTENSSVKK +RKAKK  K+  L+
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLL

XP_038880003.1 uncharacterized protein LOC120071696 [Benincasa hispida]8.72e-28684.94Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRS-SFNPKGDFLERVAAMKVLFQ
        MAL LVESM+S+NPLK+NPFLGENYEFTLAQSIQNV+AEIRKGN  FSQFTEGF++LIQAR DPPLESIWFYSALTFRS   N KGDFLERVAAMKVLFQ
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRS-SFNPKGDFLERVAAMKVLFQ

Query:  LVCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKND-KSLDFSLITPFMDLINIWTQPNEGLDQ
        LV SCSAPCGSSKTI LLSPVVSEVYKL++DM GKDL SKREKKAMREVKSLVEAILGF+NLSSC+DSDKND +SLDF+LITPF+DLI++WT PNEGLDQ
Subjt:  LVCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKND-KSLDFSLITPFMDLINIWTQPNEGLDQ

Query:  FLPLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVL
        FLPLV SEVR EFSSG CDVRRLAGVVIAETFLMKLCLDFN G SRQDLEKDLR W VGSIT+IRNFY FETLVR LLEATLPVTSLLS  +EALLRKVL
Subjt:  FLPLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVL

Query:  YDALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKA
        YD+LILV+YSFLKPE AI+LP EHVA LAVKRLILT+EAIEFYREHGDQ+RAISYLNAFSSS VSSQIIRWVKSQMPSNEN+ RPNGSSPK+ LEWLL+A
Subjt:  YDALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKA

Query:  EDQGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLL
        EDQGVRVFD TISNR AKLVLDTSKSVS EGDKVDDDLLFYIDKQG +ENGSE DTTMDESVNAALVS A TMSTTEN S KK  R  K+KN+K+K +
Subjt:  EDQGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLL

TrEMBL top hitse value%identityAlignment
A0A0A0M1W0 Uncharacterized protein6.3e-28694.53Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFT+ F+KLIQAR DPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILGFMNLSS EDSDKNDKSLDFSLITPFMDLI+IWTQPNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAE FLMKLCLDFN GRSRQDLEKDL TWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLS + EALLRKVLYD
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED
        ALILVDYSFLKPEIAINLP EHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRW+KSQMPSNENLN PNG SPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLLSQLK
        QGVRVFDNTISNRR+KLVLDTSKSVS+EGDKVDDDLLFYIDKQGGN NGSEEDTTMDESVNAAL SAAPTMSTTENSSVKKL+RKAKK+NKKLKLLSQLK
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLLSQLK

Query:  SAVEDDLLFCIDKQGKNENGNEEDTTMDESVNGALVSAAPTMSTTENS
        SAVE DLLFCI+KQG+NENGNEEDTTM+E VN ALVSAAPTMSTTENS
Subjt:  SAVEDDLLFCIDKQGKNENGNEEDTTMDESVNGALVSAAPTMSTTENS

A0A1S3BQ57 uncharacterized protein LOC1034923022.2e-24689.72Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+NPFLGENYEFTL QSIQNVLAEIRKGNVVFS+FTEGF+KLIQAR DPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILG  NLSSCEDS+KNDKSLDF+ ITPF+DLI+IWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAETFL+KLCLDFNCG SRQ LE+DLR W VGSIT+IRNFY FETLVR+LLEATLPVTSLLS ++EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLP EH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLN  NGSSPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLL
        QGVRVFDNTISN RAK+VLDTSKSV +EGDKVDDDLLFYIDKQG NENG EED TMD+SVNAALVS A TMSTTENSSVKK +RKAKK  K+  L+
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLL

A0A5D3CFU4 Pentatricopeptide repeat-containing protein2.0e-26087.25Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+N FLGENYEFTLAQSIQNVLAEIRKGNVVFS+FTEGF+KLIQAR DPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILG  NLSSCEDS+KNDKSLDF+ ITPF+DLI+IWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAETFL+KLCLDFNCG SRQ LE+DLR W VGSIT+IRNFY FETLVR+LLEATLPVTSLLS ++EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLP EH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLN  NGSSPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKL-LSQL
        QGVRVFDNTISN RAK+VLDTSKSV +EGDKVDDDLLFYIDKQG NENG EED TMD+SVNAALVS A TMSTTENSSVKK +RKAKK+NKK     SQL
Subjt:  QGVRVFDNTISNRRAKLVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKL-LSQL

Query:  KSAVEDDLLFCIDKQGKNENGNEEDTTMDESVNGALVSAAPTMSTTENS
        KSAVE+           N+   +EDTTMDESVN ALVS APTMSTTENS
Subjt:  KSAVEDDLLFCIDKQGKNENGNEEDTTMDESVNGALVSAAPTMSTTENS

A0A6J1H8Q6 uncharacterized protein LOC1114615267.6e-21575.75Show/hide
Query:  MESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP
        MES+N  KQ+PFLGENYEFTL QSIQNVLAEIR+GN+ FSQF EGF++LIQAR DPPLESIWFYSALTFRS  +   GDFL+RVA MK+LFQ  CSCSAP
Subjt:  MESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP

Query:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFLPLVCSEV
        CGSSKTI LLSPVV EVYKL+ DM GKDL+SKREKKAMREVKSLVE +LGF+NLSSC+DSD+N +SLDF+L+TPF+DLI+IW   NEGLDQFLPLV SEV
Subjt:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFLPLVCSEV

Query:  REEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYDALILVDY
        R EFSSG CD+RRLAGVVIAETFLMKLCLD N GRSRQDLE DLR WAVGSIT+I+NFY FETLVR LLEATLPV SLLS  +EALLRK+LYDALILVDY
Subjt:  REEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYDALILVDY

Query:  SFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAEDQGVRVFD
        SFL  E AINLP +HVAFLAVKRLILT+EAIEFYREHGDQNRAISYLNAFS+SLVSSQIIRWVKSQ+PSNEN N P GSSPK+FLEWLLKAED GVRVFD
Subjt:  SFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAEDQGVRVFD

Query:  NTISNRRAKLVLDTSKSVS----YEGDKVDDDLLFYIDKQGGNENGS-EEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKL-------
        +TISNRRAKLVLDTSKSVS     EG+ VDD+LLFYIDKQG NENGS EED  MDESVNAALVSAA TMSTT+N S KK  ++  KK KK+K        
Subjt:  NTISNRRAKLVLDTSKSVS----YEGDKVDDDLLFYIDKQGGNENGS-EEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKL-------

Query:  ---LSQLKSAVEDDLLFCIDKQGKNENGNEED
           +++L+SAVED+     D +G+  N + ++
Subjt:  ---LSQLKSAVEDDLLFCIDKQGKNENGNEED

A0A6J1JEZ4 uncharacterized protein LOC1114851551.9e-21374.81Show/hide
Query:  MESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP
        MES+N  KQ+PFLGENYEFTL QSIQNVLAEIR+GN+VFSQF EGF++LIQAR DPPLESIWFYSALTFRS  +   GDFL+RVA MK+LFQ  CSCSAP
Subjt:  MESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP

Query:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFLPLVCSEV
        CGSSKTI LL+PVV EVYKL+ DM GKDL SKREKKAMREVKSLVE ILGF+NLSSC+DSD+N +SLDF+L+TPF+DLI+IWT  NEGLDQFLPLV SEV
Subjt:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFLPLVCSEV

Query:  REEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYDALILVDY
        R EFSSG CD+RRLAGVVIAETFL+KLCLD N GRSRQDLE DLR WAVGSIT+I+NFY FETLVR LLEATLPV SLLS  +EALLRK+LYDALILVDY
Subjt:  REEFSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYDALILVDY

Query:  SFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAEDQGVRVFD
        SFL  E AINLP +HVAFLAVKRLILT+EAIEFYREHGDQNRAISYLNAFS+SLVSSQIIRWVKSQ+PS+EN+N P GSSPK+FLEWL KAED GVRVFD
Subjt:  SFLKPEIAINLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAEDQGVRVFD

Query:  NTISNRRAKLVLDTSKSVS----YEGDKVDDDLLFYIDKQGGNENGS-EEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKL-------
        +TISNRRAKLVLDTSKSVS     EG+ VDD+LLFYIDKQG NENGS EED  MDE+VNAALVSAA TMSTT+N   KK  R+  KK KK+K        
Subjt:  NTISNRRAKLVLDTSKSVS----YEGDKVDDDLLFYIDKQGGNENGS-EEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKL-------

Query:  ---LSQLKSAVEDDLLFCIDKQGKNENGN---EEDTTMDE
            ++L+SAV+D+     D    +E  N   +ED+ M E
Subjt:  ---LSQLKSAVEDDLLFCIDKQGKNENGN---EEDTTMDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G11780.1 unknown protein5.1e-4628.8Show/hide
Query:  LAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVD-PPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQLVCSCSAPCGSSKTITLLSPVVSEVYKL
        L  SI+ +L + R G   FS F   F +++    + PPLE +WFYSA+ F SS     D  + V      FQL+ S S      K ++LLSPVV ++ +L
Subjt:  LAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVD-PPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQLVCSCSAPCGSSKTITLLSPVVSEVYKL

Query:  VIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWT--------QPNEGLDQFLPLVCSEVREEFSSGECDVR
        VI  R             R+  SL+E I+ ++++   ++    D  +       F DL  +W         +  + L+ F+P     +R+E  S  C V 
Subjt:  VIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWT--------QPNEGLDQFLPLVCSEVREEFSSGECDVR

Query:  RLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSI--NEEALLRKVLYDALI-LVDYSFLKPEIAI
         LAG+V ++ FL+ LC  F+    R +L+KDL+   +  I+   + + F+ ++++LLE  L +TSL+ +   +EA L +++ +A+I  V+  FL P    
Subjt:  RLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSI--NEEALLRKVLYDALI-LVDYSFLKPEIAI

Query:  NLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAEDQGVRVFDNTISNRRAK
        +  + H+  +A+  L L  + +   R + DQ +   Y N FS+SL+   +I WV SQ     + +     +P  F+EWL+  E+QG RVF+   S   AK
Subjt:  NLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAEDQGVRVFDNTISNRRAK

Query:  LVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLLSQLKSAVEDDLLF
         V+  S+         D  +   + KQ   E   ++DT M +  N + +S       + N+  +K  R  K+   K+KL     S ++++  F
Subjt:  LVLDTSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLLSQLKSAVEDDLLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGGGTCTTGTTGAATCGATGGAATCTATCAACCCTTTAAAGCAAAATCCTTTTCTCGGAGAAAATTACGAGTTTACTCTTGCGCAATCAATCCAGAAT
GTCTTAGCTGAAATTCGCAAAGGAAATGTTGTTTTTTCTCAATTTACGGAAGGATTCTTCAAATTGATTCAAGCTAGAGTTGACCCACCATTAGAATCGATTTGG
TTTTACTCCGCATTAACGTTTCGTAGTAGTTTCAATCCTAAAGGCGATTTTTTGGAACGAGTGGCGGCCATGAAAGTCTTGTTTCAGTTGGTATGTTCTTGTTCG
GCTCCTTGTGGTTCTTCGAAGACCATTACGTTGCTTTCTCCAGTGGTTTCCGAGGTGTATAAATTGGTTATCGACATGCGTGGAAAGGATTTGAACTCGAAAAGG
GAAAAGAAAGCAATGAGAGAGGTTAAGTCTTTGGTTGAAGCGATTCTTGGTTTTATGAATCTGAGTTCATGTGAGGATTCGGACAAGAATGATAAATCTCTCGAC
TTCAGTTTGATTACTCCTTTTATGGATTTAATTAATATCTGGACACAGCCAAATGAGGGATTGGATCAGTTCCTACCGCTCGTGTGCAGTGAGGTTCGTGAGGAG
TTTAGTTCGGGCGAGTGTGATGTTCGTCGCTTGGCTGGAGTTGTAATCGCTGAGACATTTCTGATGAAACTGTGCTTGGATTTTAACTGTGGACGTTCGAGGCAA
GATTTGGAGAAAGATCTAAGAACATGGGCTGTTGGATCAATAACTCAGATTAGAAATTTCTACTCTTTTGAAACTCTTGTAAGAGTCCTCCTGGAGGCAACTTTA
CCTGTGACGTCTCTTTTGAGTATCAACGAGGAAGCTTTGTTAAGGAAGGTTCTATATGATGCTCTTATATTGGTTGATTATTCATTTTTGAAGCCTGAGATAGCC
ATTAACTTACCTACCGAGCATGTGGCGTTTCTTGCTGTTAAGAGATTGATTCTTACTTATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAACAGAGCC
ATCTCATATCTAAATGCCTTCTCGAGTTCTCTTGTTTCTTCTCAAATTATTAGATGGGTCAAAAGCCAAATGCCTAGCAATGAGAATCTAAATCGCCCCAATGGG
TCGTCACCTAAAGTGTTTCTTGAGTGGCTTCTCAAGGCTGAAGATCAAGGTGTAAGAGTATTTGACAATACCATTTCCAATCGTCGAGCCAAATTAGTTCTTGAT
ACTTCCAAATCAGTCTCATATGAGGGAGATAAAGTAGATGATGATCTTTTGTTTTACATCGATAAGCAAGGGGGAAATGAAAATGGAAGTGAGGAGGACACGACA
ATGGATGAATCGGTAAATGCAGCTCTCGTTTCTGCGGCTCCTACAATGTCAACGACTGAAAATAGTTCGGTAAAGAAGCTAAATAGAAAAGCAAAAAAAAAGAAT
AAAAAATTAAAGTTGTTAAGTCAGTTGAAGTCAGCTGTTGAGGACGATCTTTTGTTTTGCATCGATAAGCAAGGGAAAAATGAAAATGGAAATGAGGAGGACACG
ACAATGGATGAATCGGTAAATGGAGCTCTTGTTTCTGCGGCTCCTACAATGTCAACGACTGAAAATAGTTTG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTGGGTCTTGTTGAATCGATGGAATCTATCAACCCTTTAAAGCAAAATCCTTTTCTCGGAGAAAATTACGAGTTTACTCTTGCGCAATCAATCCAGAAT
GTCTTAGCTGAAATTCGCAAAGGAAATGTTGTTTTTTCTCAATTTACGGAAGGATTCTTCAAATTGATTCAAGCTAGAGTTGACCCACCATTAGAATCGATTTGG
TTTTACTCCGCATTAACGTTTCGTAGTAGTTTCAATCCTAAAGGCGATTTTTTGGAACGAGTGGCGGCCATGAAAGTCTTGTTTCAGTTGGTATGTTCTTGTTCG
GCTCCTTGTGGTTCTTCGAAGACCATTACGTTGCTTTCTCCAGTGGTTTCCGAGGTGTATAAATTGGTTATCGACATGCGTGGAAAGGATTTGAACTCGAAAAGG
GAAAAGAAAGCAATGAGAGAGGTTAAGTCTTTGGTTGAAGCGATTCTTGGTTTTATGAATCTGAGTTCATGTGAGGATTCGGACAAGAATGATAAATCTCTCGAC
TTCAGTTTGATTACTCCTTTTATGGATTTAATTAATATCTGGACACAGCCAAATGAGGGATTGGATCAGTTCCTACCGCTCGTGTGCAGTGAGGTTCGTGAGGAG
TTTAGTTCGGGCGAGTGTGATGTTCGTCGCTTGGCTGGAGTTGTAATCGCTGAGACATTTCTGATGAAACTGTGCTTGGATTTTAACTGTGGACGTTCGAGGCAA
GATTTGGAGAAAGATCTAAGAACATGGGCTGTTGGATCAATAACTCAGATTAGAAATTTCTACTCTTTTGAAACTCTTGTAAGAGTCCTCCTGGAGGCAACTTTA
CCTGTGACGTCTCTTTTGAGTATCAACGAGGAAGCTTTGTTAAGGAAGGTTCTATATGATGCTCTTATATTGGTTGATTATTCATTTTTGAAGCCTGAGATAGCC
ATTAACTTACCTACCGAGCATGTGGCGTTTCTTGCTGTTAAGAGATTGATTCTTACTTATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAACAGAGCC
ATCTCATATCTAAATGCCTTCTCGAGTTCTCTTGTTTCTTCTCAAATTATTAGATGGGTCAAAAGCCAAATGCCTAGCAATGAGAATCTAAATCGCCCCAATGGG
TCGTCACCTAAAGTGTTTCTTGAGTGGCTTCTCAAGGCTGAAGATCAAGGTGTAAGAGTATTTGACAATACCATTTCCAATCGTCGAGCCAAATTAGTTCTTGAT
ACTTCCAAATCAGTCTCATATGAGGGAGATAAAGTAGATGATGATCTTTTGTTTTACATCGATAAGCAAGGGGGAAATGAAAATGGAAGTGAGGAGGACACGACA
ATGGATGAATCGGTAAATGCAGCTCTCGTTTCTGCGGCTCCTACAATGTCAACGACTGAAAATAGTTCGGTAAAGAAGCTAAATAGAAAAGCAAAAAAAAAGAAT
AAAAAATTAAAGTTGTTAAGTCAGTTGAAGTCAGCTGTTGAGGACGATCTTTTGTTTTGCATCGATAAGCAAGGGAAAAATGAAAATGGAAATGAGGAGGACACG
ACAATGGATGAATCGGTAAATGGAGCTCTTGTTTCTGCGGCTCCTACAATGTCAACGACTGAAAATAGTTTG
Protein sequenceShow/hide protein sequence
MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTEGFFKLIQARVDPPLESIWFYSALTFRSSFNPKGDFLERVAAMKVLFQLVCSCS
APCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGFMNLSSCEDSDKNDKSLDFSLITPFMDLINIWTQPNEGLDQFLPLVCSEVREE
FSSGECDVRRLAGVVIAETFLMKLCLDFNCGRSRQDLEKDLRTWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSINEEALLRKVLYDALILVDYSFLKPEIA
INLPTEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWVKSQMPSNENLNRPNGSSPKVFLEWLLKAEDQGVRVFDNTISNRRAKLVLD
TSKSVSYEGDKVDDDLLFYIDKQGGNENGSEEDTTMDESVNAALVSAAPTMSTTENSSVKKLNRKAKKKNKKLKLLSQLKSAVEDDLLFCIDKQGKNENGNEEDT
TMDESVNGALVSAAPTMSTTENSL