; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G7226 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G7226
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionPentatricopeptide repeat-containing protein
Genome locationctg1528:3955386..3959480
RNA-Seq ExpressionCucsat.G7226
SyntenyCucsat.G7226
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637469.1 hypothetical protein CSA_004502 [Cucumis sativus]0.095.9Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSF--------------------------ETLVRV
        PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSF                          ETLVRV
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSF--------------------------ETLVRV

Query:  LLEATLPVTSLLSTDNEALLRKVLYDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQM
        LLEATLPVTSLLSTDNEALLRKVLYDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQM
Subjt:  LLEATLPVTSLLSTDNEALLRKVLYDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQM

Query:  PSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTT
        PSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTT
Subjt:  PSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTT

Query:  ENSSVKKLSRKAKKRNKKLKLLSQLKSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPN
        ENSSVKKLSRKAKKRNKKLKLLSQLKSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPN
Subjt:  ENSSVKKLSRKAKKRNKKLKLLSQLKSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPN

Query:  TDATQLKSAVENNDTHSEGEVHNPHSDKDSDMKQ
        TDATQLKSAVENNDTHSEGEVHNPHSDKDSDMKQ
Subjt:  TDATQLKSAVENNDTHSEGEVHNPHSDKDSDMKQ

TYK10112.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.084.92Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+N FLGENYEFTLAQSIQNVLAEIRKGNVVFS+FT+ FYKLIQARADPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILG  NLSS EDS+KNDKSLDF+ ITPF+DLISIWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAE FL+KLCLDFN G SRQ LE+DL  W VGSIT+IRNFY FETLVR+LLEATLPVTSLLSTD+EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLPAEH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRW+KSQMPSNENLN  NG SPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKL-LSQL
        QGVRVFDNTISN R+K+VLDTSKSV FEGDKVDDDLLFYIDKQG N NG EED TMD+SVNAAL S A TMSTTENSSVKK SRKAKKRNKK     SQL
Subjt:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKL-LSQL

Query:  KSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDT
        KSAVE            N+   +EDTTM+E VN ALVS APTMSTTENSS K   +KAK+KNK+ K+V+  LVPN DATQLKSAVENNDT
Subjt:  KSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDT

XP_004135698.1 uncharacterized protein LOC101207150 [Cucumis sativus]0.0100Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLK
        QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLK
Subjt:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLK

Query:  SAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHS
        SAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHS
Subjt:  SAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHS

Query:  DKDSDMKQ
        DKDSDMKQ
Subjt:  DKDSDMKQ

XP_008450830.1 PREDICTED: uncharacterized protein LOC103492302 [Cucumis melo]2.97e-30188.71Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+NPFLGENYEFTL QSIQNVLAEIRKGNVVFS+FT+ FYKLIQARADPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILG  NLSS EDS+KNDKSLDF+ ITPF+DLISIWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAE FL+KLCLDFN G SRQ LE+DL  W VGSIT+IRNFY FETLVR+LLEATLPVTSLLSTD+EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLPAEH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRW+KSQMPSNENLN  NG SPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLL
        QGVRVFDNTISN R+K+VLDTSKSV FEGDKVDDDLLFYIDKQG N NG EED TMD+SVNAAL S A TMSTTENSSVKK SRKAKK  K+  L+
Subjt:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLL

XP_038880003.1 uncharacterized protein LOC120071696 [Benincasa hispida]1.08e-28772.95Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRS-SFNPKGDFLERVAAMKVLFQ
        MAL LVESM+S+NPLK+NPFLGENYEFTLAQSIQNV+AEIRKGN  FSQFT+ FY+LIQARADPPLESIWFYSAL FRS   N KGDFLERVAAMKVLFQ
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRS-SFNPKGDFLERVAAMKVLFQ

Query:  LVCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKND-KSLDFSLITPFMDLISIWTQPNEGLDQ
        LV SCSAPCGSSKTI LLSPVVSEVYKL++DM GKDL S REKKAMREVKSLVEAILGF+NLSS +DSDKND +SLDF+LITPF+DLIS+WT PNEGLDQ
Subjt:  LVCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKND-KSLDFSLITPFMDLISIWTQPNEGLDQ

Query:  FLPLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVL
        FLPLV SEVR EFSSG CDVRRLAGVVIAE FLMKLCLDFN G SRQDLEKDL  W VGSIT+IRNFY FETLVR LLEATLPVTSLLST++EALLRKVL
Subjt:  FLPLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVL

Query:  YDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKA
        YD+LILV+YSFLKPE AI+LPAEHVA LAVKRLILT+EAIEFYREHGDQ+RAISYLNAFSSS VSSQIIRW+KSQMPSNEN+  PNG SPK+ LEWLL+A
Subjt:  YDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKA

Query:  EDQGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQ
        EDQGVRVFD TISNR +KLVLDTSKSVS EGDKVDDDLLFYIDKQG + NGSE DTTMDESVNAAL                                  
Subjt:  EDQGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQ

Query:  LKSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNP
                                             VS A TMSTTEN S K  +R  KRKN+K K VKYDLVP++D TQ +S  +NNDT SEG+VHNP
Subjt:  LKSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNP

Query:  HSDKDSDMKQ
        HSD DSD+K+
Subjt:  HSDKDSDMKQ

TrEMBL top hitse value%identityAlignment
A0A0A0M1W0 Uncharacterized protein0.0100Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLK
        QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLK
Subjt:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLK

Query:  SAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHS
        SAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHS
Subjt:  SAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHS

Query:  DKDSDMKQ
        DKDSDMKQ
Subjt:  DKDSDMKQ

A0A1S3BQ57 uncharacterized protein LOC1034923021.44e-30188.71Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+NPFLGENYEFTL QSIQNVLAEIRKGNVVFS+FT+ FYKLIQARADPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILG  NLSS EDS+KNDKSLDF+ ITPF+DLISIWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAE FL+KLCLDFN G SRQ LE+DL  W VGSIT+IRNFY FETLVR+LLEATLPVTSLLSTD+EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLPAEH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRW+KSQMPSNENLN  NG SPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLL
        QGVRVFDNTISN R+K+VLDTSKSV FEGDKVDDDLLFYIDKQG N NG EED TMD+SVNAAL S A TMSTTENSSVKK SRKAKK  K+  L+
Subjt:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLL

A0A5D3CFU4 Pentatricopeptide repeat-containing protein0.084.92Show/hide
Query:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL
        MALGLVESMESINPLK+N FLGENYEFTLAQSIQNVLAEIRKGNVVFS+FT+ FYKLIQARADPPLESIWFYSAL FRSSFNPKGDFLERVAAMKVLFQL
Subjt:  MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQL

Query:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL
        VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNS REKKAMREVKSLVEAILG  NLSS EDS+KNDKSLDF+ ITPF+DLISIWT PNEGLDQFL
Subjt:  VCSCSAPCGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFL

Query:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD
        PLVCSEVREEFSSGECDVRRLAGVVIAE FL+KLCLDFN G SRQ LE+DL  W VGSIT+IRNFY FETLVR+LLEATLPVTSLLSTD+EALLRKVL D
Subjt:  PLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYD

Query:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED
        ALILVDYSFLKPE AINLPAEH AFLAVKRLILTYEA EFYR+HGDQNRAISYLNAFSSSLVSSQIIRW+KSQMPSNENLN  NG SPKVFLEWLLKAED
Subjt:  ALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAED

Query:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKL-LSQL
        QGVRVFDNTISN R+K+VLDTSKSV FEGDKVDDDLLFYIDKQG N NG EED TMD+SVNAAL S A TMSTTENSSVKK SRKAKKRNKK     SQL
Subjt:  QGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKL-LSQL

Query:  KSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDT
        KSAVE            N+   +EDTTM+E VN ALVS APTMSTTENSS K   +KAK+KNK+ K+V+  LVPN DATQLKSAVENNDT
Subjt:  KSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDT

A0A6J1H8Q6 uncharacterized protein LOC1114615269.53e-26968.65Show/hide
Query:  MESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP
        MES+N  KQ+PFLGENYEFTL QSIQNVLAEIR+GN+ FSQF + FY+LIQAR DPPLESIWFYSAL FRS  +   GDFL+RVA MK+LFQ  CSCSAP
Subjt:  MESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP

Query:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFLPLVCSEV
        CGSSKTI LLSPVV EVYKL+ DM GKDL+S REKKAMREVKSLVE +LGF+NLSS +DSD+N +SLDF+L+TPF+DLISIW   NEGLDQFLPLV SEV
Subjt:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFLPLVCSEV

Query:  REEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYDALILVDY
        R EFSSG CD+RRLAGVVIAE FLMKLCLD N GRSRQDLE DL  WAVGSIT+I+NFY FETLVR LLEATLPV SLLST++EALLRK+LYDALILVDY
Subjt:  REEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYDALILVDY

Query:  SFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFD
        SFL  E AINLPA+HVAFLAVKRLILT+EAIEFYREHGDQNRAISYLNAFS+SLVSSQIIRW+KSQ+PSNEN N P G SPK+FLEWLLKAED GVRVFD
Subjt:  SFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFD

Query:  NTISNRRSKLVLDTSKSVS----FEGDKVDDDLLFYIDKQGGNVNGSEE-DTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLKSA
        +TISNRR+KLVLDTSKSVS     EG+ VDD+LLFYIDKQG N NGSEE D  MDESVNAAL SAA TMSTT+N S KK  ++  K+ KK+K        
Subjt:  NTISNRRSKLVLDTSKSVS----FEGDKVDDDLLFYIDKQGGNVNGSEE-DTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLKSA

Query:  VEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHSDK
                                                                         KYDLV N+D T+L+SAVE+NDT SEGEVHNPHSD+
Subjt:  VEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHSDK

Query:  DSDMKQ
        DSD K+
Subjt:  DSDMKQ

A0A6J1JEZ4 uncharacterized protein LOC1114851554.73e-26968.26Show/hide
Query:  MESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP
        MES+N  KQ+PFLGENYEFTL QSIQNVLAEIR+GN+VFSQF + FY+LIQARADPPLESIWFYSAL FRS  +   GDFL+RVA MK+LFQ  CSCSAP
Subjt:  MESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNP-KGDFLERVAAMKVLFQLVCSCSAP

Query:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFLPLVCSEV
        CGSSKTI LL+PVV EVYKL+ DM GKDL S REKKAMREVKSLVE ILGF+NLSS +DSD+N +SLDF+L+TPF+DLISIWT  NEGLDQFLPLV SEV
Subjt:  CGSSKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFLPLVCSEV

Query:  REEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYDALILVDY
        R EFSSG CD+RRLAGVVIAE FL+KLCLD N GRSRQDLE DL  WAVGSIT+I+NFY FETLVR LLEATLPV SLLST++EALLRK+LYDALILVDY
Subjt:  REEFSSGECDVRRLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYDALILVDY

Query:  SFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFD
        SFL  E AINLPA+HVAFLAVKRLILT+EAIEFYREHGDQNRAISYLNAFS+SLVSSQIIRW+KSQ+PS+EN+N P G SPK+FLEWL KAED GVRVFD
Subjt:  SFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFD

Query:  NTISNRRSKLVLDTSKSVS----FEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLKSAV
        +TISNRR+KLVLDTSKSVS     EG+ VDD+LLFYIDKQG N NGSEE                                                   
Subjt:  NTISNRRSKLVLDTSKSVS----FEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLKSAV

Query:  EGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHSDKD
                           ED  M+E VN ALVSAA TMSTT+N   K  +R+  +K KK K  KYDLVPN+DAT+L+SAV++NDT S+ EVHNPH D+D
Subjt:  EGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHSDKD

Query:  SDMKQ
        SDMK+
Subjt:  SDMKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G11780.1 unknown protein1.1e-4428.02Show/hide
Query:  LAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARAD-PPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQLVCSCSAPCGSSKTITLLSPVVSEVYKL
        L  SI+ +L + R G   FS F   F +++    + PPLE +WFYSA++F SS     D  + V      FQL+ S S      K ++LLSPVV ++ +L
Subjt:  LAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARAD-PPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQLVCSCSAPCGSSKTITLLSPVVSEVYKL

Query:  VIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWT--------QPNEGLDQFLPLVCSEVREEFSSGECDVR
        VI  R             R+  SL+E I+ ++++   ++    D  +       F DL  +W         +  + L+ F+P     +R+E  S  C V 
Subjt:  VIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWT--------QPNEGLDQFLPLVCSEVREEFSSGECDVR

Query:  RLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLST--DNEALLRKVLYDALI-LVDYSFLKPEIAI
         LAG+V +++FL+ LC  F+    R +L+KDL    +  I+   + + F+ ++++LLE  L +TSL+    ++EA L +++ +A+I  V+  FL P    
Subjt:  RLAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLST--DNEALLRKVLYDALI-LVDYSFLKPEIAI

Query:  NLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRSK
        +  + H+  +A+  L L  + +   R + DQ +   Y N FS+SL+   +I W+ SQ     + +    L+P  F+EWL+  E+QG RVF+   S   +K
Subjt:  NLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRSK

Query:  LVLDTSK---SVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLKSAVEGDLLF
         V+  S+   S+     K +++               ++DT M +  N +  S       + N+  +K  R  K+   K+KL     S ++ +  F
Subjt:  LVLDTSK---SVSFEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLKSAVEGDLLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGGGTCTTGTTGAATCGATGGAATCTATCAACCCTTTAAAGCAAAATCCTTTCCTCGGAGAAAATTACGAGTTTACTCTTGCGCAATCAATCCAGAATGTCTT
AGCTGAAATTCGCAAAGGAAATGTTGTTTTTTCTCAATTTACGAAAAGATTCTACAAATTGATTCAAGCTAGAGCTGACCCACCATTAGAATCGATTTGGTTTTACTCCG
CATTAAAGTTTCGTAGTAGTTTCAATCCTAAAGGCGATTTTTTGGAACGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTATGTTCTTGTTCGGCTCCTTGTGGTTCT
TCGAAGACCATTACGTTGCTTTCTCCAGTGGTTTCCGAGGTGTATAAATTGGTTATCGACATGCGTGGAAAGGATTTGAACTCGACAAGGGAAAAGAAAGCAATGAGAGA
GGTTAAGTCTTTAGTTGAAGCAATTCTTGGTTTTATGAATCTGAGTTCACGTGAGGATTCGGACAAGAATGATAAATCTCTCGACTTCAGTTTGATTACTCCTTTTATGG
ATTTAATTAGTATCTGGACGCAGCCAAATGAGGGATTGGATCAGTTCCTACCGCTCGTGTGCAGTGAGGTTCGTGAGGAGTTTAGTTCGGGAGAGTGTGATGTTCGTCGC
TTGGCTGGAGTTGTAATCGCTGAGATATTTCTGATGAAACTGTGCTTGGATTTTAACTATGGACGTTCGAGGCAAGATTTGGAGAAAGATCTAATAACATGGGCTGTTGG
ATCAATAACTCAGATTAGAAATTTCTACTCTTTTGAAACTCTTGTAAGAGTCCTCCTGGAGGCAACTTTACCTGTGACGTCTCTTTTGAGTACTGACAATGAAGCTTTGT
TAAGGAAGGTTCTATATGATGCTCTTATATTGGTTGATTATTCGTTTTTGAAACCTGAGATAGCCATTAACTTACCTGCCGAGCATGTGGCGTTTCTTGCTGTTAAGAGA
TTGATTCTTACTTATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAATAGAGCCATCTCATATCTAAATGCCTTCTCGAGTTCTCTTGTTTCTTCTCAAATTAT
TAGATGGATCAAAAGCCAAATGCCTAGCAATGAGAATCTAAATTGCCCCAATGGGTTGTCACCTAAAGTGTTTCTTGAGTGGCTTCTCAAGGCTGAAGATCAAGGTGTAA
GAGTATTTGACAATACCATTTCCAATCGTCGATCCAAATTAGTTCTTGATACTTCCAAATCAGTCTCATTTGAGGGAGATAAAGTAGATGATGATCTTTTGTTTTACATC
GATAAGCAAGGGGGAAATGTAAATGGAAGTGAGGAGGACACGACAATGGATGAATCGGTAAATGCAGCTCTCGCTTCTGCTGCTCCTACAATGTCAACGACTGAAAATAG
TTCGGTCAAGAAGCTAAGTAGAAAAGCAAAAAAAAGGAATAAAAAGTTAAAGTTGTTAAGTCAGTTGAAGTCAGCTGTTGAGGGCGATCTTTTGTTTTGCATCAATAAGC
AAGGGGAAAATGAAAATGGAAATGAGGAGGACACGACAATGAATGAACCAGTAAATGAAGCTCTTGTTTCTGCGGCTCCTACAATGTCAACGACTGAAAATAGTTCGGTA
AAGAGTCTAAAGAGAAAGGCAAAAAGAAAGAATAAAAAAAATAAATTGGTTAAGTACGATCTGGTTCCGAACACTGATGCTACCCAGTTGAAGTCAGCTGTTGAGAATAA
CGATACACACAGCGAGGGGGAGGTTCATAATCCACACTCGGACAAAGATTCTGACATGAAACAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTGGGTCTTGTTGAATCGATGGAATCTATCAACCCTTTAAAGCAAAATCCTTTCCTCGGAGAAAATTACGAGTTTACTCTTGCGCAATCAATCCAGAATGTCTT
AGCTGAAATTCGCAAAGGAAATGTTGTTTTTTCTCAATTTACGAAAAGATTCTACAAATTGATTCAAGCTAGAGCTGACCCACCATTAGAATCGATTTGGTTTTACTCCG
CATTAAAGTTTCGTAGTAGTTTCAATCCTAAAGGCGATTTTTTGGAACGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTATGTTCTTGTTCGGCTCCTTGTGGTTCT
TCGAAGACCATTACGTTGCTTTCTCCAGTGGTTTCCGAGGTGTATAAATTGGTTATCGACATGCGTGGAAAGGATTTGAACTCGACAAGGGAAAAGAAAGCAATGAGAGA
GGTTAAGTCTTTAGTTGAAGCAATTCTTGGTTTTATGAATCTGAGTTCACGTGAGGATTCGGACAAGAATGATAAATCTCTCGACTTCAGTTTGATTACTCCTTTTATGG
ATTTAATTAGTATCTGGACGCAGCCAAATGAGGGATTGGATCAGTTCCTACCGCTCGTGTGCAGTGAGGTTCGTGAGGAGTTTAGTTCGGGAGAGTGTGATGTTCGTCGC
TTGGCTGGAGTTGTAATCGCTGAGATATTTCTGATGAAACTGTGCTTGGATTTTAACTATGGACGTTCGAGGCAAGATTTGGAGAAAGATCTAATAACATGGGCTGTTGG
ATCAATAACTCAGATTAGAAATTTCTACTCTTTTGAAACTCTTGTAAGAGTCCTCCTGGAGGCAACTTTACCTGTGACGTCTCTTTTGAGTACTGACAATGAAGCTTTGT
TAAGGAAGGTTCTATATGATGCTCTTATATTGGTTGATTATTCGTTTTTGAAACCTGAGATAGCCATTAACTTACCTGCCGAGCATGTGGCGTTTCTTGCTGTTAAGAGA
TTGATTCTTACTTATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAATAGAGCCATCTCATATCTAAATGCCTTCTCGAGTTCTCTTGTTTCTTCTCAAATTAT
TAGATGGATCAAAAGCCAAATGCCTAGCAATGAGAATCTAAATTGCCCCAATGGGTTGTCACCTAAAGTGTTTCTTGAGTGGCTTCTCAAGGCTGAAGATCAAGGTGTAA
GAGTATTTGACAATACCATTTCCAATCGTCGATCCAAATTAGTTCTTGATACTTCCAAATCAGTCTCATTTGAGGGAGATAAAGTAGATGATGATCTTTTGTTTTACATC
GATAAGCAAGGGGGAAATGTAAATGGAAGTGAGGAGGACACGACAATGGATGAATCGGTAAATGCAGCTCTCGCTTCTGCTGCTCCTACAATGTCAACGACTGAAAATAG
TTCGGTCAAGAAGCTAAGTAGAAAAGCAAAAAAAAGGAATAAAAAGTTAAAGTTGTTAAGTCAGTTGAAGTCAGCTGTTGAGGGCGATCTTTTGTTTTGCATCAATAAGC
AAGGGGAAAATGAAAATGGAAATGAGGAGGACACGACAATGAATGAACCAGTAAATGAAGCTCTTGTTTCTGCGGCTCCTACAATGTCAACGACTGAAAATAGTTCGGTA
AAGAGTCTAAAGAGAAAGGCAAAAAGAAAGAATAAAAAAAATAAATTGGTTAAGTACGATCTGGTTCCGAACACTGATGCTACCCAGTTGAAGTCAGCTGTTGAGAATAA
CGATACACACAGCGAGGGGGAGGTTCATAATCCACACTCGGACAAAGATTCTGACATGAAACAGTAA
Protein sequenceShow/hide protein sequence
MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQARADPPLESIWFYSALKFRSSFNPKGDFLERVAAMKVLFQLVCSCSAPCGS
SKTITLLSPVVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLITPFMDLISIWTQPNEGLDQFLPLVCSEVREEFSSGECDVRR
LAGVVIAEIFLMKLCLDFNYGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLLSTDNEALLRKVLYDALILVDYSFLKPEIAINLPAEHVAFLAVKR
LILTYEAIEFYREHGDQNRAISYLNAFSSSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRSKLVLDTSKSVSFEGDKVDDDLLFYI
DKQGGNVNGSEEDTTMDESVNAALASAAPTMSTTENSSVKKLSRKAKKRNKKLKLLSQLKSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEALVSAAPTMSTTENSSV
KSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGEVHNPHSDKDSDMKQ