; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg22447 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg22447
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionTransmembrane protein
Genome locationCarg_Chr03:731719..733152
RNA-Seq ExpressionCarg22447
SyntenyCarg22447
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603211.1 hypothetical protein SDJN03_03820, partial [Cucurbita argyrosperma subsp. sororia]8.5e-25796.92Show/hide
Query:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSS---SSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRL
        MG+PLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSS   SSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRL
Subjt:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSS---SSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRL

Query:  LSQKPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRF
        LSQKPSPIEVPSLENRWIPFRKMWKPKPATV TSTAALQRMGTLHMRGTRAMADLTVVHVSED+GEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRF
Subjt:  LSQKPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRF

Query:  RPIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLR
        RPIIQEENESFLKLL RYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLR
Subjt:  RPIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLR

Query:  RWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSN
        RWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVI FPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSN
Subjt:  RWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSN

Query:  SVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
        SVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA       GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
Subjt:  SVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC

XP_022928599.1 uncharacterized protein LOC111435457 [Cucurbita moschata]5.5e-25696.5Show/hide
Query:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFP--SSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLL
        MG+PLAGKSKSTSGENWGMGLLLVFFSDD+PSAIADQNKLFP  SSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLL
Subjt:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFP--SSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLL

Query:  SQKPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFR
        SQKPSPIEVPSLENRWIPFRKMWKPKPATV TSTAALQRMGTLHMRGTRAMADLTVVHVSED+GEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFR
Subjt:  SQKPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFR

Query:  PIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRR
        PIIQEENESFLKLL RYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRR
Subjt:  PIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRR

Query:  WACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNS
        WACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVI FPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKK+NS
Subjt:  WACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNS

Query:  VSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
        VSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA       GSSSAPEKMMFLRGNTDNLGEINSV+RKKICSSEIDSSVYTDC
Subjt:  VSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC

XP_022967686.1 uncharacterized protein LOC111467140 [Cucurbita maxima]9.7e-25396.29Show/hide
Query:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLLSQ
        MGL LAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSSSSS RRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLLSQ
Subjt:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLLSQ

Query:  KPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPI
        KPSPIEV SLEN+WIPFRKMWKPKPATV TSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPI
Subjt:  KPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPI

Query:  IQEENESFLKLLRRYRNLNGTASR-AAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRW
        IQEENESFLKLLRRYRNLNGT SR AAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRW
Subjt:  IQEENESFLKLLRRYRNLNGTASR-AAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRW

Query:  ACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNSV
        ACYPMLLGR+RRNFKHVMLVDAKNSLLLGDPLGR RNKATESVI FPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVV+EIARILMQHKKSNSV
Subjt:  ACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNSV

Query:  SDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
        SDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA       GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
Subjt:  SDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC

XP_023543504.1 uncharacterized protein LOC111803371 [Cucurbita pepo subsp. pepo]1.6e-263100Show/hide
Query:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLLSQ
        MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLLSQ
Subjt:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLLSQ

Query:  KPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPI
        KPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPI
Subjt:  KPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPI

Query:  IQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRWA
        IQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRWA
Subjt:  IQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRWA

Query:  CYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNSVS
        CYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNSVS
Subjt:  CYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNSVS

Query:  DSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLAGSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
        DSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLAGSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
Subjt:  DSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLAGSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC

XP_038883664.1 uncharacterized protein LOC120074578 [Benincasa hispida]1.1e-21180.6Show/hide
Query:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSS-----SSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPR
        MGL L GKSKS++GENWGMGLLLVFFS+DS SAIADQ KLF SSS     SSS RRSNYNLL KAQSTISVCALLVF+SLLLFTLSTF+PAIKMNLTPPR
Subjt:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSS-----SSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPR

Query:  RLLSQKPSPIEV--PSLENRWIPFRKMWKPKPA-----TVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVF
        RLLSQK  PIEV  PS +N+W  F KMWK KPA         STAALQRMGTL+MRGTRAM DLTVVHVSED+GEEDLRLFLRLFHRSGVTAKSDSVFVF
Subjt:  RLLSQKPSPIEV--PSLENRWIPFRKMWKPKPA-----TVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVF

Query:  PSPAFSLRFRPIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDS----DELTRMSYGSVVSFDAAEMDSENS
        PSP  SLRF PII+EENESFLKLL +YRNLNGTASR+AAAGFDVT+F+KTKEKKE +EPIWGK++KR+ NDS    DELTR+SYGSVV FDAAE+D ENS
Subjt:  PSPAFSLRFRPIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDS----DELTRMSYGSVVSFDAAEMDSENS

Query:  LSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVV
        LSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSL+LGDPL RVRNK TESVI F NKH+KKNSE+SN+HH VNPA+V+GGARG+RRLSNA VV
Subjt:  LSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVV

Query:  EIARILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
        EIARILMQHKK NSVSDSGV+SHLVNSEF LKNVKVI + ESIPE SSLA       GSSSAPEKMMF RGN  N  EINSVI KKICSSEIDSSVY+DC
Subjt:  EIARILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC

TrEMBL top hitse value%identityAlignment
A0A0A0KWA6 Uncharacterized protein1.9e-20980.16Show/hide
Query:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLF----PSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRR
        MGL L GKSKST+G+NWGMGLLLVFFS+DSPS IAD   LF    PSSSS+S RRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTF+P IKMNLTPPRR
Subjt:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLF----PSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRR

Query:  LLSQKPSPIEV-PSLENRWIPFRKMWKPKPA------TVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFP
        LL+QK  PIE+   L NRW  FR+MWK KPA      T   ST ALQRMGTL+MRGTRAM DLTVVHVSEDIGEED RLFLRLFHRSGVTAKSDSVFVFP
Subjt:  LLSQKPSPIEV-PSLENRWIPFRKMWKPKPA------TVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFP

Query:  SPAFSLRFRPIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDS----DELTRMSYGSVVSFDAAEMDSENSL
        SPAFSLRF PII++ENESFLKLL RYRNLNGT SR+AAAGFDVT+  K+KEKKE +EPIWGK++KRLGN S    DELTR+SYGSVVSFDA E+D ENSL
Subjt:  SPAFSLRFRPIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDS----DELTRMSYGSVVSFDAAEMDSENSL

Query:  SGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVE
        SGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNK TESVIFF NKHSKKNSEKSNSHH VNP++VIGGARG+RRLSNA  VE
Subjt:  SGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVE

Query:  IARILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSL-------AGSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
        I RILMQHKK NSVSDSGV+S LVNSEF LKNVKVIMA+ESIPEASSL        GS SAPEKMMF +GN  N GEINSVI KKICSSEIDSSVYT C
Subjt:  IARILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSL-------AGSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC

A0A1S3CD81 uncharacterized protein LOC1034995406.6e-20777.15Show/hide
Query:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFP-----------------SSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTF
        MGL L GKSKST+GENWGMGLLLVFFS+DSPS IAD + LFP                 SSSS+S RRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTF
Subjt:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFP-----------------SSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTF

Query:  DPAIKMNLTPPRRLLSQKPSPIEV-PSLENRWIPFRKMWKPKPA------TVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRS
        +P IKMNLTPPRRLL+QK  PI+V   L NRW  F KMWK KPA      T   ST ALQRMGTL+MRGTRAM DLTVVHVSED+GEED RLFLRLFHRS
Subjt:  DPAIKMNLTPPRRLLSQKPSPIEV-PSLENRWIPFRKMWKPKPA------TVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRS

Query:  GVTAKSDSVFVFPSPAFSLRFRPIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDS----DELTRMSYGSVV
        GVTAKSDSVF+FPSPAFSLRF PII+EEN+SFLKLL RYRNLN TASR+AAAGFDVTR  K+KEKKE +EPIWGK++KR  N S    DELTR+SYGSVV
Subjt:  GVTAKSDSVFVFPSPAFSLRFRPIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDS----DELTRMSYGSVV

Query:  SFDAAEMDSENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGA
        SFDA E+D ENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNK TESVIFF NKH KKNSEKSNSHH VNP++VIGGA
Subjt:  SFDAAEMDSENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGA

Query:  RGVRRLSNAVVVEIARILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLAGSS-------SAPEKMMFLRGNTDNLGEINSVIRKKIC
        RG+RR+SNA +VEI R+LMQHKK NSVSDSGV+SHLVNSEF LKNVKVIMA+ESIPEASS  G         SAPEKMMF +GN  N GEINSVI KKIC
Subjt:  RGVRRLSNAVVVEIARILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLAGSS-------SAPEKMMFLRGNTDNLGEINSVIRKKIC

Query:  SSEIDSSVYTDC
        SSEIDSSVYTDC
Subjt:  SSEIDSSVYTDC

A0A5D3D8F3 Uncharacterized protein7.8e-20878.37Show/hide
Query:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFP---------SSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNL
        MGL L GKSKST+GENWGMGLLLVFFS+DSPS IAD + LFP         SSSS+S RRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTF+P IKMNL
Subjt:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFP---------SSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNL

Query:  TPPRRLLSQKPSPIEV-PSLENRWIPFRKMWKPKPA------TVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDS
        TPPRRLL+QK  PI+V   L NRW  F KMWK KPA      T   ST ALQRMGTL+MRGTRAM DLTVVHVSED+GEED RLFLRLFHRSGVTAKSDS
Subjt:  TPPRRLLSQKPSPIEV-PSLENRWIPFRKMWKPKPA------TVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDS

Query:  VFVFPSPAFSLRFRPIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDS----DELTRMSYGSVVSFDAAEMD
        VF+FPSPAFSLRF PII+EEN+SFLKLL RYRNLN TASR+AAAGFDVTR  K+KEKKE +EPIWGK++KR  N S    DELTR+SYGSVVSFDA E+D
Subjt:  VFVFPSPAFSLRFRPIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDS----DELTRMSYGSVVSFDAAEMD

Query:  SENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSN
         ENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNK TESVIFF NKH KKNSEKSNSHH VNP++VIGGARG+RR+SN
Subjt:  SENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSN

Query:  AVVVEIARILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLAGSS-------SAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSV
        A +VEI R+LMQHKK NSVSDSGV+SHLVNSEF LKNVKVIMA+ESIPEASS  G         SAPEKMMF +GN  N GEINSVI KKICSSEIDSSV
Subjt:  AVVVEIARILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLAGSS-------SAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSV

Query:  YTDC
        YTDC
Subjt:  YTDC

A0A6J1ELB2 uncharacterized protein LOC1114354572.7e-25696.5Show/hide
Query:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFP--SSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLL
        MG+PLAGKSKSTSGENWGMGLLLVFFSDD+PSAIADQNKLFP  SSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLL
Subjt:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFP--SSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLL

Query:  SQKPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFR
        SQKPSPIEVPSLENRWIPFRKMWKPKPATV TSTAALQRMGTLHMRGTRAMADLTVVHVSED+GEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFR
Subjt:  SQKPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFR

Query:  PIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRR
        PIIQEENESFLKLL RYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRR
Subjt:  PIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRR

Query:  WACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNS
        WACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVI FPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKK+NS
Subjt:  WACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNS

Query:  VSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
        VSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA       GSSSAPEKMMFLRGNTDNLGEINSV+RKKICSSEIDSSVYTDC
Subjt:  VSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC

A0A6J1HSV0 uncharacterized protein LOC1114671404.7e-25396.29Show/hide
Query:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLLSQ
        MGL LAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSSSSS RRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLLSQ
Subjt:  MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLLSQ

Query:  KPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPI
        KPSPIEV SLEN+WIPFRKMWKPKPATV TSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPI
Subjt:  KPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPI

Query:  IQEENESFLKLLRRYRNLNGTASR-AAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRW
        IQEENESFLKLLRRYRNLNGT SR AAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRW
Subjt:  IQEENESFLKLLRRYRNLNGTASR-AAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRW

Query:  ACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNSV
        ACYPMLLGR+RRNFKHVMLVDAKNSLLLGDPLGR RNKATESVI FPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVV+EIARILMQHKKSNSV
Subjt:  ACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNSV

Query:  SDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
        SDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA       GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC
Subjt:  SDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G57400.1 unknown protein8.5e-10647.6Show/hide
Query:  SKSTSGENWGMGLLLVFF------SDDSPSAIADQNKLFPSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPA----IKMNLTPPRRLL
        +K T     GMGLLLVFF      +DDSPS+ +      P++++    RS+  LL+KAQSTIS+C LL+FL+L LFTLSTF+P+       +  P RR L
Subjt:  SKSTSGENWGMGLLLVFF------SDDSPSAIADQNKLFPSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPA----IKMNLTPPRRLL

Query:  SQKPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFR
          +       S   R+  F                ALQ MGTL +RGT++M DL VVH+S D  E+DLRLF+RL HRSGVT+KSD V +F S     RF 
Subjt:  SQKPSPIEVPSLENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFR

Query:  PIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKR--------LGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSD
         +I+EEN+SFLKL+  +RN +      +  GF++T+F+K + K    EPIWGKK  R        L N ++    +++GSVV FD  E+D ENSLSGF D
Subjt:  PIIQEENESFLKLLRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKR--------LGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSD

Query:  HIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIAR--
        H+P+SLRRWACYPMLLGRVRRNFKHVMLVDAK SL LGDPL R+RN++ ESV+FF +KHS  + + S    +VNPA++IGGA+G+RRLS+++  EI R  
Subjt:  HIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIAR--

Query:  ILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRG----NTDNLGEINSVIRKKICSSEIDSSVYTDC
        I  QHKK NSV++S V+S LV +    KN +V+ +   +PEASSLA        +SS     +  RG    N++++ +I ++I K+ICS E+DSSVY  C
Subjt:  ILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLA-------GSSSAPEKMMFLRG----NTDNLGEINSVIRKKICSSEIDSSVYTDC

AT5G52500.1 unknown protein4.0e-7944.31Show/hide
Query:  MGLLLVFFSDDSPSAIADQNKLF-PSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFD------PAIKMNLTPPRRLLSQKPSPIEVPSLE
        MGL LV F D + +   D +  + P+ ++  ++ S+  LL+KA+STIS C +L+FL+L LFTLSTF+      PAI    +P  R L             
Subjt:  MGLLLVFFSDDSPSAIADQNKLF-PSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFD------PAIKMNLTPPRRLLSQKPSPIEVPSLE

Query:  NRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPIIQEENESFLKL
        NR I  R+              ALQ MGTL +RGT++M DL + H++    E DLRLF+RL HRSGVT+KSD V +F SP+ + RF  +I++EN SFLKL
Subjt:  NRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPIIQEENESFLKL

Query:  LRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGN-DSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRWACYPMLLGRVR
        +  +RN + T+S +++                 +  IWGKK +   N +S+ELT   +GS+V FD  E+D ENSLSGF D +P+SLRRWACYPMLLGRVR
Subjt:  LRRYRNLNGTASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGN-DSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRWACYPMLLGRVR

Query:  RNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKS-NSVSDSGVVSHLV
        R+FKHVMLVDAK S  +GDP  R+RN++ +SV+FF +KH  KN+       +VNP ++IGGA+G+RRLS+++  EI R  M  K S   V++S V+S LV
Subjt:  RNFKHVMLVDAKNSLLLGDPLGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKS-NSVSDSGVVSHLV

Query:  NSEFSLKNVKVIMAAESIPEAS
         +    KN +V+++   +PEA+
Subjt:  NSEFSLKNVKVIMAAESIPEAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCTTCCTCTCGCCGGAAAATCCAAATCCACTTCCGGCGAGAATTGGGGAATGGGTCTTCTTCTCGTCTTCTTCTCCGACGACTCCCCTTCCGCCATTGCCGACCA
GAACAAGCTTTTTCCGTCTTCTTCTTCTTCTTCAGCTCGTCGGAGTAATTACAATCTTCTCACTAAAGCTCAGTCCACCATTTCCGTCTGTGCTCTGCTCGTTTTTCTTT
CTCTTCTTCTTTTCACTCTCTCCACCTTCGACCCCGCCATTAAAATGAACCTCACTCCCCCCCGGCGGCTTCTCTCCCAGAAACCCTCGCCGATTGAAGTCCCTTCGTTG
GAAAATCGGTGGATTCCGTTTCGGAAAATGTGGAAGCCGAAACCGGCGACGGTGGGGACGTCGACAGCGGCGCTGCAACGAATGGGGACTTTGCATATGCGAGGTACTCG
GGCTATGGCGGACTTGACGGTGGTCCATGTGTCGGAAGACATCGGAGAAGAAGACCTCCGCCTATTTCTTCGACTGTTTCATCGCTCCGGTGTCACCGCGAAATCCGATT
CTGTCTTCGTCTTCCCCTCGCCGGCGTTCTCGTTGAGATTCCGTCCGATTATTCAAGAGGAAAACGAATCGTTTCTGAAACTCCTTCGTCGGTACCGGAATTTGAACGGA
ACAGCCAGCCGGGCCGCGGCGGCGGGATTTGATGTCACCAGGTTTATTAAGACCAAAGAGAAGAAGGAGCCGGACGAGCCGATTTGGGGGAAGAAAATGAAACGACTCGG
AAACGATTCGGACGAGTTGACTCGGATGAGTTACGGCTCGGTAGTGAGTTTCGACGCGGCGGAAATGGATTCAGAGAATTCACTTTCCGGCTTCTCCGATCACATTCCGA
TGAGTCTGCGACGGTGGGCATGTTACCCGATGCTCCTCGGCCGAGTCCGCCGGAATTTCAAGCACGTAATGCTCGTGGACGCCAAAAACTCGCTTCTACTCGGCGATCCA
CTCGGCCGAGTCAGAAACAAAGCAACCGAGTCAGTGATTTTCTTCCCGAACAAGCACAGCAAAAAGAACTCGGAAAAATCAAACTCCCACCATCAGGTAAATCCGGCCGT
CGTGATCGGCGGCGCACGCGGTGTCCGGCGACTATCAAACGCGGTGGTGGTCGAAATCGCCCGAATTCTGATGCAGCACAAGAAGAGTAACTCGGTGTCCGACTCGGGAG
TAGTGAGTCACCTCGTTAACAGTGAGTTTTCATTAAAGAACGTGAAGGTGATTATGGCCGCCGAGTCGATCCCAGAAGCGAGTTCACTCGCCGGATCGTCGTCGGCGCCG
GAGAAGATGATGTTCCTGAGGGGCAATACTGATAATTTGGGTGAAATTAATTCTGTTATTAGGAAAAAAATATGTTCGTCGGAAATTGATTCTTCTGTCTATACCGATTG
TTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTCTTCCTCTCGCCGGAAAATCCAAATCCACTTCCGGCGAGAATTGGGGAATGGGTCTTCTTCTCGTCTTCTTCTCCGACGACTCCCCTTCCGCCATTGCCGACCA
GAACAAGCTTTTTCCGTCTTCTTCTTCTTCTTCAGCTCGTCGGAGTAATTACAATCTTCTCACTAAAGCTCAGTCCACCATTTCCGTCTGTGCTCTGCTCGTTTTTCTTT
CTCTTCTTCTTTTCACTCTCTCCACCTTCGACCCCGCCATTAAAATGAACCTCACTCCCCCCCGGCGGCTTCTCTCCCAGAAACCCTCGCCGATTGAAGTCCCTTCGTTG
GAAAATCGGTGGATTCCGTTTCGGAAAATGTGGAAGCCGAAACCGGCGACGGTGGGGACGTCGACAGCGGCGCTGCAACGAATGGGGACTTTGCATATGCGAGGTACTCG
GGCTATGGCGGACTTGACGGTGGTCCATGTGTCGGAAGACATCGGAGAAGAAGACCTCCGCCTATTTCTTCGACTGTTTCATCGCTCCGGTGTCACCGCGAAATCCGATT
CTGTCTTCGTCTTCCCCTCGCCGGCGTTCTCGTTGAGATTCCGTCCGATTATTCAAGAGGAAAACGAATCGTTTCTGAAACTCCTTCGTCGGTACCGGAATTTGAACGGA
ACAGCCAGCCGGGCCGCGGCGGCGGGATTTGATGTCACCAGGTTTATTAAGACCAAAGAGAAGAAGGAGCCGGACGAGCCGATTTGGGGGAAGAAAATGAAACGACTCGG
AAACGATTCGGACGAGTTGACTCGGATGAGTTACGGCTCGGTAGTGAGTTTCGACGCGGCGGAAATGGATTCAGAGAATTCACTTTCCGGCTTCTCCGATCACATTCCGA
TGAGTCTGCGACGGTGGGCATGTTACCCGATGCTCCTCGGCCGAGTCCGCCGGAATTTCAAGCACGTAATGCTCGTGGACGCCAAAAACTCGCTTCTACTCGGCGATCCA
CTCGGCCGAGTCAGAAACAAAGCAACCGAGTCAGTGATTTTCTTCCCGAACAAGCACAGCAAAAAGAACTCGGAAAAATCAAACTCCCACCATCAGGTAAATCCGGCCGT
CGTGATCGGCGGCGCACGCGGTGTCCGGCGACTATCAAACGCGGTGGTGGTCGAAATCGCCCGAATTCTGATGCAGCACAAGAAGAGTAACTCGGTGTCCGACTCGGGAG
TAGTGAGTCACCTCGTTAACAGTGAGTTTTCATTAAAGAACGTGAAGGTGATTATGGCCGCCGAGTCGATCCCAGAAGCGAGTTCACTCGCCGGATCGTCGTCGGCGCCG
GAGAAGATGATGTTCCTGAGGGGCAATACTGATAATTTGGGTGAAATTAATTCTGTTATTAGGAAAAAAATATGTTCGTCGGAAATTGATTCTTCTGTCTATACCGATTG
TTAG
Protein sequenceShow/hide protein sequence
MGLPLAGKSKSTSGENWGMGLLLVFFSDDSPSAIADQNKLFPSSSSSSARRSNYNLLTKAQSTISVCALLVFLSLLLFTLSTFDPAIKMNLTPPRRLLSQKPSPIEVPSL
ENRWIPFRKMWKPKPATVGTSTAALQRMGTLHMRGTRAMADLTVVHVSEDIGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPAFSLRFRPIIQEENESFLKLLRRYRNLNG
TASRAAAAGFDVTRFIKTKEKKEPDEPIWGKKMKRLGNDSDELTRMSYGSVVSFDAAEMDSENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDP
LGRVRNKATESVIFFPNKHSKKNSEKSNSHHQVNPAVVIGGARGVRRLSNAVVVEIARILMQHKKSNSVSDSGVVSHLVNSEFSLKNVKVIMAAESIPEASSLAGSSSAP
EKMMFLRGNTDNLGEINSVIRKKICSSEIDSSVYTDC