; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024817 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024817
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold12:8370461..8373473
RNA-Seq ExpressionSpg024817
SyntenySpg024817
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018312.1 LysM domain receptor-like kinase 3 [Cucurbita argyrosperma subsp. argyrosperma]2.2e-21673.39Show/hide
Query:  LPAAIRMPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFR-----------------------RV
        LPAAI+M VR KSG+KPLRD++N  YGRTSSKSVATAKRKE+DNRSK+EEQDDALDRLLLVQSDLSALT Q +                          +
Subjt:  LPAAIRMPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFR-----------------------RV

Query:  YAFVCLILIDEIVVKAFELKEMSNQGRKEIESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISP
        +  V    IDE+VVKAFELK+M  QGRKEIESFTHVLSDMLSSLKPWVPRFQ  FS PSK SDDGI Q LA+E N LVNDTESNVIDSPD++++QDLISP
Subjt:  YAFVCLILIDEIVVKAFELKEMSNQGRKEIESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISP

Query:  SPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLNENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGI
        SPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKH  YSKS+L G+TS  L GAQPCF ACGDLNENL EGNG+E   SV +P GSDLTKLG NLLE NG 
Subjt:  SPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLNENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGI

Query:  EPSGVEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLG
        +PSG EPSGSDLTQVGT HQRGFASP +LSKKN SML+MTPCLKMSPPKSCVLLEPISESS KDK+R Y+ATPFPVG    SSGSD SDGLALKYPELLG
Subjt:  EPSGVEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLG

Query:  IQRAHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGC-HDDKKIFSLEDPVGVSLPHIDNTP
        IQ+AH+SG +KK VEASPDWFMSPPKTCVLLEPSDS SVESA                            + DGC ++ KK F+ +DPVGVSLP IDNTP
Subjt:  IQRAHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGC-HDDKKIFSLEDPVGVSLPHIDNTP

Query:  MLKECESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD
        MLKECESVFRVGKRAGEETLKKELWLKFEAASANPF  DQ+LQKTS KGFLDLLDEVSCD
Subjt:  MLKECESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD

XP_022153546.1 uncharacterized protein LOC111021026 isoform X1 [Momordica charantia]1.6e-21973.74Show/hide
Query:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE
        MP+ GKSG+K LRDVSN K GR SSKSV TA RKE+DNRSK+EEQDDALDRLLLVQSDLSALTHQ              IDE+VVKAFELKEM  QGRKE
Subjt:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE

Query:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
        IESFTHVLSDMLSSLKPWVPRFQK FSHP+  S+  IGQSLA ESN LVNDTE NVIDSPDH++VQ LISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
Subjt:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS

Query:  LSSKHVEYSKSILSGMTSSILN--GAQPCFIACGDLNENLPEGNGIES-----------------------KSSVGEPSGSDLTKLGQNLLEGNGIEPSG
        LSSKHV+YS+S+ +GMTS I    GAQPCFI+CGD NENL EGNGIE                        + S G+PSGSD TK+G+NLLEGNGI P G
Subjt:  LSSKHVEYSKSILSGMTSSILN--GAQPCFIACGDLNENLPEGNGIES-----------------------KSSVGEPSGSDLTKLGQNLLEGNGIEPSG

Query:  VEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDY-SSGSDASDGLALKYPELLGIQR
         E SGS+LTQVG T Q GF SP +LSK N S+L+MTPC KMSPPKSCVLLEPISESSHKD++R Y+ATPFPVG  DY SSGSDASDGLALKYPELLGIQ+
Subjt:  VEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDY-SSGSDASDGLALKYPELLGIQR

Query:  AHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKE
         H+SG RKKEVEASPDWFMSPPKTCVLLEPSDS SVE+AACD IDPP+TS VLN QLK S V  G +D+DGCH+ K  FS +DPVGVSL H+D+TPM K 
Subjt:  AHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKE

Query:  CESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD
        CESV R GKRAGEETLK+ELW+KFEAASANPF  +Q L+ TSKKGFLDLLDEVSCD
Subjt:  CESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD

XP_022153547.1 uncharacterized protein LOC111021026 isoform X2 [Momordica charantia]1.7e-21673.38Show/hide
Query:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE
        MP+ GKSG+K LRDVSN K GR SSKSV TA RKE+DNRSK+EEQDDALDRLLLVQSDLSALTHQ              IDE+VVKAFELKEM  QGRKE
Subjt:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE

Query:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
        IESFTHVLSDMLSSLKPWVPRFQK FSHP+  S+  IGQSLA ESN LVNDTE NVIDSPDH++VQ LISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
Subjt:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS

Query:  LSSKHVEYSKSILSGMTSSILN--GAQPCFIACGDLNENLPEGNGIES-----------------------KSSVGEPSGSDLTKLGQNLLEGNGIEPSG
        LSSKHV+YS+S+ +GMTS I    GAQPCFI+CGD NENL EGNGIE                        + S G+PSGSD TK+G+NLLEGNGI P G
Subjt:  LSSKHVEYSKSILSGMTSSILN--GAQPCFIACGDLNENLPEGNGIES-----------------------KSSVGEPSGSDLTKLGQNLLEGNGIEPSG

Query:  VEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDY-SSGSDASDGLALKYPELLGIQR
         E SGS+LTQVG T Q GF SP +LSK N S+L+MTPC KMSPPKSCVLLEPISESSHKD++R Y+ATPFPVG  DY SSGSDASDGLALKYPELLGIQ+
Subjt:  VEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDY-SSGSDASDGLALKYPELLGIQR

Query:  AHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKE
         H+SG RKKEVEASPDWFMSPPKTCVLLEPSDS SVE+AACD IDPP+TS VLN QLK S V  G +D+DGCH+ K  FS +  VGVSL H+D+TPM K 
Subjt:  AHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKE

Query:  CESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD
        CESV R GKRAGEETLK+ELW+KFEAASANPF  +Q L+ TSKKGFLDLLDEVSCD
Subjt:  CESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD

XP_022956198.1 uncharacterized protein LOC111457965 [Cucurbita moschata]8.0e-21976.49Show/hide
Query:  PAAIRMPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSN
        PAAI+M VR K G+KPLRD++N  YGRTSSKSV+TAKRKE+DNRSK+EEQDDALDRLLLVQSDLSALT Q              IDE+VVKAFELK+M  
Subjt:  PAAIRMPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSN

Query:  QGRKEIESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPL
        QGRKEIESFTHVLSDMLSSLKPWVPRFQ  FS PSK SDDGI Q LA+ESN LVN TESNVIDSPD++ +QDLISPSPLVSWRAGCNIERGRQLFLLTPL
Subjt:  QGRKEIESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPL

Query:  PISKSLSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLNENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVEPSGSDLTQVGTTHQRGFA
        PISKSLSSKH  YSKS+L G+TS  L GAQPCF ACGDLNENL EGNG+E   SV +P GSDLTKLG NLLEGNG +PSG EPSGSDLTQVGT HQRGFA
Subjt:  PISKSLSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLNENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVEPSGSDLTQVGTTHQRGFA

Query:  SPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAHRSGTRKKEVEASPDWFMSP
        SP +LSKKN SML+MTPCLKMSPPKSCVLLEPISESS KDK+R Y+ATPFPVG  D SSGSD SDGLALKYPELLGIQ+AH+SG +KK VEASPDWFMSP
Subjt:  SPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAHRSGTRKKEVEASPDWFMSP

Query:  PKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGC-HDDKKIFSLEDPVGVSLPHIDNTPMLKECESVFRVGKRAGEETLKKEL
        PKTCVLLEPSDS SVESA C                            DGC ++ KK F+ +DPVGVSLP IDNTPMLKECESVFRVGKRAGEETLKKEL
Subjt:  PKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGC-HDDKKIFSLEDPVGVSLPHIDNTPMLKECESVFRVGKRAGEETLKKEL

Query:  WLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD
        WLKFEAASANPF  DQ+LQKTS KGFLDLLDEVSCD
Subjt:  WLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD

XP_038882431.1 uncharacterized protein LOC120073701 isoform X1 [Benincasa hispida]2.0e-21772.86Show/hide
Query:  LPAAIRMPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMS
        LPAAI+MPVRGKSGK PL DVSN KY RTSSKSV  A RKEN  +SK+EEQ+++LDRLLLVQSDLS LTHQ              IDE+VVKAFELKEM 
Subjt:  LPAAIRMPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMS

Query:  NQGRKEIESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTP
         QG++EIESFTHVLSDMLSSLKPWVPR QKVFS PSK SDDGI QSLA+ESN LVND E+NVIDSPDH++ QDLISPSPLVSWRAGCNIERGRQLFLLTP
Subjt:  NQGRKEIESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTP

Query:  LPISKSLSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLNENLPEGNGIES-----------------------KSSVGEPSGSDLTKLGQNLLEGNGI
        LPISKSLSSK+V +SKS+L+GMTS IL  AQPCFIACGDLNE+  EG+GIE                        +  VG+PSGSDLTKLG NL+EGNG+
Subjt:  LPISKSLSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLNENLPEGNGIES-----------------------KSSVGEPSGSDLTKLGQNLLEGNGI

Query:  EPSGVEPS-GSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELL
        EPSG E S GSDLTQ G THQRGFASP +LSKKN SML+MTPC KMSPPKSCVLLEPISESSHKDK+R Y+ATPFPVG  DYSSGSDASDGLALKYPELL
Subjt:  EPSGVEPS-GSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELL

Query:  GIQRAHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTP
        GIQ+AH+SG RKK VEASPDW+MSPPKTCVLLEPSDS SVE A C                            DGCH+  K  S +DPVGVSLPHIDNTP
Subjt:  GIQRAHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTP

Query:  MLKECESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD
        MLKECESVFRVGKRAGEETLKKELWLKFEAASAN F  +QA+QKTSKKGFLDLLDEVSCD
Subjt:  MLKECESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD

TrEMBL top hitse value%identityAlignment
A0A6J1DJ80 uncharacterized protein LOC111021026 isoform X28.1e-21773.38Show/hide
Query:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE
        MP+ GKSG+K LRDVSN K GR SSKSV TA RKE+DNRSK+EEQDDALDRLLLVQSDLSALTHQ              IDE+VVKAFELKEM  QGRKE
Subjt:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE

Query:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
        IESFTHVLSDMLSSLKPWVPRFQK FSHP+  S+  IGQSLA ESN LVNDTE NVIDSPDH++VQ LISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
Subjt:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS

Query:  LSSKHVEYSKSILSGMTSSILN--GAQPCFIACGDLNENLPEGNGIES-----------------------KSSVGEPSGSDLTKLGQNLLEGNGIEPSG
        LSSKHV+YS+S+ +GMTS I    GAQPCFI+CGD NENL EGNGIE                        + S G+PSGSD TK+G+NLLEGNGI P G
Subjt:  LSSKHVEYSKSILSGMTSSILN--GAQPCFIACGDLNENLPEGNGIES-----------------------KSSVGEPSGSDLTKLGQNLLEGNGIEPSG

Query:  VEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDY-SSGSDASDGLALKYPELLGIQR
         E SGS+LTQVG T Q GF SP +LSK N S+L+MTPC KMSPPKSCVLLEPISESSHKD++R Y+ATPFPVG  DY SSGSDASDGLALKYPELLGIQ+
Subjt:  VEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDY-SSGSDASDGLALKYPELLGIQR

Query:  AHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKE
         H+SG RKKEVEASPDWFMSPPKTCVLLEPSDS SVE+AACD IDPP+TS VLN QLK S V  G +D+DGCH+ K  FS +  VGVSL H+D+TPM K 
Subjt:  AHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKE

Query:  CESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD
        CESV R GKRAGEETLK+ELW+KFEAASANPF  +Q L+ TSKKGFLDLLDEVSCD
Subjt:  CESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD

A0A6J1DKY8 uncharacterized protein LOC111021026 isoform X17.8e-22073.74Show/hide
Query:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE
        MP+ GKSG+K LRDVSN K GR SSKSV TA RKE+DNRSK+EEQDDALDRLLLVQSDLSALTHQ              IDE+VVKAFELKEM  QGRKE
Subjt:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE

Query:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
        IESFTHVLSDMLSSLKPWVPRFQK FSHP+  S+  IGQSLA ESN LVNDTE NVIDSPDH++VQ LISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
Subjt:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS

Query:  LSSKHVEYSKSILSGMTSSILN--GAQPCFIACGDLNENLPEGNGIES-----------------------KSSVGEPSGSDLTKLGQNLLEGNGIEPSG
        LSSKHV+YS+S+ +GMTS I    GAQPCFI+CGD NENL EGNGIE                        + S G+PSGSD TK+G+NLLEGNGI P G
Subjt:  LSSKHVEYSKSILSGMTSSILN--GAQPCFIACGDLNENLPEGNGIES-----------------------KSSVGEPSGSDLTKLGQNLLEGNGIEPSG

Query:  VEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDY-SSGSDASDGLALKYPELLGIQR
         E SGS+LTQVG T Q GF SP +LSK N S+L+MTPC KMSPPKSCVLLEPISESSHKD++R Y+ATPFPVG  DY SSGSDASDGLALKYPELLGIQ+
Subjt:  VEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDY-SSGSDASDGLALKYPELLGIQR

Query:  AHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKE
         H+SG RKKEVEASPDWFMSPPKTCVLLEPSDS SVE+AACD IDPP+TS VLN QLK S V  G +D+DGCH+ K  FS +DPVGVSL H+D+TPM K 
Subjt:  AHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKE

Query:  CESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD
        CESV R GKRAGEETLK+ELW+KFEAASANPF  +Q L+ TSKKGFLDLLDEVSCD
Subjt:  CESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD

A0A6J1G9T2 uncharacterized protein LOC1114522435.8e-21573.47Show/hide
Query:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE
        MP+RGKSGKKPLRDVSN KYGRTSSKSVATAKRKE+D +SK+EEQDDALDRLLLVQSDLSA T+Q              IDE+ VKAFELKEM  QGRK+
Subjt:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE

Query:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
        IESFTH+LSD+LSSLKPWVPR QKV S PSK  D  I Q L++ESNV+VNDTE++VIDSPD ++V+DLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
Subjt:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS

Query:  LSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLN-----------------------ENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVE
        LSSKHV+YSKS LSGMTSSI+ GAQPCFIAC DLN                       ENL EGNGI    S G+PSGS+LTKLG+NLLEGNGI  SGVE
Subjt:  LSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLN-----------------------ENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVE

Query:  PSGSD-LTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAH
        PSGSD + QV  THQRGFASP +LSKKN SML+MTPCLKMSPPKSCVLLEPISESSHKDK+  Y+ATPFPVG QDYSSG DASDGLALKYPELLGIQ+AH
Subjt:  PSGSD-LTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAH

Query:  RSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKECE
        +  TR KEVEASPDWFMSPPKTCVLLEPSD  SV+SAAC                             GCH+ KK F  E PVGVSLPHIDNTPMLKECE
Subjt:  RSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKECE

Query:  SVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD
        SVFRVGKRAGEETLKKELWLKFEAASANP+ FDQALQKTSKKGFLD+LDEVSCD
Subjt:  SVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD

A0A6J1GW51 uncharacterized protein LOC1114579653.9e-21976.49Show/hide
Query:  PAAIRMPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSN
        PAAI+M VR K G+KPLRD++N  YGRTSSKSV+TAKRKE+DNRSK+EEQDDALDRLLLVQSDLSALT Q              IDE+VVKAFELK+M  
Subjt:  PAAIRMPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSN

Query:  QGRKEIESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPL
        QGRKEIESFTHVLSDMLSSLKPWVPRFQ  FS PSK SDDGI Q LA+ESN LVN TESNVIDSPD++ +QDLISPSPLVSWRAGCNIERGRQLFLLTPL
Subjt:  QGRKEIESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPL

Query:  PISKSLSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLNENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVEPSGSDLTQVGTTHQRGFA
        PISKSLSSKH  YSKS+L G+TS  L GAQPCF ACGDLNENL EGNG+E   SV +P GSDLTKLG NLLEGNG +PSG EPSGSDLTQVGT HQRGFA
Subjt:  PISKSLSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLNENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVEPSGSDLTQVGTTHQRGFA

Query:  SPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAHRSGTRKKEVEASPDWFMSP
        SP +LSKKN SML+MTPCLKMSPPKSCVLLEPISESS KDK+R Y+ATPFPVG  D SSGSD SDGLALKYPELLGIQ+AH+SG +KK VEASPDWFMSP
Subjt:  SPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAHRSGTRKKEVEASPDWFMSP

Query:  PKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGC-HDDKKIFSLEDPVGVSLPHIDNTPMLKECESVFRVGKRAGEETLKKEL
        PKTCVLLEPSDS SVESA C                            DGC ++ KK F+ +DPVGVSLP IDNTPMLKECESVFRVGKRAGEETLKKEL
Subjt:  PKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGC-HDDKKIFSLEDPVGVSLPHIDNTPMLKECESVFRVGKRAGEETLKKEL

Query:  WLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD
        WLKFEAASANPF  DQ+LQKTS KGFLDLLDEVSCD
Subjt:  WLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD

A0A6J1KCQ1 uncharacterized protein LOC1114926861.2e-21272.69Show/hide
Query:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE
        MP+RGKSGKKPLRDVSN KYGRTSSKSVATAKRKE+D  SK+EEQDD+LDRLLLVQSDLSA T+Q              IDE+VVKAFELKEM  QGRK+
Subjt:  MPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKE

Query:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
        IESFTH+LSDMLSSLKPWVPR QKV S PSK  D  I Q L++ESNV+VNDTE++VIDSP  ++V+DLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS
Subjt:  IESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKS

Query:  LSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLN-----------------------ENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVE
        LSSKHV+Y KS LSGMTSSIL GAQPCF+AC DLN                       ENL EGNGI    S G+PSGS+LTKLG+NLLEG+GI  SGVE
Subjt:  LSSKHVEYSKSILSGMTSSILNGAQPCFIACGDLN-----------------------ENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVE

Query:  PSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAHR
        PSGSD+ QV  THQRGFASP +LSKKN SML+MTPCLKMSPPKSCVLLEPISESSHKDK+  Y+ATPFPVG QDYSSG DASDGLALKYPELLGIQ+AH+
Subjt:  PSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAHR

Query:  SGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKECES
           R KEVEASPDWFMSPPKTCVLLEPSD  SV+SAAC                             GC + KK F  E PVGVSLPHID+TPMLKECES
Subjt:  SGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTSGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKECES

Query:  VFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD
        VFRVGKRAGEETLKKELWLKFEAASANP+ FDQALQKTSKKGFLD+LDEVSCD
Subjt:  VFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G12540.1 unknown protein6.9e-5934.83Show/hide
Query:  EEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKEIESFTHVLSDMLSSLK-----------------PWVPRFQKV
        E  D  LD+L LV SD+ ++              L+ IDE+VV+A + K +S  G  E+ESF  VLSDMLSSLK                 PW PR Q+ 
Subjt:  EEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMSNQGRKEIESFTHVLSDMLSSLK-----------------PWVPRFQKV

Query:  FSHPSKASDDGIGQSL--ANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSILSGMTSSILNG
         S      +D   QSL   NE   L +      ++SP+ +Q + L+SPSPLV WR   N ++GRQLFLLTPLP+ KS   KH   SK     +T+  +  
Subjt:  FSHPSKASDDGIGQSL--ANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVEYSKSILSGMTSSILNG

Query:  AQPCFIACGDLNENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCV
                 D   N P     E+   V          LG++L+       + VE              +  +SP VL +K +S L+MTPCLK+SPPKSC 
Subjt:  AQPCFIACGDLNENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCLKMSPPKSCV

Query:  LLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMT
        + +P+ ESS   K+   ++T   +G    SSG + +D L  KYPELLGIQ  H   TRK ++E+SP W+ SPPKTCVL+EP + +          D P  
Subjt:  LLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMT

Query:  SGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKECESVF-RVGKRAGEETLKKELWLKFEAASANPFGFDQALQKT-----SK
            N+  ++   ++G   +                      +++TP+ KE ES+  R   +AGE TLKKELW +FE A+ +   F+     T     +K
Subjt:  SGVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKECESVF-RVGKRAGEETLKKELWLKFEAASANPFGFDQALQKT-----SK

Query:  KGFLDLLDEVS
        K F+++L+EVS
Subjt:  KGFLDLLDEVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGTTCAAGGTTTCCGAAGCGGAGGCTACCGGCGGCGATCAGGATGCCGGTGAGGGGGAAATCAGGGAAAAAGCCACTGAGGGACGTATCGAACTTCAAATACGG
CAGAACTTCCTCCAAATCTGTCGCTACAGCCAAGAGGAAGGAAAATGACAACAGGTCTAAGATTGAAGAGCAAGATGATGCTCTCGATCGCCTCCTTCTAGTTCAGTCCG
ATCTCTCCGCCCTCACTCACCAGTTAGTATTCCGTAGAGTGTATGCTTTTGTGTGTTTGATACTGATTGATGAAATCGTTGTGAAAGCATTTGAGCTGAAGGAAATGAGC
AACCAAGGGAGGAAAGAAATCGAATCTTTCACTCATGTCTTATCTGATATGCTGTCTTCTTTGAAGCCCTGGGTCCCCAGGTTTCAGAAGGTGTTCTCTCATCCATCAAA
AGCTTCTGATGATGGTATAGGACAATCGTTGGCTAATGAAAGCAATGTTTTGGTGAACGATACGGAAAGCAACGTTATTGATAGCCCAGACCATTCTCAAGTTCAAGATT
TGATCTCCCCTTCACCCCTCGTATCATGGCGGGCTGGATGCAATATTGAGAGAGGAAGACAGTTATTTTTACTCACACCTCTTCCTATTTCTAAATCACTCTCATCGAAA
CATGTGGAATATTCTAAATCTATACTTAGTGGAATGACTTCGAGCATACTCAATGGTGCACAACCATGTTTTATTGCATGTGGAGATTTAAACGAAAATCTGCCTGAAGG
TAATGGAATTGAGTCTAAGTCTAGTGTTGGTGAGCCTTCTGGATCTGATTTAACAAAACTGGGGCAGAATTTGTTAGAAGGTAATGGAATTGAGCCTAGTGGTGTTGAGC
CTTCTGGGTCTGATTTAACACAAGTGGGGACAACTCATCAGCGTGGATTTGCTTCCCCAGCAGTGTTATCAAAGAAAAATGTCTCTATGTTAATTATGACTCCATGCTTA
AAAATGTCGCCTCCAAAATCTTGTGTGCTTCTCGAACCCATTTCAGAGTCATCGCATAAAGACAAAAGAAGGCAGTACAGGGCCACACCTTTTCCCGTTGGAGCTCAAGA
TTACTCTTCTGGCAGTGACGCTTCTGATGGACTGGCTTTAAAGTACCCAGAACTCTTAGGAATTCAACGGGCTCATAGATCGGGAACTAGAAAGAAGGAGGTTGAAGCCT
CGCCGGACTGGTTTATGTCACCTCCAAAAACATGTGTTTTACTGGAGCCGTCTGATTCTCAGTCGGTGGAAAGTGCTGCTTGTGATAAAATCGATCCTCCTATGACTTCT
GGGGTCCTGAATTTGCAGTTGAAATCATCATCTGTATCAAAAGGAATCAATGATATTGATGGATGTCATGATGACAAGAAAATTTTCAGCCTCGAAGATCCAGTTGGTGT
CAGCTTGCCGCACATAGATAACACTCCCATGTTGAAGGAATGTGAAAGTGTATTCCGGGTTGGCAAACGTGCTGGCGAGGAGACTCTTAAAAAAGAACTCTGGCTGAAAT
TTGAAGCAGCATCAGCCAATCCATTTGGTTTTGACCAAGCTCTTCAAAAGACATCAAAGAAAGGTTTTCTGGACTTGCTGGATGAGGTTTCATGTGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATAGTTCAAGGTTTCCGAAGCGGAGGCTACCGGCGGCGATCAGGATGCCGGTGAGGGGGAAATCAGGGAAAAAGCCACTGAGGGACGTATCGAACTTCAAATACGG
CAGAACTTCCTCCAAATCTGTCGCTACAGCCAAGAGGAAGGAAAATGACAACAGGTCTAAGATTGAAGAGCAAGATGATGCTCTCGATCGCCTCCTTCTAGTTCAGTCCG
ATCTCTCCGCCCTCACTCACCAGTTAGTATTCCGTAGAGTGTATGCTTTTGTGTGTTTGATACTGATTGATGAAATCGTTGTGAAAGCATTTGAGCTGAAGGAAATGAGC
AACCAAGGGAGGAAAGAAATCGAATCTTTCACTCATGTCTTATCTGATATGCTGTCTTCTTTGAAGCCCTGGGTCCCCAGGTTTCAGAAGGTGTTCTCTCATCCATCAAA
AGCTTCTGATGATGGTATAGGACAATCGTTGGCTAATGAAAGCAATGTTTTGGTGAACGATACGGAAAGCAACGTTATTGATAGCCCAGACCATTCTCAAGTTCAAGATT
TGATCTCCCCTTCACCCCTCGTATCATGGCGGGCTGGATGCAATATTGAGAGAGGAAGACAGTTATTTTTACTCACACCTCTTCCTATTTCTAAATCACTCTCATCGAAA
CATGTGGAATATTCTAAATCTATACTTAGTGGAATGACTTCGAGCATACTCAATGGTGCACAACCATGTTTTATTGCATGTGGAGATTTAAACGAAAATCTGCCTGAAGG
TAATGGAATTGAGTCTAAGTCTAGTGTTGGTGAGCCTTCTGGATCTGATTTAACAAAACTGGGGCAGAATTTGTTAGAAGGTAATGGAATTGAGCCTAGTGGTGTTGAGC
CTTCTGGGTCTGATTTAACACAAGTGGGGACAACTCATCAGCGTGGATTTGCTTCCCCAGCAGTGTTATCAAAGAAAAATGTCTCTATGTTAATTATGACTCCATGCTTA
AAAATGTCGCCTCCAAAATCTTGTGTGCTTCTCGAACCCATTTCAGAGTCATCGCATAAAGACAAAAGAAGGCAGTACAGGGCCACACCTTTTCCCGTTGGAGCTCAAGA
TTACTCTTCTGGCAGTGACGCTTCTGATGGACTGGCTTTAAAGTACCCAGAACTCTTAGGAATTCAACGGGCTCATAGATCGGGAACTAGAAAGAAGGAGGTTGAAGCCT
CGCCGGACTGGTTTATGTCACCTCCAAAAACATGTGTTTTACTGGAGCCGTCTGATTCTCAGTCGGTGGAAAGTGCTGCTTGTGATAAAATCGATCCTCCTATGACTTCT
GGGGTCCTGAATTTGCAGTTGAAATCATCATCTGTATCAAAAGGAATCAATGATATTGATGGATGTCATGATGACAAGAAAATTTTCAGCCTCGAAGATCCAGTTGGTGT
CAGCTTGCCGCACATAGATAACACTCCCATGTTGAAGGAATGTGAAAGTGTATTCCGGGTTGGCAAACGTGCTGGCGAGGAGACTCTTAAAAAAGAACTCTGGCTGAAAT
TTGAAGCAGCATCAGCCAATCCATTTGGTTTTGACCAAGCTCTTCAAAAGACATCAAAGAAAGGTTTTCTGGACTTGCTGGATGAGGTTTCATGTGATTAG
Protein sequenceShow/hide protein sequence
MDSSRFPKRRLPAAIRMPVRGKSGKKPLRDVSNFKYGRTSSKSVATAKRKENDNRSKIEEQDDALDRLLLVQSDLSALTHQLVFRRVYAFVCLILIDEIVVKAFELKEMS
NQGRKEIESFTHVLSDMLSSLKPWVPRFQKVFSHPSKASDDGIGQSLANESNVLVNDTESNVIDSPDHSQVQDLISPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSK
HVEYSKSILSGMTSSILNGAQPCFIACGDLNENLPEGNGIESKSSVGEPSGSDLTKLGQNLLEGNGIEPSGVEPSGSDLTQVGTTHQRGFASPAVLSKKNVSMLIMTPCL
KMSPPKSCVLLEPISESSHKDKRRQYRATPFPVGAQDYSSGSDASDGLALKYPELLGIQRAHRSGTRKKEVEASPDWFMSPPKTCVLLEPSDSQSVESAACDKIDPPMTS
GVLNLQLKSSSVSKGINDIDGCHDDKKIFSLEDPVGVSLPHIDNTPMLKECESVFRVGKRAGEETLKKELWLKFEAASANPFGFDQALQKTSKKGFLDLLDEVSCD