; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028928 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028928
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionF-box domain-containing protein
Genome locationtig00153210:1686560..1697715
RNA-Seq ExpressionSgr028928
SyntenySgr028928
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001810 - F-box domain
IPR008507 - Protein of unknown function DUF789
IPR036047 - F-box-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601277.1 F-box/kelch-repeat protein, partial [Cucurbita argyrosperma subsp. sororia]4.0e-17172.7Show/hide
Query:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM
        M KCKRRN+RQRKKRKLKDG+G+ AMDWAGLPRDILVMIF +L L+DCMSVDSVCKQWSNILAELPNWKR GFPWLLMSGQKDREMRTC S+LDN+ WEM
Subjt:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM

Query:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------
        ELLEA+G YVWGSFQDWLILVKDLGCYSLEVSLLNPFS ++INLP LWNFY+KM LSGSPAEEN ICAAIHS++REIAFW+QGS+VWHKYRLE       
Subjt:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------

Query:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW
                                     E+E AT ISEIKAQF+EVRSLEINS+HVL+YLV+SSGEVLLVCRY+SEKPDA+ ET+NFEIY LD  QMSW
Subjt:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW

Query:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP
        E++ +                 S+ELG+GI+N IYFSNDD APWWNEWDSNHLKGLSS  GLDNSSRKDWG FHID D NG+FCFRGNRDNWAP WFTAP
Subjt:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP

Query:  LWW
        LWW
Subjt:  LWW

XP_022150678.1 uncharacterized protein LOC111018751 [Momordica charantia]2.8e-18076.67Show/hide
Query:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM
        MSKCKRRNRRQRKKRKLKDGR NS MDWAGLPRDILV IFGQLPLIDC+SVDSVCKQWSNILAELPNWKRCGFPWLLMSGQ DREMRTC S+LDN  WE+
Subjt:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM

Query:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------
        ELLEA+GGYVWGSFQDWLILVKDLGCYSLEVSLLNPFS+RKINLPRLWNFY+KMVLSGSP EENMICAAIHSEH EIAFWVQGSE WHKYRLE       
Subjt:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------

Query:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW
                                      N+CA+ +SEIKAQF+E R+LEINS HVL+YLVESSGEVLLVCRYFSEKPDA+LETVNFEIY LD SQM W
Subjt:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW

Query:  ERIS-----------------FSTELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP
        ER+                   STE G+G +NCIYFSNDDA PWWNEWDSNHLKGL+SRLGLDNSSRKDWGTFHI KD NGSFCFRGNRDNWAPIWFTAP
Subjt:  ERIS-----------------FSTELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP

Query:  LWW
        LWW
Subjt:  LWW

XP_022957505.1 F-box protein At1g49360-like [Cucurbita moschata]4.0e-17172.7Show/hide
Query:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM
        M KCKRRN+RQRKKRKLKDG+G+ AMDWAGLPRDILVMIF +L L+DCMSVDSVCKQWSNILAELPNWKR GFPWLLMSGQKDREMRTC S+LDN+ WEM
Subjt:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM

Query:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------
        ELLEA+G YVWGSFQDWLILVKDLGCYSLEVSLLNPFS ++INLP LWNFY+KM LSGSPAEEN ICAAIHS++REIAFW+QGS+VWHKYRLE       
Subjt:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------

Query:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW
                                     E+E AT ISEIKAQF+EVRSLEINS+HVL+YLV+SSGEVLLVCRY+SEKPDA+ ET+NFEIY LD  QMSW
Subjt:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW

Query:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP
        E++ +                 S+ELG+GI+N IYFSNDD APWWNEWDSNHLKGLSS  GLDNSSRKDWG FHID D NG+FCFRGNRDNWAP WFTAP
Subjt:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP

Query:  LWW
        LWW
Subjt:  LWW

XP_022979671.1 F-box protein At3g56470-like [Cucurbita maxima]4.0e-17172.7Show/hide
Query:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM
        M KCKRRN+RQRKKRKLKDG+GNS MDWAGLPRDILVMIF +L L+DCMSVDSVCKQWSNILAELPNWKR GFPWLLMSGQKDREMRTC S+LDN+ WEM
Subjt:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM

Query:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------
        ELLEA+G YVWGSFQDWLILVKDLGCYSLEVSLLNPFS ++INLP LWNFY+KM LSGSPAEEN ICAAIHS++REIAFW+QGS+VWHKYRLE       
Subjt:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------

Query:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW
                                     E+E AT ISEIKAQF+EVRSLEI+S+HVL+YLV+SSGEVLLVCRYFS+KPDA+ ET+NFEIY LD  QMSW
Subjt:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW

Query:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP
        E++ +                 S+ELG+GI+N IYFSNDD APWWNEWDSNHLKGLSS  GLDNSSRKDW TFHID+D NG+FCFRGNRDNWAP WFTAP
Subjt:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP

Query:  LWW
        LWW
Subjt:  LWW

XP_023516457.1 F-box protein At1g49360-like [Cucurbita pepo subsp. pepo]3.4e-17072.7Show/hide
Query:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM
        M KCKRRN+RQRKKRKLKDG+GNSAMDWAGLPRDILVMIF +L L+DCMSVDSVCKQWSNILAELPNWKR GFPWLLMSGQKDREMRTC S+LDN+ WEM
Subjt:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM

Query:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------
        ELLEA+G YVWGSFQDWLILVKDLGCYSLEVSLLNPFS ++INLP LWNFY+KM LSGSPAEEN I AAIHS++REIAFW+QGS+VWHKYRLE       
Subjt:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------

Query:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW
                                     E+E AT ISEIK QF+EVRSLEINS+HVL+YLV+SSGEVLLVCRY+SEKPDA+ ET+NFEIY LD  QMSW
Subjt:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW

Query:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP
        E++ +                 S+ELG+GI+N IYFSNDD APWWNEWDSNHLKGLSS  GLDNSSRKDWGTF ID+D NG+FCFRGNRDNWAP WFTAP
Subjt:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP

Query:  LWW
        LWW
Subjt:  LWW

TrEMBL top hitse value%identityAlignment
A0A1S4DWA3 uncharacterized protein LOC1034893021.4e-16470.47Show/hide
Query:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM
        MSK KRRN+RQRKKRKL+D RGNS  DWAGLPRDILVMIF QL L+DC+SVD+VCK WSNIL+ELPNWKR GFPWL+MSG+KDREMRTC SIL+NRIWE+
Subjt:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM

Query:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------
        EL EA+G Y+WGS QDWLI+VKDLGCYSLEVSLLNPFS+RKINLPRLWNFY+KMVLSGSPAEEN ICAAIHS++REIAFWVQGS  WHKY+LE       
Subjt:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------

Query:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW
                                     E+EC   ISEIK QF++VRSLEINS+HVL+YLVESSG+VLLVCRYFSEKPDAVLET+NFEIY LD SQMSW
Subjt:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW

Query:  ERIS-----------------FSTELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP
        E++                   ST+LG+ I+N IYFSNDD APWWNEWDSNHLK LSSR GL+NSS KDWGTFHI +D NG FCF GNRDNW PIWFTAP
Subjt:  ERIS-----------------FSTELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP

Query:  LWW
        LWW
Subjt:  LWW

A0A5D3CCV3 F-box protein1.4e-16470.47Show/hide
Query:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM
        MSK KRRN+RQRKKRKL+D RGNS  DWAGLPRDILVMIF QL L+DC+SVD+VCK WSNIL+ELPNWKR GFPWL+MSG+KDREMRTC SIL+NRIWE+
Subjt:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM

Query:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------
        EL EA+G Y+WGS QDWLI+VKDLGCYSLEVSLLNPFS+RKINLPRLWNFY+KMVLSGSPAEEN ICAAIHS++REIAFWVQGS  WHKY+LE       
Subjt:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------

Query:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW
                                     E+EC   ISEIK QF++VRSLEINS+HVL+YLVESSG+VLLVCRYFSEKPDAVLET+NFEIY LD SQMSW
Subjt:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW

Query:  ERIS-----------------FSTELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP
        E++                   ST+LG+ I+N IYFSNDD APWWNEWDSNHLK LSSR GL+NSS KDWGTFHI +D NG FCF GNRDNW PIWFTAP
Subjt:  ERIS-----------------FSTELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP

Query:  LWW
        LWW
Subjt:  LWW

A0A6J1DA29 uncharacterized protein LOC1110187511.4e-18076.67Show/hide
Query:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM
        MSKCKRRNRRQRKKRKLKDGR NS MDWAGLPRDILV IFGQLPLIDC+SVDSVCKQWSNILAELPNWKRCGFPWLLMSGQ DREMRTC S+LDN  WE+
Subjt:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM

Query:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------
        ELLEA+GGYVWGSFQDWLILVKDLGCYSLEVSLLNPFS+RKINLPRLWNFY+KMVLSGSP EENMICAAIHSEH EIAFWVQGSE WHKYRLE       
Subjt:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------

Query:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW
                                      N+CA+ +SEIKAQF+E R+LEINS HVL+YLVESSGEVLLVCRYFSEKPDA+LETVNFEIY LD SQM W
Subjt:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW

Query:  ERIS-----------------FSTELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP
        ER+                   STE G+G +NCIYFSNDDA PWWNEWDSNHLKGL+SRLGLDNSSRKDWGTFHI KD NGSFCFRGNRDNWAPIWFTAP
Subjt:  ERIS-----------------FSTELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP

Query:  LWW
        LWW
Subjt:  LWW

A0A6J1H0R2 F-box protein At1g49360-like2.0e-17172.7Show/hide
Query:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM
        M KCKRRN+RQRKKRKLKDG+G+ AMDWAGLPRDILVMIF +L L+DCMSVDSVCKQWSNILAELPNWKR GFPWLLMSGQKDREMRTC S+LDN+ WEM
Subjt:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM

Query:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------
        ELLEA+G YVWGSFQDWLILVKDLGCYSLEVSLLNPFS ++INLP LWNFY+KM LSGSPAEEN ICAAIHS++REIAFW+QGS+VWHKYRLE       
Subjt:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------

Query:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW
                                     E+E AT ISEIKAQF+EVRSLEINS+HVL+YLV+SSGEVLLVCRY+SEKPDA+ ET+NFEIY LD  QMSW
Subjt:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW

Query:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP
        E++ +                 S+ELG+GI+N IYFSNDD APWWNEWDSNHLKGLSS  GLDNSSRKDWG FHID D NG+FCFRGNRDNWAP WFTAP
Subjt:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP

Query:  LWW
        LWW
Subjt:  LWW

A0A6J1IRF4 F-box protein At3g56470-like2.0e-17172.7Show/hide
Query:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM
        M KCKRRN+RQRKKRKLKDG+GNS MDWAGLPRDILVMIF +L L+DCMSVDSVCKQWSNILAELPNWKR GFPWLLMSGQKDREMRTC S+LDN+ WEM
Subjt:  MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEM

Query:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------
        ELLEA+G YVWGSFQDWLILVKDLGCYSLEVSLLNPFS ++INLP LWNFY+KM LSGSPAEEN ICAAIHS++REIAFW+QGS+VWHKYRLE       
Subjt:  ELLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLE-------

Query:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW
                                     E+E AT ISEIKAQF+EVRSLEI+S+HVL+YLV+SSGEVLLVCRYFS+KPDA+ ET+NFEIY LD  QMSW
Subjt:  -----------------------------ENECATLISEIKAQFYEVRSLEINSRHVLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSW

Query:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP
        E++ +                 S+ELG+GI+N IYFSNDD APWWNEWDSNHLKGLSS  GLDNSSRKDW TFHID+D NG+FCFRGNRDNWAP WFTAP
Subjt:  ERISF-----------------STELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSFCFRGNRDNWAPIWFTAP

Query:  LWW
        LWW
Subjt:  LWW

SwissProt top hitse value%identityAlignment
Q9LXZ3 F-box protein At3g564701.1e-0423.84Show/hide
Query:  SKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEME
        SK K++ +R+++  K K+        +  LP D+L ++  +LPL D +   +VCK W      L        PWL+   + D         +      + 
Subjt:  SKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEME

Query:  LLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYN-KMVLSGSPAEENMICAAIHS
          E  G  V  S   WL++       S ++   NPF+   + +P LW  Y+ +M  S +P   + +   + S
Subjt:  LLEAHGGYVWGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYN-KMVLSGSPAEENMICAAIHS

Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)8.4e-5845.69Show/hide
Query:  SNLERFLQSVTPSVPAQFFSKSTLRGWKTCDSEIQ-PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GVESSTKPRRWGE
        SN+ERFL SVTPSVPA + SK+ +R     D E Q PYF+LGD+WE+F EWSAYG GVPL LNN  D V QYYVP LSGIQ+Y     + SS + RR GE
Subjt:  SNLERFLQSVTPSVPAQFFSKSTLRGWKTCDSEIQ-PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GVESSTKPRRWGE

Query:  ESDSDYRDSSSDGSSDSETKRRIKHTREPLHHNDPSIPASLRMDRLSLRDHHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYSREPLADKANDGADKI
        ES+SD+RDSSS+GSS SE++R + +++E +         S RMD+LSLR  H    ED SSD+ E  +SQGRL+FEYLERDLPY REP ADK +D A + 
Subjt:  ESDSDYRDSSSDGSSDSETKRRIKHTREPLHHNDPSIPASLRMDRLSLRDHHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYSREPLADKANDGADKI

Query:  P------------------------------------------------------------------------LRIFGLASYKFKGSSLWMRNGGVEHQL
        P                                                                        L +FGLASYK +G S+W   GG  HQL
Subjt:  P------------------------------------------------------------------------LRIFGLASYKFKGSSLWMRNGGVEHQL

Query:  ANSLLQAADHWLR
        ANSL QAAD+WLR
Subjt:  ANSLLQAADHWLR

AT2G01260.1 Protein of unknown function (DUF789)4.9e-6644.68Show/hide
Query:  GAGVRFGRGR-GDDRFYDSSRARKGLLSRQNDRLCRPQEDVSATPSCAPVVSP-----------LSNLERFLQSVTPSVPAQFFSKSTLRGWKTCD--SE
        GAG +  RGR GDD FY S++ R+   +++ D+L R Q DVS  PS AP  SP            SNL+RFL+SVTPSVPAQF SK+ LR  +  D  ++
Subjt:  GAGVRFGRGR-GDDRFYDSSRARKGLLSRQNDRLCRPQEDVSATPSCAPVVSP-----------LSNLERFLQSVTPSVPAQFFSKSTLRGWKTCD--SE

Query:  IQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GVESSTKPRRWGEESDSDYRDSSSDGSSDSETKRRIKHTREPLHHND
        + PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++SS K RR G+ SDSD+RDSSSD SSDS+++R             
Subjt:  IQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GVESSTKPRRWGEESDSDYRDSSSDGSSDSETKRRIKHTREPLHHND

Query:  PSIPASLRMDRLSLRDHHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYSREPLADKANDGA-------------------------------------
             S R+D +SLRD H    ED SSD+ E   SQGRL+FEYLERDLPY REP ADK  D A                                     
Subjt:  PSIPASLRMDRLSLRDHHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYSREPLADKANDGA-------------------------------------

Query:  ---------------------------------DKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLLQAADHWL
                                         +K+ L +FGLASYKF+G SLW   GG EHQL NSL QAAD WL
Subjt:  ---------------------------------DKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLLQAADHWL

AT2G01260.2 Protein of unknown function (DUF789)5.8e-5952.43Show/hide
Query:  GAGVRFGRGR-GDDRFYDSSRARKGLLSRQNDRLCRPQEDVSATPSCAPVVSP-----------LSNLERFLQSVTPSVPAQFFSKSTLRGWKTCD--SE
        GAG +  RGR GDD FY S++ R+   +++ D+L R Q DVS  PS AP  SP            SNL+RFL+SVTPSVPAQF SK+ LR  +  D  ++
Subjt:  GAGVRFGRGR-GDDRFYDSSRARKGLLSRQNDRLCRPQEDVSATPSCAPVVSP-----------LSNLERFLQSVTPSVPAQFFSKSTLRGWKTCD--SE

Query:  IQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GVESSTKPRRWGEESDSDYRDSSSDGSSDSETKRRIKHTREPLHHND
        + PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++SS K RR G+ SDSD+RDSSSD SSDS+++R             
Subjt:  IQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GVESSTKPRRWGEESDSDYRDSSSDGSSDSETKRRIKHTREPLHHND

Query:  PSIPASLRMDRLSLRDHHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYSREPLADKANDGADKIP
             S R+D +SLRD H    ED SSD+ E   SQGRL+FEYLERDLPY REP ADK  D A + P
Subjt:  PSIPASLRMDRLSLRDHHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYSREPLADKANDGADKIP

AT2G01260.3 Protein of unknown function (DUF789)8.4e-5853.1Show/hide
Query:  GAGVRFGRGR-GDDRFYDSSRARKGLLSRQNDRLCRPQEDVSATPSCAPVVSP-----------LSNLERFLQSVTPSVPAQFFSKSTLRGWKTCD--SE
        GAG +  RGR GDD FY S++ R+   +++ D+L R Q DVS  PS AP  SP            SNL+RFL+SVTPSVPAQF SK+ LR  +  D  ++
Subjt:  GAGVRFGRGR-GDDRFYDSSRARKGLLSRQNDRLCRPQEDVSATPSCAPVVSP-----------LSNLERFLQSVTPSVPAQFFSKSTLRGWKTCD--SE

Query:  IQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GVESSTKPRRWGEESDSDYRDSSSDGSSDSETKRRIKHTREPLHHND
        + PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++SS K RR G+ SDSD+RDSSSD SSDS+++R             
Subjt:  IQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GVESSTKPRRWGEESDSDYRDSSSDGSSDSETKRRIKHTREPLHHND

Query:  PSIPASLRMDRLSLRDHHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYSREPLADK
             S R+D +SLRD H    ED SSD+ E   SQGRL+FEYLERDLPY REP ADK
Subjt:  PSIPASLRMDRLSLRDHHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYSREPLADK

AT4G16100.1 Protein of unknown function (DUF789)9.3e-4136.94Show/hide
Query:  CRPQEDVSATPSCAPVVSPLSNLERFLQSVTPSVPAQFFSKSTLRGWKTCDSEIQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQL
        C     VS+T +     S  SNL RFL   TP V  Q    ++ +GW+T + E +PYF+L DLW++F+EWSAYG GVPLLLN  D VVQYYVPYLSGIQL
Subjt:  CRPQEDVSATPSCAPVVSPLSNLERFLQSVTPSVPAQFFSKSTLRGWKTCDSEIQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQL

Query:  YGVES--STKPRRWGEESDSDY-RDSSSDGSSDSETKRRIKHTREPLHHNDPSIPASLRMDRLSLRDHHLGLHEDC---SSDEAE-SFNSQGRLLFEYLE
        Y   S   T  RR GEESD D  RD SSDGS+D                       S  + R SL +      + C   SSDE+E S NS G L+FEYLE
Subjt:  YGVES--STKPRRWGEESDSDY-RDSSSDGSSDSETKRRIKHTREPLHHNDPSIPASLRMDRLSLRDHHLGLHEDC---SSDEAE-SFNSQGRLLFEYLE

Query:  RDLPYSREPLADKAND----------------------------------------------------------------------GADKIPLRIFGLAS
          +P+ REPL DK ++                                                                       + K+PL  FGLAS
Subjt:  RDLPYSREPLADKAND----------------------------------------------------------------------GADKIPLRIFGLAS

Query:  YKFKGSSLWMRNGGVEHQLANSLLQAADHWLRR
        YKFK S     +   E+Q   +LL+ A+ WLRR
Subjt:  YKFKGSSLWMRNGGVEHQLANSLLQAADHWLRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAAGTGTAAAAGGAGGAATCGAAGGCAGAGAAAGAAACGTAAGTTGAAGGATGGAAGGGGAAATTCTGCTATGGATTGGGCAGGGCTCCCTCGTGATATTCTTGT
CATGATCTTCGGGCAGTTACCTCTAATTGATTGCATGTCAGTTGATAGTGTTTGTAAGCAGTGGAGCAACATCCTTGCTGAACTTCCAAATTGGAAGAGGTGTGGATTTC
CATGGCTTTTGATGTCAGGTCAAAAAGATAGAGAGATGAGAACTTGTGTCAGCATTCTAGACAATCGAATTTGGGAAATGGAGCTTCTCGAGGCGCATGGAGGTTATGTT
TGGGGATCCTTTCAGGACTGGCTAATCCTGGTGAAGGATCTTGGTTGTTATTCTCTTGAAGTTAGTTTGTTGAACCCCTTCTCTATAAGGAAGATTAACCTGCCAAGGCT
TTGGAACTTCTATAACAAGATGGTTCTTTCAGGGTCCCCTGCTGAAGAAAACATGATTTGTGCAGCCATTCATAGTGAGCACCGTGAAATTGCCTTTTGGGTTCAAGGAT
CAGAAGTATGGCACAAGTATAGACTAGAAGAGAATGAATGTGCTACACTCATTTCTGAGATCAAAGCACAGTTTTATGAAGTTAGAAGTCTGGAAATTAACAGTCGGCAT
GTCTTGAGGTATCTTGTGGAGTCCTCAGGGGAGGTTTTGCTGGTTTGTAGATATTTCAGTGAGAAACCCGATGCCGTACTCGAAACAGTAAATTTTGAGATATATTTGCT
TGATACTTCTCAGATGTCGTGGGAAAGGATCTCTTTCTCAACTGAGCTTGGACTTGGGATCACTAACTGTATATATTTCTCTAACGATGATGCTGCTCCTTGGTGGAATG
AATGGGATTCTAATCATTTGAAAGGATTGTCTTCTCGTCTTGGACTAGACAACTCCTCCAGAAAAGATTGGGGAACCTTTCACATCGACAAAGACTACAATGGAAGCTTT
TGTTTTCGTGGTAATCGTGACAATTGGGCACCGATCTGGTTCACTGCACCTCTATGGTGGCGGATCCGGCAATTCCCGTTTGGATTTGGGATAATTCGATCGTCCTCCAT
TTTCTTCGACTTGGAGTTCGCCATCGTTGACTTTGGAGCCGGTGTACGGTTTGGTCGCGGCCGGGGAGATGACCGGTTTTATGATTCGTCGAGGGCGCGCAAGGGCCTTC
TCAGTCGGCAAAATGATAGGCTCTGTAGACCTCAAGAAGACGTTTCGGCTACTCCATCCTGCGCGCCGGTTGTTTCACCGTTGAGTAATCTTGAGCGCTTCTTGCAGTCG
GTTACCCCGTCTGTGCCTGCTCAGTTTTTCTCCAAGAGTACGTTGAGAGGTTGGAAGACGTGCGATTCGGAGATACAACCGTACTTTGTGCTCGGTGATTTGTGGGAGGC
TTTCAAGGAGTGGAGCGCTTATGGGGCAGGAGTGCCTCTTCTGTTGAATAACACTGATGGTGTGGTTCAATATTATGTCCCGTATTTGTCTGGTATACAACTGTATGGTG
TGGAATCGTCTACAAAGCCAAGGCGATGGGGTGAGGAAAGTGACAGTGACTATAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACCAAGAGAAGAATAAAACAC
ACTAGAGAGCCACTCCACCATAATGATCCATCTATCCCAGCTTCTCTTAGAATGGATAGATTGTCTTTGAGGGACCATCACTTGGGACTTCATGAGGACTGCTCCAGTGA
TGAGGCTGAATCTTTCAATTCTCAAGGTCGCCTTCTATTTGAGTATCTTGAAAGAGACCTACCTTATTCACGTGAACCTTTGGCTGACAAGGCAAACGATGGTGCCGATA
AGATTCCTTTAAGAATTTTTGGACTTGCTTCATACAAGTTTAAAGGGTCTTCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAATTGGCAAACTCCCTCTTGCAGGCT
GCTGATCACTGGTTAAGACGGGGGGGGGGGGGGTGGTTCACTCGTTTGCTTTCGTTAAGGTGGGGGACTGGAGGAAAGGTTAAGGAATGGGACAGGTCCCAAAATACTGA
AGCTTTAGATGGTCGAAATGTAAGTGTACGGTTGCCAGACCCTACTCTCTGCTTTACACAGGGAGGGACCAGTCATAGGAACAGCGTCGGCTTGCCTGCTGGTCATAACC
AAATTTCTGTGATGGATCACGGTAATGGGAAAGCAGGGAATCAAAACTCCGATCACTTAATCGGGGCCCGGCAGCCATCCCAAGAAAAGCGGCCCCAAAGCAAGCATTTC
AATATCGCCGAATTTCATCAAAAGCAGATAAAGATAATTTCGAATTCTCCCCATTCTAAGCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAAGTGTAAAAGGAGGAATCGAAGGCAGAGAAAGAAACGTAAGTTGAAGGATGGAAGGGGAAATTCTGCTATGGATTGGGCAGGGCTCCCTCGTGATATTCTTGT
CATGATCTTCGGGCAGTTACCTCTAATTGATTGCATGTCAGTTGATAGTGTTTGTAAGCAGTGGAGCAACATCCTTGCTGAACTTCCAAATTGGAAGAGGTGTGGATTTC
CATGGCTTTTGATGTCAGGTCAAAAAGATAGAGAGATGAGAACTTGTGTCAGCATTCTAGACAATCGAATTTGGGAAATGGAGCTTCTCGAGGCGCATGGAGGTTATGTT
TGGGGATCCTTTCAGGACTGGCTAATCCTGGTGAAGGATCTTGGTTGTTATTCTCTTGAAGTTAGTTTGTTGAACCCCTTCTCTATAAGGAAGATTAACCTGCCAAGGCT
TTGGAACTTCTATAACAAGATGGTTCTTTCAGGGTCCCCTGCTGAAGAAAACATGATTTGTGCAGCCATTCATAGTGAGCACCGTGAAATTGCCTTTTGGGTTCAAGGAT
CAGAAGTATGGCACAAGTATAGACTAGAAGAGAATGAATGTGCTACACTCATTTCTGAGATCAAAGCACAGTTTTATGAAGTTAGAAGTCTGGAAATTAACAGTCGGCAT
GTCTTGAGGTATCTTGTGGAGTCCTCAGGGGAGGTTTTGCTGGTTTGTAGATATTTCAGTGAGAAACCCGATGCCGTACTCGAAACAGTAAATTTTGAGATATATTTGCT
TGATACTTCTCAGATGTCGTGGGAAAGGATCTCTTTCTCAACTGAGCTTGGACTTGGGATCACTAACTGTATATATTTCTCTAACGATGATGCTGCTCCTTGGTGGAATG
AATGGGATTCTAATCATTTGAAAGGATTGTCTTCTCGTCTTGGACTAGACAACTCCTCCAGAAAAGATTGGGGAACCTTTCACATCGACAAAGACTACAATGGAAGCTTT
TGTTTTCGTGGTAATCGTGACAATTGGGCACCGATCTGGTTCACTGCACCTCTATGGTGGCGGATCCGGCAATTCCCGTTTGGATTTGGGATAATTCGATCGTCCTCCAT
TTTCTTCGACTTGGAGTTCGCCATCGTTGACTTTGGAGCCGGTGTACGGTTTGGTCGCGGCCGGGGAGATGACCGGTTTTATGATTCGTCGAGGGCGCGCAAGGGCCTTC
TCAGTCGGCAAAATGATAGGCTCTGTAGACCTCAAGAAGACGTTTCGGCTACTCCATCCTGCGCGCCGGTTGTTTCACCGTTGAGTAATCTTGAGCGCTTCTTGCAGTCG
GTTACCCCGTCTGTGCCTGCTCAGTTTTTCTCCAAGAGTACGTTGAGAGGTTGGAAGACGTGCGATTCGGAGATACAACCGTACTTTGTGCTCGGTGATTTGTGGGAGGC
TTTCAAGGAGTGGAGCGCTTATGGGGCAGGAGTGCCTCTTCTGTTGAATAACACTGATGGTGTGGTTCAATATTATGTCCCGTATTTGTCTGGTATACAACTGTATGGTG
TGGAATCGTCTACAAAGCCAAGGCGATGGGGTGAGGAAAGTGACAGTGACTATAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACCAAGAGAAGAATAAAACAC
ACTAGAGAGCCACTCCACCATAATGATCCATCTATCCCAGCTTCTCTTAGAATGGATAGATTGTCTTTGAGGGACCATCACTTGGGACTTCATGAGGACTGCTCCAGTGA
TGAGGCTGAATCTTTCAATTCTCAAGGTCGCCTTCTATTTGAGTATCTTGAAAGAGACCTACCTTATTCACGTGAACCTTTGGCTGACAAGGCAAACGATGGTGCCGATA
AGATTCCTTTAAGAATTTTTGGACTTGCTTCATACAAGTTTAAAGGGTCTTCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAATTGGCAAACTCCCTCTTGCAGGCT
GCTGATCACTGGTTAAGACGGGGGGGGGGGGGGTGGTTCACTCGTTTGCTTTCGTTAAGGTGGGGGACTGGAGGAAAGGTTAAGGAATGGGACAGGTCCCAAAATACTGA
AGCTTTAGATGGTCGAAATGTAAGTGTACGGTTGCCAGACCCTACTCTCTGCTTTACACAGGGAGGGACCAGTCATAGGAACAGCGTCGGCTTGCCTGCTGGTCATAACC
AAATTTCTGTGATGGATCACGGTAATGGGAAAGCAGGGAATCAAAACTCCGATCACTTAATCGGGGCCCGGCAGCCATCCCAAGAAAAGCGGCCCCAAAGCAAGCATTTC
AATATCGCCGAATTTCATCAAAAGCAGATAAAGATAATTTCGAATTCTCCCCATTCTAAGCAGTGA
Protein sequenceShow/hide protein sequence
MSKCKRRNRRQRKKRKLKDGRGNSAMDWAGLPRDILVMIFGQLPLIDCMSVDSVCKQWSNILAELPNWKRCGFPWLLMSGQKDREMRTCVSILDNRIWEMELLEAHGGYV
WGSFQDWLILVKDLGCYSLEVSLLNPFSIRKINLPRLWNFYNKMVLSGSPAEENMICAAIHSEHREIAFWVQGSEVWHKYRLEENECATLISEIKAQFYEVRSLEINSRH
VLRYLVESSGEVLLVCRYFSEKPDAVLETVNFEIYLLDTSQMSWERISFSTELGLGITNCIYFSNDDAAPWWNEWDSNHLKGLSSRLGLDNSSRKDWGTFHIDKDYNGSF
CFRGNRDNWAPIWFTAPLWWRIRQFPFGFGIIRSSSIFFDLEFAIVDFGAGVRFGRGRGDDRFYDSSRARKGLLSRQNDRLCRPQEDVSATPSCAPVVSPLSNLERFLQS
VTPSVPAQFFSKSTLRGWKTCDSEIQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGVESSTKPRRWGEESDSDYRDSSSDGSSDSETKRRIKH
TREPLHHNDPSIPASLRMDRLSLRDHHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYSREPLADKANDGADKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLLQA
ADHWLRRGGGGWFTRLLSLRWGTGGKVKEWDRSQNTEALDGRNVSVRLPDPTLCFTQGGTSHRNSVGLPAGHNQISVMDHGNGKAGNQNSDHLIGARQPSQEKRPQSKHF
NIAEFHQKQIKIISNSPHSKQ