; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg009585 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg009585
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationscaffold7:4378374..4382785
RNA-Seq ExpressionSpg009585
SyntenySpg009585
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573941.1 hypothetical protein SDJN03_27828, partial [Cucurbita argyrosperma subsp. sororia]4.7e-19282.23Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCAVKDVS+   IT AGDRV SD ATKP+    P P VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD
        VPAQFLSKSALRGW+TSD E QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKP       R+WGEE+DSDYRDSSSD
Subjt:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD

Query:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I
        GSSDS+TKRRIKHCREP HHNDPSITAPLRMDRLSLRDQHLG  EDCSSDEAES NSQG LLFEYLERDLPY+REPLADK S                 +
Subjt:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I

Query:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKW
           + +V  YPIYRIPTGQTLKDLDACFLTYHSLHTA+GG QSPQ+PFVAYPCKTD A+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLAN LSREADKW
Subjt:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKW

Query:  LRDLQVNHPDFLFFSRRDATPY
        LR+LQVNHPDF+FFS+RD  PY
Subjt:  LRDLQVNHPDFLFFSRRDATPY

KAG7013005.1 hypothetical protein SDJN02_25760 [Cucurbita argyrosperma subsp. argyrosperma]1.0e-19479.19Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCAVKDVS+   IT AGDRV SD ATKP+    P P VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD
        VPAQFLSKSALRGW+TSD E QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKP       R+WGEE+DSDYRDSSSD
Subjt:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD

Query:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKASIFLGFENVFR--------
        GSSDS+TKRRIKHCREP HHNDPSITAPLRMDRLSLRDQHLG  EDCSSDEAES NSQG LLFEYLERDLPY+REPLADKAS FLGFEN+F         
Subjt:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKASIFLGFENVFR--------

Query:  ----------------------------------YPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKG
                                          YPIYRIPTGQTLKDLDACFLTYHSLHTA+GG QSPQ+PFVAYPCKTD A+KVPLRIFGLASYKFKG
Subjt:  ----------------------------------YPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKG

Query:  SSLWMRNGGVEHQLANNLSREADKWLRDLQVNHPDFLFFSRRDATPY
        SSLWMRNGGVEHQLAN LSREADKWLR+LQVNHPDF+FFS+RD  PY
Subjt:  SSLWMRNGGVEHQLANNLSREADKWLRDLQVNHPDFLFFSRRDATPY

XP_022150656.1 uncharacterized protein LOC111018737 [Momordica charantia]4.1e-19683.49Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPH--PVVSPLSNLERFLQSVT
        MLGAGVRFGRGRGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD SLHS IT    RVASD ATKP+A+ NP+  PVVSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPH--PVVSPLSNLERFLQSVT

Query:  PSVPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSS
        PSVPAQF SKS+LRGWRT D ETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGME S KP       RRWGEESDSDYRDSS
Subjt:  PSVPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSS

Query:  SDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADK----ASIF----------
        SDGSSDS+TKRRIKH RE LHHNDPSITAPLR+DRLSLRDQH+GLHEDCSSDEAESFNS+GRLLFEYLERDLPY+REPLADK    AS F          
Subjt:  SDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADK----ASIF----------

Query:  ---LGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREAD
             + +V  YPIYRIPTGQTLKDLDACFLTYHSLHTAI GPQS QVPFVAYPCKTD AEK+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN+LS+ AD
Subjt:  ---LGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREAD

Query:  KWLRDLQVNHPDFLFFSRRDATPY
         WLR LQVNHPDFLFFSRRDATPY
Subjt:  KWLRDLQVNHPDFLFFSRRDATPY

XP_022945839.1 uncharacterized protein LOC111449961 isoform X2 [Cucurbita moschata]6.2e-19282.23Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCAVKDVS+   I  AGDRV SD ATKP+    P P VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD
        VPAQFLSKSALRGW+TSD E QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKP       R+WGEE+DSDYRDSSSD
Subjt:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD

Query:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I
        GSSDS+TKRRIKHCREP HHNDPSITAPLRMDRLSLRDQHLG  EDCSSDEAES NSQG LLFEYLERD PY+REPLADK S                 +
Subjt:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I

Query:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKW
           + +V  YPIYRIPTGQTLKDLDACFLTYH LHTA+GGPQSPQ+PFVAYPCKTD A+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLAN LSREADKW
Subjt:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKW

Query:  LRDLQVNHPDFLFFSRRDATPY
        LRDLQVNHPDF+FFSRRD  PY
Subjt:  LRDLQVNHPDFLFFSRRDATPY

XP_023541256.1 uncharacterized protein LOC111801477 [Cucurbita pepo subsp. pepo]4.0e-19181.99Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS
        MLGAGVRFG  RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCAVKDVS+   IT AGDRV SD ATKP+    P P VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD
        VPAQFLSKSALRGW+TSD E QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKP       R+WGEE+DSDYRDSSSD
Subjt:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD

Query:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I
        GSSDS+TKRRIKHCREP HHNDPSITAPLRMDRLSLRDQHLG  EDCSSDEAES NSQG LLFEYLERDLPY+REPLADK S                 +
Subjt:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I

Query:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKW
           + +V  YPIYRIPTGQTLKDLDACFLTYHSLHT +GG QSPQ+PFVAYPCKTD A+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLAN LSREADKW
Subjt:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKW

Query:  LRDLQVNHPDFLFFSRRDATPY
        LR+LQVNHPDF+FFSRRD  PY
Subjt:  LRDLQVNHPDFLFFSRRDATPY

TrEMBL top hitse value%identityAlignment
A0A6J1DC61 uncharacterized protein LOC1110187372.0e-19683.49Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPH--PVVSPLSNLERFLQSVT
        MLGAGVRFGRGRGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD SLHS IT    RVASD ATKP+A+ NP+  PVVSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPH--PVVSPLSNLERFLQSVT

Query:  PSVPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSS
        PSVPAQF SKS+LRGWRT D ETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGME S KP       RRWGEESDSDYRDSS
Subjt:  PSVPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSS

Query:  SDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADK----ASIF----------
        SDGSSDS+TKRRIKH RE LHHNDPSITAPLR+DRLSLRDQH+GLHEDCSSDEAESFNS+GRLLFEYLERDLPY+REPLADK    AS F          
Subjt:  SDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADK----ASIF----------

Query:  ---LGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREAD
             + +V  YPIYRIPTGQTLKDLDACFLTYHSLHTAI GPQS QVPFVAYPCKTD AEK+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN+LS+ AD
Subjt:  ---LGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREAD

Query:  KWLRDLQVNHPDFLFFSRRDATPY
         WLR LQVNHPDFLFFSRRDATPY
Subjt:  KWLRDLQVNHPDFLFFSRRDATPY

A0A6J1G225 uncharacterized protein LOC111449961 isoform X23.0e-19282.23Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCAVKDVS+   I  AGDRV SD ATKP+    P P VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD
        VPAQFLSKSALRGW+TSD E QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKP       R+WGEE+DSDYRDSSSD
Subjt:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD

Query:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I
        GSSDS+TKRRIKHCREP HHNDPSITAPLRMDRLSLRDQHLG  EDCSSDEAES NSQG LLFEYLERD PY+REPLADK S                 +
Subjt:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I

Query:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKW
           + +V  YPIYRIPTGQTLKDLDACFLTYH LHTA+GGPQSPQ+PFVAYPCKTD A+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLAN LSREADKW
Subjt:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKW

Query:  LRDLQVNHPDFLFFSRRDATPY
        LRDLQVNHPDF+FFSRRD  PY
Subjt:  LRDLQVNHPDFLFFSRRDATPY

A0A6J1G242 uncharacterized protein LOC111449961 isoform X11.3e-19081.65Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCAVKDVS+   I  AGDRV SD ATKP+    P P VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD
        VPAQFLSKSALRGW+TSD E QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKP       R+WGEE+DSDYRDSSSD
Subjt:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD

Query:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I
        GSSDS+TKRRIKHCREP HHNDPSITAPLRMDRLSLRDQHLG  EDCSSDEAES NSQG LLFEYLERD PY+REPLADK S                 +
Subjt:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I

Query:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGG---PQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREA
           + +V  YPIYRIPTGQTLKDLDACFLTYH LHTA+GG   PQSPQ+PFVAYPCKTD A+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLAN LSREA
Subjt:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGG---PQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREA

Query:  DKWLRDLQVNHPDFLFFSRRDATPY
        DKWLRDLQVNHPDF+FFSRRD  PY
Subjt:  DKWLRDLQVNHPDFLFFSRRDATPY

A0A6J1HWA7 uncharacterized protein LOC111467452 isoform X27.4e-19181.52Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCAVKDVS+   IT AGDRV SD ATKP+    P P VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD
        VPAQFLSKSALRGW+TSD E QP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKP       R+WGEE+DSDYRDSSSD
Subjt:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD

Query:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I
        GSSDS+TKRRIKHCREP HHNDPSITAPLRMDRLSLRDQHLG  EDCSSDEAES NSQG LLFEYLERD PY+REPLADK S                 +
Subjt:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I

Query:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKW
           + +V  YPIYRIPTGQTLKDLDACFLTYHSLHTA+GG QSPQ+PFVAYPCKTD A+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLAN LSREADKW
Subjt:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKW

Query:  LRDLQVNHPDFLFFSRRDATPY
        L++LQVNHPDF+FF RRD  PY
Subjt:  LRDLQVNHPDFLFFSRRDATPY

A0A6J1HYQ4 uncharacterized protein LOC111467452 isoform X12.4e-18980.94Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCAVKDVS+   IT AGDRV SD ATKP+    P P VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD
        VPAQFLSKSALRGW+TSD E QP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKP       R+WGEE+DSDYRDSSSD
Subjt:  VPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSD

Query:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I
        GSSDS+TKRRIKHCREP HHNDPSITAPLRMDRLSLRDQHLG  EDCSSDEAES NSQG LLFEYLERD PY+REPLADK S                 +
Subjt:  GSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----------------I

Query:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGP---QSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREA
           + +V  YPIYRIPTGQTLKDLDACFLTYHSLHTA+GG    QSPQ+PFVAYPCKTD A+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLAN LSREA
Subjt:  FLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGP---QSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREA

Query:  DKWLRDLQVNHPDFLFFSRRDATPY
        DKWL++LQVNHPDF+FF RRD  PY
Subjt:  DKWLRDLQVNHPDFLFFSRRDATPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)2.0e-8452.6Show/hide
Query:  AISNPHPVVSPLSNLERFLQSVTPSVPAQFLSKSALRGWRTSDLETQ-PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLYG-MES
        A+   H   +  SN+ERFL SVTPSVPA +LSK+ +R    SD+E+Q PYF+LGD+WE+F EWSAYG GVPL LNN  D V QYYVP LSGIQ+Y  +++
Subjt:  AISNPHPVVSPLSNLERFLQSVTPSVPAQFLSKSALRGWRTSDLETQ-PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLYG-MES

Query:  STKPRRVGGILRRWGEESDSDYRDSSSDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPY
         T   +     RR GEES+SD+RDSSS+GSS S+++R + + +E +           RMD+LSLR +H    ED SSD+ E  +SQGRL+FEYLERDLPY
Subjt:  STKPRRVGGILRRWGEESDSDYRDSSSDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPY

Query:  TREPLADKAS-----------------IFLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGP-QSPQVPFVAYPCKTDGAEKVPLRIFGLASY
         REP ADK S                 +   + +V  YPIY+IPTG TLKDLDACFLTYHSLHT   GP  +     V  P   +  EK+ L +FGLASY
Subjt:  TREPLADKAS-----------------IFLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGP-QSPQVPFVAYPCKTDGAEKVPLRIFGLASY

Query:  KFKGSSLWMRNGGVEHQLANNLSREADKWLRDLQVNHPDFLFFSRR
        K +G S+W   GG  HQLAN+L + AD WLR  QVNHPDF+FF RR
Subjt:  KFKGSSLWMRNGGVEHQLANNLSREADKWLRDLQVNHPDFLFFSRR

AT2G01260.1 Protein of unknown function (DUF789)2.3e-9149.41Show/hide
Query:  MLGAGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTP
        MLGAG +  RGR G+D FY S++ R+   +++ D+L R Q   S  PS A    S H                +P  +S+        SNL+RFL+SVTP
Subjt:  MLGAGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTP

Query:  SVPAQFLSKSALRGWRTSD--LETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMESSTKPRRVGGILRRWGEESDS
        SVPAQFLSK+ LR  R  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++SS K RR        G+ SDS
Subjt:  SVPAQFLSKSALRGWRTSD--LETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMESSTKPRRVGGILRRWGEESDS

Query:  DYRDSSSDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKA-----------
        D+RDSSSD SSDSD++R                    R+D +SLRDQH    ED SSD+ E   SQGRL+FEYLERDLPY REP ADK            
Subjt:  DYRDSSSDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKA-----------

Query:  ------SIFLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANN
               +   + +V  YPIYRIPTG TLKDLDACFLTYHSLHT+ GG  S Q   +  P +   +EK+ L +FGLASYKF+G SLW   GG EHQL N+
Subjt:  ------SIFLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANN

Query:  LSREADKWLRDLQVNHPDFLFFSRR
        L + ADKWL    V+HPDFLFF RR
Subjt:  LSREADKWLRDLQVNHPDFLFFSRR

AT2G01260.2 Protein of unknown function (DUF789)1.4e-6948.56Show/hide
Query:  MLGAGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTP
        MLGAG +  RGR G+D FY S++ R+   +++ D+L R Q   S  PS A    S H                +P  +S+        SNL+RFL+SVTP
Subjt:  MLGAGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTP

Query:  SVPAQFLSKSALRGWRTSD--LETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMESSTKPRRVGGILRRWGEESDS
        SVPAQFLSK+ LR  R  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++SS K RR        G+ SDS
Subjt:  SVPAQFLSKSALRGWRTSD--LETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMESSTKPRRVGGILRRWGEESDS

Query:  DYRDSSSDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKA-----------
        D+RDSSSD SSDSD++R                    R+D +SLRDQH    ED SSD+ E   SQGRL+FEYLERDLPY REP ADK            
Subjt:  DYRDSSSDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKA-----------

Query:  ------SIFLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGG
               +   + +V  YPIYRIPTG TLKDLDACFLTYHSLHT+ GG
Subjt:  ------SIFLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGG

AT4G16100.1 Protein of unknown function (DUF789)3.4e-6341.46Show/hide
Query:  RGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSL---HSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPSVPAQFL
        R RGE+RFY+    RK    R+  RL   +       +  + D  +      I    +   SD +      S      +  SNL RFL   TP V  Q L
Subjt:  RGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSL---HSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPSVPAQFL

Query:  SKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDY-RDSSSDGSSDS
          ++ +GWRT + E +PYF+L DLW++F+EWSAYG GVPLLLN  D VVQYYVPYLSGIQLY       P R     RR GEESD D  RD SSDGS+D 
Subjt:  SKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDY-RDSSSDGSSDS

Query:  DTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAE-SFNSQGRLLFEYLERDLPYTREPLADKASIF-----------------LGF
                CRE L  N         + R SL ++        SSDE+E S NS G L+FEYLE  +P+ REPL DK S                     +
Subjt:  DTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAE-SFNSQGRLLFEYLERDLPYTREPLADKASIF-----------------LGF

Query:  ENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKWLRDL
         +V  YPIYRIP GQ+L++LDACFLT+HSL T   G  + +    +   K+  + K+PL  FGLASYKFK S     +   E+Q    L R A++WLR L
Subjt:  ENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKWLRDL

Query:  QVNHPDFLFF
        +V  PDF  F
Subjt:  QVNHPDFLFF

AT5G49220.1 Protein of unknown function (DUF789)1.5e-5838.23Show/hide
Query:  RGEDRFYDSSRARKGLLSRQNDRLCRPQE----------------GASATPSCAVKDVSLHSS---ITGAGDRVASDGATKPLAISNPHPVVSPLSNLER
        RGE+RFY+    R+     Q  +  R ++                 A+  P    K + +  S   +  +G  V +  +    + S    V+S  SNL+R
Subjt:  RGEDRFYDSSRARKGLLSRQNDRLCRPQE----------------GASATPSCAVKDVSLHSS---ITGAGDRVASDGATKPLAISNPHPVVSPLSNLER

Query:  FLQSVTPSVPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWG
        FL+  TP VPA+     +    +T + +   YFVL DLWE+F EWSAYGAGV     PL ++  D  VQYYVPYLSGIQLY ++   KPR   G      
Subjt:  FLQSVTPSVPAQFLSKSALRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWG

Query:  EESDSDYRDSSSDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----
                + SS+GSS+S T               P   +   ++R+SL+DQ   +    SS EAE  N QGRLLFEYLE + P+ REPLA+K S     
Subjt:  EESDSDYRDSSSDGSSDSDTKRRIKHCREPLHHNDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKAS-----

Query:  ------------IFLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCK-TDGAEKVPLRIFGLASYKFKGSSLWMRNGGVE
                    +   + +V  YPIYRIP G TL++LDACFLT+HSL TA   PQS      A  C  +  + K+PL  FGLASYK K  S+W +N   E
Subjt:  ------------IFLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCK-TDGAEKVPLRIFGLASYKFKGSSLWMRNGGVE

Query:  HQLANNLSREADKWLRDLQVNHPDFLFFS
         Q   +L + ADKWL+ LQV+HPD+ FF+
Subjt:  HQLANNLSREADKWLRDLQVNHPDFLFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGAGCGGGTGTACGGTTCGGTCGCGGCAGGGGAGAGGACCGGTTTTACGATTCATCCAGAGCGAGGAAGGGCCTTCTCAGTCGCCAAAACGATAGGCTCTGTAG
ACCTCAAGAAGGCGCTTCGGCTACTCCATCTTGCGCGGTTAAGGATGTTTCCCTGCATTCTTCGATTACAGGGGCTGGGGACCGTGTCGCGTCTGATGGAGCTACTAAAC
CACTTGCCATTTCTAATCCCCACCCGGTTGTTTCTCCCTTAAGTAATCTCGAGCGATTCTTGCAGTCGGTTACTCCGTCTGTGCCTGCTCAGTTTCTCTCTAAGAGTGCG
TTGAGAGGTTGGAGGACCAGCGATTTGGAGACGCAACCTTACTTCGTGCTTGGTGATTTGTGGGAGGCTTTCAAGGAATGGAGCGCGTATGGGGCTGGAGTGCCTCTTTT
ATTGAATAACACCGACGGTGTGGTTCAGTATTATGTTCCGTATTTGTCTGGTATACAATTGTATGGCATGGAATCGTCTACAAAACCGAGGAGGGTCGGAGGGATACTGA
GGCGATGGGGTGAGGAGAGTGACAGTGACTACAGAGATTCAAGTAGTGATGGTAGCAGTGATTCTGATACAAAGAGAAGAATAAAACACTGTAGAGAACCACTCCACCAT
AATGATCCGTCTATCACAGCTCCTCTTAGAATGGATAGATTGTCTTTGAGGGACCAGCATTTGGGACTTCATGAGGATTGCTCCAGTGATGAGGCTGAATCTTTCAATTC
TCAAGGTCGCCTTCTATTTGAGTATCTTGAAAGAGATCTACCGTATACACGTGAACCTTTGGCAGACAAGGCAAGTATTTTTCTGGGTTTTGAGAATGTATTCAGGTACC
CAATTTACAGGATACCAACTGGGCAAACATTAAAGGATCTCGATGCTTGCTTTCTCACATACCATTCTCTACACACAGCAATCGGAGGCCCTCAAAGCCCACAAGTACCA
TTTGTGGCATATCCTTGTAAGACGGATGGTGCCGAAAAGGTTCCTTTAAGAATTTTTGGACTTGCTTCATACAAGTTTAAAGGCTCGTCATTGTGGATGCGAAATGGTGG
AGTTGAGCATCAATTGGCAAACAACCTCTCGCGAGAAGCTGATAAGTGGTTAAGAGATCTCCAGGTCAATCACCCAGATTTTCTGTTCTTCAGCCGTCGAGATGCAACAC
CTTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGGAGCGGGTGTACGGTTCGGTCGCGGCAGGGGAGAGGACCGGTTTTACGATTCATCCAGAGCGAGGAAGGGCCTTCTCAGTCGCCAAAACGATAGGCTCTGTAG
ACCTCAAGAAGGCGCTTCGGCTACTCCATCTTGCGCGGTTAAGGATGTTTCCCTGCATTCTTCGATTACAGGGGCTGGGGACCGTGTCGCGTCTGATGGAGCTACTAAAC
CACTTGCCATTTCTAATCCCCACCCGGTTGTTTCTCCCTTAAGTAATCTCGAGCGATTCTTGCAGTCGGTTACTCCGTCTGTGCCTGCTCAGTTTCTCTCTAAGAGTGCG
TTGAGAGGTTGGAGGACCAGCGATTTGGAGACGCAACCTTACTTCGTGCTTGGTGATTTGTGGGAGGCTTTCAAGGAATGGAGCGCGTATGGGGCTGGAGTGCCTCTTTT
ATTGAATAACACCGACGGTGTGGTTCAGTATTATGTTCCGTATTTGTCTGGTATACAATTGTATGGCATGGAATCGTCTACAAAACCGAGGAGGGTCGGAGGGATACTGA
GGCGATGGGGTGAGGAGAGTGACAGTGACTACAGAGATTCAAGTAGTGATGGTAGCAGTGATTCTGATACAAAGAGAAGAATAAAACACTGTAGAGAACCACTCCACCAT
AATGATCCGTCTATCACAGCTCCTCTTAGAATGGATAGATTGTCTTTGAGGGACCAGCATTTGGGACTTCATGAGGATTGCTCCAGTGATGAGGCTGAATCTTTCAATTC
TCAAGGTCGCCTTCTATTTGAGTATCTTGAAAGAGATCTACCGTATACACGTGAACCTTTGGCAGACAAGGCAAGTATTTTTCTGGGTTTTGAGAATGTATTCAGGTACC
CAATTTACAGGATACCAACTGGGCAAACATTAAAGGATCTCGATGCTTGCTTTCTCACATACCATTCTCTACACACAGCAATCGGAGGCCCTCAAAGCCCACAAGTACCA
TTTGTGGCATATCCTTGTAAGACGGATGGTGCCGAAAAGGTTCCTTTAAGAATTTTTGGACTTGCTTCATACAAGTTTAAAGGCTCGTCATTGTGGATGCGAAATGGTGG
AGTTGAGCATCAATTGGCAAACAACCTCTCGCGAGAAGCTGATAAGTGGTTAAGAGATCTCCAGGTCAATCACCCAGATTTTCTGTTCTTCAGCCGTCGAGATGCAACAC
CTTACTGA
Protein sequenceShow/hide protein sequence
MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEGASATPSCAVKDVSLHSSITGAGDRVASDGATKPLAISNPHPVVSPLSNLERFLQSVTPSVPAQFLSKSA
LRGWRTSDLETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRVGGILRRWGEESDSDYRDSSSDGSSDSDTKRRIKHCREPLHH
NDPSITAPLRMDRLSLRDQHLGLHEDCSSDEAESFNSQGRLLFEYLERDLPYTREPLADKASIFLGFENVFRYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVP
FVAYPCKTDGAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANNLSREADKWLRDLQVNHPDFLFFSRRDATPY