; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036220 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036220
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationscaffold5:45087453..45091664
RNA-Seq ExpressionSpg036220
SyntenySpg036220
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573073.1 hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia]5.4e-22089.34Show/hide
Query:  MASSSSCSSSCSSCS-FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKG
        MASS S SSSCSSCS FVV LLVFTS  SVFSTS+AHQMP KNQT FHP KELKKLKHIR YLRKINKPP KTIQSSDGDVIDCVLSHLQPAFDHPELKG
Subjt:  MASSSSCSSSCSSCS-FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKG

Query:  HSPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        HSPLEPPERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
Subjt:  HSPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPK
        TDQYEFSLSQIW                  VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY GKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ
        HGHWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ
Subjt:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ

Query:  GSNSVWGTYFYYGGPGRNVRCP
        GSN+VWGTYFYYGGPGR VRCP
Subjt:  GSNSVWGTYFYYGGPGRNVRCP

XP_022954586.1 uncharacterized protein LOC111456810 [Cucurbita moschata]4.5e-21989.1Show/hide
Query:  MASSSSCSSSCSSCS-FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKG
        MASS S SSSCSSCS FVV LLVFTS  SVFSTS+AHQMP KNQT FHP KELKKLKHIR YLRKINKPP KTIQSSDGDVIDCVLSHLQPAFDHPELKG
Subjt:  MASSSSCSSSCSSCS-FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKG

Query:  HSPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        HSPLEPPERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
Subjt:  HSPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPK
        TDQYEFSLSQIW                  VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSY GKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ
        HGHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ
Subjt:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ

Query:  GSNSVWGTYFYYGGPGRNVRCP
        GSN+VWGTYFYYGGPGR VRCP
Subjt:  GSNSVWGTYFYYGGPGRNVRCP

XP_022994621.1 uncharacterized protein LOC111490277 [Cucurbita maxima]3.7e-22189.5Show/hide
Query:  SSSCSSSCSSCS-FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP
        +SSCSSSCSSCS FVVFLLVFTS  SVFSTS+AHQMP KNQT FHP KELKKLK+IR YLRKINKPP KTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP
Subjt:  SSSCSSSCSSCS-FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP

Query:  LEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
        LEPPERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
Subjt:  LEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ

Query:  YEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGH
        YEFSLSQIW                  VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY GKQFDIGLMVWKDPKHGH
Subjt:  YEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGH

Query:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN
        WWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN
Subjt:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN

Query:  SVWGTYFYYGGPGRNVRCP
        +VWGTYFYYGGPGR VRCP
Subjt:  SVWGTYFYYGGPGRNVRCP

XP_023542314.1 uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo]6.5e-21888Show/hide
Query:  MASSSSCSSSCSSCS----FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPE
        MASS SCSSS SS S    FVV LLVFTS  S FSTS+AHQMP KNQT FHP KELKKLK+IR YLRKINKPP KTIQSSDGDVIDCVLSHLQPAFDHPE
Subjt:  MASSSSCSSSCSSCS----FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPE

Query:  LKGHSPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA
        LKGHSPLEPPERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA
Subjt:  LKGHSPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA

Query:  PRVTDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWK
        PRVTDQYEFSLSQIW                  VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY GKQFDIGLMVWK
Subjt:  PRVTDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWK

Query:  DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD
        DPKHGHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD
Subjt:  DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD

Query:  IRQGSNSVWGTYFYYGGPGRNVRCP
        IRQGSN+VWGTYFYYGGPGR VRCP
Subjt:  IRQGSNSVWGTYFYYGGPGRNVRCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]1.8e-22088.86Show/hide
Query:  MASSSSCSSSCSSCSFVVFLLVF-TSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKG
        MASSSS SSSCS   FVVFLLVF TSF+SVFS+S++HQ+PPKNQTFFHP KELKKLKHIR YLRKINKPP KTI+SSDGDVIDCVLSHLQPAFDHPELKG
Subjt:  MASSSSCSSSCSSCSFVVFLLVF-TSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKG

Query:  HSPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        H+PLEPPERPRGN S EE+AENFQLWSASG+FCPEGT+PIRRTTE+DIFRASS RRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
Subjt:  HSPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPK
        TDQYEFSLSQIW                  VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYSGKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ
        HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ
Subjt:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ

Query:  GSNSVWGTYFYYGGPGRNVRCP
         +N+VWGTYFYYGGPGRNV+CP
Subjt:  GSNSVWGTYFYYGGPGRNVRCP

TrEMBL top hitse value%identityAlignment
A0A1S3AXP9 uncharacterized protein LOC1034837232.1e-21485.98Show/hide
Query:  MASSSSCSSSCSSCS-----FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHP
        MASSSS SSSCS CS     FVV LLVFTSF+SVFS+S++HQ+P KNQT FHP +ELKKLKHIR YLRKINKPP KTIQSSDGDVIDCVLSHLQPAFDHP
Subjt:  MASSSSCSSSCSSCS-----FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHP

Query:  ELKGHSPLEPPERPRG-NKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRR-VRRDSSGNGHEHAVVFVNGEQYYGAKASLN
        +LKGH+PLEPPERPRG N S+EE  ENFQLWS SGEFCPEGT+PIRRTTEKDI+RASS RR+GRKPIRR VRRDSSGNGHEHAVV+VNGEQYYGAKASLN
Subjt:  ELKGHSPLEPPERPRG-NKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRR-VRRDSSGNGHEHAVVFVNGEQYYGAKASLN

Query:  IWAPRVTDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLM
        IWAPRVTDQYEFS+SQIW                  VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLM
Subjt:  IWAPRVTDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLM

Query:  VWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSD
        VWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL VLADHSD
Subjt:  VWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSD

Query:  CYDIRQGSNSVWGTYFYYGGPGRNVRCP
        CYDIRQ +NSVWGTYFYYGGPGRNV+CP
Subjt:  CYDIRQGSNSVWGTYFYYGGPGRNVRCP

A0A6J1E7H5 uncharacterized protein LOC1114313344.4e-21284.32Show/hide
Query:  MASSSSCSSSCSSCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        M SSSSC        FVV LLVFTSF+SVF TS+AH+ PPKN+TFFHP KEL +LKHIR YLRKINKP TKTIQSSDGDVIDCVLSHLQPAFDHP LKGH
Subjt:  MASSSSCSSSCSSCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        +PL+PPERPRGN S EE+AE+FQLWSASG+FCPEGT+PIRRTTE+DI+RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKH
        DQ EFSLSQIW                  VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+GKQFD+G+MVWKDPKH
Subjt:  DQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSGLLVGYWPAFLFSHLRSH SMVQFGGEIVNSR SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNSVWGTYFYYGGPGRNVRCP
        +N  WGTYFYYGGPGRNV+CP
Subjt:  SNSVWGTYFYYGGPGRNVRCP

A0A6J1GRA4 uncharacterized protein LOC1114568102.2e-21989.1Show/hide
Query:  MASSSSCSSSCSSCS-FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKG
        MASS S SSSCSSCS FVV LLVFTS  SVFSTS+AHQMP KNQT FHP KELKKLKHIR YLRKINKPP KTIQSSDGDVIDCVLSHLQPAFDHPELKG
Subjt:  MASSSSCSSSCSSCS-FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKG

Query:  HSPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        HSPLEPPERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
Subjt:  HSPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPK
        TDQYEFSLSQIW                  VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSY GKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ
        HGHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ
Subjt:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ

Query:  GSNSVWGTYFYYGGPGRNVRCP
        GSN+VWGTYFYYGGPGR VRCP
Subjt:  GSNSVWGTYFYYGGPGRNVRCP

A0A6J1JZN8 uncharacterized protein LOC1114902771.8e-22189.5Show/hide
Query:  SSSCSSSCSSCS-FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP
        +SSCSSSCSSCS FVVFLLVFTS  SVFSTS+AHQMP KNQT FHP KELKKLK+IR YLRKINKPP KTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP
Subjt:  SSSCSSSCSSCS-FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP

Query:  LEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
        LEPPERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
Subjt:  LEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ

Query:  YEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGH
        YEFSLSQIW                  VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY GKQFDIGLMVWKDPKHGH
Subjt:  YEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGH

Query:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN
        WWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN
Subjt:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN

Query:  SVWGTYFYYGGPGRNVRCP
        +VWGTYFYYGGPGR VRCP
Subjt:  SVWGTYFYYGGPGRNVRCP

A0A6J1KJ26 uncharacterized protein LOC1114950546.8e-21384.8Show/hide
Query:  MASSSSCSSSCSSCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        M SSSSC        FVV LLVFTSF+SVF TS+AH+ PPKNQT+FHP KEL +LKHIR YLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHP LKGH
Subjt:  MASSSSCSSSCSSCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        +PL PPERPRGN S EE+AENFQLWSASG+FCPEGT+PIRRTTE+DI+RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKH
        DQ EFSLSQIW                  VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+GKQFD+G+MVWKDPKH
Subjt:  DQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSGLLVGYWPAFLFSHLRSH SMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNSVWGTYFYYGGPGRNVRCP
        +N VWGTYFYYGGPGRNV+CP
Subjt:  SNSVWGTYFYYGGPGRNVRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)3.8e-16366.26Show/hide
Query:  SCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRGN
        +CS  +  L     +S FS+ L+  + P+NQT   P+ EL KLK I  +LRKINKP  KTI S DGD+IDCVL H QPAFDHP L+G  PL+PPERPRG+
Subjt:  SCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRGN

Query:  KSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW-
               ++FQLW   GE CPEGTVPIRRT E+DI RA+S+  FG+K +R  RRD+S NGHEHAV +V+GE+YYGAKAS+N+WAP+V +QYEFSLSQIW 
Subjt:  KSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW-

Query:  -----------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLL
                         VSPELYGDN PRFFTYWT DAYQATGCYNLLCSGFVQTN+ IAIGAAISP SSY G QFDI L++WKDPKHG+WWLE+GSG+L
Subjt:  -----------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLL

Query:  VGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNSVWGTYFYYG
        VGYWP+FLF+HL+ HASMVQ+GGEIVNS   G HT+TQMGSGHFAEEGF K+SYFRN+QVVDWDNNL+P  NL VLADH +CYDI+ GSN  WG+YFYYG
Subjt:  VGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNSVWGTYFYYG

Query:  GPGRNVRCP
        GPG+N +CP
Subjt:  GPGRNVRCP

AT1G23340.1 Protein of Unknown Function (DUF239)2.4e-16567.31Show/hide
Query:  SCSSSCSSCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP
        S SSSC   +F++ L +F+S+ S  S S +  +P        P +E++K+K IR  L+KINKP  KTI SSDGD IDCV SH QPAFDHP L+G  P++P
Subjt:  SCSSSCSSCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP

Query:  PERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PE P G     E  ENFQLWS  GE CPEGT+PIRRTTE+D+ RA+S+RRFGRK IRRVRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  PERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWL
        SLSQIW                  +SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNSVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL VLADH +CYDIR G N VW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNSVW

Query:  GTYFYYGGPGRNVRCP
        G +FYYGGPG+N +CP
Subjt:  GTYFYYGGPGRNVRCP

AT1G23340.2 Protein of Unknown Function (DUF239)2.4e-16567.31Show/hide
Query:  SCSSSCSSCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP
        S SSSC   +F++ L +F+S+ S  S S +  +P        P +E++K+K IR  L+KINKP  KTI SSDGD IDCV SH QPAFDHP L+G  P++P
Subjt:  SCSSSCSSCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP

Query:  PERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PE P G     E  ENFQLWS  GE CPEGT+PIRRTTE+D+ RA+S+RRFGRK IRRVRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  PERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWL
        SLSQIW                  +SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNSVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL VLADH +CYDIR G N VW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNSVW

Query:  GTYFYYGGPGRNVRCP
        G +FYYGGPG+N +CP
Subjt:  GTYFYYGGPGRNVRCP

AT1G70550.2 Protein of Unknown Function (DUF239)1.9e-16266.75Show/hide
Query:  SCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRGN
        S SF+  +L+    +S FS++ +            P +EL+KL  IR  L KINKP  KTIQSSDGD IDCV +H QPAFDHP L+G  PL+PPE P+G 
Subjt:  SCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRGN

Query:  KSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWV
           +   EN QLWS SGE CPEGT+PIRRTTE+D+ RASS++RFGRK IRRV+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIWV
Subjt:  KSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWV

Query:  ------------------SPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLL
                          SPELYGD  PRFFTYWT+DAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY G QFDI L++WKDPKHGHWWL++GSG L
Subjt:  ------------------SPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLL

Query:  VGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNSVWGTYFYYG
        VGYWPAFLF+HL+ H SMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR G+N VWG YFYYG
Subjt:  VGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNSVWGTYFYYG

Query:  GPGRNVRCP
        GPG+N RCP
Subjt:  GPGRNVRCP

AT5G50150.1 Protein of Unknown Function (DUF239)5.6e-17571.23Show/hide
Query:  MASSSSCSSSCSSCS--FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELK
        MASSSS SS+ S+ +  F+  +L+ +    +   S  H    KNQT F P +E++KL+ +  YL KINKP  KTI S DGDVI+CV SHLQPAFDHP+L+
Subjt:  MASSSSCSSSCSSCS--FVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELK

Query:  GHSPLEPPERP-RGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAP
        G  PL+ P RP +GN++  E + N QLWS SGE CP G++PIR+TT+ D+ RA+S+RRFGRK  R +RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAP
Subjt:  GHSPLEPPERP-RGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAP

Query:  RVTDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKD
        RVTD YEFSLSQIW                  VSPELYGDN PRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SSY+G+QFDIGLM+WKD
Subjt:  RVTDQYEFSLSQIW------------------VSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKD

Query:  PKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDI
        PKHGHWWLE G+GLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NLHVLADH  CYDI
Subjt:  PKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDI

Query:  RQGSNSVWGTYFYYGGPGRNVRCP
        RQG N+VWGTYFYYGGPGRN RCP
Subjt:  RQGSNSVWGTYFYYGGPGRNVRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCTTCCTGTTCTTCTTCTTGTTCTTCTTGTTCTTTTGTTGTTTTCCTTCTGGTTTTTACTTCCTTCACCTCTGTTTTCTCAACTTCCTTAGCCCATCA
AATGCCACCAAAAAACCAAACTTTTTTCCACCCCATCAAAGAGCTGAAGAAACTAAAGCACATCAGAACTTATTTACGCAAAATCAACAAGCCTCCAACCAAGACAATTC
AGAGCTCAGATGGTGATGTCATAGACTGTGTGCTTTCTCATCTCCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGACACTCCCCATTGGAACCGCCTGAGAGGCCGAGA
GGGAACAAGTCCATGGAAGAAATGGCAGAGAACTTCCAGTTATGGTCAGCTTCAGGCGAATTTTGCCCTGAAGGAACAGTTCCAATCAGAAGAACAACAGAGAAAGACAT
TTTCAGAGCAAGTTCTCTTCGAAGATTTGGAAGAAAACCCATTAGACGTGTGAGAAGAGATTCTTCTGGCAATGGCCATGAGCATGCTGTGGTGTTTGTGAATGGAGAAC
AATATTATGGAGCAAAGGCCAGCTTAAACATATGGGCACCACGTGTAACTGATCAATACGAATTCAGCTTATCACAGATATGGGTAAGTCCTGAACTGTATGGCGACAAC
AATCCTAGATTCTTTACGTACTGGACGACTGATGCTTATCAAGCGACTGGGTGTTATAATCTACTTTGCTCTGGCTTTGTTCAAACGAACAACAGGATCGCCATTGGAGC
TGCGATTTCGCCTGTATCTTCTTACAGTGGGAAACAGTTCGACATTGGTTTGATGGTTTGGAAGGATCCGAAGCACGGGCACTGGTGGCTGGAGTACGGGTCGGGTCTGC
TAGTCGGGTACTGGCCAGCATTTCTGTTCAGCCATCTAAGGAGCCATGCAAGCATGGTGCAATTTGGAGGGGAAATAGTGAACAGCAGATCCTCAGGGTTCCACACAGCC
ACACAAATGGGGAGTGGGCATTTTGCCGAAGAAGGCTTTGGCAAAGCTTCTTACTTCAGGAACTTGCAGGTTGTTGATTGGGACAACAATTTGCTTCCTCTCACAAATCT
TCATGTCTTGGCTGACCATTCTGATTGTTATGATATCAGACAAGGCAGCAATAGTGTTTGGGGCACTTATTTTTACTATGGAGGTCCTGGAAGAAATGTTAGATGCCCTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCTTCCTGTTCTTCTTCTTGTTCTTCTTGTTCTTTTGTTGTTTTCCTTCTGGTTTTTACTTCCTTCACCTCTGTTTTCTCAACTTCCTTAGCCCATCA
AATGCCACCAAAAAACCAAACTTTTTTCCACCCCATCAAAGAGCTGAAGAAACTAAAGCACATCAGAACTTATTTACGCAAAATCAACAAGCCTCCAACCAAGACAATTC
AGAGCTCAGATGGTGATGTCATAGACTGTGTGCTTTCTCATCTCCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGACACTCCCCATTGGAACCGCCTGAGAGGCCGAGA
GGGAACAAGTCCATGGAAGAAATGGCAGAGAACTTCCAGTTATGGTCAGCTTCAGGCGAATTTTGCCCTGAAGGAACAGTTCCAATCAGAAGAACAACAGAGAAAGACAT
TTTCAGAGCAAGTTCTCTTCGAAGATTTGGAAGAAAACCCATTAGACGTGTGAGAAGAGATTCTTCTGGCAATGGCCATGAGCATGCTGTGGTGTTTGTGAATGGAGAAC
AATATTATGGAGCAAAGGCCAGCTTAAACATATGGGCACCACGTGTAACTGATCAATACGAATTCAGCTTATCACAGATATGGGTAAGTCCTGAACTGTATGGCGACAAC
AATCCTAGATTCTTTACGTACTGGACGACTGATGCTTATCAAGCGACTGGGTGTTATAATCTACTTTGCTCTGGCTTTGTTCAAACGAACAACAGGATCGCCATTGGAGC
TGCGATTTCGCCTGTATCTTCTTACAGTGGGAAACAGTTCGACATTGGTTTGATGGTTTGGAAGGATCCGAAGCACGGGCACTGGTGGCTGGAGTACGGGTCGGGTCTGC
TAGTCGGGTACTGGCCAGCATTTCTGTTCAGCCATCTAAGGAGCCATGCAAGCATGGTGCAATTTGGAGGGGAAATAGTGAACAGCAGATCCTCAGGGTTCCACACAGCC
ACACAAATGGGGAGTGGGCATTTTGCCGAAGAAGGCTTTGGCAAAGCTTCTTACTTCAGGAACTTGCAGGTTGTTGATTGGGACAACAATTTGCTTCCTCTCACAAATCT
TCATGTCTTGGCTGACCATTCTGATTGTTATGATATCAGACAAGGCAGCAATAGTGTTTGGGGCACTTATTTTTACTATGGAGGTCCTGGAAGAAATGTTAGATGCCCTT
GA
Protein sequenceShow/hide protein sequence
MASSSSCSSSCSSCSFVVFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPPTKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR
GNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVSPELYGDN
NPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTA
TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNSVWGTYFYYGGPGRNVRCP