; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024517 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024517
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr10:3661540..3664450
RNA-Seq ExpressionLag0024517
SyntenyLag0024517
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573073.1 hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia]4.8e-23292.79Show/hide
Query:  SSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEL
        SSSSCSS   FV  LLVFTS  SVFSTS+AHQMP KNQT FHP KELKKLKHIR YLRKINKP  KTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLE 
Subjt:  SSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEL

Query:  PERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
Subjt:  PERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY GKQFDIGLMVWKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW
        EYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW

Query:  GTYFYYGGPGRNVRCP
        GTYFYYGGPGR VRCP
Subjt:  GTYFYYGGPGRNVRCP

XP_022954586.1 uncharacterized protein LOC111456810 [Cucurbita moschata]4.1e-23192.55Show/hide
Query:  SSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEL
        SSSSCSS   FV  LLVFTS  SVFSTS+AHQMP KNQT FHP KELKKLKHIR YLRKINKP  KTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLE 
Subjt:  SSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEL

Query:  PERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
Subjt:  PERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSY GKQFDIGLMVWKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW
        EYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW

Query:  GTYFYYGGPGRNVRCP
        GTYFYYGGPGR VRCP
Subjt:  GTYFYYGGPGRNVRCP

XP_022994621.1 uncharacterized protein LOC111490277 [Cucurbita maxima]6.3e-23292.36Show/hide
Query:  SSSCSSSCS----FVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP
        +SSCSSSCS    FV FLLVFTS  SVFSTS+AHQMP KNQT FHP KELKKLK+IR YLRKINKP  KTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP
Subjt:  SSSCSSSCS----FVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP

Query:  LELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
        LE PERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
Subjt:  LELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ

Query:  YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGH
        YEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY GKQFDIGLMVWKDPKHGH
Subjt:  YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGH

Query:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN
        WWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN
Subjt:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN

Query:  NVWGTYFYYGGPGRNVRCP
        NVWGTYFYYGGPGR VRCP
Subjt:  NVWGTYFYYGGPGRNVRCP

XP_023542314.1 uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo]2.0e-23092.58Show/hide
Query:  ASSSSCSSSCS-FVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPL
        +SSSS SSSCS FV  LLVFTS  S FSTS+AHQMP KNQT FHP KELKKLK+IR YLRKINKP  KTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPL
Subjt:  ASSSSCSSSCS-FVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPL

Query:  ELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY
        E PERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY
Subjt:  ELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY

Query:  EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHW
        EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY GKQFDIGLMVWKDPKHGHW
Subjt:  EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHW

Query:  WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNN
        WLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNN
Subjt:  WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNN

Query:  VWGTYFYYGGPGRNVRCP
        VWGTYFYYGGPGR VRCP
Subjt:  VWGTYFYYGGPGRNVRCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]4.4e-23392.82Show/hide
Query:  ASSSSCSSSCSFVAFLLVF-TSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPL
        +SSSSCS SC FV FLLVF TSF+SVFS+S++HQ+PPKNQTFFHP KELKKLKHIR YLRKINKP  KTI+SSDGDVIDCVLSHLQPAFDHPELKGH+PL
Subjt:  ASSSSCSSSCSFVAFLLVF-TSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPL

Query:  ELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY
        E PERPRGN S EE+AENFQLWSASG+FCPEGT+PIRRTTE+DIFRASS RRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY
Subjt:  ELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY

Query:  EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHW
        EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYSGKQFDIGLMVWKDPKHGHW
Subjt:  EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHW

Query:  WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNN
        WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ +NN
Subjt:  WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNN

Query:  VWGTYFYYGGPGRNVRCP
        VWGTYFYYGGPGRNV+CP
Subjt:  VWGTYFYYGGPGRNVRCP

TrEMBL top hitse value%identityAlignment
A0A6J1C958 uncharacterized protein LOC1110085932.3e-22491.48Show/hide
Query:  SSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLELPERPR
        SSSC FV FLLVFTS TSVFSTS    MPPKNQT FHP KEL+KLKHIR YLRKINKP+TKTI+SSDGDVIDCV+SHLQPAFDHPELKGH+PLE PERPR
Subjt:  SSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLELPERPR

Query:  GNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GN S E +AE+FQLWS SGEFCPEGT+PIRRT E DI RASS+RRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
Subjt:  GNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYGSG
        WVISGSFGNDLNTIEAGWQVSPELYGD+NPRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP+SSY+GKQFDIGLMVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFY
        LLVGYWPAFLFSHLRSH SMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG+NNVWGTYFY
Subjt:  LLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFY

Query:  YGGPGRNVRCP
        YGGPGRNV+CP
Subjt:  YGGPGRNVRCP

A0A6J1E7H5 uncharacterized protein LOC1114313349.5e-22688.76Show/hide
Query:  MASSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPL
        M SSSSC     FV  LLVFTSF+SVF TS+AH+ PPKN+TFFHP KEL +LKHIR YLRKINKP TKTIQSSDGDVIDCVLSHLQPAFDHP LKGH+PL
Subjt:  MASSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPL

Query:  ELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY
        + PERPRGN S EE+AE+FQLWSASG+FCPEGT+PIRRTTE+DI+RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ 
Subjt:  ELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY

Query:  EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHW
        EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+GKQFD+G+MVWKDPKHGHW
Subjt:  EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHW

Query:  WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNN
        WLEYGSGLLVGYWPAFLFSHLRSH SMVQFGGEIVNSR SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG+N+
Subjt:  WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNN

Query:  VWGTYFYYGGPGRNVRCP
         WGTYFYYGGPGRNV+CP
Subjt:  VWGTYFYYGGPGRNVRCP

A0A6J1GRA4 uncharacterized protein LOC1114568102.0e-23192.55Show/hide
Query:  SSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEL
        SSSSCSS   FV  LLVFTS  SVFSTS+AHQMP KNQT FHP KELKKLKHIR YLRKINKP  KTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLE 
Subjt:  SSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEL

Query:  PERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
Subjt:  PERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSY GKQFDIGLMVWKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW
        EYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW

Query:  GTYFYYGGPGRNVRCP
        GTYFYYGGPGR VRCP
Subjt:  GTYFYYGGPGRNVRCP

A0A6J1JZN8 uncharacterized protein LOC1114902773.1e-23292.36Show/hide
Query:  SSSCSSSCS----FVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP
        +SSCSSSCS    FV FLLVFTS  SVFSTS+AHQMP KNQT FHP KELKKLK+IR YLRKINKP  KTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP
Subjt:  SSSCSSSCS----FVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP

Query:  LELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
        LE PERPR NKSMEE+A+N QLWSASGEFCPEGT+PIRRTTEKDIFRA+S+RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
Subjt:  LELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ

Query:  YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGH
        YEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY GKQFDIGLMVWKDPKHGH
Subjt:  YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGH

Query:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN
        WWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN
Subjt:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSN

Query:  NVWGTYFYYGGPGRNVRCP
        NVWGTYFYYGGPGR VRCP
Subjt:  NVWGTYFYYGGPGRNVRCP

A0A6J1KJ26 uncharacterized protein LOC1114950541.6e-22589Show/hide
Query:  MASSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPL
        M SSSSC     FV  LLVFTSF+SVF TS+AH+ PPKNQT+FHP KEL +LKHIR YLRKINKP TKTIQSSDGDVIDCVLSHLQPAFDHP LKGH+PL
Subjt:  MASSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPL

Query:  ELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY
          PERPRGN S EE+AENFQLWSASG+FCPEGT+PIRRTTE+DI+RASS RRFGRKPIR +RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ 
Subjt:  ELPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY

Query:  EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHW
        EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+GKQFD+G+MVWKDPKHGHW
Subjt:  EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHW

Query:  WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNN
        WLEYGSGLLVGYWPAFLFSHLRSH SMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG+N+
Subjt:  WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNN

Query:  VWGTYFYYGGPGRNVRCP
        VWGTYFYYGGPGRNV+CP
Subjt:  VWGTYFYYGGPGRNVRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)3.3e-17870.7Show/hide
Query:  SCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLELPER
        +CSS+  F++ LL+ +SF+SV S +L+    P+NQT   P+ EL KLK I  +LRKINKP+ KTI S DGD+IDCVL H QPAFDHP L+G  PL+ PER
Subjt:  SCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLELPER

Query:  PRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLS
        PRG+       ++FQLW   GE CPEGTVPIRRT E+DI RA+S+  FG+K +R  RRD+S NGHEHAV +V+GE+YYGAKAS+N+WAP+V +QYEFSLS
Subjt:  PRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLS

Query:  QIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYG
        QIW+ISGSFGNDLNTIEAGWQVSPELYGDN PRFFTYWT DAYQATGCYNLLCSGFVQTN+ IAIGAAISP SSY G QFDI L++WKDPKHG+WWLE+G
Subjt:  QIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYG

Query:  SGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTY
        SG+LVGYWP+FLF+HL+ HASMVQ+GGEIVNS   G HT+TQMGSGHFAEEGF K+SYFRN+QVVDWDNNL+P  NL VLADH +CYDI+ GSN  WG+Y
Subjt:  SGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTY

Query:  FYYGGPGRNVRCP
        FYYGGPG+N +CP
Subjt:  FYYGGPGRNVRCP

AT1G23340.1 Protein of Unknown Function (DUF239)5.6e-17870.74Show/hide
Query:  SCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTF----FHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLE
        S SSSC F  F+L+ + F+S  S        P N T       P +E++K+K IR  L+KINKPA KTI SSDGD IDCV SH QPAFDHP L+G  P++
Subjt:  SCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTF----FHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLE

Query:  LPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
         PE P G     E  ENFQLWS  GE CPEGT+PIRRTTE+D+ RA+S+RRFGRK IRRVRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYE
Subjt:  LPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWW
        FSLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY G QFDI L++WKDPKHGHWW
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWW

Query:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNV
        L++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL VLADH +CYDIR G N V
Subjt:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNV

Query:  WGTYFYYGGPGRNVRCP
        WG +FYYGGPG+N +CP
Subjt:  WGTYFYYGGPGRNVRCP

AT1G23340.2 Protein of Unknown Function (DUF239)5.6e-17870.74Show/hide
Query:  SCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTF----FHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLE
        S SSSC F  F+L+ + F+S  S        P N T       P +E++K+K IR  L+KINKPA KTI SSDGD IDCV SH QPAFDHP L+G  P++
Subjt:  SCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTF----FHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLE

Query:  LPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
         PE P G     E  ENFQLWS  GE CPEGT+PIRRTTE+D+ RA+S+RRFGRK IRRVRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYE
Subjt:  LPERPRGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWW
        FSLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY G QFDI L++WKDPKHGHWW
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWW

Query:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNV
        L++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL VLADH +CYDIR G N V
Subjt:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNV

Query:  WGTYFYYGGPGRNVRCP
        WG +FYYGGPG+N +CP
Subjt:  WGTYFYYGGPGRNVRCP

AT1G70550.2 Protein of Unknown Function (DUF239)5.8e-17570.17Show/hide
Query:  SCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLELPERPRGN
        S SF+  +L+    +S FS++ +            P +EL+KL  IR  L KINKPA KTIQSSDGD IDCV +H QPAFDHP L+G  PL+ PE P+G 
Subjt:  SCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLELPERPRGN

Query:  KSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWV
           +   EN QLWS SGE CPEGT+PIRRTTE+D+ RASS++RFGRK IRRV+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIWV
Subjt:  KSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWV

Query:  ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLL
        I+GSF +DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY G QFDI L++WKDPKHGHWWL++GSG L
Subjt:  ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLL

Query:  VGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYG
        VGYWPAFLF+HL+ H SMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR G+N VWG YFYYG
Subjt:  VGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYG

Query:  GPGRNVRCP
        GPG+N RCP
Subjt:  GPGRNVRCP

AT5G50150.1 Protein of Unknown Function (DUF239)8.3e-19075.24Show/hide
Query:  MASSSSCSSSCS-----FVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELK
        MASSSS SS+ S     F+  +L+ +    +   S  H    KNQT F P +E++KL+ +  YL KINKP+ KTI S DGDVI+CV SHLQPAFDHP+L+
Subjt:  MASSSSCSSSCS-----FVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELK

Query:  GHSPLELPERP-RGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAP
        G  PL+ P RP +GN++  E + N QLWS SGE CP G++PIR+TT+ D+ RA+S+RRFGRK  R +RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAP
Subjt:  GHSPLELPERP-RGNKSMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAP

Query:  RVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKD
        RVTD YEFSLSQIW+ISGSFG+DLNTIEAGWQVSPELYGDN PRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SSY+G+QFDIGLM+WKD
Subjt:  RVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKD

Query:  PKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDI
        PKHGHWWLE G+GLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NLHVLADH  CYDI
Subjt:  PKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDI

Query:  RQGSNNVWGTYFYYGGPGRNVRCP
        RQG NNVWGTYFYYGGPGRN RCP
Subjt:  RQGSNNVWGTYFYYGGPGRNVRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCTTCCTGTTCTTCTTCTTGTTCTTTTGTTGCTTTCCTTCTGGTTTTTACTTCCTTCACCTCTGTTTTCTCAACTTCCTTAGCCCATCAAATGCCACC
AAAAAACCAAACTTTTTTCCACCCCATCAAAGAGCTGAAGAAACTAAAGCACATCAGAACTTATTTACGCAAAATCAACAAGCCTGCAACCAAGACAATTCAGAGCTCAG
ATGGTGATGTCATAGACTGTGTGCTTTCTCATCTCCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGACACTCCCCATTGGAGCTGCCTGAGAGGCCAAGAGGGAACAAG
TCCATGGAAGAAATGGCAGAGAACTTCCAGTTGTGGTCAGCTTCAGGCGAATTTTGCCCTGAAGGAACAGTTCCAATCAGAAGAACAACAGAGAAAGACATTTTCAGAGC
AAGTTCTCTTCGAAGATTTGGAAGAAAACCCATTAGACGTGTGAGAAGAGATTCTTCTGGCAATGGCCATGAGCATGCTGTGGTGTTTGTGAATGGAGAACAATATTATG
GAGCAAAGGCCAGCTTAAACATATGGGCACCACGTGTAACTGATCAATACGAATTCAGCTTATCACAGATATGGGTAATTTCAGGCTCCTTTGGCAATGATTTAAACACC
ATTGAAGCTGGATGGCAGGTTAGTCCTGAACTGTATGGCGACAACAATCCTAGATTCTTTACGTACTGGACGACTGATGCTTATCAAGCGACTGGGTGTTATAATCTACT
TTGCTCTGGCTTTGTTCAAACGAACAACAGGATCGCCATTGGAGCAGCGATTTCGCCTGTATCCTCTTACAGTGGGAAGCAGTTCGACATTGGTTTGATGGTTTGGAAGG
ATCCGAAGCACGGGCACTGGTGGCTGGAGTACGGGTCGGGTCTGCTAGTCGGGTACTGGCCAGCATTTCTGTTCAGCCATCTAAGGAGCCATGCAAGCATGGTGCAATTT
GGAGGGGAAATAGTGAACAGCAGATCCTCAGGGTTCCACACAGCCACACAAATGGGGAGTGGCCATTTTGCTGAAGAAGGATTTGGCAAAGCTTCTTATTTCAGGAACTT
GCAGGTTGTTGATTGGGACAACAATTTGCTTCCTCTCACAAATCTTCATGTCTTGGCTGACCATTCTGATTGTTATGATATCAGACAAGGCAGCAATAATGTTTGGGGAA
CTTATTTTTACTATGGAGGTCCTGGAAGAAATGTTAGATGCCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCTTCCTGTTCTTCTTCTTGTTCTTTTGTTGCTTTCCTTCTGGTTTTTACTTCCTTCACCTCTGTTTTCTCAACTTCCTTAGCCCATCAAATGCCACC
AAAAAACCAAACTTTTTTCCACCCCATCAAAGAGCTGAAGAAACTAAAGCACATCAGAACTTATTTACGCAAAATCAACAAGCCTGCAACCAAGACAATTCAGAGCTCAG
ATGGTGATGTCATAGACTGTGTGCTTTCTCATCTCCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGACACTCCCCATTGGAGCTGCCTGAGAGGCCAAGAGGGAACAAG
TCCATGGAAGAAATGGCAGAGAACTTCCAGTTGTGGTCAGCTTCAGGCGAATTTTGCCCTGAAGGAACAGTTCCAATCAGAAGAACAACAGAGAAAGACATTTTCAGAGC
AAGTTCTCTTCGAAGATTTGGAAGAAAACCCATTAGACGTGTGAGAAGAGATTCTTCTGGCAATGGCCATGAGCATGCTGTGGTGTTTGTGAATGGAGAACAATATTATG
GAGCAAAGGCCAGCTTAAACATATGGGCACCACGTGTAACTGATCAATACGAATTCAGCTTATCACAGATATGGGTAATTTCAGGCTCCTTTGGCAATGATTTAAACACC
ATTGAAGCTGGATGGCAGGTTAGTCCTGAACTGTATGGCGACAACAATCCTAGATTCTTTACGTACTGGACGACTGATGCTTATCAAGCGACTGGGTGTTATAATCTACT
TTGCTCTGGCTTTGTTCAAACGAACAACAGGATCGCCATTGGAGCAGCGATTTCGCCTGTATCCTCTTACAGTGGGAAGCAGTTCGACATTGGTTTGATGGTTTGGAAGG
ATCCGAAGCACGGGCACTGGTGGCTGGAGTACGGGTCGGGTCTGCTAGTCGGGTACTGGCCAGCATTTCTGTTCAGCCATCTAAGGAGCCATGCAAGCATGGTGCAATTT
GGAGGGGAAATAGTGAACAGCAGATCCTCAGGGTTCCACACAGCCACACAAATGGGGAGTGGCCATTTTGCTGAAGAAGGATTTGGCAAAGCTTCTTATTTCAGGAACTT
GCAGGTTGTTGATTGGGACAACAATTTGCTTCCTCTCACAAATCTTCATGTCTTGGCTGACCATTCTGATTGTTATGATATCAGACAAGGCAGCAATAATGTTTGGGGAA
CTTATTTTTACTATGGAGGTCCTGGAAGAAATGTTAGATGCCCTTGA
Protein sequenceShow/hide protein sequence
MASSSSCSSSCSFVAFLLVFTSFTSVFSTSLAHQMPPKNQTFFHPIKELKKLKHIRTYLRKINKPATKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLELPERPRGNK
SMEEMAENFQLWSASGEFCPEGTVPIRRTTEKDIFRASSLRRFGRKPIRRVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNT
IEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQF
GGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGGPGRNVRCP