; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012640 (gene) of Snake gourd v1 genome

Gene IDTan0012640
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG05:69783764..69787423
RNA-Seq ExpressionTan0012640
SyntenyTan0012640
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573073.1 hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia]6.2e-23593.11Show/hide
Query:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASS SSSS S  SCFVV LLVFTS  SVFST+I+HQMP KNQT FHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPR NKSMEEVA+  QLWSASGEFCPEGTIPIRRTTEKDIFRA+S RRFGRKPIR  RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKH
        DQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQG
        GHWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG
Subjt:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQG

Query:  SNNVWGTYFYYGGPGRNVKCP
        SNNVWGTYFYYGGPGR V+CP
Subjt:  SNNVWGTYFYYGGPGRNVKCP

XP_022954586.1 uncharacterized protein LOC111456810 [Cucurbita moschata]5.2e-23492.87Show/hide
Query:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASS SSSS S  SCFVV LLVFTS  SVFST+I+HQMP KNQT FHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPR NKSMEEVA+  QLWSASGEFCPEGTIPIRRTTEKDIFRA+S RRFGRKPIR  RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKH
        DQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSY GKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQG
        GHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG
Subjt:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQG

Query:  SNNVWGTYFYYGGPGRNVKCP
        SNNVWGTYFYYGGPGR V+CP
Subjt:  SNNVWGTYFYYGGPGRNVKCP

XP_022994621.1 uncharacterized protein LOC111490277 [Cucurbita maxima]6.2e-23593.35Show/hide
Query:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASS SSS SSC SCFVVFLLVFTS  SVFST+I+HQMP KNQT FHPTKELKKLK+IRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPR NKSMEEVA+  QLWSASGEFCPEGTIPIRRTTEKDIFRA+S RRFGRKPIR  RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKH
        DQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQG
        GHWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG
Subjt:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQG

Query:  SNNVWGTYFYYGGPGRNVKCP
        SNNVWGTYFYYGGPGR V+CP
Subjt:  SNNVWGTYFYYGGPGRNVKCP

XP_023542314.1 uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo]1.2e-23393.56Show/hide
Query:  SSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP
        SSSSSSSSSC SCFVV LLVFTS  S FST+I+HQMP KNQT FHPTKELKKLK+IRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP
Subjt:  SSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSP

Query:  LEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
        LEPPERPR NKSMEEVA+  QLWSASGEFCPEGTIPIRRTTEKDIFRA+S RRFGRKPIR  RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
Subjt:  LEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ

Query:  YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGH
        YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKHGH
Subjt:  YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGH

Query:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSN
        WWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQGSN
Subjt:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSN

Query:  NVWGTYFYYGGPGRNVKCP
        NVWGTYFYYGGPGR V+CP
Subjt:  NVWGTYFYYGGPGRNVKCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]5.9e-23895.02Show/hide
Query:  MASSSSSSSSSCSSCFVVFLLVF-TSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKG
        MASSSSSSS SC SCFVVFLLVF TSF+SVFS++ISHQ+PPKNQTFFHP KELKKLKHIR YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG
Subjt:  MASSSSSSSSSCSSCFVVFLLVF-TSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKG

Query:  HSPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        H+PLEPPERPRGN S EEVAE  QLWSASG+FCPEGTIPIRRTTE+DIFRASSFRRFGRKPIRR RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
Subjt:  HSPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPK
        TDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQ
        HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQ
Subjt:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQ

Query:  GSNNVWGTYFYYGGPGRNVKCP
         +NNVWGTYFYYGGPGRNVKCP
Subjt:  GSNNVWGTYFYYGGPGRNVKCP

TrEMBL top hitse value%identityAlignment
A0A0A0LTK3 Uncharacterized protein3.0e-22790.78Show/hide
Query:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSSSSSSSSC SCFVV LLVFTSF+SV S++ISHQ+P KNQT FHP KELKKLKHIR YLRKINKPPIK IQSSDGDVIDCVLSHLQPAFDHP+LKGH
Subjt:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRGN-KSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        SPLEPPERPRGN  S EE  E  QLWS SGEFCPEGTIPIRRTTEKDI+RASS+RR+GRKPI+  +RDSSGNGHEHAVV+VNGEQYYGAKASLNIWAPRV
Subjt:  SPLEPPERPRGN-KSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPK
        TDQYEFS+SQIWVISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSY GKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRS-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIR
        HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRS SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHSDCYDIR
Subjt:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRS-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIR

Query:  QGSNNVWGTYFYYGGPGRNVKCP
        Q +NNVWGTYFYYGGPGRNVKCP
Subjt:  QGSNNVWGTYFYYGGPGRNVKCP

A0A6J1E7H5 uncharacterized protein LOC1114313347.9e-22890.75Show/hide
Query:  SCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR
        S SSCFVV LLVFTSF+SVF T+I+H+ PPKN+TFFHP+KEL +LKHIRAYLRKINKP  KTIQSSDGDVIDCVLSHLQPAFDHP LKGH+PL+PPERPR
Subjt:  SCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR

Query:  GNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GN S EEVAE  QLWSASG+FCPEGTIPIRRTTE+DI+RASSFRRFGRKPIR  RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSG
        WVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSY+GKQFD+G+MVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVWGTYFY
        LLVGYWPAFLFSHLRSH SMVQFGGEIVNSR SGFHTAT+MGSGHF EEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG+N+ WGTYFY
Subjt:  LLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPGRNVKCP
Subjt:  YGGPGRNVKCP

A0A6J1GRA4 uncharacterized protein LOC1114568102.5e-23492.87Show/hide
Query:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASS SSSS S  SCFVV LLVFTS  SVFST+I+HQMP KNQT FHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPR NKSMEEVA+  QLWSASGEFCPEGTIPIRRTTEKDIFRA+S RRFGRKPIR  RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKH
        DQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSY GKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQG
        GHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG
Subjt:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQG

Query:  SNNVWGTYFYYGGPGRNVKCP
        SNNVWGTYFYYGGPGR V+CP
Subjt:  SNNVWGTYFYYGGPGRNVKCP

A0A6J1JZN8 uncharacterized protein LOC1114902773.0e-23593.35Show/hide
Query:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASS SSS SSC SCFVVFLLVFTS  SVFST+I+HQMP KNQT FHPTKELKKLK+IRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPR NKSMEEVA+  QLWSASGEFCPEGTIPIRRTTEKDIFRA+S RRFGRKPIR  RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKH
        DQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQG
        GHWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG
Subjt:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQG

Query:  SNNVWGTYFYYGGPGRNVKCP
        SNNVWGTYFYYGGPGR V+CP
Subjt:  SNNVWGTYFYYGGPGRNVKCP

A0A6J1KJ26 uncharacterized protein LOC1114950546.0e-22890.75Show/hide
Query:  SCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR
        S SSCFVV LLVFTSF+SVF T+I+H+ PPKNQT+FHP+KEL +LKHIRAYLRKINKPP KTIQSSDGDVIDCVLSHLQPAFDHP LKGH+PL PPERPR
Subjt:  SCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR

Query:  GNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        GN S EEVAE  QLWSASG+FCPEGTIPIRRTTE+DI+RASSFRRFGRKPIR  RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  GNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSG
        WVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+GKQFD+G+MVWKDPKHGHWWLEYGSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVWGTYFY
        LLVGYWPAFLFSHLRSH SMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQG+N+VWGTYFY
Subjt:  LLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPGRNVKCP
Subjt:  YGGPGRNVKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)4.0e-17670.32Show/hide
Query:  SCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR
        +CSS  +   L+  S  S FS+ +S  + P+NQT   P  EL KLK I  +LRKINKP IKTI S DGD+IDCVL H QPAFDHP L+G  PL+PPERPR
Subjt:  SCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR

Query:  GNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
        G+       +  QLW   GE CPEGT+PIRRT E+DI RA+S   FG+K +R  RRD+S NGHEHAV +V+GE+YYGAKAS+N+WAP+V +QYEFSLSQI
Subjt:  GNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSG
        W+ISGSFGNDLNTIEAGWQVSPELYGDN PRFFTYWT DAYQATGCYNLLCSGFVQTN+ IAIGAAISP SSY G QFDI L++WKDPKHG+WWLE+GSG
Subjt:  WVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  LLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVWGTYFY
        +LVGYWP+FLF+HL+ HASMVQ+GGEIVNS   G HT+TQMGSGHFAEEGF K+SYFRN+QVVDWDNNL+P  NL +LADH +CYDI+ GSN  WG+YFY
Subjt:  LLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVWGTYFY

Query:  YGGPGRNVKCP
        YGGPG+N KCP
Subjt:  YGGPGRNVKCP

AT1G23340.1 Protein of Unknown Function (DUF239)5.3e-17670.19Show/hide
Query:  SSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP
        SSSSS     F++ L +F+S+ S  S + S  +P        P +E++K+K IR  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+G  P++P
Subjt:  SSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP

Query:  PERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PE P G     E  E  QLWS  GE CPEGTIPIRRTTE+D+ RA+S RRFGRK IRR RRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  PERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH +CYDIR G N VW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVW

Query:  GTYFYYGGPGRNVKCP
        G +FYYGGPG+N KCP
Subjt:  GTYFYYGGPGRNVKCP

AT1G23340.2 Protein of Unknown Function (DUF239)5.3e-17670.19Show/hide
Query:  SSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP
        SSSSS     F++ L +F+S+ S  S + S  +P        P +E++K+K IR  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+G  P++P
Subjt:  SSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP

Query:  PERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PE P G     E  E  QLWS  GE CPEGTIPIRRTTE+D+ RA+S RRFGRK IRR RRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  PERPRGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH +CYDIR G N VW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVW

Query:  GTYFYYGGPGRNVKCP
        G +FYYGGPG+N KCP
Subjt:  GTYFYYGGPGRNVKCP

AT1G70550.2 Protein of Unknown Function (DUF239)3.8e-17469.93Show/hide
Query:  SSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRGN
        SS F+  +L+    +S FS+  S            P +EL+KL  IR  L KINKP +KTIQSSDGD IDCV +H QPAFDHP L+G  PL+PPE P+G 
Subjt:  SSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRGN

Query:  KSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWV
           +   E  QLWS SGE CPEGTIPIRRTTE+D+ RASS +RFGRK IRR +RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIWV
Subjt:  KSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWV

Query:  ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLL
        I+GSF +DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY G QFDI L++WKDPKHGHWWL++GSG L
Subjt:  ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLL

Query:  VGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVWGTYFYYG
        VGYWPAFLF+HL+ H SMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR G+N VWG YFYYG
Subjt:  VGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVWGTYFYYG

Query:  GPGRNVKCP
        GPG+N +CP
Subjt:  GPGRNVKCP

AT5G50150.1 Protein of Unknown Function (DUF239)1.2e-18875.47Show/hide
Query:  MASSSSSSS--SSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELK
        MASSSSSSS  S+ +S F+  +L+ +    +   +  H    KNQT F P +E++KL+ + AYL KINKP IKTI S DGDVI+CV SHLQPAFDHP+L+
Subjt:  MASSSSSSS--SSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELK

Query:  GHSPLEPPERP-RGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAP
        G  PL+ P RP +GN++  E +   QLWS SGE CP G+IPIR+TT+ D+ RA+S RRFGRK  R  RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAP
Subjt:  GHSPLEPPERP-RGNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAP

Query:  RVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKD
        RVTD YEFSLSQIW+ISGSFG+DLNTIEAGWQVSPELYGDN PRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SSY+G+QFDIGLM+WKD
Subjt:  RVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKD

Query:  PKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDI
        PKHGHWWLE G+GLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NLH+LADH  CYDI
Subjt:  PKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDI

Query:  RQGSNNVWGTYFYYGGPGRNVKCP
        RQG NNVWGTYFYYGGPGRN +CP
Subjt:  RQGSNNVWGTYFYYGGPGRNVKCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCTTCTTCTTCTTCTTCCTCCTGTTCTTCTTGTTTTGTTGTTTTCCTTCTGGTTTTTACTTCTTTCACCTCTGTTTTCTCAACTGCCATATCCCATCA
AATGCCACCAAAAAACCAAACTTTTTTCCACCCCACCAAGGAGCTGAAGAAACTAAAGCACATCAGAGCTTACTTACGCAAAATCAACAAGCCTCCAATCAAGACAATTC
AGAGTTCAGATGGTGATGTCATAGACTGTGTTCTTTCTCATCTCCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGACACTCCCCATTGGAACCGCCTGAGAGGCCGAGA
GGGAACAAATCCATGGAAGAAGTGGCAGAGAAGTTGCAGTTATGGTCAGCTTCAGGCGAATTTTGCCCTGAAGGGACAATTCCGATCAGAAGAACAACAGAGAAAGACAT
TTTCAGAGCAAGTTCTTTTCGAAGATTTGGAAGAAAACCCATTCGACGTACGAGGAGAGATTCTTCTGGCAATGGCCATGAGCATGCAGTGGTGTTTGTAAATGGAGAAC
AATATTATGGAGCAAAGGCCAGTTTAAACATATGGGCACCACGTGTAACTGATCAATACGAATTCAGCTTATCACAGATTTGGGTCATTTCAGGCTCATTTGGCAATGAT
TTAAACACCATTGAAGCTGGATGGCAGGTTAGTCCTGAACTTTATGGCGACAACAATCCTAGGTTCTTTACGTACTGGACGACTGATGCTTATCAAGCTACTGGGTGTTA
TAATCTACTTTGCTCTGGCTTCGTTCAAACCAATAATAGGATCGCCATTGGAGCTGCAATTTCGCCTATTTCCTCTTACAGTGGGAAACAATTTGATATTGGTTTGATGG
TTTGGAAGGATCCGAAGCACGGGCACTGGTGGCTCGAGTACGGGTCGGGTCTGCTCGTCGGGTACTGGCCAGCATTTCTGTTCAGCCATTTAAGGAGTCATGCAAGCATG
GTACAATTTGGAGGGGAAATAGTGAACAGCAGATCCTCAGGGTTCCACACAGCCACACAAATGGGGAGTGGCCATTTTGCTGAAGAAGGCTTTGGCAAAGCTTCTTATTT
CAGAAATTTGCAAGTGGTTGATTGGGACAACAATTTGCTTCCTCTTACAAATCTTCACCTCTTGGCTGACCATTCTGATTGCTATGATATTAGACAAGGCAGCAATAATG
TTTGGGGCACTTATTTTTACTATGGAGGCCCTGGTAGGAATGTTAAATGCCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCTTCTTCTTCTTCTTCCTCCTGTTCTTCTTGTTTTGTTGTTTTCCTTCTGGTTTTTACTTCTTTCACCTCTGTTTTCTCAACTGCCATATCCCATCA
AATGCCACCAAAAAACCAAACTTTTTTCCACCCCACCAAGGAGCTGAAGAAACTAAAGCACATCAGAGCTTACTTACGCAAAATCAACAAGCCTCCAATCAAGACAATTC
AGAGTTCAGATGGTGATGTCATAGACTGTGTTCTTTCTCATCTCCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGACACTCCCCATTGGAACCGCCTGAGAGGCCGAGA
GGGAACAAATCCATGGAAGAAGTGGCAGAGAAGTTGCAGTTATGGTCAGCTTCAGGCGAATTTTGCCCTGAAGGGACAATTCCGATCAGAAGAACAACAGAGAAAGACAT
TTTCAGAGCAAGTTCTTTTCGAAGATTTGGAAGAAAACCCATTCGACGTACGAGGAGAGATTCTTCTGGCAATGGCCATGAGCATGCAGTGGTGTTTGTAAATGGAGAAC
AATATTATGGAGCAAAGGCCAGTTTAAACATATGGGCACCACGTGTAACTGATCAATACGAATTCAGCTTATCACAGATTTGGGTCATTTCAGGCTCATTTGGCAATGAT
TTAAACACCATTGAAGCTGGATGGCAGGTTAGTCCTGAACTTTATGGCGACAACAATCCTAGGTTCTTTACGTACTGGACGACTGATGCTTATCAAGCTACTGGGTGTTA
TAATCTACTTTGCTCTGGCTTCGTTCAAACCAATAATAGGATCGCCATTGGAGCTGCAATTTCGCCTATTTCCTCTTACAGTGGGAAACAATTTGATATTGGTTTGATGG
TTTGGAAGGATCCGAAGCACGGGCACTGGTGGCTCGAGTACGGGTCGGGTCTGCTCGTCGGGTACTGGCCAGCATTTCTGTTCAGCCATTTAAGGAGTCATGCAAGCATG
GTACAATTTGGAGGGGAAATAGTGAACAGCAGATCCTCAGGGTTCCACACAGCCACACAAATGGGGAGTGGCCATTTTGCTGAAGAAGGCTTTGGCAAAGCTTCTTATTT
CAGAAATTTGCAAGTGGTTGATTGGGACAACAATTTGCTTCCTCTTACAAATCTTCACCTCTTGGCTGACCATTCTGATTGCTATGATATTAGACAAGGCAGCAATAATG
TTTGGGGCACTTATTTTTACTATGGAGGCCCTGGTAGGAATGTTAAATGCCCTTGA
Protein sequenceShow/hide protein sequence
MASSSSSSSSSCSSCFVVFLLVFTSFTSVFSTAISHQMPPKNQTFFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR
GNKSMEEVAEKLQLWSASGEFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRRTRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGND
LNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASM
VQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQGSNNVWGTYFYYGGPGRNVKCP