; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g045000 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g045000
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCsor_Chr18:1003225..1005765
RNA-Seq ExpressionCsor.00g045000
SyntenyCsor.00g045000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573073.1 hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia]0.0100Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
        DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNNVWGTYFYYGGPGRTVRCP
        SNNVWGTYFYYGGPGRTVRCP
Subjt:  SNNVWGTYFYYGGPGRTVRCP

XP_022954586.1 uncharacterized protein LOC111456810 [Cucurbita moschata]0.099.52Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
        DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNNVWGTYFYYGGPGRTVRCP
        SNNVWGTYFYYGGPGRTVRCP
Subjt:  SNNVWGTYFYYGGPGRTVRCP

XP_022994621.1 uncharacterized protein LOC111490277 [Cucurbita maxima]0.099.29Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSCSSS CSSCSCFVV LLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLK+IRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
        DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNNVWGTYFYYGGPGRTVRCP
        SNNVWGTYFYYGGPGRTVRCP
Subjt:  SNNVWGTYFYYGGPGRTVRCP

XP_023542314.1 uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo]3.43e-31497.88Show/hide
Query:  MASSCS----SSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPE
        MASSCS    SSS SSCSCFVVVLLVFTSLAS FSTSIAHQMPLKNQTLFHPTKELKKLK+IRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPE
Subjt:  MASSCS----SSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPE

Query:  LKGHSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA
        LKGHSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA
Subjt:  LKGHSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA

Query:  PRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWK
        PRVTDQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWK
Subjt:  PRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWK

Query:  DPKHGHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD
        DPKHGHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD
Subjt:  DPKHGHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD

Query:  IRQGSNNVWGTYFYYGGPGRTVRCP
        IRQGSNNVWGTYFYYGGPGRTVRCP
Subjt:  IRQGSNNVWGTYFYYGGPGRTVRCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]1.27e-29291Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVF-TSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKG
        MASS SSSSCS CSCFVV LLVF TS +SVFS+SI+HQ+P KNQT FHP KELKKLKHIR YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG
Subjt:  MASSCSSSSCSSCSCFVVVLLVF-TSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKG

Query:  HSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        H+PLEPPERPR N S EEVA+N QLWSASG+FCPEGTIPIRRTTE+DIFRA+S RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
Subjt:  HSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPK
        TDQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ
        HGHWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ
Subjt:  HGHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ

Query:  GSNNVWGTYFYYGGPGRTVRCP
         +NNVWGTYFYYGGPGR V+CP
Subjt:  GSNNVWGTYFYYGGPGRTVRCP

TrEMBL top hitse value%identityAlignment
A0A0A0LTK3 Uncharacterized protein1.60e-28689.6Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASS SSSS SSCSCFVV+LLVFTS +SV S+SI+HQ+P KNQTLFHP KELKKLKHIR YLRKINKPPIK IQSSDGDVIDCVLSHLQPAFDHP+LKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNK-SMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        SPLEPPERPR N  S EE  +N QLWS SGEFCPEGTIPIRRTTEKDI+RA+S RR+GRKPI+HV+RDSSGNGHEHAVV+VNGEQYYGAKASLNIWAPRV
Subjt:  SPLEPPERPRSNK-SMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPK
        TDQYEFS+SQIW+ISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYRGKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRA-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIR
        HGHWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGEVVN R+ SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL VLADHSDCYDIR
Subjt:  HGHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRA-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIR

Query:  QGSNNVWGTYFYYGGPGRTVRCP
        Q +NNVWGTYFYYGGPGR V+CP
Subjt:  QGSNNVWGTYFYYGGPGRTVRCP

A0A1S3AXP9 uncharacterized protein LOC1034837232.01e-28688.94Show/hide
Query:  SSCSSSSCS----SCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELK
        SS SSSSCS    SCSCFVV+LLVFTS +SVFS+SI+HQ+P KNQT FHP +ELKKLKHIR YLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHP+LK
Subjt:  SSCSSSSCS----SCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELK

Query:  GHSPLEPPERPR-SNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIR-HVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA
        GH+PLEPPERPR +N S+EE  +N QLWS SGEFCPEGTIPIRRTTEKDI+RA+S RR+GRKPIR HVRRDSSGNGHEHAVV+VNGEQYYGAKASLNIWA
Subjt:  GHSPLEPPERPR-SNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIR-HVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA

Query:  PRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWK
        PRVTDQYEFS+SQIW+ISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYRGKQFDIGLMVWK
Subjt:  PRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWK

Query:  DPKHGHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD
        DPKHGHWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGEVVN R+SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL VLADHSDCYD
Subjt:  DPKHGHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD

Query:  IRQGSNNVWGTYFYYGGPGRTVRCP
        IRQ +N+VWGTYFYYGGPGR V+CP
Subjt:  IRQGSNNVWGTYFYYGGPGRTVRCP

A0A6J1GRA4 uncharacterized protein LOC1114568100.099.52Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
        DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNNVWGTYFYYGGPGRTVRCP
        SNNVWGTYFYYGGPGRTVRCP
Subjt:  SNNVWGTYFYYGGPGRTVRCP

A0A6J1JZN8 uncharacterized protein LOC1114902770.099.29Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSCSSS CSSCSCFVV LLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLK+IRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
        DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNNVWGTYFYYGGPGRTVRCP
        SNNVWGTYFYYGGPGRTVRCP
Subjt:  SNNVWGTYFYYGGPGRTVRCP

A0A6J1KJ26 uncharacterized protein LOC1114950545.41e-28589.05Show/hide
Query:  SSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR
        SS SCFVV+LLVFTS +SVF TSIAH+ P KNQT FHP+KEL +LKHIRAYLRKINKPP KTIQSSDGDVIDCVLSHLQPAFDHP LKGH+PL PPERPR
Subjt:  SSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR

Query:  SNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
         N S EEVA+N QLWSASG+FCPEGTIPIRRTTE+DI+RA+S RRFGRKPIRH+RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  SNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYGSG
        W+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFD+G+MVWKDPKHGHWWLEYGSG
Subjt:  WIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  MLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFY
        +LVGYWPAFLFSHLRSH SMVQFGGE+VN R SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG+N+VWGTYFY
Subjt:  MLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFY

Query:  YGGPGRTVRCP
        YGGPGR V+CP
Subjt:  YGGPGRTVRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)9.0e-17669.98Show/hide
Query:  SCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPER
        +CSS   F+ +LL+ +S +SV S +++     +NQTL  P  EL KLK I  +LRKINKP IKTI S DGD+IDCVL H QPAFDHP L+G  PL+PPER
Subjt:  SCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPER

Query:  PRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLS
        PR +        + QLW   GE CPEGT+PIRRT E+DI RANS+  FG+K +RH RRD+S NGHEHAV +V+GE+YYGAKAS+N+WAP+V +QYEFSLS
Subjt:  PRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLS

Query:  QIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYG
        QIWIISGSFGNDLNTIEAGWQVSPELYGDN PRFFTYWT DAYQATGCYNLLCSGFVQTN+ IAIGAAISP SSY+G QFDI L++WKDPKHG+WWLE+G
Subjt:  QIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYG

Query:  SGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTY
        SG+LVGYWP+FLF+HL+ HASMVQ+GGE+VN    G HT+TQMGSGHFAEEGF K+SYFRN+QVVDWDNNL+P  NL VLADH +CYDI+ GSN  WG+Y
Subjt:  SGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTY

Query:  FYYGGPGRTVRCP
        FYYGGPG+  +CP
Subjt:  FYYGGPGRTVRCP

AT1G23340.1 Protein of Unknown Function (DUF239)4.8e-17770.91Show/hide
Query:  SSCSSCSCFVVVLL--VFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP
        SS SSC  F  +LL  +F+S AS  S S +  +PL+      P +E++K+K IR  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+G  P++P
Subjt:  SSCSSCSCFVVVLL--VFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP

Query:  PERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PE P       E  +N QLWS  GE CPEGTIPIRRTTE+D+ RANS+RRFGRK IR VRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  PERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWL
        SLSQIWII+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY+G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGE+VN R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL VLADH +CYDIR G N VW
Subjt:  EYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW

Query:  GTYFYYGGPGRTVRCP
        G +FYYGGPG+  +CP
Subjt:  GTYFYYGGPGRTVRCP

AT1G23340.2 Protein of Unknown Function (DUF239)4.8e-17770.91Show/hide
Query:  SSCSSCSCFVVVLL--VFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP
        SS SSC  F  +LL  +F+S AS  S S +  +PL+      P +E++K+K IR  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+G  P++P
Subjt:  SSCSSCSCFVVVLL--VFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP

Query:  PERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PE P       E  +N QLWS  GE CPEGTIPIRRTTE+D+ RANS+RRFGRK IR VRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  PERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWL
        SLSQIWII+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSY+G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGE+VN R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL VLADH +CYDIR G N VW
Subjt:  EYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW

Query:  GTYFYYGGPGRTVRCP
        G +FYYGGPG+  +CP
Subjt:  GTYFYYGGPGRTVRCP

AT1G70550.2 Protein of Unknown Function (DUF239)8.4e-17468.63Show/hide
Query:  SCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRSNK
        S F+ ++L+   ++S FS++ +            P +EL+KL  IR  L KINKP +KTIQSSDGD IDCV +H QPAFDHP L+G  PL+PPE P+   
Subjt:  SCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRSNK

Query:  SMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWII
          +   +N QLWS SGE CPEGTIPIRRTTE+D+ RA+S++RFGRK IR V+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIW+I
Subjt:  SMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWII

Query:  SGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYGSGMLV
        +GSF +DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY+G QFDI L++WKDPKHGHWWL++GSG LV
Subjt:  SGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYGSGMLV

Query:  GYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGG
        GYWPAFLF+HL+ H SMVQFGGE+VN R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR G+N VWG YFYYGG
Subjt:  GYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGG

Query:  PGRTVRCP
        PG+  RCP
Subjt:  PGRTVRCP

AT5G50150.1 Protein of Unknown Function (DUF239)2.9e-19075.18Show/hide
Query:  MASSCSSSSCSS--CSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELK
        MASS SSSS +S   S F+ ++L+ +    +   S  H   LKNQT F P +E++KL+ + AYL KINKP IKTI S DGDVI+CV SHLQPAFDHP+L+
Subjt:  MASSCSSSSCSS--CSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELK

Query:  GHSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPR
        G  PL+ P RP            +QLWS SGE CP G+IPIR+TT+ D+ RANS+RRFGRK  R +RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAPR
Subjt:  GHSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPR

Query:  VTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDP
        VTD YEFSLSQIW+ISGSFG+DLNTIEAGWQVSPELYGDN PRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SSY G+QFDIGLM+WKDP
Subjt:  VTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDP

Query:  KHGHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIR
        KHGHWWLE G+G+LVGYWPAFLFSHLRSHASMVQFGGEVVN R+SG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NLHVLADH  CYDIR
Subjt:  KHGHWWLEYGSGMLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIR

Query:  QGSNNVWGTYFYYGGPGRTVRCP
        QG NNVWGTYFYYGGPGR  RCP
Subjt:  QGSNNVWGTYFYYGGPGRTVRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTGTTCTTCTTCTTCTTGTTCTTCCTGTTCTTGTTTTGTTGTTGTCCTTCTGGTTTTTACTTCTTTGGCTTCTGTTTTCTCCACTTCCATAGCC
CATCAAATGCCACTAAAAAACCAAACTTTATTCCACCCCACCAAAGAGCTGAAGAAGCTCAAGCACATCAGAGCTTATTTACGAAAAATCAACAAGCCTCCAATC
AAGACAATTCAGAGCTCAGATGGTGATGTCATAGACTGTGTTCTTTCTCATCTCCAGCCTGCCTTTGACCATCCTGAGCTCAAAGGACACTCCCCATTGGAACCG
CCTGAGAGGCCGAGAAGCAACAAATCCATGGAGGAAGTGGCGGATAACCACCAGTTATGGTCAGCTTCCGGCGAGTTTTGCCCGGAAGGGACAATTCCGATCAGA
AGAACAACAGAAAAAGACATTTTCAGAGCAAATTCTATTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGATTCTTCTGGCAATGGCCATGAGCAT
GCAGTGGTGTTCGTTAATGGAGAACAATATTATGGAGCAAAGGCAAGCTTGAACATATGGGCACCTCGTGTAACAGATCAATATGAATTCAGCTTATCACAGATA
TGGATCATTTCAGGCTCATTTGGCAATGATTTGAACACCATAGAAGCTGGATGGCAGGTTAGTCCTGAACTCTATGGCGACAACAATCCTAGATTCTTCACATAT
TGGACAACTGATGCTTATCAAGCCACTGGTTGTTACAATCTGCTTTGTTCAGGATTTGTTCAAACCAATAACAGGATAGCCATTGGAGCAGCAATTTCGCCTGTT
TCCTCTTACAGAGGGAAACAGTTTGATATTGGTTTAATGGTTTGGAAGGATCCAAAGCACGGGCACTGGTGGCTCGAGTACGGGTCGGGTATGCTCGTCGGGTAC
TGGCCAGCATTTCTATTCAGCCATTTAAGGAGCCATGCAAGCATGGTACAATTTGGAGGTGAAGTAGTGAACAGAAGAGCTTCAGGGTTCCACACAGCCACACAA
ATGGGGAGTGGCCACTTTGCTGAAGAAGGCTTTGGCAAAGCCTCTTATTTCAGGAACTTACAAGTGGTTGATTGGGACAACAACTTGCTTCCTCTCACAAATCTT
CATGTCTTGGCCGACCATTCTGACTGCTATGATATCAGACAAGGCAGCAATAACGTTTGGGGCACTTACTTTTACTATGGAGGTCCTGGCAGGACCGTTAGATGC
CCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTGTTCTTCTTCTTCTTGTTCTTCCTGTTCTTGTTTTGTTGTTGTCCTTCTGGTTTTTACTTCTTTGGCTTCTGTTTTCTCCACTTCCATAGCC
CATCAAATGCCACTAAAAAACCAAACTTTATTCCACCCCACCAAAGAGCTGAAGAAGCTCAAGCACATCAGAGCTTATTTACGAAAAATCAACAAGCCTCCAATC
AAGACAATTCAGAGCTCAGATGGTGATGTCATAGACTGTGTTCTTTCTCATCTCCAGCCTGCCTTTGACCATCCTGAGCTCAAAGGACACTCCCCATTGGAACCG
CCTGAGAGGCCGAGAAGCAACAAATCCATGGAGGAAGTGGCGGATAACCACCAGTTATGGTCAGCTTCCGGCGAGTTTTGCCCGGAAGGGACAATTCCGATCAGA
AGAACAACAGAAAAAGACATTTTCAGAGCAAATTCTATTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGATTCTTCTGGCAATGGCCATGAGCAT
GCAGTGGTGTTCGTTAATGGAGAACAATATTATGGAGCAAAGGCAAGCTTGAACATATGGGCACCTCGTGTAACAGATCAATATGAATTCAGCTTATCACAGATA
TGGATCATTTCAGGCTCATTTGGCAATGATTTGAACACCATAGAAGCTGGATGGCAGGTTAGTCCTGAACTCTATGGCGACAACAATCCTAGATTCTTCACATAT
TGGACAACTGATGCTTATCAAGCCACTGGTTGTTACAATCTGCTTTGTTCAGGATTTGTTCAAACCAATAACAGGATAGCCATTGGAGCAGCAATTTCGCCTGTT
TCCTCTTACAGAGGGAAACAGTTTGATATTGGTTTAATGGTTTGGAAGGATCCAAAGCACGGGCACTGGTGGCTCGAGTACGGGTCGGGTATGCTCGTCGGGTAC
TGGCCAGCATTTCTATTCAGCCATTTAAGGAGCCATGCAAGCATGGTACAATTTGGAGGTGAAGTAGTGAACAGAAGAGCTTCAGGGTTCCACACAGCCACACAA
ATGGGGAGTGGCCACTTTGCTGAAGAAGGCTTTGGCAAAGCCTCTTATTTCAGGAACTTACAAGTGGTTGATTGGGACAACAACTTGCTTCCTCTCACAAATCTT
CATGTCTTGGCCGACCATTCTGACTGCTATGATATCAGACAAGGCAGCAATAACGTTTGGGGCACTTACTTTTACTATGGAGGTCCTGGCAGGACCGTTAGATGC
CCTTAA
Protein sequenceShow/hide protein sequence
MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP
PERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
WIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYGSGMLVGY
WPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGGPGRTVRC
P