; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G001400 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G001400
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCmo_Chr18:1001358..1004925
RNA-Seq ExpressionCmoCh18G001400
SyntenyCmoCh18G001400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573073.1 hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia]5.1e-24499.51Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
        DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNNVWGTYFYYG
        SNNVWGTYFYYG
Subjt:  SNNVWGTYFYYG

XP_022954586.1 uncharacterized protein LOC111456810 [Cucurbita moschata]4.6e-245100Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
        DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNNVWGTYFYYG
        SNNVWGTYFYYG
Subjt:  SNNVWGTYFYYG

XP_022994621.1 uncharacterized protein LOC111490277 [Cucurbita maxima]8.9e-24198.79Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSC SSSCSSCSCFVV LLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLK+IRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
        DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNNVWGTYFYYG
        SNNVWGTYFYYG
Subjt:  SNNVWGTYFYYG

XP_023542314.1 uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo]2.0e-24097.84Show/hide
Query:  MASSC----SSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPE
        MASSC    SSSS SSCSCFVVVLLVFTSLAS FSTSIAHQMPLKNQTLFHPTKELKKLK+IRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPE
Subjt:  MASSC----SSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPE

Query:  LKGHSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA
        LKGHSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA
Subjt:  LKGHSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWA

Query:  PRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWK
        PRVTDQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSYRGKQFDIGLMVWK
Subjt:  PRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWK

Query:  DPKHGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD
        DPKHGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD
Subjt:  DPKHGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYD

Query:  IRQGSNNVWGTYFYYG
        IRQGSNNVWGTYFYYG
Subjt:  IRQGSNNVWGTYFYYG

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]9.9e-22491.04Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVF-TSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKG
        MASS SSSSC SCSCFVV LLVF TS +SVFS+SI+HQ+P KNQT FHP KELKKLKHIR YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG
Subjt:  MASSCSSSSCSSCSCFVVVLLVF-TSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKG

Query:  HSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        H+PLEPPERPR N S EEVA+N QLWSASG+FCPEGTIPIRRTTE+DIFRA+S RRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
Subjt:  HSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPK
        TDQYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSY GKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ
        HGHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ
Subjt:  HGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQ

Query:  GSNNVWGTYFYYG
         +NNVWGTYFYYG
Subjt:  GSNNVWGTYFYYG

TrEMBL top hitse value%identityAlignment
A0A0A0LTK3 Uncharacterized protein2.7e-21989.61Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASS SSSS SSCSCFVV+LLVFTS +SV S+SI+HQ+P KNQTLFHP KELKKLKHIR YLRKINKPPIK IQSSDGDVIDCVLSHLQPAFDHP+LKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSN-KSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        SPLEPPERPR N  S EE  +N QLWS SGEFCPEGTIPIRRTTEKDI+RA+S RR+GRKPI+HV+RDSSGNGHEHAVV+VNGEQYYGAKASLNIWAPRV
Subjt:  SPLEPPERPRSN-KSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPK
        TDQYEFS+SQIW+ISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSYRGKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRA-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIR
        HGHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGEVVN R+ SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL VLADHSDCYDIR
Subjt:  HGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRA-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIR

Query:  QGSNNVWGTYFYYG
        Q +NNVWGTYFYYG
Subjt:  QGSNNVWGTYFYYG

A0A1S3AXP9 uncharacterized protein LOC1034837232.7e-21988.73Show/hide
Query:  ASSCSSSSCS----SCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPEL
        +SS SSSSCS    SCSCFVV+LLVFTS +SVFS+SI+HQ+P KNQT FHP +ELKKLKHIR YLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHP+L
Subjt:  ASSCSSSSCS----SCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPEL

Query:  KGHSPLEPPERPR-SNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPI-RHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIW
        KGH+PLEPPERPR +N S+EE  +N QLWS SGEFCPEGTIPIRRTTEKDI+RA+S RR+GRKPI RHVRRDSSGNGHEHAVV+VNGEQYYGAKASLNIW
Subjt:  KGHSPLEPPERPR-SNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPI-RHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIW

Query:  APRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVW
        APRVTDQYEFS+SQIW+ISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSYRGKQFDIGLMVW
Subjt:  APRVTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVW

Query:  KDPKHGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCY
        KDPKHGHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGEVVN R+SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL VLADHSDCY
Subjt:  KDPKHGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCY

Query:  DIRQGSNNVWGTYFYYG
        DIRQ +N+VWGTYFYYG
Subjt:  DIRQGSNNVWGTYFYYG

A0A6J1GRA4 uncharacterized protein LOC1114568102.2e-245100Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
        DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNNVWGTYFYYG
        SNNVWGTYFYYG
Subjt:  SNNVWGTYFYYG

A0A6J1JZN8 uncharacterized protein LOC1114902774.3e-24198.79Show/hide
Query:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSC SSSCSSCSCFVV LLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLK+IRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  SPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
        DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSYRGKQFDIGLMVWKDPKH
Subjt:  DQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
        GHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG
Subjt:  GHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG

Query:  SNNVWGTYFYYG
        SNNVWGTYFYYG
Subjt:  SNNVWGTYFYYG

A0A6J1KJ26 uncharacterized protein LOC1114950546.7e-21889.05Show/hide
Query:  SSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR
        SS SCFVV+LLVFTS +SVF TSIAH+ P KNQT FHP+KEL +LKHIRAYLRKINKPP KTIQSSDGDVIDCVLSHLQPAFDHP LKGH+PL PPERPR
Subjt:  SSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR

Query:  SNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI
         N S EEVA+N QLWSASG+FCPEGTIPIRRTTE+DI+RA+S RRFGRKPIRH+RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQI
Subjt:  SNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQI

Query:  WIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYGSG
        W+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSY GKQFD+G+MVWKDPKHGHWWLEYGSG
Subjt:  WIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYGSG

Query:  TLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFY
         LVGYWPAFLFSHLRSH SMVQFGGE+VN R SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQG+N+VWGTYFY
Subjt:  TLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFY

Query:  YG
        YG
Subjt:  YG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)3.6e-17170.54Show/hide
Query:  SCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPER
        +CSS   F+ +LL+ +S +SV S +++     +NQTL  P  EL KLK I  +LRKINKP IKTI S DGD+IDCVL H QPAFDHP L+G  PL+PPER
Subjt:  SCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPER

Query:  PRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLS
        PR +        + QLW   GE CPEGT+PIRRT E+DI RANS+  FG+K +RH RRD+S NGHEHAV +V+GE+YYGAKAS+N+WAP+V +QYEFSLS
Subjt:  PRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLS

Query:  QIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYG
        QIWIISGSFGNDLNTIEAGWQVSPELYGDN PRFFTYWT DAYQATGCYNLLCSGFVQTNS IAIGAAISP SSY+G QFDI L++WKDPKHG+WWLE+G
Subjt:  QIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYG

Query:  SGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTY
        SG LVGYWP+FLF+HL+ HASMVQ+GGE+VN    G HT+TQMGSGHFAEEGF K+SYFRN+QVVDWDNNL+P  NL VLADH +CYDI+ GSN  WG+Y
Subjt:  SGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTY

Query:  FYYG
        FYYG
Subjt:  FYYG

AT1G23340.1 Protein of Unknown Function (DUF239)2.5e-17271.25Show/hide
Query:  SSCSSCSCFVVVLL--VFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP
        SS SSC  F  +LL  +F+S AS  S S +  +PL+      P +E++K+K IR  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+G  P++P
Subjt:  SSCSSCSCFVVVLL--VFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP

Query:  PERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PE P       E  +N QLWS  GE CPEGTIPIRRTTE+D+ RANS+RRFGRK IR VRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  PERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWL
        SLSQIWII+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSY+G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW
        ++GSGTLVGYWP  LF+HLR H +MVQFGGE+VN R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL VLADH +CYDIR G N VW
Subjt:  EYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW

Query:  GTYFYYG
        G +FYYG
Subjt:  GTYFYYG

AT1G23340.2 Protein of Unknown Function (DUF239)2.5e-17271.25Show/hide
Query:  SSCSSCSCFVVVLL--VFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP
        SS SSC  F  +LL  +F+S AS  S S +  +PL+      P +E++K+K IR  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+G  P++P
Subjt:  SSCSSCSCFVVVLL--VFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEP

Query:  PERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PE P       E  +N QLWS  GE CPEGTIPIRRTTE+D+ RANS+RRFGRK IR VRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  PERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWL
        SLSQIWII+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTN+RIAIGAAISPVSSY+G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW
        ++GSGTLVGYWP  LF+HLR H +MVQFGGE+VN R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL VLADH +CYDIR G N VW
Subjt:  EYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVW

Query:  GTYFYYG
        G +FYYG
Subjt:  GTYFYYG

AT1G70550.2 Protein of Unknown Function (DUF239)9.7e-16968.67Show/hide
Query:  SCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRSNK
        S F+ ++L+   ++S FS++ +            P +EL+KL  IR  L KINKP +KTIQSSDGD IDCV +H QPAFDHP L+G  PL+PPE P+   
Subjt:  SCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPRSNK

Query:  SMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWII
          +   +N QLWS SGE CPEGTIPIRRTTE+D+ RA+S++RFGRK IR V+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIW+I
Subjt:  SMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWII

Query:  SGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYGSGTLV
        +GSF +DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY+G QFDI L++WKDPKHGHWWL++GSG LV
Subjt:  SGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYGSGTLV

Query:  GYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYG
        GYWPAFLF+HL+ H SMVQFGGE+VN R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR G+N VWG YFYYG
Subjt:  GYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYG

AT5G50150.1 Protein of Unknown Function (DUF239)8.2e-18474.88Show/hide
Query:  MASSCSSSSCSS--CSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELK
        MASS SSSS +S   S F+ ++L+ +    +   S  H   LKNQT F P +E++KL+ + AYL KINKP IKTI S DGDVI+CV SHLQPAFDHP+L+
Subjt:  MASSCSSSSCSS--CSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELK

Query:  GHSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPR
        G  PL+ P RP            +QLWS SGE CP G+IPIR+TT+ D+ RANS+RRFGRK  R +RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAPR
Subjt:  GHSPLEPPERPRSNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPR

Query:  VTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDP
        VTD YEFSLSQIW+ISGSFG+DLNTIEAGWQVSPELYGDN PRFFTYWTTDAYQATGCYNLLCSGFVQTN++IAIGAAISP SSY G+QFDIGLM+WKDP
Subjt:  VTDQYEFSLSQIWIISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDP

Query:  KHGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIR
        KHGHWWLE G+G LVGYWPAFLFSHLRSHASMVQFGGEVVN R+SG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NLHVLADH  CYDIR
Subjt:  KHGHWWLEYGSGTLVGYWPAFLFSHLRSHASMVQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIR

Query:  QGSNNVWGTYFYYG
        QG NNVWGTYFYYG
Subjt:  QGSNNVWGTYFYYG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTGTTCTTCTTCTTCTTGTTCTTCCTGTTCTTGTTTTGTTGTTGTCCTTCTGGTTTTTACTTCTTTGGCTTCTGTTTTCTCCACTTCCATAGCCCATCA
AATGCCACTAAAAAACCAAACTTTATTCCACCCCACCAAAGAGCTGAAGAAGCTCAAGCACATCAGAGCTTATTTACGAAAAATCAACAAGCCTCCAATCAAGACAATTC
AGAGCTCAGATGGTGATGTCATAGACTGTGTTCTTTCTCATCTCCAGCCTGCCTTTGACCATCCTGAGCTCAAAGGACACTCCCCATTGGAACCGCCTGAGAGGCCGAGA
AGCAACAAATCCATGGAGGAAGTGGCGGATAACCACCAGTTATGGTCAGCTTCCGGCGAGTTTTGTCCTGAAGGGACAATTCCGATCAGAAGAACAACAGAAAAAGACAT
TTTCAGAGCAAATTCTATTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGATTCTTCTGGCAATGGCCATGAGCATGCAGTGGTGTTCGTTAATGGAGAAC
AATATTATGGAGCAAAGGCAAGCTTGAACATATGGGCACCTCGTGTAACAGATCAATATGAATTCAGCTTATCACAGATATGGATCATTTCAGGCTCATTTGGCAATGAT
TTGAACACCATAGAAGCTGGATGGCAGGTTAGTCCTGAACTCTATGGCGACAACAATCCTAGATTCTTCACATACTGGACAACTGATGCTTATCAAGCCACTGGTTGTTA
CAATCTGCTTTGTTCAGGATTTGTTCAAACCAATAGCAGGATAGCCATTGGAGCAGCAATTTCGCCTGTTTCCTCTTACAGAGGGAAACAGTTTGATATTGGTTTAATGG
TTTGGAAGGATCCAAAACACGGGCACTGGTGGCTCGAGTACGGGTCGGGTACGCTCGTCGGGTACTGGCCAGCATTTCTATTCAGCCATTTAAGGAGCCATGCAAGCATG
GTACAATTTGGAGGTGAAGTAGTGAACAGAAGAGCTTCAGGGTTCCACACAGCCACACAAATGGGGAGTGGCCACTTTGCTGAAGAAGGCTTTGGCAAAGCCTCTTATTT
CAGGAACTTACAAGTGGTTGATTGGGACAACAACTTGCTTCCTCTCACAAATCTTCATGTCTTGGCCGACCATTCTGACTGCTATGATATCAGACAAGGCAGCAATAACG
TTTGGGGCACTTACTTTTACTATGGAGAACATTATAGGATAGAACGTAGTGTCTTGGGCAGCGATTTGGGTCGAGAGTGGTTGGTAAGGGATAATGAGTTATTCTGGTAC
AGACACGATGAGCAGTGTGGGACAGACCTACCTATCCTAATAGGAACTCGTAAAGGAACGAGAGCTGTTGAGAAGTTCCATATTGTCAAGTCGAAGAGTGCGAATGCGGG
TTTTGTTACTCGTGAGAGTTTGTTTATATTTTTTTTCGGTTTGGTTCGGGTTTTTTATCTTTGTGTTTTGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTGTTCTTCTTCTTCTTGTTCTTCCTGTTCTTGTTTTGTTGTTGTCCTTCTGGTTTTTACTTCTTTGGCTTCTGTTTTCTCCACTTCCATAGCCCATCA
AATGCCACTAAAAAACCAAACTTTATTCCACCCCACCAAAGAGCTGAAGAAGCTCAAGCACATCAGAGCTTATTTACGAAAAATCAACAAGCCTCCAATCAAGACAATTC
AGAGCTCAGATGGTGATGTCATAGACTGTGTTCTTTCTCATCTCCAGCCTGCCTTTGACCATCCTGAGCTCAAAGGACACTCCCCATTGGAACCGCCTGAGAGGCCGAGA
AGCAACAAATCCATGGAGGAAGTGGCGGATAACCACCAGTTATGGTCAGCTTCCGGCGAGTTTTGTCCTGAAGGGACAATTCCGATCAGAAGAACAACAGAAAAAGACAT
TTTCAGAGCAAATTCTATTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGATTCTTCTGGCAATGGCCATGAGCATGCAGTGGTGTTCGTTAATGGAGAAC
AATATTATGGAGCAAAGGCAAGCTTGAACATATGGGCACCTCGTGTAACAGATCAATATGAATTCAGCTTATCACAGATATGGATCATTTCAGGCTCATTTGGCAATGAT
TTGAACACCATAGAAGCTGGATGGCAGGTTAGTCCTGAACTCTATGGCGACAACAATCCTAGATTCTTCACATACTGGACAACTGATGCTTATCAAGCCACTGGTTGTTA
CAATCTGCTTTGTTCAGGATTTGTTCAAACCAATAGCAGGATAGCCATTGGAGCAGCAATTTCGCCTGTTTCCTCTTACAGAGGGAAACAGTTTGATATTGGTTTAATGG
TTTGGAAGGATCCAAAACACGGGCACTGGTGGCTCGAGTACGGGTCGGGTACGCTCGTCGGGTACTGGCCAGCATTTCTATTCAGCCATTTAAGGAGCCATGCAAGCATG
GTACAATTTGGAGGTGAAGTAGTGAACAGAAGAGCTTCAGGGTTCCACACAGCCACACAAATGGGGAGTGGCCACTTTGCTGAAGAAGGCTTTGGCAAAGCCTCTTATTT
CAGGAACTTACAAGTGGTTGATTGGGACAACAACTTGCTTCCTCTCACAAATCTTCATGTCTTGGCCGACCATTCTGACTGCTATGATATCAGACAAGGCAGCAATAACG
TTTGGGGCACTTACTTTTACTATGGAGAACATTATAGGATAGAACGTAGTGTCTTGGGCAGCGATTTGGGTCGAGAGTGGTTGGTAAGGGATAATGAGTTATTCTGGTAC
AGACACGATGAGCAGTGTGGGACAGACCTACCTATCCTAATAGGAACTCGTAAAGGAACGAGAGCTGTTGAGAAGTTCCATATTGTCAAGTCGAAGAGTGCGAATGCGGG
TTTTGTTACTCGTGAGAGTTTGTTTATATTTTTTTTCGGTTTGGTTCGGGTTTTTTATCTTTGTGTTTTGTCCTAG
Protein sequenceShow/hide protein sequence
MASSCSSSSCSSCSCFVVVLLVFTSLASVFSTSIAHQMPLKNQTLFHPTKELKKLKHIRAYLRKINKPPIKTIQSSDGDVIDCVLSHLQPAFDHPELKGHSPLEPPERPR
SNKSMEEVADNHQLWSASGEFCPEGTIPIRRTTEKDIFRANSIRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWIISGSFGND
LNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNSRIAIGAAISPVSSYRGKQFDIGLMVWKDPKHGHWWLEYGSGTLVGYWPAFLFSHLRSHASM
VQFGGEVVNRRASGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHVLADHSDCYDIRQGSNNVWGTYFYYGEHYRIERSVLGSDLGREWLVRDNELFWY
RHDEQCGTDLPILIGTRKGTRAVEKFHIVKSKSANAGFVTRESLFIFFFGLVRVFYLCVLS