; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020873 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020873
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationChr05:3155822..3159037
RNA-Seq ExpressionHG10020873
SyntenyHG10020873
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573073.1 hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia]6.8e-23491.67Show/hide
Query:  SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQT
        +SS SSSS  SCSCFVV LL+FTS +SVFSTSI+HQ+P KNQT FHP KELKKLKH+RAYLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +
Subjt:  SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQT

Query:  PLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
        PLEPPERPR NKSMEEVA+N QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
Subjt:  PLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD

Query:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG
        QYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKHG
Subjt:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG

Query:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQAS
        HWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ S
Subjt:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQAS

Query:  NNVWGTYFYYGGPGRNVKCP
        NNVWGTYFYYGGPGR V+CP
Subjt:  NNVWGTYFYYGGPGRNVKCP

XP_022954586.1 uncharacterized protein LOC111456810 [Cucurbita moschata]5.8e-23391.43Show/hide
Query:  SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQT
        +SS SSSS  SCSCFVV LL+FTS +SVFSTSI+HQ+P KNQT FHP KELKKLKH+RAYLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +
Subjt:  SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQT

Query:  PLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
        PLEPPERPR NKSMEEVA+N QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
Subjt:  PLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD

Query:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG
        QYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSY GKQFDIGLMVWKDPKHG
Subjt:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG

Query:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQAS
        HWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ S
Subjt:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQAS

Query:  NNVWGTYFYYGGPGRNVKCP
        NNVWGTYFYYGGPGR V+CP
Subjt:  NNVWGTYFYYGGPGRNVKCP

XP_022994621.1 uncharacterized protein LOC111490277 [Cucurbita maxima]3.4e-23391.41Show/hide
Query:  SSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTP
        +S+ SSS  SCSCFVVFLL+FTS +SVFSTSI+HQ+P KNQT FHP KELKKLK++RAYLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +P
Subjt:  SSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTP

Query:  LEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
        LEPPERPR NKSMEEVA+N QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
Subjt:  LEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ

Query:  YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGH
        YEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKHGH
Subjt:  YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGH

Query:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASN
        WWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ SN
Subjt:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASN

Query:  NVWGTYFYYGGPGRNVKCP
        NVWGTYFYYGGPGR V+CP
Subjt:  NVWGTYFYYGGPGRNVKCP

XP_023542314.1 uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo]5.8e-23391.67Show/hide
Query:  SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQT
        S S+SSSSS SCSCFVV LL+FTS +S FSTSI+HQ+P KNQT FHP KELKKLK++RAYLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +
Subjt:  SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQT

Query:  PLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
        PLEPPERPR NKSMEEVA+N QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
Subjt:  PLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD

Query:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG
        QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKHG
Subjt:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG

Query:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQAS
        HWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ S
Subjt:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQAS

Query:  NNVWGTYFYYGGPGRNVKCP
        NNVWGTYFYYGGPGR V+CP
Subjt:  NNVWGTYFYYGGPGRNVKCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]1.0e-24296.43Show/hide
Query:  SSTSSSSSCSCSCFVVFLLLF-TSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQT
        +S+SSSSSCSCSCFVVFLL+F TSFSSVFS+SISHQIPPKNQTFFHPAKELKKLKH+R YLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKG T
Subjt:  SSTSSSSSCSCSCFVVFLLLF-TSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQT

Query:  PLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
        PLEPPERPRGN S EEVAENFQLWS SGDFCPEGTIPIRRTTE+DIFRASSFRRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
Subjt:  PLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD

Query:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG
        QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG
Subjt:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG

Query:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQAS
        HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQA+
Subjt:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQAS

Query:  NNVWGTYFYYGGPGRNVKCP
        NNVWGTYFYYGGPGRNVKCP
Subjt:  NNVWGTYFYYGGPGRNVKCP

TrEMBL top hitse value%identityAlignment
A0A0A0LTK3 Uncharacterized protein2.4e-22990.74Show/hide
Query:  SSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTP
        +S+SSSSS SCSCFVV LL+FTSFSSV S+SISHQIP KNQT FHPAKELKKLKH+R YLRKINKPPIK I+SSDGDVIDCVLSHLQPAFDHP+LKG +P
Subjt:  SSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTP

Query:  LEPPERPRGN-KSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
        LEPPERPRGN  S EE  ENFQLWS SG+FCPEGTIPIRRTTEKDI+RASS+RR+GRKPI+HV+RDSSGNGHEHAVV+VNGEQYYGAKASLNIWAPRVTD
Subjt:  LEPPERPRGN-KSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD

Query:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG
        QYEFS+SQIWVISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSY GKQFDIGLMVWKDPKHG
Subjt:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG

Query:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRS-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQA
        HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRS SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHSDCYDIRQ 
Subjt:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRS-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQA

Query:  SNNVWGTYFYYGGPGRNVKCP
        +NNVWGTYFYYGGPGRNVKCP
Subjt:  SNNVWGTYFYYGGPGRNVKCP

A0A1S3AXP9 uncharacterized protein LOC1034837235.3e-23291.12Show/hide
Query:  MASSSTSSSS----SCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHP
        MASSS+SSSS    SCSCSCFVV LL+FTSFSSVFS+SISHQIP KNQT FHPA+ELKKLKH+R YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHP
Subjt:  MASSSTSSSS----SCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHP

Query:  ELKGQTPLEPPERPRG-NKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPI-RHVRRDSSGNGHEHAVVFVNGEQYYGAKASLN
        +LKG TPLEPPERPRG N S+EE  ENFQLWS SG+FCPEGTIPIRRTTEKDI+RASS+RR+GRKPI RHVRRDSSGNGHEHAVV+VNGEQYYGAKASLN
Subjt:  ELKGQTPLEPPERPRG-NKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPI-RHVRRDSSGNGHEHAVVFVNGEQYYGAKASLN

Query:  IWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLM
        IWAPRVTDQYEFS+SQIWVISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSY GKQFDIGLM
Subjt:  IWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLM

Query:  VWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSD
        VWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHSD
Subjt:  VWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSD

Query:  CYDIRQASNNVWGTYFYYGGPGRNVKCP
        CYDIRQ +N+VWGTYFYYGGPGRNVKCP
Subjt:  CYDIRQASNNVWGTYFYYGGPGRNVKCP

A0A6J1GRA4 uncharacterized protein LOC1114568102.8e-23391.43Show/hide
Query:  SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQT
        +SS SSSS  SCSCFVV LL+FTS +SVFSTSI+HQ+P KNQT FHP KELKKLKH+RAYLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +
Subjt:  SSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQT

Query:  PLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
        PLEPPERPR NKSMEEVA+N QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD
Subjt:  PLEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTD

Query:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG
        QYEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSY GKQFDIGLMVWKDPKHG
Subjt:  QYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHG

Query:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQAS
        HWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ S
Subjt:  HWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQAS

Query:  NNVWGTYFYYGGPGRNVKCP
        NNVWGTYFYYGGPGR V+CP
Subjt:  NNVWGTYFYYGGPGRNVKCP

A0A6J1JZN8 uncharacterized protein LOC1114902771.6e-23391.41Show/hide
Query:  SSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTP
        +S+ SSS  SCSCFVVFLL+FTS +SVFSTSI+HQ+P KNQT FHP KELKKLK++RAYLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKG +P
Subjt:  SSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTP

Query:  LEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
        LEPPERPR NKSMEEVA+N QLWS SG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
Subjt:  LEPPERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ

Query:  YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGH
        YEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFDIGLMVWKDPKHGH
Subjt:  YEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGH

Query:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASN
        WWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ SN
Subjt:  WWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASN

Query:  NVWGTYFYYGGPGRNVKCP
        NVWGTYFYYGGPGR V+CP
Subjt:  NVWGTYFYYGGPGRNVKCP

A0A6J1KJ26 uncharacterized protein LOC1114950542.4e-22990.98Show/hide
Query:  SCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRG
        S SCFVV LL+FTSFSSVF TSI+H+ PPKNQT+FHP+KEL +LKH+RAYLRKINKPP KTI+SSDGDVIDCVLSHLQPAFDHP LKG TPL PPERPRG
Subjt:  SCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRG

Query:  NKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW
        N S EEVAENFQLWS SGDFCPEGTIPIRRTTE+DI+RASSFRRFGRKPIRH+RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQIW
Subjt:  NKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW

Query:  VISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGL
        VISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+GKQFD+G+MVWKDPKHGHWWLEYGSGL
Subjt:  VISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGL

Query:  LVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYY
        LVGYWPAFLFSHLRSH SMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ +N+VWGTYFYY
Subjt:  LVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYY

Query:  GGPGRNVKCP
        GGPGRNVKCP
Subjt:  GGPGRNVKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)5.1e-17970.73Show/hide
Query:  SCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRG
        +CS  ++FL L    SS FS+ +S  + P+NQT   P  EL KLK +  +LRKINKP IKTI S DGD+IDCVL H QPAFDHP L+GQ PL+PPERPRG
Subjt:  SCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRG

Query:  NKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW
        +       ++FQLW + G+ CPEGT+PIRRT E+DI RA+S   FG+K +RH RRD+S NGHEHAV +V+GE+YYGAKAS+N+WAP+V +QYEFSLSQIW
Subjt:  NKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW

Query:  VISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGL
        +ISGSFGNDLNTIEAGWQVSPELYGDN PRFFTYWT DAYQATGCYNLLCSGFVQTN+ IAIGAAISP SSY G QFDI L++WKDPKHG+WWLE+GSG+
Subjt:  VISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGL

Query:  LVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYY
        LVGYWP+FLF+HL+ HASMVQ+GGEIVNS   G HT+TQMGSGHFAEEGF K+SYFRN+QVVDWDNNL+P  NL +LADH +CYDI+  SN  WG+YFYY
Subjt:  LVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYY

Query:  GGPGRNVKCP
        GGPG+N KCP
Subjt:  GGPGRNVKCP

AT1G23340.1 Protein of Unknown Function (DUF239)1.3e-17970.91Show/hide
Query:  SSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEP
        SSSSSC    F++ L LF+S++S  S S S  +P        P +E++K+K +R  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+GQ P++P
Subjt:  SSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEP

Query:  PERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PE P G     E  ENFQLWS+ G+ CPEGTIPIRRTTE+D+ RA+S RRFGRK IR VRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  PERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH +CYDIR   N VW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVW

Query:  GTYFYYGGPGRNVKCP
        G +FYYGGPG+N KCP
Subjt:  GTYFYYGGPGRNVKCP

AT1G23340.2 Protein of Unknown Function (DUF239)1.3e-17970.91Show/hide
Query:  SSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEP
        SSSSSC    F++ L LF+S++S  S S S  +P        P +E++K+K +R  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+GQ P++P
Subjt:  SSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEP

Query:  PERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
        PE P G     E  ENFQLWS+ G+ CPEGTIPIRRTTE+D+ RA+S RRFGRK IR VRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  PERPRGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH +CYDIR   N VW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVW

Query:  GTYFYYGGPGRNVKCP
        G +FYYGGPG+N KCP
Subjt:  GTYFYYGGPGRNVKCP

AT1G70550.2 Protein of Unknown Function (DUF239)1.0e-17469.85Show/hide
Query:  SCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNK
        S F+  +LL    SS FS++ S            P +EL+KL  +R  L KINKP +KTI+SSDGD IDCV +H QPAFDHP L+GQ PL+PPE P+G  
Subjt:  SCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERPRGNK

Query:  SMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVI
          +   EN QLWS+SG+ CPEGTIPIRRTTE+D+ RASS +RFGRK IR V+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIWVI
Subjt:  SMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVI

Query:  SGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLLV
        +GSF +DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY G QFDI L++WKDPKHGHWWL++GSG LV
Subjt:  SGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLLV

Query:  GYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGG
        GYWPAFLF+HL+ H SMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR  +N VWG YFYYGG
Subjt:  GYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGG

Query:  PGRNVKCP
        PG+N +CP
Subjt:  PGRNVKCP

AT5G50150.1 Protein of Unknown Function (DUF239)2.9e-19075.41Show/hide
Query:  MASSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKG
        MASSS+SSS++ + +   + L+L  S        +   I  KNQT F P +E++KL+ V AYL KINKP IKTI S DGDVI+CV SHLQPAFDHP+L+G
Subjt:  MASSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKG

Query:  QTPLEPPERP-RGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPR
        Q PL+ P RP +GN++  E + N QLWS+SG+ CP G+IPIR+TT+ D+ RA+S RRFGRK  R +RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAPR
Subjt:  QTPLEPPERP-RGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPR

Query:  VTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDP
        VTD YEFSLSQIW+ISGSFG+DLNTIEAGWQVSPELYGDN PRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SSY+G+QFDIGLM+WKDP
Subjt:  VTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDP

Query:  KHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIR
        KHGHWWLE G+GLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NLH+LADH  CYDIR
Subjt:  KHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIR

Query:  QASNNVWGTYFYYGGPGRNVKCP
        Q  NNVWGTYFYYGGPGRN +CP
Subjt:  QASNNVWGTYFYYGGPGRNVKCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCAACTTCTTCTTCTTCTTCTTGTTCTTGTTCTTGTTTTGTTGTTTTCCTTCTGCTTTTTACTTCTTTCTCCTCTGTTTTCTCAACTTCCATATCCCA
TCAAATCCCACCAAAAAACCAAACTTTTTTCCACCCAGCTAAAGAGCTCAAGAAACTCAAGCACGTCAGAGCTTATTTACGCAAAATCAACAAGCCTCCAATCAAGACAA
TTCGGAGTTCAGATGGTGATGTTATAGACTGTGTTCTTTCTCATCTTCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGGCAAACTCCATTGGAACCACCTGAGAGGCCA
AGAGGGAACAAATCCATGGAAGAAGTAGCAGAAAATTTCCAATTATGGTCAGTTTCAGGGGATTTTTGCCCTGAAGGAACAATTCCAATAAGAAGAACAACAGAAAAAGA
CATTTTCAGAGCAAGTTCTTTTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGATTCTTCTGGGAATGGTCATGAGCATGCTGTGGTATTTGTGAATGGAG
AACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCGCCACGTGTAACTGATCAATACGAATTTAGTTTATCACAAATATGGGTAATTTCAGGGTCTTTTGGGAAT
GATTTGAACACCATTGAAGCTGGATGGCAGGTTAGTCCTGAACTGTATGGCGACAACAATCCTAGATTCTTTACGTATTGGACAACTGATGCTTATCAAGCCACTGGGTG
TTATAATTTACTTTGTTCTGGGTTCGTTCAAACCAATAATAGGATCGCCATTGGAGCAGCAATTTCGCCTATATCTTCTTACAGTGGCAAGCAATTCGATATTGGTTTAA
TGGTTTGGAAGGATCCAAAGCACGGGCATTGGTGGTTGGAATACGGGTCGGGTTTGCTAGTCGGGTATTGGCCAGCATTTCTATTCAGCCATTTACGGAGCCATGCTAGC
ATGGTACAATTTGGAGGGGAAATAGTGAACAGCAGATCTTCAGGGTTTCATACAGCCACTCAAATGGGGAGTGGTCATTTTGCTGAAGAAGGGTTTGGAAAAGCTTCTTA
TTTCAGGAACTTGCAAGTAGTTGATTGGGATAACAATTTGCTTCCTCTTACAAATCTTCATCTCTTGGCTGACCATTCTGATTGCTATGATATTAGACAAGCCTCTAATA
ATGTTTGGGGCACTTATTTTTACTATGGAGGGCCTGGTAGAAATGTCAAATGCCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCAACTTCTTCTTCTTCTTCTTGTTCTTGTTCTTGTTTTGTTGTTTTCCTTCTGCTTTTTACTTCTTTCTCCTCTGTTTTCTCAACTTCCATATCCCA
TCAAATCCCACCAAAAAACCAAACTTTTTTCCACCCAGCTAAAGAGCTCAAGAAACTCAAGCACGTCAGAGCTTATTTACGCAAAATCAACAAGCCTCCAATCAAGACAA
TTCGGAGTTCAGATGGTGATGTTATAGACTGTGTTCTTTCTCATCTTCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGGCAAACTCCATTGGAACCACCTGAGAGGCCA
AGAGGGAACAAATCCATGGAAGAAGTAGCAGAAAATTTCCAATTATGGTCAGTTTCAGGGGATTTTTGCCCTGAAGGAACAATTCCAATAAGAAGAACAACAGAAAAAGA
CATTTTCAGAGCAAGTTCTTTTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGATTCTTCTGGGAATGGTCATGAGCATGCTGTGGTATTTGTGAATGGAG
AACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCGCCACGTGTAACTGATCAATACGAATTTAGTTTATCACAAATATGGGTAATTTCAGGGTCTTTTGGGAAT
GATTTGAACACCATTGAAGCTGGATGGCAGGTTAGTCCTGAACTGTATGGCGACAACAATCCTAGATTCTTTACGTATTGGACAACTGATGCTTATCAAGCCACTGGGTG
TTATAATTTACTTTGTTCTGGGTTCGTTCAAACCAATAATAGGATCGCCATTGGAGCAGCAATTTCGCCTATATCTTCTTACAGTGGCAAGCAATTCGATATTGGTTTAA
TGGTTTGGAAGGATCCAAAGCACGGGCATTGGTGGTTGGAATACGGGTCGGGTTTGCTAGTCGGGTATTGGCCAGCATTTCTATTCAGCCATTTACGGAGCCATGCTAGC
ATGGTACAATTTGGAGGGGAAATAGTGAACAGCAGATCTTCAGGGTTTCATACAGCCACTCAAATGGGGAGTGGTCATTTTGCTGAAGAAGGGTTTGGAAAAGCTTCTTA
TTTCAGGAACTTGCAAGTAGTTGATTGGGATAACAATTTGCTTCCTCTTACAAATCTTCATCTCTTGGCTGACCATTCTGATTGCTATGATATTAGACAAGCCTCTAATA
ATGTTTGGGGCACTTATTTTTACTATGGAGGGCCTGGTAGAAATGTCAAATGCCCTTAG
Protein sequenceShow/hide protein sequence
MASSSTSSSSSCSCSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTFFHPAKELKKLKHVRAYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGQTPLEPPERP
RGNKSMEEVAENFQLWSVSGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGN
DLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYSGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHAS
MVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQASNNVWGTYFYYGGPGRNVKCP