; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G013250 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G013250
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCicolChr01:25246421..25249778
RNA-Seq ExpressionCcUC01G013250
SyntenyCcUC01G013250
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573073.1 hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia]4.8e-23292.09Show/hide
Query:  ASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE
        +SSS SS SCFVV LL+FTS +SVFSTSI+HQ+P KNQTLFHPTKELKKLKHIR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+P LE
Subjt:  ASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE

Query:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
        P ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
Subjt:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWW
        FSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYRGKQFDIGLMVWKDPKHGHWW
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWW

Query:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNV
        LEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ +NNV
Subjt:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNV

Query:  WGTYFYYGGPGRNVKCP
        WGTYFYYGGPGR V+CP
Subjt:  WGTYFYYGGPGRNVKCP

XP_022954586.1 uncharacterized protein LOC111456810 [Cucurbita moschata]4.1e-23191.85Show/hide
Query:  ASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE
        +SSS SS SCFVV LL+FTS +SVFSTSI+HQ+P KNQTLFHPTKELKKLKHIR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+P LE
Subjt:  ASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE

Query:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
        P ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
Subjt:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWW
        FSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSYRGKQFDIGLMVWKDPKHGHWW
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWW

Query:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNV
        LEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ +NNV
Subjt:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNV

Query:  WGTYFYYGGPGRNVKCP
        WGTYFYYGGPGR V+CP
Subjt:  WGTYFYYGGPGRNVKCP

XP_022994621.1 uncharacterized protein LOC111490277 [Cucurbita maxima]6.3e-23292.31Show/hide
Query:  SSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEP
        SSS SS SCFVVFLL+FTS +SVFSTSI+HQ+P KNQTLFHPTKELKKLK+IR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+P LEP
Subjt:  SSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEP

Query:  GERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
         ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
Subjt:  GERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYRGKQFDIGLMVWKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVW
        EYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ +NNVW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVW

Query:  GTYFYYGGPGRNVKCP
        GTYFYYGGPGR V+CP
Subjt:  GTYFYYGGPGRNVKCP

XP_023542314.1 uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo]4.1e-23192.09Show/hide
Query:  ASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE
        +SSSSSS SCFVV LL+FTS +S FSTSI+HQ+P KNQTLFHPTKELKKLK+IR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+P LE
Subjt:  ASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE

Query:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
        P ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
Subjt:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWW
        FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYRGKQFDIGLMVWKDPKHGHWW
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWW

Query:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNV
        LEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ +NNV
Subjt:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNV

Query:  WGTYFYYGGPGRNVKCP
        WGTYFYYGGPGR V+CP
Subjt:  WGTYFYYGGPGRNVKCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]7.7e-23895.73Show/hide
Query:  MASSSSSSS---SCFVVFLLLF-TSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSSSSSS   SCFVVFLL+F TSFSSVFS+SISHQIPPKNQT FHP KELKKLKHIR+YLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSSSSSS---SCFVVFLLLF-TSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  TPILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        TP LEP ERPRGNNS EEVAEN QLWSASGDFCPEGTIPIRRTTE+DIFRASSFRRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
Subjt:  TPILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPK
        TDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSY GKQFDIGLMVWKDPK
Subjt:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPK

Query:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQ
        HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQ
Subjt:  HGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQ

Query:  ATNNVWGTYFYYGGPGRNVKCP
        A NNVWGTYFYYGGPGRNVKCP
Subjt:  ATNNVWGTYFYYGGPGRNVKCP

TrEMBL top hitse value%identityAlignment
A0A0A0LTK3 Uncharacterized protein1.6e-22891.41Show/hide
Query:  ASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE
        +SSSSSS SCFVV LL+FTSFSSV S+SISHQIP KNQTLFHP KELKKLKHIR+YLRKINKPPIK I+SSDGDVIDCVLSHLQPAFDHP+LKGH+P LE
Subjt:  ASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE

Query:  PGERPRGN-NSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY
        P ERPRGN NS EE  EN QLWS SG+FCPEGTIPIRRTTEKDI+RASS+RR+GRKPI+HV+RDSSGNGHEHAVV+VNGEQYYGAKASLNIWAPRVTDQY
Subjt:  PGERPRGN-NSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQY

Query:  EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHW
        EFS+SQIWVISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHW
Subjt:  EFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHW

Query:  WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRS-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATN
        WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRS SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHSDCYDIRQ TN
Subjt:  WLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRS-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATN

Query:  NVWGTYFYYGGPGRNVKCP
        NVWGTYFYYGGPGRNVKCP
Subjt:  NVWGTYFYYGGPGRNVKCP

A0A1S3AXP9 uncharacterized protein LOC1034837239.2e-22990.44Show/hide
Query:  MASSSSSSS---------SCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHP
        MASSSSSSS         SCFVV LL+FTSFSSVFS+SISHQIP KNQT FHP +ELKKLKHIR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHP
Subjt:  MASSSSSSS---------SCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHP

Query:  ELKGHTPILEPGERPRG-NNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPI-RHVRRDSSGNGHEHAVVFVNGEQYYGAKASL
        +LKGHTP LEP ERPRG NNS+EE  EN QLWS SG+FCPEGTIPIRRTTEKDI+RASS+RR+GRKPI RHVRRDSSGNGHEHAVV+VNGEQYYGAKASL
Subjt:  ELKGHTPILEPGERPRG-NNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPI-RHVRRDSSGNGHEHAVVFVNGEQYYGAKASL

Query:  NIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGL
        NIWAPRVTDQYEFS+SQIWVISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGL
Subjt:  NIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGL

Query:  MVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHS
        MVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHS
Subjt:  MVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHS

Query:  DCYDIRQATNNVWGTYFYYGGPGRNVKCP
        DCYDIRQ TN+VWGTYFYYGGPGRNVKCP
Subjt:  DCYDIRQATNNVWGTYFYYGGPGRNVKCP

A0A6J1GRA4 uncharacterized protein LOC1114568102.0e-23191.85Show/hide
Query:  ASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE
        +SSS SS SCFVV LL+FTS +SVFSTSI+HQ+P KNQTLFHPTKELKKLKHIR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+P LE
Subjt:  ASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE

Query:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
        P ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
Subjt:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWW
        FSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSYRGKQFDIGLMVWKDPKHGHWW
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWW

Query:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNV
        LEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ +NNV
Subjt:  LEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNV

Query:  WGTYFYYGGPGRNVKCP
        WGTYFYYGGPGR V+CP
Subjt:  WGTYFYYGGPGRNVKCP

A0A6J1JZN8 uncharacterized protein LOC1114902773.1e-23292.31Show/hide
Query:  SSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEP
        SSS SS SCFVVFLL+FTS +SVFSTSI+HQ+P KNQTLFHPTKELKKLK+IR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+P LEP
Subjt:  SSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEP

Query:  GERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
         ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
Subjt:  GERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYRGKQFDIGLMVWKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVW
        EYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ +NNVW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVW

Query:  GTYFYYGGPGRNVKCP
        GTYFYYGGPGR V+CP
Subjt:  GTYFYYGGPGRNVKCP

A0A6J1KJ26 uncharacterized protein LOC1114950545.4e-22991.5Show/hide
Query:  SSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERP
        SSSSCFVV LL+FTSFSSVF TSI+H+ PPKNQT FHP+KEL +LKHIR+YLRKINKPP KTI+SSDGDVIDCVLSHLQPAFDHP LKGHTP L P ERP
Subjt:  SSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERP

Query:  RGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ
        RGNNS EEVAEN QLWSASGDFCPEGTIPIRRTTE+DI+RASSFRRFGRKPIRH+RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQ
Subjt:  RGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ

Query:  IWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWLEYGS
        IWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFD+G+MVWKDPKHGHWWLEYGS
Subjt:  IWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWLEYGS

Query:  GLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVWGTYF
        GLLVGYWPAFLFSHLRSH SMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYDIRQ TN+VWGTYF
Subjt:  GLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVWGTYF

Query:  YYGGPGRNVKCP
        YYGGPGRNVKCP
Subjt:  YYGGPGRNVKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)1.4e-17670.87Show/hide
Query:  SSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERP
        SS+  F+  LLL +SFSSV S ++S    P+NQTL  P  EL KLK I  +LRKINKP IKTI S DGD+IDCVL H QPAFDHP L+G  P L+P ERP
Subjt:  SSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERP

Query:  RGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ
        RG+N      ++ QLW   G+ CPEGT+PIRRT E+DI RA+S   FG+K +RH RRD+S NGHEHAV +V+GE+YYGAKAS+N+WAP+V +QYEFSLSQ
Subjt:  RGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ

Query:  IWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWLEYGS
        IW+ISGSFGNDLNTIEAGWQVSPELYGDN PRFFTYWT DAYQATGCYNLLCSGFVQTN+ IAIGAAISP SSY+G QFDI L++WKDPKHG+WWLE+GS
Subjt:  IWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWLEYGS

Query:  GLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVWGTYF
        G+LVGYWP+FLF+HL+ HASMVQ+GGEIVNS   G HT+TQMGSGHFAEEGF K+SYFRN+QVVDWDNNL+P  NL +LADH +CYDI+  +N  WG+YF
Subjt:  GLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVWGTYF

Query:  YYGGPGRNVKCP
        YYGGPG+N KCP
Subjt:  YYGGPGRNVKCP

AT1G23340.1 Protein of Unknown Function (DUF239)4.0e-17670.43Show/hide
Query:  SSSSSSCFVVFLLLFTSFSSVF--STSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEP
        SSSSS  F  F+LL + FSS    S S S  +P        P +E++K+K IR  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+G  P ++P
Subjt:  SSSSSSCFVVFLLLFTSFSSVF--STSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEP

Query:  GERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
         E P G +   E  EN QLWS  G+ CPEGTIPIRRTTE+D+ RA+S RRFGRK IR VRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  GERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH +CYDIR   N VW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVW

Query:  GTYFYYGGPGRNVKCP
        G +FYYGGPG+N KCP
Subjt:  GTYFYYGGPGRNVKCP

AT1G23340.2 Protein of Unknown Function (DUF239)4.0e-17670.43Show/hide
Query:  SSSSSSCFVVFLLLFTSFSSVF--STSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEP
        SSSSS  F  F+LL + FSS    S S S  +P        P +E++K+K IR  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+G  P ++P
Subjt:  SSSSSSCFVVFLLLFTSFSSVF--STSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEP

Query:  GERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF
         E P G +   E  EN QLWS  G+ CPEGTIPIRRTTE+D+ RA+S RRFGRK IR VRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYEF
Subjt:  GERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEF

Query:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWL
        SLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+G QFDI L++WKDPKHGHWWL
Subjt:  SLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWL

Query:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH +CYDIR   N VW
Subjt:  EYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVW

Query:  GTYFYYGGPGRNVKCP
        G +FYYGGPG+N KCP
Subjt:  GTYFYYGGPGRNVKCP

AT1G70550.2 Protein of Unknown Function (DUF239)5.4e-17369.76Show/hide
Query:  SSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERPRG
        SS F+  +LL    SS FS++ S            P +EL+KL  IR  L KINKP +KTI+SSDGD IDCV +H QPAFDHP L+G  P L+P E P+G
Subjt:  SSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERPRG

Query:  NNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW
         +  +   EN QLWS SG+ CPEGTIPIRRTTE+D+ RASS +RFGRK IR V+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIW
Subjt:  NNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW

Query:  VISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWLEYGSGL
        VI+GSF +DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY+G QFDI L++WKDPKHGHWWL++GSG 
Subjt:  VISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWLEYGSGL

Query:  LVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVWGTYFYY
        LVGYWPAFLF+HL+ H SMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR  TN VWG YFYY
Subjt:  LVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVWGTYFYY

Query:  GGPGRNVKCP
        GGPG+N +CP
Subjt:  GGPGRNVKCP

AT5G50150.1 Protein of Unknown Function (DUF239)4.6e-18874.58Show/hide
Query:  MASSSSSSSSCFVV---FLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHT
        MASSSSSSS+   +   F+ L    S      +   I  KNQT F P +E++KL+ + +YL KINKP IKTI S DGDVI+CV SHLQPAFDHP+L+G  
Subjt:  MASSSSSSSSCFVV---FLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHT

Query:  PILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        P+  P    +GN +  E + N QLWS SG+ CP G+IPIR+TT+ D+ RA+S RRFGRK  R +RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAPRVT
Subjt:  PILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKH
        D YEFSLSQIW+ISGSFG+DLNTIEAGWQVSPELYGDN PRFFTYWTTDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SSY G+QFDIGLM+WKDPKH
Subjt:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKH

Query:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQA
        GHWWLE G+GLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NLH+LADH  CYDIRQ 
Subjt:  GHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQA

Query:  TNNVWGTYFYYGGPGRNVKCP
         NNVWGTYFYYGGPGRN +CP
Subjt:  TNNVWGTYFYYGGPGRNVKCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCTTCTTCTTCTTCTTCTTGTTTTGTTGTTTTCCTTCTGCTTTTTACTTCTTTCTCCTCTGTTTTCTCAACTTCCATTTCCCATCAAATCCCACCAAA
AAACCAAACTCTTTTCCATCCAACCAAAGAGCTGAAGAAACTAAAACACATCAGATCTTATTTACGCAAAATCAACAAGCCTCCAATCAAGACAATTCGGAGTTCAGATG
GTGATGTTATAGACTGTGTTCTTTCTCATCTTCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGGCATACTCCAATATTGGAACCAGGTGAGAGGCCAAGAGGGAACAAC
TCCATGGAAGAAGTAGCAGAAAATTTGCAATTATGGTCAGCTTCAGGGGATTTTTGCCCAGAAGGAACAATTCCAATAAGAAGAACAACAGAAAAAGACATTTTCAGAGC
AAGTTCTTTTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGACTCTTCTGGCAATGGCCACGAGCATGCTGTTGTATTTGTGAATGGAGAACAATATTATG
GAGCGAAGGCGAGTTTAAACATATGGGCGCCACGTGTAACGGATCAATATGAATTTAGTTTATCACAAATATGGGTAATTTCAGGGTCTTTTGGGAATGATTTGAACACC
ATTGAAGCTGGATGGCAGGTTAGTCCTGAACTGTATGGAGACAACAATCCTAGATTCTTTACGTATTGGACAACCGATGCTTATCAAGCGACTGGGTGTTATAATTTACT
TTGTTCTGGGTTCGTTCAAACCAACAATAGGATCGCCATTGGAGCAGCAATCTCGCCTATATCTTCTTATAGAGGAAAGCAATTCGATATTGGTTTAATGGTTTGGAAGG
ATCCGAAGCACGGGCATTGGTGGTTGGAATATGGGTCGGGTTTGCTAGTCGGGTATTGGCCAGCATTTCTGTTCAGCCATTTAAGAAGTCATGCTAGCATGGTACAATTT
GGAGGAGAAATAGTGAACAGCAGATCATCAGGGTTTCACACGGCGACGCAGATGGGGAGTGGTCATTTTGCTGAAGAAGGGTTTGGAAAAGCTTCTTATTTCAGGAACTT
GCAAGTGGTTGATTGGGATAATAATTTGCTTCCTCTTACAAATCTTCATCTTTTGGCTGACCATTCTGATTGTTATGATATTAGACAAGCCACTAATAATGTTTGGGGCA
CTTATTTTTACTATGGAGGGCCTGGTAGAAATGTCAAATGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
AGGCAAAGCCGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGTATTTTTATATTATATGTTGGGAATGGTGAAGCAGAGAAGAAGAAGAAGAAGAAGA
AGAAGAAGAAAACAAAACAAAAGTTTGGGTTTTTGTAAGTGAGAATTGATATTAGAGAGAAAATGAAGTAAATGATTTCTTTTCTTTTTTATTCTTTTTTCTCTTCATCC
ATATACAACCAGACAACCTCAATACTTACAAATCTCACATCATTACACTAAAACAAACGACTTCCCCGAAAACATTCTAACAAAATGGCTTCTTCTTCTTCTTCTTCTTC
TTCTTGTTTTGTTGTTTTCCTTCTGCTTTTTACTTCTTTCTCCTCTGTTTTCTCAACTTCCATTTCCCATCAAATCCCACCAAAAAACCAAACTCTTTTCCATCCAACCA
AAGAGCTGAAGAAACTAAAACACATCAGATCTTATTTACGCAAAATCAACAAGCCTCCAATCAAGACAATTCGGAGTTCAGATGGTGATGTTATAGACTGTGTTCTTTCT
CATCTTCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGGCATACTCCAATATTGGAACCAGGTGAGAGGCCAAGAGGGAACAACTCCATGGAAGAAGTAGCAGAAAATTT
GCAATTATGGTCAGCTTCAGGGGATTTTTGCCCAGAAGGAACAATTCCAATAAGAAGAACAACAGAAAAAGACATTTTCAGAGCAAGTTCTTTTCGAAGATTTGGAAGAA
AACCCATTAGACATGTGAGAAGAGACTCTTCTGGCAATGGCCACGAGCATGCTGTTGTATTTGTGAATGGAGAACAATATTATGGAGCGAAGGCGAGTTTAAACATATGG
GCGCCACGTGTAACGGATCAATATGAATTTAGTTTATCACAAATATGGGTAATTTCAGGGTCTTTTGGGAATGATTTGAACACCATTGAAGCTGGATGGCAGGTTAGTCC
TGAACTGTATGGAGACAACAATCCTAGATTCTTTACGTATTGGACAACCGATGCTTATCAAGCGACTGGGTGTTATAATTTACTTTGTTCTGGGTTCGTTCAAACCAACA
ATAGGATCGCCATTGGAGCAGCAATCTCGCCTATATCTTCTTATAGAGGAAAGCAATTCGATATTGGTTTAATGGTTTGGAAGGATCCGAAGCACGGGCATTGGTGGTTG
GAATATGGGTCGGGTTTGCTAGTCGGGTATTGGCCAGCATTTCTGTTCAGCCATTTAAGAAGTCATGCTAGCATGGTACAATTTGGAGGAGAAATAGTGAACAGCAGATC
ATCAGGGTTTCACACGGCGACGCAGATGGGGAGTGGTCATTTTGCTGAAGAAGGGTTTGGAAAAGCTTCTTATTTCAGGAACTTGCAAGTGGTTGATTGGGATAATAATT
TGCTTCCTCTTACAAATCTTCATCTTTTGGCTGACCATTCTGATTGTTATGATATTAGACAAGCCACTAATAATGTTTGGGGCACTTATTTTTACTATGGAGGGCCTGGT
AGAAATGTCAAATGCCCTTAAAATTATTTCTTCATTATTATTATTATTGGTAATGTTTTGAAAAGGTTTTAGGATATAATGTAATTTATAATTGTTTTTTGTTTGGGGTT
TTTTATTTTTGTTTATTTTATTTTATTTT
Protein sequenceShow/hide protein sequence
MASSSSSSSSCFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERPRGNN
SMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIWVISGSFGNDLNT
IEAGWQVSPELYGDNNPRFFTYWTTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQF
GGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP