; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg18599 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg18599
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of unknown function (DUF1005)
Genome locationCarg_Chr12:11755202..11757825
RNA-Seq ExpressionCarg18599
SyntenyCarg18599
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586452.1 hypothetical protein SDJN03_19185, partial [Cucurbita argyrosperma subsp. sororia]2.3e-23799.76Show/hide
Query:  MVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGNTCGVNSGKL
        MVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGNTCGVNSGKL
Subjt:  MVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGNTCGVNSGKL

Query:  LGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNST
        LGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSE DPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNST
Subjt:  LGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNST

Query:  KGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVAD
        KGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVAD
Subjt:  KGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVAD

Query:  TGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRR
        TGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRR
Subjt:  TGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRR

Query:  ELCHDEHDSSFL
        ELCHDEHDSSFL
Subjt:  ELCHDEHDSSFL

KAG7021307.1 hypothetical protein SDJN02_17996 [Cucurbita argyrosperma subsp. argyrosperma]2.5e-244100Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
        MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN

Query:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_022938284.1 uncharacterized protein LOC111444418 [Cucurbita moschata]4.0e-24299.05Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
        MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSS SGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN

Query:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRIAISIDGAENKPKTFQNGWVNLGKDE+KTSARLHLLVRSE DPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_022965765.1 uncharacterized protein LOC111465558 [Cucurbita maxima]4.7e-24399.52Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
        MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSS SGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN

Query:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSE DPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_023536792.1 uncharacterized protein LOC111798069 [Cucurbita pepo subsp. pepo]2.3e-24299.29Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
        MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSS SGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN

Query:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSE DPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRD SPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

TrEMBL top hitse value%identityAlignment
A0A0A0LIQ0 Uncharacterized protein9.9e-23193.59Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
        MDPCPFVRLMV+SL+LNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSS SGDSPPDSAASSAGFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN

Query:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI +SIDGAE+KPK FQNGWV LGK EDK SARLHL+VRSE DPRFVFQFG EPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRD + NS+S +KG+FVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A1S3C9N5 uncharacterized protein LOC1034982202.8e-23394.3Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
        MDPCPFVRLMV+SL+LNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSS SGDSPPDSAASSAGFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN

Query:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI +SIDGAENKPK FQNGWV LGKDEDK SARLHL+VRSE DPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVAD+GLATGIPIAEATMSVKKGGQFCIDRKTVRDF+ NS+S +KG+FVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A5A7V777 DUF1005 domain-containing protein9.6e-23494.54Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
        MDPCPFVRLMV+SL+LNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSS SGDSPPDSAASSAGFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN

Query:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI +SIDGAENKPK FQNGWV LGKDEDK SARLHL+VRSE DPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDF+ NS+S +KG+FVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A6J1FDM1 uncharacterized protein LOC1114444181.9e-24299.05Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
        MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSS SGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN

Query:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRIAISIDGAENKPKTFQNGWVNLGKDE+KTSARLHLLVRSE DPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A6J1HPN4 uncharacterized protein LOC1114655582.3e-24399.52Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
        MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSS SGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGN

Query:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSE DPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)5.5e-12549.79Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIM---CLS--VFA
        MDPCPF+RL + +L+L +P A +   + VHPS++PCFCKI +KNFP QTA +P         P+    +A FHL  + ++RL+ + +     CL   ++ 
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIM---CLS--VFA

Query:  GRMGNTCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDK--TSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---
        GR G  CGV+SG+LL +V + + + G ++KP  F NGW+++GK   K  +SA+ HL V++E DPRFVFQF GEPECSP V QIQGNIRQPVF+CKFS   
Subjt:  GRMGNTCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDK--TSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---

Query:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
          DR  R+RSLP++    S    W+ +F  ERE+PG+ERKGW I V+DLSGS VA AS++TPFV SPGTDRVSRSNPG+WLILRP      +W+PWGRLE
Subjt:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRER-GPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRS-----------------------------------------
        AWRER G  DGLGY+FEL+ D     GI +AE+T+S  +GG+F I+  +    SP+S S                                         
Subjt:  AWRER-GPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRS-----------------------------------------

Query:  -NIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHD
         N+   FVM++SVEGEGK SKP V+V VQHV+CM DAA +VAL+AAIDLSMDACR F Q++R+ELCH+
Subjt:  -NIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHD

AT1G50040.1 Protein of unknown function (DUF1005)1.1e-8543.4Show/hide
Query:  MDPCPFVRLMVESLSLNLPQ-------ATRPAGAAVHP-STTPCFCKIAIKNFPSQTALLPL----------SSASGDSPPDSAASSAGFHLDPTSLRRL
        MDPC FVR++V +L++  P+       ++  +G +V   S+  C+CKI  K+FP Q   +P+             SG+    +A  S       TSL++ 
Subjt:  MDPCPFVRLMVESLSLNLPQ-------ATRPAGAAVHP-STTPCFCKIAIKNFPSQTALLPL----------SSASGDSPPDSAASSAGFHLDPTSLRRL

Query:  SGKPVIMCLSVFAGRMGNTCG---VNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLG---KDEDKTSA--RLHLLVRSELDPRFVFQFGGEPECSPVVF
          K  ++ + V++ R   +CG    +  KL+GR ++ + +  AE+K     NGWV+LG   K+  K+ +   LH+ VR E D RFVFQF GEPECSP VF
Subjt:  SGKPVIMCLSVFAGRMGNTCG---VNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLG---KDEDKTSA--RLHLLVRSELDPRFVFQFGGEPECSPVVF

Query:  QIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRP
        Q+QGN +Q VF+CKF   RNS  R+L    S + T GK         E+  +ERKGW I ++DLSGS VA ASM+TPFVPSPG++RVSRS+PGAWLILRP
Subjt:  QIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRP

Query:  HGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVR-----------DFSPNSRSNIKGN-----------
         G+   +WKPW RL+AWRE G  D LGY+FEL  D G+A  +  A +++S K GG F ID  T              F  +S S+I+ +           
Subjt:  HGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRKTVR-----------DFSPNSRSNIKGN-----------

Query:  -----------FVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
                   FVM++ V+G  K SKP V+VGV+HVTC  DAA  VALAAA+DLSMDACR F+QKLR EL
Subjt:  -----------FVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT3G19680.1 Protein of unknown function (DUF1005)1.2e-9243.76Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATR-------PAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAG--------FHLDPTSLRRLSGK
        MDPC FVR++V +L++  P ++        P+ + ++P+   C+CKI  KNFP +   +P+   + +S  ++  SS+G        F L    +     K
Subjt:  MDPCPFVRLMVESLSLNLPQATR-------PAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAG--------FHLDPTSLRRLSGK

Query:  PVIMCLSVFAGRMGN----------TCGVNSG--KLLGRVRIAISIDGAENKPKTFQNGWVNL--GKDEDKTSA--RLHLLVRSELDPRFVFQFGGEPEC
        P    LSV A   GN          +CG+ +   KLLGR  +++ +  AE K     NGWV L   K + KT +   LH+ VR E DPRFVFQF GEPEC
Subjt:  PVIMCLSVFAGRMGN----------TCGVNSG--KLLGRVRIAISIDGAENKPKTFQNGWVNL--GKDEDKTSA--RLHLLVRSELDPRFVFQFGGEPEC

Query:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSN
        SP VFQ+QGN +Q VF+CKF S + NS  R+L    S  S  S+    + + + E+E+P +ERKGW I V+DLSGS VA ASM+TPFVPSPG++RV+RS+
Subjt:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSN

Query:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCID------------------------------R
        PGAWLILRP G    +WKPWGRLEAWRE G  D LGY+FEL  D G+AT +  A +++S+K GG F ID                              R
Subjt:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCID------------------------------R

Query:  KTVR-------DFS------PNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
           R       DF       P++ +  +G FVM+++VEG GK SKP V+VGV HVTC  DAA  VALAAA+DLS+DACR F+ KLR+EL
Subjt:  KTVR-------DFS------PNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT4G29310.1 Protein of unknown function (DUF1005)1.3e-16167.22Show/hide
Query:  MDPCPFVRLMVESLSLNLPQ--ATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSAS-GDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGR
        MDPCPFVRL ++SL+L LP+    +  G  VHPS+TPC+CK+ IK+FPSQ ALLPLSS S   SPP+S+ S+ GFHLD  ++RR+SGK + + +SV+AGR
Subjt:  MDPCPFVRLMVESLSLNLPQ--ATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSAS-GDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGR

Query:  MGNTCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT
         G+TCGV SGKLLG+V +A+ +  A ++   F NGW  LG D DK SARLHLLV +E DPRFVFQFGGEPECSPVV+QIQ N++QPVFSCKFS+DRN R+
Subjt:  MGNTCGVNSGKLLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT

Query:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG
        RSLPS F++ S++G   RT SG++  +K  RERKGWMI ++DLSGS VAAASMITPFV SPG+DRVSRSNPGAWLILRPHG  VSSWKPWGRLEAWRERG
Subjt:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG

Query:  PIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRK-TVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAID
         IDGLGYKFELV D   +TGIPIAE TMS K+GG+F IDR+ + +  SP   S +KG FVM SSVEGEGKVSKP+V VG QHVTCMADAALFVAL+AA+D
Subjt:  PIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDRK-TVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAID

Query:  LSMDACRHFTQKLRRELCHDEHDS
        LS+DAC+ F++KLR+ELCHD+  S
Subjt:  LSMDACRHFTQKLRRELCHDEHDS

AT5G17640.1 Protein of unknown function (DUF1005)4.3e-8540.91Show/hide
Query:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPS---TTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRL------SGKPVIMCL
        MDP  F+RL V SL+L +P+    + +  +     ++ C C+I ++ FP QT  +PL   S D+ PD  + S  F+L+ + LR L            + +
Subjt:  MDPCPFVRLMVESLSLNLPQATRPAGAAVHPS---TTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRL------SGKPVIMCL

Query:  SVFAGRMGNTCGVNSGK-LLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS
        SVF G+    CGV   +  +G  ++ +  +  E KP    NGW+++GK +   +A LHL V+ + DPR+VFQF      SP + Q++G+++QP+FSCKFS
Subjt:  SVFAGRMGNTCGVNSGK-LLGRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS

Query:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
         DR S+   L          G W  +  G E E   RERKGW + ++DLSGSAVAAA + TPFVPS G D V++SNPGAWL++RP     +SW+PWG+LE
Subjt:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRERGPIDGLGYKFELVADTGLATG-IPIAEATMSVKKGGQFCIDR---------------KTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVG
        AWRERG  D +  +F L+++ GL  G + ++E  +S +KGG+F ID                ++  DFS   +    G FVM+S V+GEGK SKP+VQ+ 
Subjt:  AWRERGPIDGLGYKFELVADTGLATG-IPIAEATMSVKKGGQFCIDR---------------KTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVG

Query:  VQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH
        ++HVTC+ DAA+F+ALAAA+DLS+ AC+ F +  RR   H
Subjt:  VQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCTGTCCGTTTGTCCGACTGATGGTCGAATCCCTTTCTCTCAACCTTCCTCAGGCAACTCGCCCCGCCGGCGCCGCCGTCCACCCGTCGACAACGCCCTGCTT
CTGTAAGATCGCGATCAAGAATTTCCCTTCGCAAACGGCGCTTCTTCCTCTTTCATCTGCCTCCGGCGACTCACCGCCAGACTCCGCCGCGTCGTCCGCCGGTTTCCACC
TCGACCCGACATCTCTACGCCGGCTCTCTGGTAAGCCAGTTATAATGTGTCTTTCGGTTTTCGCCGGTCGGATGGGTAACACGTGTGGGGTTAATTCTGGCAAATTGCTC
GGCCGGGTTCGGATCGCTATCTCGATCGACGGCGCTGAGAATAAACCGAAAACGTTTCAGAATGGGTGGGTGAACTTGGGAAAAGATGAGGATAAAACATCTGCACGACT
TCACTTGCTTGTCCGGTCTGAACTGGACCCCCGATTCGTGTTTCAGTTCGGCGGCGAACCGGAATGTAGTCCGGTGGTTTTCCAGATCCAAGGCAATATCCGACAGCCGG
TTTTCAGTTGCAAGTTCAGTGCTGATCGGAACTCGAGAACCCGGTCACTGCCATCAGATTTCAGCTTCAACAGCACCAAAGGAAAATGGATGAGAACTTTTTCAGGCGAA
AGAGAGAAGCCCGGGAGAGAAAGAAAGGGTTGGATGATCATGGTTTACGACCTCTCGGGGTCCGCTGTCGCAGCGGCCTCCATGATTACGCCGTTCGTCCCTTCTCCGGG
CACGGATCGTGTCTCCCGCTCCAACCCTGGTGCCTGGCTCATCCTCCGCCCCCATGGCTTCTCTGTTAGCAGTTGGAAGCCATGGGGCCGCCTTGAGGCTTGGCGTGAGC
GGGGACCGATTGATGGTCTCGGCTACAAGTTCGAGCTCGTGGCTGATACTGGACTAGCAACTGGCATTCCCATTGCTGAAGCTACCATGAGCGTGAAAAAAGGTGGCCAA
TTTTGTATTGACCGTAAGACCGTGAGAGATTTTAGTCCAAACTCTCGATCCAACATTAAAGGTAATTTTGTAATGGCTTCGAGTGTGGAAGGAGAAGGGAAGGTAAGCAA
GCCTATCGTACAAGTCGGGGTTCAGCACGTGACGTGCATGGCGGATGCTGCTTTATTTGTCGCACTCGCAGCGGCCATTGATCTAAGCATGGATGCTTGTAGACATTTTA
CACAAAAATTAAGGAGGGAGCTATGTCACGACGAACATGATTCCAGTTTTCTCTAA
mRNA sequenceShow/hide mRNA sequence
CAATTCCCCAACTTAGATGTTCACCAATTTTTCTTCTTCTTCCTCTACCCACATTTCCGCCGCCGGCCACTTTTCCGATGGATCCCTGTCCGTTTGTCCGACTGATGGTC
GAATCCCTTTCTCTCAACCTTCCTCAGGCAACTCGCCCCGCCGGCGCCGCCGTCCACCCGTCGACAACGCCCTGCTTCTGTAAGATCGCGATCAAGAATTTCCCTTCGCA
AACGGCGCTTCTTCCTCTTTCATCTGCCTCCGGCGACTCACCGCCAGACTCCGCCGCGTCGTCCGCCGGTTTCCACCTCGACCCGACATCTCTACGCCGGCTCTCTGGTA
AGCCAGTTATAATGTGTCTTTCGGTTTTCGCCGGTCGGATGGGTAACACGTGTGGGGTTAATTCTGGCAAATTGCTCGGCCGGGTTCGGATCGCTATCTCGATCGACGGC
GCTGAGAATAAACCGAAAACGTTTCAGAATGGGTGGGTGAACTTGGGAAAAGATGAGGATAAAACATCTGCACGACTTCACTTGCTTGTCCGGTCTGAACTGGACCCCCG
ATTCGTGTTTCAGTTCGGCGGCGAACCGGAATGTAGTCCGGTGGTTTTCCAGATCCAAGGCAATATCCGACAGCCGGTTTTCAGTTGCAAGTTCAGTGCTGATCGGAACT
CGAGAACCCGGTCACTGCCATCAGATTTCAGCTTCAACAGCACCAAAGGAAAATGGATGAGAACTTTTTCAGGCGAAAGAGAGAAGCCCGGGAGAGAAAGAAAGGGTTGG
ATGATCATGGTTTACGACCTCTCGGGGTCCGCTGTCGCAGCGGCCTCCATGATTACGCCGTTCGTCCCTTCTCCGGGCACGGATCGTGTCTCCCGCTCCAACCCTGGTGC
CTGGCTCATCCTCCGCCCCCATGGCTTCTCTGTTAGCAGTTGGAAGCCATGGGGCCGCCTTGAGGCTTGGCGTGAGCGGGGACCGATTGATGGTCTCGGCTACAAGTTCG
AGCTCGTGGCTGATACTGGACTAGCAACTGGCATTCCCATTGCTGAAGCTACCATGAGCGTGAAAAAAGGTGGCCAATTTTGTATTGACCGTAAGACCGTGAGAGATTTT
AGTCCAAACTCTCGATCCAACATTAAAGGTAATTTTGTAATGGCTTCGAGTGTGGAAGGAGAAGGGAAGGTAAGCAAGCCTATCGTACAAGTCGGGGTTCAGCACGTGAC
GTGCATGGCGGATGCTGCTTTATTTGTCGCACTCGCAGCGGCCATTGATCTAAGCATGGATGCTTGTAGACATTTTACACAAAAATTAAGGAGGGAGCTATGTCACGACG
AACATGATTCCAGTTTTCTCTAA
Protein sequenceShow/hide protein sequence
MDPCPFVRLMVESLSLNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSASGDSPPDSAASSAGFHLDPTSLRRLSGKPVIMCLSVFAGRMGNTCGVNSGKLL
GRVRIAISIDGAENKPKTFQNGWVNLGKDEDKTSARLHLLVRSELDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGE
REKPGRERKGWMIMVYDLSGSAVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQ
FCIDRKTVRDFSPNSRSNIKGNFVMASSVEGEGKVSKPIVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHDEHDSSFL