; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003325 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003325
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationscaffold4:48682831..48686478
RNA-Seq ExpressionSpg003325
SyntenySpg003325
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061519.1 DUF1005 domain-containing protein [Cucumis melo var. makuwa]3.9e-23795.49Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASS+GFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAENKPKV+QNGWVKLGKDEDK SARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCIDRKTVRDF+ NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDYAFL
        RHFTQKLRRELCHDEHD +FL
Subjt:  RHFTQKLRRELCHDEHDYAFL

XP_008458969.1 PREDICTED: uncharacterized protein LOC103498220 [Cucumis melo]1.1e-23695.25Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASS+GFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAENKPKV+QNGWVKLGKDEDK SARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA++GLATGIPIAEATMSVKKGGQFCIDRKTVRDF+ NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDYAFL
        RHFTQKLRRELCHDEHD +FL
Subjt:  RHFTQKLRRELCHDEHDYAFL

XP_022965765.1 uncharacterized protein LOC111465558 [Cucurbita maxima]5.6e-23695.01Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+V+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASS+GFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI ++IDGAENKPK +QNGWV LGKDEDKTSARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRS IKGNFVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDYAFL
        RHFTQKLRRELCHDEHD +FL
Subjt:  RHFTQKLRRELCHDEHDYAFL

XP_023536792.1 uncharacterized protein LOC111798069 [Cucurbita pepo subsp. pepo]2.8e-23594.77Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+V+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASS+GFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI ++IDGAENKPK +QNGWV LGKDEDKTSARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCIDRKTVRD SPNSRS IKGNFVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDYAFL
        RHFTQKLRRELCHDEHD +FL
Subjt:  RHFTQKLRRELCHDEHDYAFL

XP_038889424.1 uncharacterized protein LOC120079336 [Benincasa hispida]3.6e-23594.54Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+V+SLALNLPQATRPAGAAVHPS TPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASS+GFHLDPSSLRR+SG PVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAENKP+V+QNGWVKLGKD+DK SARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCIDRKTVRDFS NS+S +KGNFVMASSVEGEGKVSKP++QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDYAFL
        RHFTQKLRRELCHDEHD +FL
Subjt:  RHFTQKLRRELCHDEHDYAFL

TrEMBL top hitse value%identityAlignment
A0A0A0LIQ0 Uncharacterized protein1.9e-23494.54Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASS+GFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAE+KPKV+QNGWVKLGK EDK SARLHL+VRSEPDPRFVFQFG EPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCIDRKTVRD + NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDYAFL
        RHFTQKLRRELCHDEHD +FL
Subjt:  RHFTQKLRRELCHDEHDYAFL

A0A1S3C9N5 uncharacterized protein LOC1034982205.4e-23795.25Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASS+GFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAENKPKV+QNGWVKLGKDEDK SARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA++GLATGIPIAEATMSVKKGGQFCIDRKTVRDF+ NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDYAFL
        RHFTQKLRRELCHDEHD +FL
Subjt:  RHFTQKLRRELCHDEHDYAFL

A0A5A7V777 DUF1005 domain-containing protein1.9e-23795.49Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASS+GFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAENKPKV+QNGWVKLGKDEDK SARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCIDRKTVRDF+ NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDYAFL
        RHFTQKLRRELCHDEHD +FL
Subjt:  RHFTQKLRRELCHDEHDYAFL

A0A6J1FDM1 uncharacterized protein LOC1114444182.3e-23594.54Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+V+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASS+GFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRI ++IDGAENKPK +QNGWV LGKDE+KTSARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRS IKGNFVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDYAFL
        RHFTQKLRRELCHDEHD +FL
Subjt:  RHFTQKLRRELCHDEHDYAFL

A0A6J1HPN4 uncharacterized protein LOC1114655582.7e-23695.01Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+V+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASS+GFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI ++IDGAENKPK +QNGWV LGKDEDKTSARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRS IKGNFVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDYAFL
        RHFTQKLRRELCHDEHD +FL
Subjt:  RHFTQKLRRELCHDEHDYAFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.3e-12650.86Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVM---CLS--VFA
        MDPCPF+RL + +LAL +P A +   + VHPS++PCFCKI +KNFP QTA +P   +     P+    ++ FHL  S ++RL+ + +     CL   ++ 
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVM---CLS--VFA

Query:  GRMGHTCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDK--TSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---
        GR G  CGV+SG+LL +V + + + G ++KP V+ NGW+ +GK   K  +SA+ HL V++EPDPRFVFQF GEPECSP V QIQGNIRQPVF+CKFS   
Subjt:  GRMGHTCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDK--TSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---

Query:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
          DR  R+RSLP++    S    W+ +F  ERE+PGKERKGW I V+DLSGSPVA AS++TPFV SPGTDRVSRSNPG+WLILRP      +W+PWGRLE
Subjt:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRER-GPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID-RKTVRDFSP-----NSRSTIKGN------------------------------
        AWRER G  DGLGY+FEL+ +     GI +AE+T+S  +GG+F I+   +    SP      SRS   G+                              
Subjt:  AWRER-GPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID-RKTVRDFSP-----NSRSTIKGN------------------------------

Query:  --FVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHD
          FVM++SVEGEGK SKP ++V VQHV+CM DAA +VAL+AAIDLSMDACR F Q++R+ELCH+
Subjt:  --FVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHD

AT1G50040.1 Protein of unknown function (DUF1005)2.4e-8844.66Show/hide
Query:  MDPCPFVRLIVDSLALNLPQ-------ATRPAGAAVHP-STTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSG-------FHLDPSSLRRLSGK
        MDPC FVR+IV +LA+  P+       ++  +G +V   S+  C+CKI  K+FP Q   +P+  +  +S  +S   S         F L  S +     K
Subjt:  MDPCPFVRLIVDSLALNLPQ-------ATRPAGAAVHP-STTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSG-------FHLDPSSLRRLSGK

Query:  PVVMCLSV-FAGRMGHTCG---VNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLG---KDEDKTSA--RLHLIVRSEPDPRFVFQFGGEPECSPVVFQI
             LSV    R   +CG    +  KL+GR ++T+ +  AE+K  +  NGWV LG   K+  K+ +   LH+ VR EPD RFVFQF GEPECSP VFQ+
Subjt:  PVVMCLSV-FAGRMGHTCG---VNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLG---KDEDKTSA--RLHLIVRSEPDPRFVFQFGGEPECSPVVFQI

Query:  QGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHG
        QGN +Q VF+CKF   RNS  R+L    S + T GK         E+  KERKGW I ++DLSGSPVA ASM+TPFVPSPG++RVSRS+PGAWLILRP G
Subjt:  QGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHG

Query:  FSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVR-----------DFSPNSRSTIKGN-------------
        +   +WKPW RL+AWRE G  D LGY+FEL  + G+A  +  A +++S K GG F ID  T              F  +S S+I+ +             
Subjt:  FSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRKTVR-----------DFSPNSRSTIKGN-------------

Query:  ---------FVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
                 FVM++ V+G  K SKP ++VGV+HVTC  DAA  VALAAA+DLSMDACR F+QKLR EL
Subjt:  ---------FVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT3G19680.1 Protein of unknown function (DUF1005)3.5e-9544.58Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATR-------PAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSG--------FHLDPSSLRRLSGK
        MDPC FVR+IV +LA+  P ++        P+ + ++P+   C+CKI  KNFP +   +P+     +S  ++  SSSG        F L  + +     K
Subjt:  MDPCPFVRLIVDSLALNLPQATR-------PAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSG--------FHLDPSSLRRLSGK

Query:  PVVMCLSVFA----------GRMGHTCGVNSG--KLLGRVRITVAIDGAENKPKVYQNGWVKL--GKDEDKTSA--RLHLIVRSEPDPRFVFQFGGEPEC
        P    LSV A          G  G +CG+ +   KLLGR  +++ +  AE K  +  NGWV L   K + KT +   LH+ VR EPDPRFVFQF GEPEC
Subjt:  PVVMCLSVFA----------GRMGHTCGVNSG--KLLGRVRITVAIDGAENKPKVYQNGWVKL--GKDEDKTSA--RLHLIVRSEPDPRFVFQFGGEPEC

Query:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN
        SP VFQ+QGN +Q VF+CKF S + NS  R+L    S  S  S+    + + + E+E+P KERKGW I V+DLSGSPVA ASM+TPFVPSPG++RV+RS+
Subjt:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN

Query:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID------------------------------R
        PGAWLILRP G    +WKPWGRLEAWRE G  D LGY+FEL  + G+AT +  A +++S+K GG F ID                              R
Subjt:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID------------------------------R

Query:  KTVR-------DFS------PNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
           R       DF       P++ +  +G FVM+++VEG GK SKP ++VGV HVTC  DAA  VALAAA+DLS+DACR F+ KLR+EL
Subjt:  KTVR-------DFS------PNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT4G29310.1 Protein of unknown function (DUF1005)7.9e-16467.7Show/hide
Query:  MDPCPFVRLIVDSLALNLPQ--ATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVS-GDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGR
        MDPCPFVRL +DSLAL LP+    +  G  VHPS+TPC+CK+ IK+FPSQ ALLPLSS S   SPP+S+ S+ GFHLD  ++RR+SGK + + +SV+AGR
Subjt:  MDPCPFVRLIVDSLALNLPQ--ATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVS-GDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGR

Query:  MGHTCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT
         GHTCGV SGKLLG+V + V +  A ++   + NGW KLG D DK SARLHL+V +EPDPRFVFQFGGEPECSPVV+QIQ N++QPVFSCKFS+DRN R+
Subjt:  MGHTCGVNSGKLLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT

Query:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG
        RSLPS F++ S++G   RT SG++  +K  +ERKGWMI ++DLSGSPVAAASMITPFV SPG+DRVSRSNPGAWLILRPHG  VSSWKPWGRLEAWRERG
Subjt:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG

Query:  PIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRK-TVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAID
         IDGLGYKFELV +   +TGIPIAE TMS K+GG+F IDR+ + +  SP   S +KG FVM SSVEGEGKVSKP++ VG QHVTCMADAALFVAL+AA+D
Subjt:  PIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDRK-TVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAID

Query:  LSMDACRHFTQKLRRELCHDE
        LS+DAC+ F++KLR+ELCHD+
Subjt:  LSMDACRHFTQKLRRELCHDE

AT5G17640.1 Protein of unknown function (DUF1005)2.7e-8741.36Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPS---TTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRL------SGKPVVMCL
        MDP  F+RL V SLAL +P+    + +  +     ++ C C+I ++ FP QT  +PL   S D+ PD  + S+ F+L+ S LR L            + +
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPS---TTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRL------SGKPVVMCL

Query:  SVFAGRMGHTCGVNSGK-LLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS
        SVF G+    CGV   +  +G  ++ V  +  E KP +  NGW+ +GK +   +A LHL V+ +PDPR+VFQF      SP + Q++G+++QP+FSCKFS
Subjt:  SVFAGRMGHTCGVNSGK-LLGRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS

Query:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
         DR S+   L          G W  +  G E E   +ERKGW + ++DLSGS VAAA + TPFVPS G D V++SNPGAWL++RP     +SW+PWG+LE
Subjt:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRERGPIDGLGYKFELVANTGLATG-IPIAEATMSVKKGGQFCIDR---------------KTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVG
        AWRERG  D +  +F L++N GL  G + ++E  +S +KGG+F ID                ++  DFS   +    G FVM+S V+GEGK SKP++Q+ 
Subjt:  AWRERGPIDGLGYKFELVANTGLATG-IPIAEATMSVKKGGQFCIDR---------------KTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVG

Query:  VQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH
        ++HVTC+ DAA+F+ALAAA+DLS+ AC+ F +  RR   H
Subjt:  VQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCTGTCCCTTCGTCCGCCTCATCGTCGACTCCCTCGCCCTCAACCTCCCTCAGGCCACCCGCCCCGCCGGCGCCGCCGTCCACCCCTCCACCACGCCCTGCTT
CTGTAAGATCGCCATCAAGAACTTCCCTTCGCAGACCGCGCTTCTTCCTCTCTCCTCCGTCTCCGGCGACTCGCCCCCGGACTCCGCCGCGTCCTCCTCCGGTTTCCACC
TCGACCCCTCCTCTCTCCGCCGCCTCTCCGGTAAGCCGGTTGTAATGTGTCTGTCTGTCTTCGCCGGCCGGATGGGCCACACGTGTGGGGTTAATTCCGGCAAGTTGCTC
GGCCGGGTTCGGATCACTGTCGCGATCGACGGCGCTGAGAATAAACCGAAAGTGTATCAGAATGGGTGGGTGAAATTGGGGAAAGATGAGGATAAAACCTCGGCTCGGTT
GCATTTGATCGTCCGGTCTGAACCGGACCCCCGGTTCGTGTTCCAGTTCGGCGGCGAACCGGAATGTAGCCCGGTGGTTTTCCAGATCCAAGGCAATATCCGACAGCCGG
TTTTCAGCTGCAAGTTCAGTGCCGATCGAAACTCGAGAACCCGGTCACTGCCATCAGATTTCAGCTTCAACAGCACCAAAGGGAAATGGATGAGAACTTTTTCAGGGGAG
AGAGAGAAGCCAGGGAAAGAAAGAAAGGGTTGGATGATCATGGTTTATGACCTCTCCGGCTCCCCCGTCGCGGCCGCCTCCATGATCACACCGTTCGTCCCTTCCCCGGG
CACCGACCGCGTCTCCCGCTCCAACCCCGGTGCGTGGCTCATCCTCCGCCCCCACGGCTTCTCCGTTAGCAGTTGGAAGCCATGGGGCCGCCTCGAGGCTTGGCGCGAGC
GGGGACCGATCGATGGCCTCGGCTACAAGTTCGAGCTCGTTGCCAACACCGGACTAGCCACTGGCATTCCCATTGCCGAAGCTACCATGAGCGTGAAAAAAGGTGGTCAA
TTTTGCATCGACCGTAAAACCGTGAGAGATTTTAGTCCAAACTCCAGATCCACTATTAAAGGCAACTTTGTAATGGCTTCGAGTGTGGAAGGAGAAGGAAAGGTGAGTAA
GCCAATTATACAAGTCGGAGTTCAGCACGTGACGTGCATGGCGGACGCTGCTCTATTTGTAGCACTTGCTGCAGCCATTGACCTAAGCATGGATGCTTGTAGACATTTTA
CACAAAAGCTAAGGAGGGAGCTTTGTCACGACGAACACGATTATGCTTTTCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCCTGTCCCTTCGTCCGCCTCATCGTCGACTCCCTCGCCCTCAACCTCCCTCAGGCCACCCGCCCCGCCGGCGCCGCCGTCCACCCCTCCACCACGCCCTGCTT
CTGTAAGATCGCCATCAAGAACTTCCCTTCGCAGACCGCGCTTCTTCCTCTCTCCTCCGTCTCCGGCGACTCGCCCCCGGACTCCGCCGCGTCCTCCTCCGGTTTCCACC
TCGACCCCTCCTCTCTCCGCCGCCTCTCCGGTAAGCCGGTTGTAATGTGTCTGTCTGTCTTCGCCGGCCGGATGGGCCACACGTGTGGGGTTAATTCCGGCAAGTTGCTC
GGCCGGGTTCGGATCACTGTCGCGATCGACGGCGCTGAGAATAAACCGAAAGTGTATCAGAATGGGTGGGTGAAATTGGGGAAAGATGAGGATAAAACCTCGGCTCGGTT
GCATTTGATCGTCCGGTCTGAACCGGACCCCCGGTTCGTGTTCCAGTTCGGCGGCGAACCGGAATGTAGCCCGGTGGTTTTCCAGATCCAAGGCAATATCCGACAGCCGG
TTTTCAGCTGCAAGTTCAGTGCCGATCGAAACTCGAGAACCCGGTCACTGCCATCAGATTTCAGCTTCAACAGCACCAAAGGGAAATGGATGAGAACTTTTTCAGGGGAG
AGAGAGAAGCCAGGGAAAGAAAGAAAGGGTTGGATGATCATGGTTTATGACCTCTCCGGCTCCCCCGTCGCGGCCGCCTCCATGATCACACCGTTCGTCCCTTCCCCGGG
CACCGACCGCGTCTCCCGCTCCAACCCCGGTGCGTGGCTCATCCTCCGCCCCCACGGCTTCTCCGTTAGCAGTTGGAAGCCATGGGGCCGCCTCGAGGCTTGGCGCGAGC
GGGGACCGATCGATGGCCTCGGCTACAAGTTCGAGCTCGTTGCCAACACCGGACTAGCCACTGGCATTCCCATTGCCGAAGCTACCATGAGCGTGAAAAAAGGTGGTCAA
TTTTGCATCGACCGTAAAACCGTGAGAGATTTTAGTCCAAACTCCAGATCCACTATTAAAGGCAACTTTGTAATGGCTTCGAGTGTGGAAGGAGAAGGAAAGGTGAGTAA
GCCAATTATACAAGTCGGAGTTCAGCACGTGACGTGCATGGCGGACGCTGCTCTATTTGTAGCACTTGCTGCAGCCATTGACCTAAGCATGGATGCTTGTAGACATTTTA
CACAAAAGCTAAGGAGGGAGCTTTGTCACGACGAACACGATTATGCTTTTCTCTAA
Protein sequenceShow/hide protein sequence
MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSSGFHLDPSSLRRLSGKPVVMCLSVFAGRMGHTCGVNSGKLL
GRVRITVAIDGAENKPKVYQNGWVKLGKDEDKTSARLHLIVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGE
REKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQ
FCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHDEHDYAFL