; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007014 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007014
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationchr6:48040563..48043444
RNA-Seq ExpressionLag0007014
SyntenyLag0007014
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061519.1 DUF1005 domain-containing protein [Cucumis melo var. makuwa]1.6e-23896.2Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLA+GIPIAEATMSVKKGGQFCIDRKTVRDF+ NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDQHDSAFL
        RHFTQKLRRELCHD+HDS+FL
Subjt:  RHFTQKLRRELCHDQHDSAFL

XP_004141026.2 uncharacterized protein LOC101219082 [Cucumis sativus]1.6e-23595.25Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAE+KPKVFQNGWVKLGK EDKISARLHLVVRSEPDPRFVFQFG EPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLA+GIPIAEATMSVKKGGQFCIDRKTVRD + NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDQHDSAFL
        RHFTQKLRRELCHD+HDS+FL
Subjt:  RHFTQKLRRELCHDQHDSAFL

XP_008458969.1 PREDICTED: uncharacterized protein LOC103498220 [Cucumis melo]4.6e-23895.96Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA++GLA+GIPIAEATMSVKKGGQFCIDRKTVRDF+ NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDQHDSAFL
        RHFTQKLRRELCHD+HDS+FL
Subjt:  RHFTQKLRRELCHDQHDSAFL

XP_022965765.1 uncharacterized protein LOC111465558 [Cucurbita maxima]7.3e-23695.01Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+V+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI ++IDGAENKPK FQNGWV LGKDEDK SARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLA+GIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRS IKGNFVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDQHDSAFL
        RHFTQKLRRELCHD+HDS+FL
Subjt:  RHFTQKLRRELCHDQHDSAFL

XP_038889424.1 uncharacterized protein LOC120079336 [Benincasa hispida]1.5e-23695.25Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+V+SLALNLPQATRPAGAAVHPS TPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRR+SG PVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAENKP+VFQNGWVKLGKD+DKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLA+GIPIAEATMSVKKGGQFCIDRKTVRDFS NS+S +KGNFVMASSVEGEGKVSKP++QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDQHDSAFL
        RHFTQKLRRELCHD+HDS+FL
Subjt:  RHFTQKLRRELCHDQHDSAFL

TrEMBL top hitse value%identityAlignment
A0A0A0LIQ0 Uncharacterized protein7.8e-23695.25Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAE+KPKVFQNGWVKLGK EDKISARLHLVVRSEPDPRFVFQFG EPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLA+GIPIAEATMSVKKGGQFCIDRKTVRD + NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDQHDSAFL
        RHFTQKLRRELCHD+HDS+FL
Subjt:  RHFTQKLRRELCHDQHDSAFL

A0A1S3C9N5 uncharacterized protein LOC1034982202.2e-23895.96Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA++GLA+GIPIAEATMSVKKGGQFCIDRKTVRDF+ NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDQHDSAFL
        RHFTQKLRRELCHD+HDS+FL
Subjt:  RHFTQKLRRELCHDQHDSAFL

A0A5A7V777 DUF1005 domain-containing protein7.6e-23996.2Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+VDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITV+IDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLA+GIPIAEATMSVKKGGQFCIDRKTVRDF+ NS+ST+KG+FVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDQHDSAFL
        RHFTQKLRRELCHD+HDS+FL
Subjt:  RHFTQKLRRELCHDQHDSAFL

A0A6J1FDM1 uncharacterized protein LOC1114444183.0e-23594.54Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+V+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRI ++IDGAENKPK FQNGWV LGKDE+K SARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLA+GIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRS IKGNFVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDQHDSAFL
        RHFTQKLRRELCHD+HDS+FL
Subjt:  RHFTQKLRRELCHDQHDSAFL

A0A6J1HPN4 uncharacterized protein LOC1114655583.5e-23695.01Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRL+V+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI ++IDGAENKPK FQNGWV LGKDEDK SARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPG+ERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLA+GIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRS IKGNFVMASSVEGEGKVSKPI+QVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDQHDSAFL
        RHFTQKLRRELCHD+HDS+FL
Subjt:  RHFTQKLRRELCHDQHDSAFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)4.5e-12751.29Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVM---CLS--VFA
        MDPCPF+RL + +LAL +P A +   + VHPS++PCFCKI +KNFP QTA +P   +     P+    +A FHL  S ++RL+ + +     CL   ++ 
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVM---CLS--VFA

Query:  GRMGHTCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDK--ISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---
        GR G  CGV+SG+LL +V + + + G ++KP VF NGW+ +GK   K   SA+ HL V++EPDPRFVFQF GEPECSP V QIQGNIRQPVF+CKFS   
Subjt:  GRMGHTCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDK--ISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---

Query:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
          DR  R+RSLP++    S    W+ +F  ERE+PGKERKGW I V+DLSGSPVA AS++TPFV SPGTDRVSRSNPG+WLILRP      +W+PWGRLE
Subjt:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRER-GPIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQFCID-RKTVRDFSP-----NSRSTIKGN------------------------------
        AWRER G  DGLGY+FEL+ +    +GI +AE+T+S  +GG+F I+   +    SP      SRS   G+                              
Subjt:  AWRER-GPIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQFCID-RKTVRDFSP-----NSRSTIKGN------------------------------

Query:  --FVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHD
          FVM++SVEGEGK SKP ++V VQHV+CM DAA +VAL+AAIDLSMDACR F Q++R+ELCH+
Subjt:  --FVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHD

AT1G50040.1 Protein of unknown function (DUF1005)5.4e-8844.75Show/hide
Query:  MDPCPFVRLIVDSLALNLPQ-------ATRPAGAAVHP-STTPCFCKIAIKNFPSQTALLPL------SSVSGDSPPDSAASSAGFHLDPSSLRRLSGKP
        MDPC FVR+IV +LA+  P+       ++  +G +V   S+  C+CKI  K+FP Q   +P+       S S     + +  +A F L  S +     K 
Subjt:  MDPCPFVRLIVDSLALNLPQ-------ATRPAGAAVHP-STTPCFCKIAIKNFPSQTALLPL------SSVSGDSPPDSAASSAGFHLDPSSLRRLSGKP

Query:  VVMCLSV-FAGRMGHTCG---VNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLG---KDEDKISA--RLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQ
            LSV    R   +CG    +  KL+GR ++T+ +  AE+K  +  NGWV LG   K+  K  +   LH+ VR EPD RFVFQF GEPECSP VFQ+Q
Subjt:  VVMCLSV-FAGRMGHTCG---VNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLG---KDEDKISA--RLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQ

Query:  GNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGF
        GN +Q VF+CKF   RNS  R+L    S + T GK         E+  KERKGW I ++DLSGSPVA ASM+TPFVPSPG++RVSRS+PGAWLILRP G+
Subjt:  GNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGF

Query:  SVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVR-----------DFSPNSRSTIKGN--------------
           +WKPW RL+AWRE G  D LGY+FEL  + G+A  +  A +++S K GG F ID  T              F  +S S+I+ +              
Subjt:  SVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRKTVR-----------DFSPNSRSTIKGN--------------

Query:  --------FVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
                FVM++ V+G  K SKP ++VGV+HVTC  DAA  VALAAA+DLSMDACR F+QKLR EL
Subjt:  --------FVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT3G19680.1 Protein of unknown function (DUF1005)5.1e-9443.76Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATR-------PAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAG--------FHLDPSSLRRLSGK
        MDPC FVR+IV +LA+  P ++        P+ + ++P+   C+CKI  KNFP +   +P+     +S  ++  SS+G        F L  + +     K
Subjt:  MDPCPFVRLIVDSLALNLPQATR-------PAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAG--------FHLDPSSLRRLSGK

Query:  PVVMCLSVFA----------GRMGHTCGVNSG--KLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISA----RLHLVVRSEPDPRFVFQFGGEPEC
        P    LSV A          G  G +CG+ +   KLLGR  +++ +  AE K  +  NGWV L   + K        LH+ VR EPDPRFVFQF GEPEC
Subjt:  PVVMCLSVFA----------GRMGHTCGVNSG--KLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISA----RLHLVVRSEPDPRFVFQFGGEPEC

Query:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN
        SP VFQ+QGN +Q VF+CKF S + NS  R+L    S  S  S+    + + + E+E+P KERKGW I V+DLSGSPVA ASM+TPFVPSPG++RV+RS+
Subjt:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN

Query:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQFCID------------------------------R
        PGAWLILRP G    +WKPWGRLEAWRE G  D LGY+FEL  + G+A+ +  A +++S+K GG F ID                              R
Subjt:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQFCID------------------------------R

Query:  KTVR-------DFS------PNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
           R       DF       P++ +  +G FVM+++VEG GK SKP ++VGV HVTC  DAA  VALAAA+DLS+DACR F+ KLR+EL
Subjt:  KTVR-------DFS------PNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT4G29310.1 Protein of unknown function (DUF1005)1.3e-16367.45Show/hide
Query:  MDPCPFVRLIVDSLALNLPQ--ATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVS-GDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGR
        MDPCPFVRL +DSLAL LP+    +  G  VHPS+TPC+CK+ IK+FPSQ ALLPLSS S   SPP+S+ S+ GFHLD  ++RR+SGK + + +SV+AGR
Subjt:  MDPCPFVRLIVDSLALNLPQ--ATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVS-GDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGR

Query:  MGHTCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT
         GHTCGV SGKLLG+V + V +  A ++   F NGW KLG D DK SARLHL+V +EPDPRFVFQFGGEPECSPVV+QIQ N++QPVFSCKFS+DRN R+
Subjt:  MGHTCGVNSGKLLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT

Query:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG
        RSLPS F++ S++G   RT SG++  +K  +ERKGWMI ++DLSGSPVAAASMITPFV SPG+DRVSRSNPGAWLILRPHG  VSSWKPWGRLEAWRERG
Subjt:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG

Query:  PIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRK-TVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAID
         IDGLGYKFELV +   ++GIPIAE TMS K+GG+F IDR+ + +  SP   S +KG FVM SSVEGEGKVSKP++ VG QHVTCMADAALFVAL+AA+D
Subjt:  PIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQFCIDRK-TVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAID

Query:  LSMDACRHFTQKLRRELCHDQHDS
        LS+DAC+ F++KLR+ELCHD   S
Subjt:  LSMDACRHFTQKLRRELCHDQHDS

AT5G17640.1 Protein of unknown function (DUF1005)4.6e-8740.77Show/hide
Query:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPS---TTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRL------SGKPVVMCL
        MDP  F+RL V SLAL +P+    + +  +     ++ C C+I ++ FP QT  +PL   S D+ PD  + S  F+L+ S LR L            + +
Subjt:  MDPCPFVRLIVDSLALNLPQATRPAGAAVHPS---TTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRL------SGKPVVMCL

Query:  SVFAGRMGHTCGVNSGK-LLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS
        SVF G+    CGV   +  +G  ++ V  +  E KP +  NGW+ +GK +   +A LHL V+ +PDPR+VFQF      SP + Q++G+++QP+FSCKFS
Subjt:  SVFAGRMGHTCGVNSGK-LLGRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS

Query:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
         DR S+   L          G W  +  G E E   +ERKGW + ++DLSGS VAAA + TPFVPS G D V++SNPGAWL++RP     +SW+PWG+LE
Subjt:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRERGPIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQFCIDR---------------KTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGV
        AWRERG  D +  +F L++N      + ++E  +S +KGG+F ID                ++  DFS   +    G FVM+S V+GEGK SKP++Q+ +
Subjt:  AWRERGPIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQFCIDR---------------KTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGV

Query:  QHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH
        +HVTC+ DAA+F+ALAAA+DLS+ AC+ F +  RR   H
Subjt:  QHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCTGTCCCTTCGTCCGCCTCATCGTCGACTCCCTCGCCCTCAACCTCCCTCAGGCCACCCGCCCCGCCGGCGCCGCCGTCCACCCCTCCACCACGCCCTGCTT
CTGTAAGATCGCCATCAAGAACTTCCCTTCTCAGACCGCGCTTCTTCCTCTCTCCTCCGTCTCCGGCGACTCGCCTCCCGACTCCGCCGCGTCCTCCGCCGGTTTCCACC
TCGACCCCTCCTCTCTCCGCCGGCTCTCCGGCAAGCCGGTTGTAATGTGTCTGTCGGTCTTCGCCGGCCGGATGGGCCACACGTGTGGGGTTAATTCCGGCAAGTTGCTC
GGCCGGGTTCGGATCACTGTCGCGATCGACGGCGCTGAGAATAAACCGAAAGTGTTTCAGAATGGGTGGGTGAAATTGGGGAAGGATGAGGATAAAATCTCGGCTCGGTT
GCATTTGGTCGTCCGGTCTGAACCGGACCCCCGGTTCGTGTTCCAGTTCGGCGGCGAACCGGAATGTAGCCCGGTGGTTTTCCAGATCCAAGGCAATATCCGACAGCCGG
TTTTCAGCTGCAAGTTCAGTGCTGATCGAAACTCGAGAACCCGGTCACTGCCATCAGATTTCAGCTTCAACAGCACCAAAGGGAAATGGATGAGAACTTTTTCAGGGGAG
AGAGAGAAGCCAGGGAAAGAAAGAAAGGGTTGGATGATCATGGTTTATGACCTCTCCGGCTCCCCGGTCGCGGCCGCCTCCATGATCACACCGTTCGTCCCTTCCCCGGG
CACCGACCGCGTCTCGCGCTCCAACCCCGGTGCGTGGCTCATCCTCCGCCCCCACGGCTTCTCCGTTAGCAGTTGGAAGCCATGGGGCCGCCTCGAGGCTTGGCGCGAGC
GGGGACCGATCGATGGCCTCGGCTACAAGTTCGAGCTCGTGGCCAACACCGGACTAGCCAGTGGCATTCCCATTGCCGAAGCTACCATGAGCGTGAAGAAAGGTGGCCAG
TTTTGCATCGACCGTAAAACCGTGAGAGACTTTAGTCCAAACTCTAGATCCACTATTAAAGGCAACTTTGTAATGGCTTCGAGTGTGGAAGGAGAAGGAAAGGTGAGCAA
ACCAATCATACAAGTCGGAGTTCAGCACGTGACGTGCATGGCGGACGCTGCTCTATTTGTAGCACTTGCTGCAGCCATTGACCTAAGCATGGATGCTTGTAGACATTTTA
CACAAAAGCTAAGGAGGGAGCTTTGTCACGACCAACACGACTCCGCTTTTCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCCTGTCCCTTCGTCCGCCTCATCGTCGACTCCCTCGCCCTCAACCTCCCTCAGGCCACCCGCCCCGCCGGCGCCGCCGTCCACCCCTCCACCACGCCCTGCTT
CTGTAAGATCGCCATCAAGAACTTCCCTTCTCAGACCGCGCTTCTTCCTCTCTCCTCCGTCTCCGGCGACTCGCCTCCCGACTCCGCCGCGTCCTCCGCCGGTTTCCACC
TCGACCCCTCCTCTCTCCGCCGGCTCTCCGGCAAGCCGGTTGTAATGTGTCTGTCGGTCTTCGCCGGCCGGATGGGCCACACGTGTGGGGTTAATTCCGGCAAGTTGCTC
GGCCGGGTTCGGATCACTGTCGCGATCGACGGCGCTGAGAATAAACCGAAAGTGTTTCAGAATGGGTGGGTGAAATTGGGGAAGGATGAGGATAAAATCTCGGCTCGGTT
GCATTTGGTCGTCCGGTCTGAACCGGACCCCCGGTTCGTGTTCCAGTTCGGCGGCGAACCGGAATGTAGCCCGGTGGTTTTCCAGATCCAAGGCAATATCCGACAGCCGG
TTTTCAGCTGCAAGTTCAGTGCTGATCGAAACTCGAGAACCCGGTCACTGCCATCAGATTTCAGCTTCAACAGCACCAAAGGGAAATGGATGAGAACTTTTTCAGGGGAG
AGAGAGAAGCCAGGGAAAGAAAGAAAGGGTTGGATGATCATGGTTTATGACCTCTCCGGCTCCCCGGTCGCGGCCGCCTCCATGATCACACCGTTCGTCCCTTCCCCGGG
CACCGACCGCGTCTCGCGCTCCAACCCCGGTGCGTGGCTCATCCTCCGCCCCCACGGCTTCTCCGTTAGCAGTTGGAAGCCATGGGGCCGCCTCGAGGCTTGGCGCGAGC
GGGGACCGATCGATGGCCTCGGCTACAAGTTCGAGCTCGTGGCCAACACCGGACTAGCCAGTGGCATTCCCATTGCCGAAGCTACCATGAGCGTGAAGAAAGGTGGCCAG
TTTTGCATCGACCGTAAAACCGTGAGAGACTTTAGTCCAAACTCTAGATCCACTATTAAAGGCAACTTTGTAATGGCTTCGAGTGTGGAAGGAGAAGGAAAGGTGAGCAA
ACCAATCATACAAGTCGGAGTTCAGCACGTGACGTGCATGGCGGACGCTGCTCTATTTGTAGCACTTGCTGCAGCCATTGACCTAAGCATGGATGCTTGTAGACATTTTA
CACAAAAGCTAAGGAGGGAGCTTTGTCACGACCAACACGACTCCGCTTTTCTCTAA
Protein sequenceShow/hide protein sequence
MDPCPFVRLIVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGHTCGVNSGKLL
GRVRITVAIDGAENKPKVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGE
REKPGKERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLASGIPIAEATMSVKKGGQ
FCIDRKTVRDFSPNSRSTIKGNFVMASSVEGEGKVSKPIIQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHDQHDSAFL