; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012816 (gene) of Snake gourd v1 genome

Gene IDTan0012816
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationLG08:921547..925171
RNA-Seq ExpressionTan0012816
SyntenyTan0012816
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061519.1 DUF1005 domain-containing protein [Cucumis melo var. makuwa]9.2e-23996.44Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGF+LDP+SLRRLSGKPV++CLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHL+ RSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDF  NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_008458969.1 PREDICTED: uncharacterized protein LOC103498220 [Cucumis melo]2.7e-23896.2Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGF+LDP+SLRRLSGKPV++CLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHL+ RSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVAD+GLATGIPIAEATMSVKKGGQFCID KTVRDF  NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_022965765.1 uncharacterized protein LOC111465558 [Cucurbita maxima]6.6e-23795.96Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH
        MDPCPFVRLMV+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGF+LDPTSLRRLSGKPVI+CLSVFAGRMG+
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI +SIDGAENKPK FQNGWV LGKDEDK SARLHLL RSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDF+PNS+S IKGNFVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_023536792.1 uncharacterized protein LOC111798069 [Cucurbita pepo subsp. pepo]3.3e-23695.72Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH
        MDPCPFVRLMV+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGF+LDPTSLRRLSGKPVI+CLSVFAGRMG+
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI +SIDGAENKPK FQNGWV LGKDEDK SARLHLL RSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRD +PNS+S IKGNFVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_038889424.1 uncharacterized protein LOC120079336 [Benincasa hispida]8.6e-23795.72Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPS TPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGF+LDP+SLRR+SG PV++CLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAENKP+VFQNGWVKLGKD+DKISARLHL+ RSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDF+ NSKS +KGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

TrEMBL top hitse value%identityAlignment
A0A0A0LIQ0 Uncharacterized protein4.6e-23695.49Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGF+LDP+SLRRLSGKPV++CLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAE+KPKVFQNGWVKLGK EDKISARLHL+ RSEPDPRFVFQFG EPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRD   NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A1S3C9N5 uncharacterized protein LOC1034982201.3e-23896.2Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGF+LDP+SLRRLSGKPV++CLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHL+ RSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVAD+GLATGIPIAEATMSVKKGGQFCID KTVRDF  NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A5A7V777 DUF1005 domain-containing protein4.4e-23996.44Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPS TPCFCKI+IKNFPSQTALLPLSSVSGDSPPDSAASSAGF+LDP+SLRRLSGKPV++CLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHL+ RSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDF  NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A6J1FDM1 uncharacterized protein LOC1114444182.7e-23695.49Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH
        MDPCPFVRLMV+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGF+LDPTSLRRLSGKPVI+CLSVFAGRMG+
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRI +SIDGAENKPK FQNGWV LGKDE+K SARLHLL RSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDF+PNS+S IKGNFVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A6J1HPN4 uncharacterized protein LOC1114655583.2e-23795.96Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH
        MDPCPFVRLMV+SL+LNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGF+LDPTSLRRLSGKPVI+CLSVFAGRMG+
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI +SIDGAENKPK FQNGWV LGKDEDK SARLHLL RSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDF+PNS+S IKGNFVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.1e-12549.79Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVII---CLS--VFA
        MDPCPF+RL + +LAL +P A +   + VHPS++PCFCKI +KNFP QTA +P   +     P+    +A F+L  + ++RL+ + +     CL   ++ 
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVII---CLS--VFA

Query:  GRMGHTCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDK--ISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---
        GR G  CGV+SG+LL +V + + + G ++KP VF NGW+ +GK   K   SA+ HL  ++EPDPRFVFQF GEPECSP V QIQGNIRQPVF+CKFS   
Subjt:  GRMGHTCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDK--ISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---

Query:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
          DR  R+RSLP++    S    W+ +F  ERE+PG+ERKGW I V+DLSGSPVA AS++TPFV SPGTDRVSRSNPG+WLILRP      +W+PWGRLE
Subjt:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRER-GPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKS-----------------------------------------
        AWRER G  DGLGY+FEL+ D     GI +AE+T+S  +GG+F I+  +    +P+S S                                         
Subjt:  AWRER-GPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFNPNSKS-----------------------------------------

Query:  -TIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHD
          +   FVM++SVEGEGK SKP V+V VQHV+CM DAA +VAL+AAIDLSMDACR F Q++R+ELCH+
Subjt:  -TIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHD

AT1G50040.1 Protein of unknown function (DUF1005)4.1e-8844.04Show/hide
Query:  MDPCPFVRLMVDSLALNLPQ-------ATRPAGAAVHP-STTPCFCKIAIKNFPSQTALLPL----------SSVSGDSPPDSAASSAGFYLDPTSLRRL
        MDPC FVR++V +LA+  P+       ++  +G +V   S+  C+CKI  K+FP Q   +P+             SG+    +A  S       TSL++ 
Subjt:  MDPCPFVRLMVDSLALNLPQ-------ATRPAGAAVHP-STTPCFCKIAIKNFPSQTALLPL----------SSVSGDSPPDSAASSAGFYLDPTSLRRL

Query:  SGKPVIICLSVFAGRMGHTCG---VNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLG---KDEDKISA--RLHLLARSEPDPRFVFQFGGEPECSPVVF
          K  ++ + V++ R   +CG    +  KL+GR ++T+ +  AE+K  +  NGWV LG   K+  K  +   LH+  R EPD RFVFQF GEPECSP VF
Subjt:  SGKPVIICLSVFAGRMGHTCG---VNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLG---KDEDKISA--RLHLLARSEPDPRFVFQFGGEPECSPVVF

Query:  QIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRP
        Q+QGN +Q VF+CKF   RNS  R+L    S + T GK         E+  +ERKGW I ++DLSGSPVA ASM+TPFVPSPG++RVSRS+PGAWLILRP
Subjt:  QIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRP

Query:  HGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVR-----------DFNPNSKSTIKGN-----------
         G+   +WKPW RL+AWRE G  D LGY+FEL  D G+A  +  A +++S K GG F ID  T              F+ +S S+I+ +           
Subjt:  HGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVR-----------DFNPNSKSTIKGN-----------

Query:  -----------FVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
                   FVM++ V+G  K SKP V+VGV+HVTC  DAA  VALAAA+DLSMDACR F+QKLR EL
Subjt:  -----------FVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT3G19680.1 Protein of unknown function (DUF1005)3.0e-9443.35Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATR-------PAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAG--------FYLDPTSLRRLSGK
        MDPC FVR++V +LA+  P ++        P+ + ++P+   C+CKI  KNFP +   +P+     +S  ++  SS+G        F L    +     K
Subjt:  MDPCPFVRLMVDSLALNLPQATR-------PAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAG--------FYLDPTSLRRLSGK

Query:  PVIICLSVFA----------GRMGHTCGVNSG--KLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISA----RLHLLARSEPDPRFVFQFGGEPEC
        P    LSV A          G  G +CG+ +   KLLGR  +++ +  AE K  +  NGWV L   + K        LH+  R EPDPRFVFQF GEPEC
Subjt:  PVIICLSVFA----------GRMGHTCGVNSG--KLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISA----RLHLLARSEPDPRFVFQFGGEPEC

Query:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN
        SP VFQ+QGN +Q VF+CKF S + NS  R+L    S  S  S+    + + + E+E+P +ERKGW I V+DLSGSPVA ASM+TPFVPSPG++RV+RS+
Subjt:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN

Query:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCID-------------------------------
        PGAWLILRP G    +WKPWGRLEAWRE G  D LGY+FEL  D G+AT +  A +++S+K GG F ID                               
Subjt:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCID-------------------------------

Query:  ------HKTVRDF------NPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
                +  DF      +P++ +  +G FVM+++VEG GK SKP V+VGV HVTC  DAA  VALAAA+DLS+DACR F+ KLR+EL
Subjt:  ------HKTVRDF------NPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT4G29310.1 Protein of unknown function (DUF1005)4.6e-16467.92Show/hide
Query:  MDPCPFVRLMVDSLALNLPQ--ATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVS-GDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGR
        MDPCPFVRL +DSLAL LP+    +  G  VHPS+TPC+CK+ IK+FPSQ ALLPLSS S   SPP+S+ S+ GF+LD  ++RR+SGK + + +SV+AGR
Subjt:  MDPCPFVRLMVDSLALNLPQ--ATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVS-GDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGR

Query:  MGHTCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT
         GHTCGV SGKLLG+V + V +  A ++   F NGW KLG D DK SARLHLL  +EPDPRFVFQFGGEPECSPVV+QIQ N++QPVFSCKFS+DRN R+
Subjt:  MGHTCGVNSGKLLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT

Query:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG
        RSLPS F++ S++G   RT SG++  +K  RERKGWMI ++DLSGSPVAAASMITPFV SPG+DRVSRSNPGAWLILRPHG  VSSWKPWGRLEAWRERG
Subjt:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG

Query:  PIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHK-TVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAID
         IDGLGYKFELV D   +TGIPIAE TMS K+GG+F ID + + +  +P   S +KG FVM SSVEGEGKVSKPVV VG QHVTCMADAALFVAL+AA+D
Subjt:  PIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHK-TVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAID

Query:  LSMDACRHFTQKLRRELCHDEHDS
        LS+DAC+ F++KLR+ELCHD+  S
Subjt:  LSMDACRHFTQKLRRELCHDEHDS

AT5G17640.1 Protein of unknown function (DUF1005)1.0e-8641.36Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPS---TTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRL------SGKPVIICL
        MDP  F+RL V SLAL +P+    + +  +     ++ C C+I ++ FP QT  +PL   S D+ PD  + S  FYL+ + LR L            + +
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPS---TTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRL------SGKPVIICL

Query:  SVFAGRMGHTCGVNSGK-LLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS
        SVF G+    CGV   +  +G  ++ V  +  E KP +  NGW+ +GK +   +A LHL  + +PDPR+VFQF      SP + Q++G+++QP+FSCKFS
Subjt:  SVFAGRMGHTCGVNSGK-LLGRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS

Query:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
         DR S+   L          G W  +  G E E   RERKGW + ++DLSGS VAAA + TPFVPS G D V++SNPGAWL++RP     +SW+PWG+LE
Subjt:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRERGPIDGLGYKFELVADTGLATG-IPIAEATMSVKKGGQFCID---------------HKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVG
        AWRERG  D +  +F L+++ GL  G + ++E  +S +KGG+F ID                ++  DF+   +    G FVM+S V+GEGK SKPVVQ+ 
Subjt:  AWRERGPIDGLGYKFELVADTGLATG-IPIAEATMSVKKGGQFCID---------------HKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVG

Query:  VQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH
        ++HVTC+ DAA+F+ALAAA+DLS+ AC+ F +  RR   H
Subjt:  VQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCTGTCCGTTCGTCCGGCTGATGGTCGATTCCCTCGCTCTCAACCTCCCTCAAGCGACCCGACCCGCCGGCGCCGCCGTCCACCCGTCCACGACGCCCTGCTT
CTGTAAGATCGCGATCAAGAATTTCCCTTCGCAGACGGCGCTTCTTCCTCTTTCCTCTGTCTCCGGCGACTCGCCGCCGGACTCCGCCGCGTCGTCCGCCGGTTTCTACC
TCGACCCCACGTCTCTCCGCCGTCTCTCCGGTAAGCCGGTTATAATTTGTCTGTCGGTTTTCGCCGGCCGGATGGGCCACACGTGTGGGGTCAATTCCGGCAAATTGCTC
GGCCGGGTTCGGATCACTGTCTCGATCGACGGCGCTGAGAATAAACCGAAAGTGTTTCAGAATGGGTGGGTGAAATTGGGGAAGGATGAGGATAAAATCTCGGCTCGGCT
TCACTTGCTTGCCCGGTCTGAACCGGACCCGCGGTTCGTGTTCCAGTTCGGTGGCGAACCGGAATGCAGTCCGGTGGTTTTTCAGATCCAAGGAAATATCCGTCAGCCAG
TTTTCAGCTGCAAGTTCAGTGCCGATCGAAACTCAAGAACACGGTCACTGCCATCAGATTTCAGCTTCAACAGCACCAAAGGGAAATGGATGAGAACATTTTCAGGGGAG
AGGGAGAAGCCAGGGAGAGAGAGAAAGGGTTGGATGATCATGGTCTACGACCTCTCCGGCTCCCCCGTCGCGGCCGCCTCCATGATAACCCCGTTCGTCCCCTCCCCGGG
CACGGACCGTGTCTCCCGCTCCAACCCCGGTGCCTGGCTCATCCTTCGACCCCATGGCTTCTCCGTTAGCAGTTGGAAGCCATGGGGCCGCCTCGAGGCTTGGCGCGAGC
GGGGACCAATCGATGGCCTCGGCTACAAGTTCGAGCTCGTCGCTGACACTGGACTAGCCACCGGCATTCCCATCGCCGAAGCTACCATGAGCGTAAAAAAAGGTGGCCAA
TTTTGCATTGACCATAAAACCGTGAGAGATTTCAATCCAAATTCTAAATCCACTATTAAAGGCAACTTTGTAATGGCTTCGAGCGTGGAAGGGGAAGGGAAAGTGAGCAA
GCCTGTCGTACAAGTCGGGGTTCAACACGTGACGTGCATGGCGGATGCTGCTTTATTTGTAGCGCTCGCAGCGGCAATTGATCTGAGCATGGATGCTTGTAGACATTTTA
CACAAAAGCTAAGGAGGGAGCTTTGTCACGACGAACACGATTCGAGTTTTCTCTAA
mRNA sequenceShow/hide mRNA sequence
GGGGAGCTCATTCTCCATCTGTCACGAGAGATAGAAACAGAATAAAAAGCAAAACCTTCAATCCATCAATGGCCGACAGTTCACACTCCAAATCCCCAACTTAGATGTTC
GCCAATTTTTCTTTTCCTTCTTCTTCTTTCTCAGTCTACATTTCCGCCGCCGGCGACTTTTCCGATGGATCCCTGTCCGTTCGTCCGGCTGATGGTCGATTCCCTCGCTC
TCAACCTCCCTCAAGCGACCCGACCCGCCGGCGCCGCCGTCCACCCGTCCACGACGCCCTGCTTCTGTAAGATCGCGATCAAGAATTTCCCTTCGCAGACGGCGCTTCTT
CCTCTTTCCTCTGTCTCCGGCGACTCGCCGCCGGACTCCGCCGCGTCGTCCGCCGGTTTCTACCTCGACCCCACGTCTCTCCGCCGTCTCTCCGGTAAGCCGGTTATAAT
TTGTCTGTCGGTTTTCGCCGGCCGGATGGGCCACACGTGTGGGGTCAATTCCGGCAAATTGCTCGGCCGGGTTCGGATCACTGTCTCGATCGACGGCGCTGAGAATAAAC
CGAAAGTGTTTCAGAATGGGTGGGTGAAATTGGGGAAGGATGAGGATAAAATCTCGGCTCGGCTTCACTTGCTTGCCCGGTCTGAACCGGACCCGCGGTTCGTGTTCCAG
TTCGGTGGCGAACCGGAATGCAGTCCGGTGGTTTTTCAGATCCAAGGAAATATCCGTCAGCCAGTTTTCAGCTGCAAGTTCAGTGCCGATCGAAACTCAAGAACACGGTC
ACTGCCATCAGATTTCAGCTTCAACAGCACCAAAGGGAAATGGATGAGAACATTTTCAGGGGAGAGGGAGAAGCCAGGGAGAGAGAGAAAGGGTTGGATGATCATGGTCT
ACGACCTCTCCGGCTCCCCCGTCGCGGCCGCCTCCATGATAACCCCGTTCGTCCCCTCCCCGGGCACGGACCGTGTCTCCCGCTCCAACCCCGGTGCCTGGCTCATCCTT
CGACCCCATGGCTTCTCCGTTAGCAGTTGGAAGCCATGGGGCCGCCTCGAGGCTTGGCGCGAGCGGGGACCAATCGATGGCCTCGGCTACAAGTTCGAGCTCGTCGCTGA
CACTGGACTAGCCACCGGCATTCCCATCGCCGAAGCTACCATGAGCGTAAAAAAAGGTGGCCAATTTTGCATTGACCATAAAACCGTGAGAGATTTCAATCCAAATTCTA
AATCCACTATTAAAGGCAACTTTGTAATGGCTTCGAGCGTGGAAGGGGAAGGGAAAGTGAGCAAGCCTGTCGTACAAGTCGGGGTTCAACACGTGACGTGCATGGCGGAT
GCTGCTTTATTTGTAGCGCTCGCAGCGGCAATTGATCTGAGCATGGATGCTTGTAGACATTTTACACAAAAGCTAAGGAGGGAGCTTTGTCACGACGAACACGATTCGAG
TTTTCTCTAAAATCTTATCAATTTTTTCGAGATTTTTTTTTTTGTTTGTTTTTCTTTGCAATGATAACCCAGTTATCATCCAAGGAACAACCGGATTTGAGCAGAGAAAT
CCTAGATTTATGTAATCTTTAATTTTCTCCTCTTCATTGATTCTTTAATTCTCTTTTTTGTTTTATTAATTTCCACATTTTATTTATTCGTAGTTTTGGTTTCAGGTTCA
AAAGCTTTCTGTGTGTAAGTTTGAATTTGGCCTAACCAAAAGTGTACAGACAGTAAACGAGAATTTGTGTTTAATATATATTTTCCTACTTTGTTTTGTGATT
Protein sequenceShow/hide protein sequence
MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSTTPCFCKIAIKNFPSQTALLPLSSVSGDSPPDSAASSAGFYLDPTSLRRLSGKPVIICLSVFAGRMGHTCGVNSGKLL
GRVRITVSIDGAENKPKVFQNGWVKLGKDEDKISARLHLLARSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGE
REKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQ
FCIDHKTVRDFNPNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHDEHDSSFL