; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G002150 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G002150
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationchr11:2202770..2207136
RNA-Seq ExpressionLsi11G002150
SyntenyLsi11G002150
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061519.1 DUF1005 domain-containing protein [Cucumis melo var. makuwa]3.7e-24097.39Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSSV+GDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAEN+P+VFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQF GEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDF+ NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVALSAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_004141026.2 uncharacterized protein LOC101219082 [Cucumis sativus]1.0e-23796.44Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSSV+GDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAE++P+VFQNGWVKLGK EDKISARLHLVVRSEPDPRFVFQF  EPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRD ++NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVALSAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_008458969.1 PREDICTED: uncharacterized protein LOC103498220 [Cucumis melo]1.1e-23997.15Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSSV+GDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAEN+P+VFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQF GEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
        GYKFELVAD+GLATGIPIAEATMSVKKGGQFCID KTVRDF+ NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVALSAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_022965765.1 uncharacterized protein LOC111465558 [Cucurbita maxima]6.8e-23494.54Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRLMV+SL+LNLPQATRPAGAAVHPS TPCFCKIAIKNFPSQTALLPLSSV+GDSPPDSAASSAGFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI +SIDGAEN+P+ FQNGWV LGKDEDK SARLHL+VRSEPDPRFVFQF GEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDFS NS+S IKGNFVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

XP_038889424.1 uncharacterized protein LOC120079336 [Benincasa hispida]6.4e-24097.62Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSV+GDSPPDSAASSAGFHLDPSSLRR+SG PVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAEN+PRVFQNGWVKLGKD+DKISARLHLVVRSEPDPRFVFQF GEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDFSLNSKS +KGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

TrEMBL top hitse value%identityAlignment
A0A0A0LIQ0 Uncharacterized protein4.9e-23896.44Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSSV+GDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAE++P+VFQNGWVKLGK EDKISARLHLVVRSEPDPRFVFQF  EPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRD ++NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVALSAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A1S3C9N5 uncharacterized protein LOC1034982205.2e-24097.15Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSSV+GDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAEN+P+VFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQF GEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
        GYKFELVAD+GLATGIPIAEATMSVKKGGQFCID KTVRDF+ NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVALSAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A5A7V777 DUF1005 domain-containing protein1.8e-24097.39Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSSV+GDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRITVSIDGAEN+P+VFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQF GEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDF+ NSKST+KG+FVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVALSAAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A6J1DL04 uncharacterized protein LOC111021889 isoform X21.3e-23394.06Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSS++GDSPPDSAASS+GFHLDPSSLRRLSGKP+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA++RPRVF NGWVKLGK+EDKISARLHLVVRSEPDPRFVFQF GEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
        GYKFELVA+TGLATGI IAEATMSVKKGGQFCID +T+RDFS NS+S IKGNFVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

A0A6J1HPN4 uncharacterized protein LOC1114655583.3e-23494.54Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH
        MDPCPFVRLMV+SL+LNLPQATRPAGAAVHPS TPCFCKIAIKNFPSQTALLPLSSV+GDSPPDSAASSAGFHLDP+SLRRLSGKPV+MCLSVFAGRMG+
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGH

Query:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKLLGRVRI +SIDGAEN+P+ FQNGWV LGKDEDK SARLHL+VRSEPDPRFVFQF GEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC
        GYKFELVADTGLATGIPIAEATMSVKKGGQFCID KTVRDFS NS+S IKGNFVMASSVEGEGKVSKP+VQVGVQHVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDAC

Query:  RHFTQKLRRELCHDEHDSSFL
        RHFTQKLRRELCHDEHDSSFL
Subjt:  RHFTQKLRRELCHDEHDSSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)7.0e-12851.08Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVM---CLS--VFA
        MDPCPF+RL + +LAL +P A +   + VHPS++PCFCKI +KNFP QTA +P   +     P+    +A FHL  S ++RL+ + +     CL   ++ 
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVM---CLS--VFA

Query:  GRMGHTCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDK--ISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFS---
        GR G  CGV+SG+LL +V + + + G +++P VF NGW+ +GK   K   SA+ HL V++EPDPRFVFQFDGEPECSP V QIQGNIRQPVF+CKFS   
Subjt:  GRMGHTCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDK--ISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFS---

Query:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
          DR  R+RSLP++    S    W+ +F  ERE+PG+ERKGW I V+DLSGSPVA AS++TPFV SPGTDRVSRSNPG+WLILRP      +W+PWGRLE
Subjt:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRER-GPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFS-----LNSKSTIKG--------------------------------
        AWRER G  DGLGY+FEL+ D     GI +AE+T+S  +GG+F I+  +    S     +N   + +G                                
Subjt:  AWRER-GPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFS-----LNSKSTIKG--------------------------------

Query:  -NFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDACRHFTQKLRRELCHD
          FVM++SVEGEGK SKP V+V VQHV+CM DAA +VALSAAIDLSMDACR F Q++R+ELCH+
Subjt:  -NFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDACRHFTQKLRRELCHD

AT1G50040.1 Protein of unknown function (DUF1005)1.7e-8944.87Show/hide
Query:  MDPCPFVRLMVDSLALNLPQ-------ATRPAGAAVHP-SATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASS-------AGFHLDPSSLRRLSGK
        MDPC FVR++V +LA+  P+       ++  +G +V   S+  C+CKI  K+FP Q   +P+  +  +S  +S   S       A F L  S +     K
Subjt:  MDPCPFVRLMVDSLALNLPQ-------ATRPAGAAVHP-SATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASS-------AGFHLDPSSLRRLSGK

Query:  PVVMCLSV-FAGRMGHTCG---VNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLG---KDEDKISA--RLHLVVRSEPDPRFVFQFDGEPECSPVVFQI
             LSV    R   +CG    +  KL+GR ++T+ +  AE++  +  NGWV LG   K+  K  +   LH+ VR EPD RFVFQFDGEPECSP VFQ+
Subjt:  PVVMCLSV-FAGRMGHTCG---VNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLG---KDEDKISA--RLHLVVRSEPDPRFVFQFDGEPECSPVVFQI

Query:  QGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHG
        QGN +Q VF+CKF   RNS  R+L    S + T GK         E+  +ERKGW I ++DLSGSPVA ASM+TPFVPSPG++RVSRS+PGAWLILRP G
Subjt:  QGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHG

Query:  FSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVR-----------DFSLNSKSTIKGN-------------
        +   +WKPW RL+AWRE G  D LGY+FEL  D G+A  +  A +++S K GG F ID  T              F L+S S+I+ +             
Subjt:  FSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVR-----------DFSLNSKSTIKGN-------------

Query:  ---------FVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDACRHFTQKLRREL
                 FVM++ V+G  K SKP V+VGV+HVTC  DAA  VAL+AA+DLSMDACR F+QKLR EL
Subjt:  ---------FVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDACRHFTQKLRREL

AT3G19680.1 Protein of unknown function (DUF1005)7.8e-9543.44Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATR-------PAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAG--------FHLDPSSLRRLSGK
        MDPC FVR++V +LA+  P ++        P+ + ++P+A  C+CKI  KNFP +   +P+     +S  ++  SS+G        F L  + +     K
Subjt:  MDPCPFVRLMVDSLALNLPQATR-------PAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAG--------FHLDPSSLRRLSGK

Query:  PVVMCLSVFA----------GRMGHTCGVNSG--KLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISA----RLHLVVRSEPDPRFVFQFDGEPEC
        P    LSV A          G  G +CG+ +   KLLGR  +++ +  AE +  +  NGWV L   + K        LH+ VR EPDPRFVFQFDGEPEC
Subjt:  PVVMCLSVFA----------GRMGHTCGVNSG--KLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISA----RLHLVVRSEPDPRFVFQFDGEPEC

Query:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN
        SP VFQ+QGN +Q VF+CKF S + NS  R+L    S  S  S+    + + + E+E+P +ERKGW I V+DLSGSPVA ASM+TPFVPSPG++RV+RS+
Subjt:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN

Query:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCID-------------------------------
        PGAWLILRP G    +WKPWGRLEAWRE G  D LGY+FEL  D G+AT +  A +++S+K GG F ID                               
Subjt:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCID-------------------------------

Query:  ------HKTVRDFSL-----NSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDACRHFTQKLRREL
                +  DF        S +     FVM+++VEG GK SKP V+VGV HVTC  DAA  VAL+AA+DLS+DACR F+ KLR+EL
Subjt:  ------HKTVRDFSL-----NSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDACRHFTQKLRREL

AT4G29310.1 Protein of unknown function (DUF1005)1.8e-16367.61Show/hide
Query:  MDPCPFVRLMVDSLALNLPQ--ATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVA-GDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGR
        MDPCPFVRL +DSLAL LP+    +  G  VHPS+TPC+CK+ IK+FPSQ ALLPLSS +   SPP+S+ S+ GFHLD  ++RR+SGK + + +SV+AGR
Subjt:  MDPCPFVRLMVDSLALNLPQ--ATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVA-GDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGR

Query:  MGHTCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT
         GHTCGV SGKLLG+V + V +  A +R   F NGW KLG D DK SARLHL+V +EPDPRFVFQF GEPECSPVV+QIQ N++QPVFSCKFS+DRN R+
Subjt:  MGHTCGVNSGKLLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT

Query:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG
        RSLPS F++ S++G   RT SG++  +K  RERKGWMI ++DLSGSPVAAASMITPFV SPG+DRVSRSNPGAWLILRPHG  VSSWKPWGRLEAWRERG
Subjt:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG

Query:  PIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDL
         IDGLGYKFELV D   +TGIPIAE TMS K+GG+F ID +        + S+    FVM SSVEGEGKVSKPVV VG QHVTCMADAALFVALSAA+DL
Subjt:  PIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQFCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDL

Query:  SMDACRHFTQKLRRELCHDEHDS
        S+DAC+ F++KLR+ELCHD+  S
Subjt:  SMDACRHFTQKLRRELCHDEHDS

AT5G17640.1 Protein of unknown function (DUF1005)1.7e-8641.14Show/hide
Query:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPS---ATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRL------SGKPVVMCL
        MDP  F+RL V SLAL +P+    + +  +     ++ C C+I ++ FP QT  +PL   + D+ PD  + S  F+L+ S LR L            + +
Subjt:  MDPCPFVRLMVDSLALNLPQATRPAGAAVHPS---ATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRL------SGKPVVMCL

Query:  SVFAGRMGHTCGVNSGK-LLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFS
        SVF G+    CGV   +  +G  ++ V  +  E +P +  NGW+ +GK +   +A LHL V+ +PDPR+VFQF+     SP + Q++G+++QP+FSCKFS
Subjt:  SVFAGRMGHTCGVNSGK-LLGRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFS

Query:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
         DR S+   L          G W  +  G E E   RERKGW + ++DLSGS VAAA + TPFVPS G D V++SNPGAWL++RP     +SW+PWG+LE
Subjt:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRERGPIDGLGYKFELVADTGLATG-IPIAEATMSVKKGGQFCID---------------HKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVG
        AWRERG  D +  +F L+++ GL  G + ++E  +S +KGG+F ID                ++  DFS   +    G FVM+S V+GEGK SKPVVQ+ 
Subjt:  AWRERGPIDGLGYKFELVADTGLATG-IPIAEATMSVKKGGQFCID---------------HKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVG

Query:  VQHVTCMADAALFVALSAAIDLSMDACRHFTQKLRRELCH
        ++HVTC+ DAA+F+AL+AA+DLS+ AC+ F +  RR   H
Subjt:  VQHVTCMADAALFVALSAAIDLSMDACRHFTQKLRRELCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCATGTCCGTTCGTCCGGCTAATGGTAGACTCACTCGCTCTCAACCTCCCTCAGGCCACCCGACCCGCTGGCGCCGCCGTTCACCCATCGGCCACGCCGTGCTT
CTGTAAGATCGCGATCAAGAATTTCCCTTCGCAGACCGCGCTTCTTCCTCTTTCCTCCGTCGCCGGCGATTCGCCGCCGGACTCCGCCGCGTCATCCGCCGGATTCCATC
TCGACCCGTCATCTCTTCGACGGCTTTCCGGTAAGCCGGTCGTGATGTGTCTGTCGGTTTTCGCCGGCCGGATGGGCCACACGTGTGGTGTTAATTCCGGCAAGTTGCTG
GGCCGGGTTCGGATCACCGTCTCAATCGACGGCGCTGAGAATAGACCGAGAGTTTTTCAGAATGGGTGGGTGAAATTGGGGAAAGATGAGGATAAAATCTCGGCTCGGCT
TCACTTGGTTGTCCGGTCTGAACCGGACCCTCGATTTGTGTTCCAGTTCGATGGCGAACCGGAATGTAGCCCGGTGGTTTTCCAGATCCAAGGCAATATCCGACAGCCGG
TTTTCAGCTGCAAGTTCAGTGCCGATCGGAACTCGAGAACCCGGTCACTGCCATCAGATTTCAGCTTCAACAGCACCAAAGGGAAATGGATGAGAACTTTTTCAGGTGAG
AGAGAGAAGCCAGGGAGAGAAAGAAAGGGTTGGATGATCATGGTTTACGACCTCTCCGGCTCCCCCGTTGCGGCCGCCTCCATGATCACACCGTTCGTCCCTTCCCCAGG
CACGGACCGTGTCTCCCGCTCCAACCCGGGCGCATGGCTCATCCTCCGCCCCCACGGCTTCTCCGTAAGTAGTTGGAAGCCATGGGGTCGCCTCGAGGCATGGCGCGAGC
GGGGACCAATAGATGGCCTCGGTTACAAGTTCGAGCTCGTCGCGGACACTGGACTAGCCACCGGCATTCCTATCGCCGAAGCCACTATGAGTGTAAAAAAGGGTGGTCAA
TTTTGCATTGACCATAAAACCGTGAGAGATTTCAGTCTAAACTCGAAATCCACTATTAAAGGTAACTTTGTAATGGCTTCGAGCGTGGAGGGAGAAGGGAAGGTGAGCAA
GCCCGTCGTACAAGTCGGAGTTCAACACGTGACGTGCATGGCGGATGCTGCTTTATTTGTGGCACTTTCAGCAGCAATTGATCTAAGCATGGATGCTTGTAGACATTTTA
CACAAAAGCTAAGGAGAGAGCTTTGTCATGATGAACACGATTCTAGTTTTCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATAATATTTGGGTGTAAAAGTCAAGAGTAGCTCATTCTCCATCTGTCACGAGCCAGAGAAACAGAATAAAAACCAAAACCTTCAATCCAGCCATGGCCGACAATTCACAC
TCCAAATCCCCAACTTAGATGTTCGCCATTTTTTCTTCTCCTTCTTCCTCAAGCCACATTTCCGCCGCCGGTGACTTTTCCGATGGATCCATGTCCGTTCGTCCGGCTAA
TGGTAGACTCACTCGCTCTCAACCTCCCTCAGGCCACCCGACCCGCTGGCGCCGCCGTTCACCCATCGGCCACGCCGTGCTTCTGTAAGATCGCGATCAAGAATTTCCCT
TCGCAGACCGCGCTTCTTCCTCTTTCCTCCGTCGCCGGCGATTCGCCGCCGGACTCCGCCGCGTCATCCGCCGGATTCCATCTCGACCCGTCATCTCTTCGACGGCTTTC
CGGTAAGCCGGTCGTGATGTGTCTGTCGGTTTTCGCCGGCCGGATGGGCCACACGTGTGGTGTTAATTCCGGCAAGTTGCTGGGCCGGGTTCGGATCACCGTCTCAATCG
ACGGCGCTGAGAATAGACCGAGAGTTTTTCAGAATGGGTGGGTGAAATTGGGGAAAGATGAGGATAAAATCTCGGCTCGGCTTCACTTGGTTGTCCGGTCTGAACCGGAC
CCTCGATTTGTGTTCCAGTTCGATGGCGAACCGGAATGTAGCCCGGTGGTTTTCCAGATCCAAGGCAATATCCGACAGCCGGTTTTCAGCTGCAAGTTCAGTGCCGATCG
GAACTCGAGAACCCGGTCACTGCCATCAGATTTCAGCTTCAACAGCACCAAAGGGAAATGGATGAGAACTTTTTCAGGTGAGAGAGAGAAGCCAGGGAGAGAAAGAAAGG
GTTGGATGATCATGGTTTACGACCTCTCCGGCTCCCCCGTTGCGGCCGCCTCCATGATCACACCGTTCGTCCCTTCCCCAGGCACGGACCGTGTCTCCCGCTCCAACCCG
GGCGCATGGCTCATCCTCCGCCCCCACGGCTTCTCCGTAAGTAGTTGGAAGCCATGGGGTCGCCTCGAGGCATGGCGCGAGCGGGGACCAATAGATGGCCTCGGTTACAA
GTTCGAGCTCGTCGCGGACACTGGACTAGCCACCGGCATTCCTATCGCCGAAGCCACTATGAGTGTAAAAAAGGGTGGTCAATTTTGCATTGACCATAAAACCGTGAGAG
ATTTCAGTCTAAACTCGAAATCCACTATTAAAGGTAACTTTGTAATGGCTTCGAGCGTGGAGGGAGAAGGGAAGGTGAGCAAGCCCGTCGTACAAGTCGGAGTTCAACAC
GTGACGTGCATGGCGGATGCTGCTTTATTTGTGGCACTTTCAGCAGCAATTGATCTAAGCATGGATGCTTGTAGACATTTTACACAAAAGCTAAGGAGAGAGCTTTGTCA
TGATGAACACGATTCTAGTTTTCTCTAAAATCTCATCAATTTTTTCAACAAAGATTTTCGATCTATTTTTCTTTGCAATGATAACCCAGTTATCATCTAAATTACAACCG
GATTTGAGCAGAGAAACGGTATATTTATGTAATATTCAATTCCCTTTTTGTTTTTTGTTTTTTTTTTCTTTTCAAGATCCTTAATTTTCTCTATTATTCTTCATTGATTC
TTTAATTCCCTTTTGTTTTATTATTTTCACATAATTTAGATTTTTTGAATTGTAGTTTTTGTTTGGGGCTCAAAAGCTTTCTCTTTGTAAGTTTTAATTTGACCTAATTA
TCAAAAGTGTATAGACAGACAGTAAACGGGAATTTGTAATTTTAATTTTTTTTTCCGACACGAATTATGTAATTTGATTTCATTGCTCTCTTTATTATGAATTCTTTTTA
AAGTAAAAGTGCATTTATTTTATTTAATTATGTAATTTGATTTGGGTTTCACTTTGGAATTTTTTAATTTAATGAAGTACTTTGATTTATTTCAAGTTTATTCTTTTTTC
CATTATGAACTAATTCC
Protein sequenceShow/hide protein sequence
MDPCPFVRLMVDSLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSVAGDSPPDSAASSAGFHLDPSSLRRLSGKPVVMCLSVFAGRMGHTCGVNSGKLL
GRVRITVSIDGAENRPRVFQNGWVKLGKDEDKISARLHLVVRSEPDPRFVFQFDGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGE
REKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVADTGLATGIPIAEATMSVKKGGQ
FCIDHKTVRDFSLNSKSTIKGNFVMASSVEGEGKVSKPVVQVGVQHVTCMADAALFVALSAAIDLSMDACRHFTQKLRRELCHDEHDSSFL