; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g2530 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g2530
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationMC08:34055682..34058721
RNA-Seq ExpressionMC08g2530
SyntenyMC08g2530
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061519.1 DUF1005 domain-containing protein [Cucumis melo var. makuwa]1.06e-28693.19Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDPSSLRRLSGKP+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA+++P+VF NGWVKLGK+EDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCID +T+RDF+ NS+S +KG+FVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRREL
        RHFTQKLRREL
Subjt:  RHFTQKLRREL

XP_008458969.1 PREDICTED: uncharacterized protein LOC103498220 [Cucumis melo]4.30e-28692.94Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDPSSLRRLSGKP+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA+++P+VF NGWVKLGK+EDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA++GLATGIPIAEATMSVKKGGQFCID +T+RDF+ NS+S +KG+FVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRREL
        RHFTQKLRREL
Subjt:  RHFTQKLRREL

XP_022154692.1 uncharacterized protein LOC111021889 isoform X1 [Momordica charantia]3.49e-29999.51Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIK-GNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVANTGLATGI IAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIK GNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIK-GNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRREL
        CRHFTQKLRREL
Subjt:  CRHFTQKLRREL

XP_022154693.1 uncharacterized protein LOC111021889 isoform X2 [Momordica charantia]4.98e-30199.76Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVANTGLATGI IAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRREL
        RHFTQKLRREL
Subjt:  RHFTQKLRREL

XP_038889424.1 uncharacterized protein LOC120079336 [Benincasa hispida]7.44e-28793.67Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDPSSLRR+SG P+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA+++PRVF NGWVKLGK++DKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCID +T+RDFS NS+S +KGNFVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRREL
        RHFTQKLRREL
Subjt:  RHFTQKLRREL

TrEMBL top hitse value%identityAlignment
A0A1S3C9N5 uncharacterized protein LOC1034982202.08e-28692.94Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDPSSLRRLSGKP+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA+++P+VF NGWVKLGK+EDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA++GLATGIPIAEATMSVKKGGQFCID +T+RDF+ NS+S +KG+FVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRREL
        RHFTQKLRREL
Subjt:  RHFTQKLRREL

A0A5A7V777 DUF1005 domain-containing protein5.12e-28793.19Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDPSSLRRLSGKP+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA+++P+VF NGWVKLGK+EDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCID +T+RDF+ NS+S +KG+FVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRREL
        RHFTQKLRREL
Subjt:  RHFTQKLRREL

A0A6J1DL04 uncharacterized protein LOC111021889 isoform X22.41e-30199.76Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVANTGLATGI IAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRREL
        RHFTQKLRREL
Subjt:  RHFTQKLRREL

A0A6J1DMB6 uncharacterized protein LOC111021889 isoform X11.69e-29999.51Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIK-GNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVANTGLATGI IAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIK GNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIK-GNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRREL
        CRHFTQKLRREL
Subjt:  CRHFTQKLRREL

A0A6J1FDM1 uncharacterized protein LOC1114444183.45e-28592.46Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESL+LNLPQATRPAGAAVHPS TPCFCKIAIKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDP+SLRRLSGKP++MCLSVFAGRMG+
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKFLGRVRI +++DGA+++P+ F NGWV LGK+E+K SARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCID +T+RDFSPNSRSNIKGNFVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVALAAAIDLSMDAC
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDAC

Query:  RHFTQKLRREL
        RHFTQKLRREL
Subjt:  RHFTQKLRREL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.9e-12551.19Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVM---CLS--VFA
        MDPCPF+RL + +LAL +P A +   + VHPS++PCFCKI +KNFP QTA +P   +     P+    ++ FHL  S ++RL+ + +     CL   ++ 
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVM---CLS--VFA

Query:  GRMGHTCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDK--ISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---
        GR G  CGV+SG+ L +V + + L G  S+P VFHNGW+ +GK   K   SA+ HL V++EPDPRFVFQF GEPECSP V QIQGNIRQPVF+CKFS   
Subjt:  GRMGHTCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDK--ISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---

Query:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
          DR  R+RSLP++    S    W+ +F  ERE+PG+ERKGW I V+DLSGSPVA AS++TPFV SPGTDRVSRSNPG+WLILRP      +W+PWGRLE
Subjt:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRER-GPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID----------------SRTIRDFSPNSRS----------------------NIK
        AWRER G  DGLGY+FEL+ +     GI +AE+T+S  +GG+F I+                SR+ R  S  S                        N+ 
Subjt:  AWRER-GPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID----------------SRTIRDFSPNSRS----------------------NIK

Query:  GNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
          FVM++SVEGEGK SKP VEV V+HV+CM DAA +VAL+AAIDLSMDACR F Q++R+EL
Subjt:  GNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT1G50040.1 Protein of unknown function (DUF1005)8.2e-8944.44Show/hide
Query:  MDPCPFVRLMVESLALNLPQ-------ATRPAGAAVHP-SATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSG-------FHLDPSSLRRLSGK
        MDPC FVR++V +LA+  P+       ++  +G +V   S+  C+CKI  K+FP Q   +P+  +  +S  +S   S         F L  S +     K
Subjt:  MDPCPFVRLMVESLALNLPQ-------ATRPAGAAVHP-SATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSG-------FHLDPSSLRRLSGK

Query:  PLVMCLSV-FAGRMGHTCG---VNSGKFLGRVRITVALDGADSRPRVFHNGWVKLG-----KEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQI
             LSV    R   +CG    +  K +GR ++T+ L  A+S+  + HNGWV LG      ++      LH+ VR EPD RFVFQF GEPECSP VFQ+
Subjt:  PLVMCLSV-FAGRMGHTCG---VNSGKFLGRVRITVALDGADSRPRVFHNGWVKLG-----KEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQI

Query:  QGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHG
        QGN +Q VF+CKF   RNS  R+L    S + T GK         E+  +ERKGW I ++DLSGSPVA ASM+TPFVPSPG++RVSRS+PGAWLILRP G
Subjt:  QGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHG

Query:  FSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIR-----------DFSPNSRSNIKGN-------------
        +   +WKPW RL+AWRE G  D LGY+FEL  + G+A  +  A +++S K GG F ID  T              F  +S S+I+ +             
Subjt:  FSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIR-----------DFSPNSRSNIKGN-------------

Query:  ---------FVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
                 FVM++ V+G  K SKP VEVGVKHVTC  DAA  VALAAA+DLSMDACR F+QKLR EL
Subjt:  ---------FVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT3G19680.1 Protein of unknown function (DUF1005)6.9e-9643.97Show/hide
Query:  MDPCPFVRLMVESLALNLPQATR-------PAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSG--------FHLDPSSLRRLSGK
        MDPC FVR++V +LA+  P ++        P+ + ++P+A  C+CKI  KNFP +   +P+     +S  ++  SSSG        F L  + +     K
Subjt:  MDPCPFVRLMVESLALNLPQATR-------PAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSG--------FHLDPSSLRRLSGK

Query:  PLVMCLSVFA----------GRMGHTCGVNSG--KFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISA----RLHLVVRSEPDPRFVFQFGGEPEC
        P    LSV A          G  G +CG+ +   K LGR  +++ L  A+++  + HNGWV L  ++ K        LH+ VR EPDPRFVFQF GEPEC
Subjt:  PLVMCLSVFA----------GRMGHTCGVNSG--KFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISA----RLHLVVRSEPDPRFVFQFGGEPEC

Query:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN
        SP VFQ+QGN +Q VF+CKF S + NS  R+L    S  S  S+    + + + E+E+P +ERKGW I V+DLSGSPVA ASM+TPFVPSPG++RV+RS+
Subjt:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN

Query:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID-------------------------------
        PGAWLILRP G    +WKPWGRLEAWRE G  D LGY+FEL  + G+AT +  A +++S+K GG F ID                               
Subjt:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID-------------------------------

Query:  ------SRTIRDFS------PNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
              S +  DF       P++ +  +G FVM+++VEG GK SKP VEVGV HVTC  DAA  VALAAA+DLS+DACR F+ KLR+EL
Subjt:  ------SRTIRDFS------PNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT4G29310.1 Protein of unknown function (DUF1005)1.8e-16068.35Show/hide
Query:  MDPCPFVRLMVESLALNLPQ--ATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSIS-GDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGR
        MDPCPFVRL ++SLAL LP+    +  G  VHPS+TPC+CK+ IK+FPSQ ALLPLSS S   SPP+S+ S+ GFHLD  ++RR+SGK + + +SV+AGR
Subjt:  MDPCPFVRLMVESLALNLPQ--ATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSIS-GDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGR

Query:  MGHTCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT
         GHTCGV SGK LG+V + V L  A SR   FHNGW KLG + DK SARLHL+V +EPDPRFVFQFGGEPECSPVV+QIQ N++QPVFSCKFS+DRN R+
Subjt:  MGHTCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT

Query:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG
        RSLPS F++ S++G   RT SG++  +K  RERKGWMI ++DLSGSPVAAASMITPFV SPG+DRVSRSNPGAWLILRPHG  VSSWKPWGRLEAWRERG
Subjt:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG

Query:  PIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTI-RDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAID
         IDGLGYKFELV +   +TGIPIAE TMS K+GG+F ID R   +  SP   S +KG FVM SSVEGEGKVSKP+V VG +HVTCMADAALFVAL+AA+D
Subjt:  PIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTI-RDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAID

Query:  LSMDACRHFTQKLRREL
        LS+DAC+ F++KLR+EL
Subjt:  LSMDACRHFTQKLRREL

AT5G17640.1 Protein of unknown function (DUF1005)6.5e-8641.28Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPS---ATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRL------SGKPLVMCL
        MDP  F+RL V SLAL +P+    + +  +     ++ C C+I ++ FP QT  +PL   S D+ PD  + S+ F+L+ S LR L            + +
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPS---ATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRL------SGKPLVMCL

Query:  SVFAGRMGHTCGVNSGK-FLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS
        SVF G+    CGV   +  +G  ++ V  +  + +P +  NGW+ +GK +   +A LHL V+ +PDPR+VFQF      SP + Q++G+++QP+FSCKFS
Subjt:  SVFAGRMGHTCGVNSGK-FLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS

Query:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
         DR S+   L          G W  +  G E E   RERKGW + ++DLSGS VAAA + TPFVPS G D V++SNPGAWL++RP     +SW+PWG+LE
Subjt:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRERGPIDGLGYKFELVANTGLATG-IPIAEATMSVKKGGQFCIDS---------------RTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVG
        AWRERG  D +  +F L++N GL  G + ++E  +S +KGG+F ID+               ++  DFS   +    G FVM+S V+GEGK SKP+V++ 
Subjt:  AWRERGPIDGLGYKFELVANTGLATG-IPIAEATMSVKKGGQFCIDS---------------RTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVG

Query:  VKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRR
        ++HVTC+ DAA+F+ALAAA+DLS+ AC+ F +  RR
Subjt:  VKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCGTGCCCCTTCGTCCGGCTGATGGTCGAATCGCTCGCTCTCAACCTCCCTCAGGCCACCCGCCCCGCCGGGGCCGCCGTCCACCCCTCCGCCACGCCCTGCTT
CTGCAAGATCGCGATCAAGAATTTCCCCTCCCAAACGGCGCTTCTTCCGCTTTCCTCAATTTCCGGCGACTCACCTCCGGACTCCGCCGCGTCGTCCTCCGGCTTCCACC
TGGACCCGTCGTCTCTCCGTCGCCTCTCTGGCAAGCCTCTCGTGATGTGTCTCTCGGTTTTCGCGGGGCGGATGGGGCACACGTGCGGGGTGAATTCTGGCAAGTTCCTC
GGCCGGGTTCGGATCACGGTGGCTCTCGACGGCGCTGACAGTAGACCCAGAGTATTCCACAACGGGTGGGTTAAATTGGGGAAAGAAGAGGATAAAATCTCGGCCCGCCT
CCACTTGGTTGTCCGGTCCGAACCCGACCCGCGCTTCGTTTTCCAGTTCGGTGGAGAACCGGAATGTAGCCCGGTGGTTTTCCAGATCCAAGGCAATATCCGGCAACCGG
TTTTCAGCTGCAAGTTCAGTGCCGATCGGAATTCGAGGACCCGGTCACTGCCGTCGGATTTCAGCTTCAACAGTACTAAAGGAAAATGGATGAGAACTTTTTCAGGGGAG
AGAGAGAAGCCGGGGAGGGAGAGAAAGGGGTGGATGATCATGGTCTATGACCTCTCCGGCTCCCCCGTCGCGGCCGCCTCCATGATAACGCCGTTCGTCCCTTCCCCGGG
CACCGACCGTGTCTCTCGCTCCAACCCTGGTGCTTGGCTCATCCTCCGCCCCCATGGCTTCTCCGTCAGCAGTTGGAAGCCATGGGGACGCCTCGAGGCTTGGCGCGAGC
GGGGACCGATCGATGGCCTAGGCTACAAGTTCGAGCTCGTCGCCAACACTGGACTAGCCACTGGCATCCCCATCGCAGAAGCTACCATGAGCGTAAAAAAGGGTGGCCAA
TTTTGCATCGACAGTAGAACTATAAGAGATTTCAGTCCAAACTCTAGATCCAACATTAAAGGGAACTTTGTAATGGCTTCGAGCGTGGAGGGAGAAGGGAAGGTGAGCAA
GCCAATGGTAGAAGTGGGAGTTAAGCACGTGACATGCATGGCGGATGCTGCTTTATTTGTAGCTCTAGCAGCCGCCATTGATCTTAGCATGGACGCTTGCAGACACTTCA
CACAGAAGCTGAGAAGAGAACTC
mRNA sequenceShow/hide mRNA sequence
GTGTGATGTTAATAAATGAGAAATTATTGTTGGTTATAAGTATTTAGTAAACAATATTTCGGTACATTGAAAAGTCAAGAGGAGCTCATTCTCTATCTGTCATGAGAGAG
AGAAACAGAATAAAGAGCAAAACCTTCAATCCAGCCATGGCCGACAATCCTCACTAGAAAAAAACGCCCAACTTAAATGTTCTTCGCCAATCTTTCTTCTTTCTTACTCC
ACATTCCCGCCGCCGCCAAATTTTCCGATGGATCCGTGCCCCTTCGTCCGGCTGATGGTCGAATCGCTCGCTCTCAACCTCCCTCAGGCCACCCGCCCCGCCGGGGCCGC
CGTCCACCCCTCCGCCACGCCCTGCTTCTGCAAGATCGCGATCAAGAATTTCCCCTCCCAAACGGCGCTTCTTCCGCTTTCCTCAATTTCCGGCGACTCACCTCCGGACT
CCGCCGCGTCGTCCTCCGGCTTCCACCTGGACCCGTCGTCTCTCCGTCGCCTCTCTGGCAAGCCTCTCGTGATGTGTCTCTCGGTTTTCGCGGGGCGGATGGGGCACACG
TGCGGGGTGAATTCTGGCAAGTTCCTCGGCCGGGTTCGGATCACGGTGGCTCTCGACGGCGCTGACAGTAGACCCAGAGTATTCCACAACGGGTGGGTTAAATTGGGGAA
AGAAGAGGATAAAATCTCGGCCCGCCTCCACTTGGTTGTCCGGTCCGAACCCGACCCGCGCTTCGTTTTCCAGTTCGGTGGAGAACCGGAATGTAGCCCGGTGGTTTTCC
AGATCCAAGGCAATATCCGGCAACCGGTTTTCAGCTGCAAGTTCAGTGCCGATCGGAATTCGAGGACCCGGTCACTGCCGTCGGATTTCAGCTTCAACAGTACTAAAGGA
AAATGGATGAGAACTTTTTCAGGGGAGAGAGAGAAGCCGGGGAGGGAGAGAAAGGGGTGGATGATCATGGTCTATGACCTCTCCGGCTCCCCCGTCGCGGCCGCCTCCAT
GATAACGCCGTTCGTCCCTTCCCCGGGCACCGACCGTGTCTCTCGCTCCAACCCTGGTGCTTGGCTCATCCTCCGCCCCCATGGCTTCTCCGTCAGCAGTTGGAAGCCAT
GGGGACGCCTCGAGGCTTGGCGCGAGCGGGGACCGATCGATGGCCTAGGCTACAAGTTCGAGCTCGTCGCCAACACTGGACTAGCCACTGGCATCCCCATCGCAGAAGCT
ACCATGAGCGTAAAAAAGGGTGGCCAATTTTGCATCGACAGTAGAACTATAAGAGATTTCAGTCCAAACTCTAGATCCAACATTAAAGGGAACTTTGTAATGGCTTCGAG
CGTGGAGGGAGAAGGGAAGGTGAGCAAGCCAATGGTAGAAGTGGGAGTTAAGCACGTGACATGCATGGCGGATGCTGCTTTATTTGTAGCTCTAGCAGCCGCCATTGATC
TTAGCATGGACGCTTGCAGACACTTCACACAGAAGCTGAGAAGAGAACTC
Protein sequenceShow/hide protein sequence
MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGHTCGVNSGKFL
GRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGE
REKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQ
FCIDSRTIRDFSPNSRSNIKGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL