; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023725 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023725
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationscaffold570:545692..548473
RNA-Seq ExpressionMS023725
SyntenyMS023725
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061519.1 DUF1005 domain-containing protein [Cucumis melo var. makuwa]4.6e-22793Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDPSSLRRLSGKP+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA+++P+VF NGWVKLGK+EDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCID +T+RDF+ NS+S +K G+FVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCH
        CRHFTQKLRRELCH
Subjt:  CRHFTQKLRRELCH

XP_008458969.1 PREDICTED: uncharacterized protein LOC103498220 [Cucumis melo]1.4e-22692.75Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDPSSLRRLSGKP+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA+++P+VF NGWVKLGK+EDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVA++GLATGIPIAEATMSVKKGGQFCID +T+RDF+ NS+S +K G+FVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCH
        CRHFTQKLRRELCH
Subjt:  CRHFTQKLRRELCH

XP_022154692.1 uncharacterized protein LOC111021889 isoform X1 [Momordica charantia]8.2e-24099.76Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVANTGLATGI IAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCH
        CRHFTQKLRRELCH
Subjt:  CRHFTQKLRRELCH

XP_022154693.1 uncharacterized protein LOC111021889 isoform X2 [Momordica charantia]5.8e-23899.52Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVANTGLATGI IAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIK GNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCH
        CRHFTQKLRRELCH
Subjt:  CRHFTQKLRRELCH

XP_038889424.1 uncharacterized protein LOC120079336 [Benincasa hispida]3.6e-22793.48Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDPSSLRR+SG P+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA+++PRVF NGWVKLGK++DKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCID +T+RDFS NS+S +K GNFVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCH
        CRHFTQKLRRELCH
Subjt:  CRHFTQKLRRELCH

TrEMBL top hitse value%identityAlignment
A0A1S3C9N5 uncharacterized protein LOC1034982206.5e-22792.75Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDPSSLRRLSGKP+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA+++P+VF NGWVKLGK+EDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVA++GLATGIPIAEATMSVKKGGQFCID +T+RDF+ NS+S +K G+FVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCH
        CRHFTQKLRRELCH
Subjt:  CRHFTQKLRRELCH

A0A5A7V777 DUF1005 domain-containing protein2.2e-22793Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMV+SLALNLPQATRPAGAAVHPSATPCFCKI+IKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDPSSLRRLSGKP+VMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGK LGRVRITV++DGA+++P+VF NGWVKLGK+EDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCID +T+RDF+ NS+S +K G+FVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVAL+AAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCH
        CRHFTQKLRRELCH
Subjt:  CRHFTQKLRRELCH

A0A6J1DL04 uncharacterized protein LOC111021889 isoform X22.8e-23899.52Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVANTGLATGI IAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIK GNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCH
        CRHFTQKLRRELCH
Subjt:  CRHFTQKLRRELCH

A0A6J1DMB6 uncharacterized protein LOC111021889 isoform X13.9e-24099.76Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVANTGLATGI IAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCH
        CRHFTQKLRRELCH
Subjt:  CRHFTQKLRRELCH

A0A6J1FDM1 uncharacterized protein LOC1114444185.5e-22692.27Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH
        MDPCPFVRLMVESL+LNLPQATRPAGAAVHPS TPCFCKIAIKNFPSQTALLPLSS+SGDSPPDSAASS+GFHLDP+SLRRLSGKP++MCLSVFAGRMG+
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGH

Query:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
        TCGVNSGKFLGRVRI +++DGA+++P+ F NGWV LGK+E+K SARLHL+VRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL
Subjt:  TCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSL

Query:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
        PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGS VAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL
Subjt:  PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGL

Query:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA
        GYKFELVA+TGLATGIPIAEATMSVKKGGQFCID +T+RDFSPNSRSNIK GNFVMASSVEGEGKVSKP+V+VGV+HVTCMADAALFVALAAAIDLSMDA
Subjt:  GYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCH
        CRHFTQKLRRELCH
Subjt:  CRHFTQKLRRELCH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)2.6e-12751.4Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVM---CLS--VFA
        MDPCPF+RL + +LAL +P A +   + VHPS++PCFCKI +KNFP QTA +P   +     P+    ++ FHL  S ++RL+ + +     CL   ++ 
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVM---CLS--VFA

Query:  GRMGHTCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDK--ISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---
        GR G  CGV+SG+ L +V + + L G  S+P VFHNGW+ +GK   K   SA+ HL V++EPDPRFVFQF GEPECSP V QIQGNIRQPVF+CKFS   
Subjt:  GRMGHTCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDK--ISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS---

Query:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
          DR  R+RSLP++    S    W+ +F  ERE+PG+ERKGW I V+DLSGSPVA AS++TPFV SPGTDRVSRSNPG+WLILRP      +W+PWGRLE
Subjt:  -ADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRER-GPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID----------------SRTIRDFSPNSRSNIKVGN-------------------
        AWRER G  DGLGY+FEL+ +     GI +AE+T+S  +GG+F I+                SR+ R  S  S       N                   
Subjt:  AWRER-GPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID----------------SRTIRDFSPNSRSNIKVGN-------------------

Query:  --FVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH
          FVM++SVEGEGK SKP VEV V+HV+CM DAA +VAL+AAIDLSMDACR F Q++R+ELCH
Subjt:  --FVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH

AT1G50040.1 Protein of unknown function (DUF1005)1.4e-8844.44Show/hide
Query:  MDPCPFVRLMVESLALNLPQ-------ATRPAGAAVHP-SATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSG-------FHLDPSSLRRLSGK
        MDPC FVR++V +LA+  P+       ++  +G +V   S+  C+CKI  K+FP Q   +P+  +  +S  +S   S         F L  S +     K
Subjt:  MDPCPFVRLMVESLALNLPQ-------ATRPAGAAVHP-SATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSG-------FHLDPSSLRRLSGK

Query:  PLVMCLSV-FAGRMGHTCG---VNSGKFLGRVRITVALDGADSRPRVFHNGWVKLG-----KEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQI
             LSV    R   +CG    +  K +GR ++T+ L  A+S+  + HNGWV LG      ++      LH+ VR EPD RFVFQF GEPECSP VFQ+
Subjt:  PLVMCLSV-FAGRMGHTCG---VNSGKFLGRVRITVALDGADSRPRVFHNGWVKLG-----KEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQI

Query:  QGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHG
        QGN +Q VF+CKF   RNS  R+L    S + T GK         E+  +ERKGW I ++DLSGSPVA ASM+TPFVPSPG++RVSRS+PGAWLILRP G
Subjt:  QGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHG

Query:  FSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIR-----------DFSPNSRSNIKVG-------------
        +   +WKPW RL+AWRE G  D LGY+FEL  + G+A  +  A +++S K GG F ID  T              F  +S S+I+               
Subjt:  FSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTIR-----------DFSPNSRSNIKVG-------------

Query:  --------NFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
                 FVM++ V+G  K SKP VEVGVKHVTC  DAA  VALAAA+DLSMDACR F+QKLR EL
Subjt:  --------NFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT3G19680.1 Protein of unknown function (DUF1005)6.9e-9644.06Show/hide
Query:  MDPCPFVRLMVESLALNLPQATR-------PAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSG--------FHLDPSSLRRLSGK
        MDPC FVR++V +LA+  P ++        P+ + ++P+A  C+CKI  KNFP +   +P+     +S  ++  SSSG        F L  + +     K
Subjt:  MDPCPFVRLMVESLALNLPQATR-------PAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSG--------FHLDPSSLRRLSGK

Query:  PLVMCLSVFA----------GRMGHTCGVNSG--KFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISA----RLHLVVRSEPDPRFVFQFGGEPEC
        P    LSV A          G  G +CG+ +   K LGR  +++ L  A+++  + HNGWV L  ++ K        LH+ VR EPDPRFVFQF GEPEC
Subjt:  PLVMCLSVFA----------GRMGHTCGVNSG--KFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISA----RLHLVVRSEPDPRFVFQFGGEPEC

Query:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN
        SP VFQ+QGN +Q VF+CKF S + NS  R+L    S  S  S+    + + + E+E+P +ERKGW I V+DLSGSPVA ASM+TPFVPSPG++RV+RS+
Subjt:  SPVVFQIQGNIRQPVFSCKF-SADRNSRTRSL---PSDFSFNSTKGKWMRTFSGEREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSN

Query:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID-----------------------SRTIRDFS
        PGAWLILRP G    +WKPWGRLEAWRE G  D LGY+FEL  + G+AT +  A +++S+K GG F ID                       S +     
Subjt:  PGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCID-----------------------SRTIRDFS

Query:  PNSRSNIKVGN------------------FVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
        P SR     G+                  FVM+++VEG GK SKP VEVGV HVTC  DAA  VALAAA+DLS+DACR F+ KLR+EL
Subjt:  PNSRSNIKVGN------------------FVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT4G29310.1 Protein of unknown function (DUF1005)1.2e-16168.1Show/hide
Query:  MDPCPFVRLMVESLALNLPQ--ATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSIS-GDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGR
        MDPCPFVRL ++SLAL LP+    +  G  VHPS+TPC+CK+ IK+FPSQ ALLPLSS S   SPP+S+ S+ GFHLD  ++RR+SGK + + +SV+AGR
Subjt:  MDPCPFVRLMVESLALNLPQ--ATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSIS-GDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGR

Query:  MGHTCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT
         GHTCGV SGK LG+V + V L  A SR   FHNGW KLG + DK SARLHL+V +EPDPRFVFQFGGEPECSPVV+QIQ N++QPVFSCKFS+DRN R+
Subjt:  MGHTCGVNSGKFLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT

Query:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG
        RSLPS F++ S++G   RT SG++  +K  RERKGWMI ++DLSGSPVAAASMITPFV SPG+DRVSRSNPGAWLILRPHG  VSSWKPWGRLEAWRERG
Subjt:  RSLPSDFSFNSTKGKWMRTFSGER--EKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERG

Query:  PIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTI-RDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAI
         IDGLGYKFELV +   +TGIPIAE TMS K+GG+F ID R   +  SP   S +K   FVM SSVEGEGKVSKP+V VG +HVTCMADAALFVAL+AA+
Subjt:  PIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQFCIDSRTI-RDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAI

Query:  DLSMDACRHFTQKLRRELCH
        DLS+DAC+ F++KLR+ELCH
Subjt:  DLSMDACRHFTQKLRRELCH

AT5G17640.1 Protein of unknown function (DUF1005)6.5e-8641.04Show/hide
Query:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPS---ATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRL------SGKPLVMCL
        MDP  F+RL V SLAL +P+    + +  +     ++ C C+I ++ FP QT  +PL   S D+ PD  + S+ F+L+ S LR L            + +
Subjt:  MDPCPFVRLMVESLALNLPQATRPAGAAVHPS---ATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRL------SGKPLVMCL

Query:  SVFAGRMGHTCGVNSGK-FLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS
        SVF G+    CGV   +  +G  ++ V  +  + +P +  NGW+ +GK +   +A LHL V+ +PDPR+VFQF      SP + Q++G+++QP+FSCKFS
Subjt:  SVFAGRMGHTCGVNSGK-FLGRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS

Query:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE
         DR S+   L          G W  +  G E E   RERKGW + ++DLSGS VAAA + TPFVPS G D V++SNPGAWL++RP     +SW+PWG+LE
Subjt:  ADRNSRTRSLPSDFSFNSTKGKWMRTFSG-EREKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLE

Query:  AWRERGPIDGLGYKFELVANTGLATG-IPIAEATMSVKKGGQFCIDS---------------RTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEV
        AWRERG  D +  +F L++N GL  G + ++E  +S +KGG+F ID+               ++  DFS   +  +  G FVM+S V+GEGK SKP+V++
Subjt:  AWRERGPIDGLGYKFELVANTGLATG-IPIAEATMSVKKGGQFCIDS---------------RTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEV

Query:  GVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH
         ++HVTC+ DAA+F+ALAAA+DLS+ AC+ F +  RR   H
Subjt:  GVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCGTGCCCCTTCGTCCGGCTGATGGTCGAATCGCTCGCTCTCAACCTCCCTCAGGCCACCCGCCCCGCCGGGGCCGCCGTCCACCCCTCCGCCACACCCTGCTT
CTGCAAGATCGCGATCAAGAATTTCCCCTCCCAAACGGCGCTTCTTCCGCTTTCCTCAATTTCCGGCGACTCGCCTCCGGACTCCGCCGCGTCGTCCTCCGGCTTCCACC
TGGACCCGTCGTCTCTCCGTCGCCTCTCTGGCAAGCCTCTCGTGATGTGTCTCTCGGTTTTCGCGGGGCGGATGGGGCACACGTGCGGGGTGAATTCTGGCAAGTTCCTC
GGCCGGGTTCGGATCACGGTGGCTCTCGACGGCGCTGACAGTAGACCCAGAGTATTCCACAACGGGTGGGTTAAATTGGGGAAAGAAGAGGATAAAATCTCGGCCCGCCT
CCACTTGGTTGTCCGGTCCGAACCCGACCCGCGCTTCGTTTTCCAGTTCGGTGGAGAACCGGAATGTAGCCCGGTGGTTTTCCAGATCCAAGGCAATATCCGGCAACCGG
TTTTCAGCTGCAAGTTCAGTGCCGATCGGAATTCGAGGACCCGGTCACTGCCGTCGGATTTCAGCTTCAACAGTACTAAAGGAAAATGGATGAGAACTTTTTCAGGGGAG
AGAGAGAAGCCGGGGAGGGAGAGAAAGGGGTGGATGATCATGGTCTATGACCTCTCCGGCTCCCCCGTCGCGGCCGCCTCCATGATAACGCCGTTCGTCCCTTCCCCGGG
CACCGACCGTGTCTCTCGCTCCAACCCTGGTGCTTGGCTCATCCTCCGCCCCCATGGCTTCTCCGTCAGCAGTTGGAAGCCATGGGGACGCCTCGAGGCTTGGCGCGAGC
GGGGACCGATCGATGGCCTAGGCTACAAGTTCGAGCTCGTCGCCAACACTGGACTAGCCACTGGCATCCCCATCGCAGAAGCTACCATGAGCGTAAAAAAGGGTGGCCAA
TTTTGCATCGACAGTAGAACTATAAGAGATTTCAGTCCAAACTCTAGATCCAACATTAAAGTAGGGAACTTTGTAATGGCTTCGAGCGTGGAGGGAGAAGGGAAGGTGAG
CAAGCCAATGGTAGAAGTGGGAGTTAAGCACGTGACATGCATGGCAGATGCTGCTTTATTTGTAGCTCTAGCAGCCGCCATTGATCTTAGCATGGACGCTTGCAGACACT
TCACACAGAAGCTGAGAAGAGAACTCTGTCAC
mRNA sequenceShow/hide mRNA sequence
ATGGATCCGTGCCCCTTCGTCCGGCTGATGGTCGAATCGCTCGCTCTCAACCTCCCTCAGGCCACCCGCCCCGCCGGGGCCGCCGTCCACCCCTCCGCCACACCCTGCTT
CTGCAAGATCGCGATCAAGAATTTCCCCTCCCAAACGGCGCTTCTTCCGCTTTCCTCAATTTCCGGCGACTCGCCTCCGGACTCCGCCGCGTCGTCCTCCGGCTTCCACC
TGGACCCGTCGTCTCTCCGTCGCCTCTCTGGCAAGCCTCTCGTGATGTGTCTCTCGGTTTTCGCGGGGCGGATGGGGCACACGTGCGGGGTGAATTCTGGCAAGTTCCTC
GGCCGGGTTCGGATCACGGTGGCTCTCGACGGCGCTGACAGTAGACCCAGAGTATTCCACAACGGGTGGGTTAAATTGGGGAAAGAAGAGGATAAAATCTCGGCCCGCCT
CCACTTGGTTGTCCGGTCCGAACCCGACCCGCGCTTCGTTTTCCAGTTCGGTGGAGAACCGGAATGTAGCCCGGTGGTTTTCCAGATCCAAGGCAATATCCGGCAACCGG
TTTTCAGCTGCAAGTTCAGTGCCGATCGGAATTCGAGGACCCGGTCACTGCCGTCGGATTTCAGCTTCAACAGTACTAAAGGAAAATGGATGAGAACTTTTTCAGGGGAG
AGAGAGAAGCCGGGGAGGGAGAGAAAGGGGTGGATGATCATGGTCTATGACCTCTCCGGCTCCCCCGTCGCGGCCGCCTCCATGATAACGCCGTTCGTCCCTTCCCCGGG
CACCGACCGTGTCTCTCGCTCCAACCCTGGTGCTTGGCTCATCCTCCGCCCCCATGGCTTCTCCGTCAGCAGTTGGAAGCCATGGGGACGCCTCGAGGCTTGGCGCGAGC
GGGGACCGATCGATGGCCTAGGCTACAAGTTCGAGCTCGTCGCCAACACTGGACTAGCCACTGGCATCCCCATCGCAGAAGCTACCATGAGCGTAAAAAAGGGTGGCCAA
TTTTGCATCGACAGTAGAACTATAAGAGATTTCAGTCCAAACTCTAGATCCAACATTAAAGTAGGGAACTTTGTAATGGCTTCGAGCGTGGAGGGAGAAGGGAAGGTGAG
CAAGCCAATGGTAGAAGTGGGAGTTAAGCACGTGACATGCATGGCAGATGCTGCTTTATTTGTAGCTCTAGCAGCCGCCATTGATCTTAGCATGGACGCTTGCAGACACT
TCACACAGAAGCTGAGAAGAGAACTCTGTCAC
Protein sequenceShow/hide protein sequence
MDPCPFVRLMVESLALNLPQATRPAGAAVHPSATPCFCKIAIKNFPSQTALLPLSSISGDSPPDSAASSSGFHLDPSSLRRLSGKPLVMCLSVFAGRMGHTCGVNSGKFL
GRVRITVALDGADSRPRVFHNGWVKLGKEEDKISARLHLVVRSEPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSLPSDFSFNSTKGKWMRTFSGE
REKPGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRERGPIDGLGYKFELVANTGLATGIPIAEATMSVKKGGQ
FCIDSRTIRDFSPNSRSNIKVGNFVMASSVEGEGKVSKPMVEVGVKHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH