; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0334 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0334
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionGOLD domain-containing protein
Genome locationMC01:10046289..10059977
RNA-Seq ExpressionMC01g0334
SyntenyMC01g0334
Gene Ontology termsNA
InterPro domainsIPR009038 - GOLD domain
IPR036598 - GOLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147520.1 uncharacterized protein LOC101218161 [Cucumis sativus]1.14e-28289.43Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MASMEGLVPITRHFLASYYDKY FTPLSD VSRLSTEMLA+ NSLLDELPPTSEESTLLDEA +HPPHKIDENMWKNRENVEEIL L EKS WPQEVQ++
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        SATGESEL  I+GKLEEK RN+L+ L  FQ+KNSE+VFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRY LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI+++NDDDGPMEEQRQRYGPPLY LTTMV LIRLCISLSWRRFDA KL EHL ILEQAVDVYTSE+ERFLGFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV

Query:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
        FNN+PFFISADVA AA+ R ++SYKEISV AGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVE ESPGG K LILPHKRYESDQGNFCTCMAGDY
Subjt:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE
        KLIWDNTYSTFFKKVLRYKVDCIPPV EPVQP  E
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE

XP_008463316.1 PREDICTED: uncharacterized protein LOC103501503 isoform X1 [Cucumis melo]5.66e-28389.43Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MASMEGLVPITRHFLASYYDKY F PLSD VSRLSTEMLA+ NSLLDELPPTSEESTLLDEA +HPPHKIDENMWKNRENVEEIL LLEKS WPQEVQ++
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        SATG+SEL  I+GKLEEK RN+L++L  FQ+KNSE+VFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRY LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI+++NDDDGPMEEQRQRYGPPLY LTTMV LIRLCISLSWRRFDA KL EHL ILEQAVDVYTSE+ERFLGFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV

Query:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
        FNN+PFFISADVA AA+ R ++SYKEISV AGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVE ESPGG KTLILPHKRYESDQGNFCTCMAGDY
Subjt:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE
        KLIWDNTYSTFFKKVLRYKVDCIPPV EPVQP  E
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE

XP_022156250.1 uncharacterized protein LOC111023184 [Momordica charantia]0.099.77Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAM NSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV

Query:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
        FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
Subjt:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTEG
        KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTEG
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTEG

XP_022924994.1 uncharacterized protein LOC111432376 isoform X1 [Cucurbita moschata]7.21e-27186.44Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MASMEGLVPITRHFLASYY+KY FTPLSDD+SRLSTEMLA+ N LLDELPPT EESTLLDEA   PPHKIDENMWKNRENVEEIL LLEKS WPQEVQ++
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        SATGESEL  ILGKLEEK +N+L +L  FQ+KNSE+VFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRY LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI+++NDDDGPMEEQRQRYGPPLY LTTMV LIRL ISLSWRRFDARKL +HLAILEQAVDVY SELERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV

Query:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
        FNN+PFFI ADV      R  +SYKEISV AGKTYEVSLSVES+NSYIAWDFSLVQGKMNMDIGFSVE ESPGG KTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE
        KLIWDNTYSTFFKKV+RYKVDCIPPV EP+Q   E
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE

XP_038881964.1 uncharacterized protein LOC120073287 isoform X1 [Benincasa hispida]1.67e-27988.97Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MASMEGLVPITRHFLASYYDKY FTPL D VSRLSTEMLA+ NSLLDELPPTSEES LLDEA +HPPHKIDENMWKNRENVEEIL LLEKS WPQEVQ +
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        SATGESEL  I+GKLEEK RN+L+ L  FQ+KNSE+VFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRY LLW QQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI+++NDDDGPMEEQRQRYGPPLY LTTMV LIRLCISLSWRRFDA K  EHL ILEQAVDVYTSELERFLGFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV

Query:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
        FNN+PFFISADVA AAD R ++SYKEISV AGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVE ESPGG KTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE
        KL+WDNTYSTFFKKVLRYKVDCIPPV EPVQP  E
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE

TrEMBL top hitse value%identityAlignment
A0A0A0L040 GOLD domain-containing protein5.52e-28389.43Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MASMEGLVPITRHFLASYYDKY FTPLSD VSRLSTEMLA+ NSLLDELPPTSEESTLLDEA +HPPHKIDENMWKNRENVEEIL L EKS WPQEVQ++
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        SATGESEL  I+GKLEEK RN+L+ L  FQ+KNSE+VFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRY LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI+++NDDDGPMEEQRQRYGPPLY LTTMV LIRLCISLSWRRFDA KL EHL ILEQAVDVYTSE+ERFLGFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV

Query:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
        FNN+PFFISADVA AA+ R ++SYKEISV AGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVE ESPGG K LILPHKRYESDQGNFCTCMAGDY
Subjt:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE
        KLIWDNTYSTFFKKVLRYKVDCIPPV EPVQP  E
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE

A0A1S3CKI8 uncharacterized protein LOC103501503 isoform X12.74e-28389.43Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MASMEGLVPITRHFLASYYDKY F PLSD VSRLSTEMLA+ NSLLDELPPTSEESTLLDEA +HPPHKIDENMWKNRENVEEIL LLEKS WPQEVQ++
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        SATG+SEL  I+GKLEEK RN+L++L  FQ+KNSE+VFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRY LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI+++NDDDGPMEEQRQRYGPPLY LTTMV LIRLCISLSWRRFDA KL EHL ILEQAVDVYTSE+ERFLGFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV

Query:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
        FNN+PFFISADVA AA+ R ++SYKEISV AGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVE ESPGG KTLILPHKRYESDQGNFCTCMAGDY
Subjt:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE
        KLIWDNTYSTFFKKVLRYKVDCIPPV EPVQP  E
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE

A0A5A7SM14 Emp24/gp25L/p24 family/GOLD family protein2.74e-28389.43Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MASMEGLVPITRHFLASYYDKY F PLSD VSRLSTEMLA+ NSLLDELPPTSEESTLLDEA +HPPHKIDENMWKNRENVEEIL LLEKS WPQEVQ++
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        SATG+SEL  I+GKLEEK RN+L++L  FQ+KNSE+VFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRY LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI+++NDDDGPMEEQRQRYGPPLY LTTMV LIRLCISLSWRRFDA KL EHL ILEQAVDVYTSE+ERFLGFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV

Query:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
        FNN+PFFISADVA AA+ R ++SYKEISV AGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVE ESPGG KTLILPHKRYESDQGNFCTCMAGDY
Subjt:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE
        KLIWDNTYSTFFKKVLRYKVDCIPPV EPVQP  E
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE

A0A6J1DPR8 uncharacterized protein LOC1110231840.099.77Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAM NSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV

Query:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
        FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
Subjt:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTEG
        KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTEG
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTEG

A0A6J1EAU1 uncharacterized protein LOC111432376 isoform X13.49e-27186.44Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MASMEGLVPITRHFLASYY+KY FTPLSDD+SRLSTEMLA+ N LLDELPPT EESTLLDEA   PPHKIDENMWKNRENVEEIL LLEKS WPQEVQ++
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        SATGESEL  ILGKLEEK +N+L +L  FQ+KNSE+VFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRY LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI+++NDDDGPMEEQRQRYGPPLY LTTMV LIRL ISLSWRRFDARKL +HLAILEQAVDVY SELERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREV

Query:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY
        FNN+PFFI ADV      R  +SYKEISV AGKTYEVSLSVES+NSYIAWDFSLVQGKMNMDIGFSVE ESPGG KTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE
        KLIWDNTYSTFFKKV+RYKVDCIPPV EP+Q   E
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G01010.1 CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 172 Blast hits to 172 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 95; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 14 (source: NCBI BLink).1.1e-17369.07Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MAS EGL+PITR FLASYYDKY F+PLSDDVSRLS++M +++  L  + PP+  E++L+DEA R PPHKIDENMWKNRE +EEIL LL  S WP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        S + ++E   IL  L++ F N+   +  FQ KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIRD Y LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLS-EHLAILEQAVDVYTSELERFLGFIRE
        GSATGVYKTLVKYLVGVP+VLL+FIR++NDDDGPMEEQR+RYGPPLY+LT MV+ IR+ ++L W R+D  KLS + + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLS-EHLAILEQAVDVYTSELERFLGFIRE

Query:  VFNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGD
        VF NSPFFISAD AG    R+NE YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++MDIGFSVEY +  GEKTLILP++RYE+DQGNF T MAG+
Subjt:  VFNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGD

Query:  YKLIWDNTYSTFFKKVLRYKVDCIPPVAEP
        YKL+WDN+YSTFFKK LRYKVDCI PV EP
Subjt:  YKLIWDNTYSTFFKKVLRYKVDCIPPVAEP

AT5G01010.2 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 85 Blast hits to 85 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 20; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).1.2e-16763.06Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MAS EGL+PITR FLASYYDKY F+PLSDDVSRLS++M +++  L  + PP+  E++L+DEA R PPHKIDENMWKNRE +EEIL LL  S WP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        S + ++E   IL  L++ F N+   +  FQ KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIRD Y LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLS-EHLAILEQAVDVYTSELERFLGFIRE
        GSATGVYKTLVKYLVGVP+VLL+FIR++NDDDGPMEEQR+RYGPPLY+LT MV+ IR+ ++L W R+D  KLS + + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLS-EHLAILEQAVDVYTSELERFLGFIRE

Query:  VFNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNM--------------------------------------
        VF NSPFFISAD AG    R+NE YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++M                                      
Subjt:  VFNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNM--------------------------------------

Query:  ---DIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVAEP
           DIGFSVEY +  GEKTLILP++RYE+DQGNF T MAG+YKL+WDN+YSTFFKK LRYKVDCI PV EP
Subjt:  ---DIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVAEP

AT5G01010.3 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 76 Blast hits to 76 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).2.1e-16768.74Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MAS EGL+PITR FLASYYDKY F+PLSDDVSRLS++M +++  L  + PP+  E++L+DEA R PPHKIDENMWKNRE +EEIL LL  S WP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        S + ++E   IL  L++ F N+   +  FQ KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIRD Y LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLS-EHLAILEQAVDVYTSELERFLGFIRE
        GSATGVYKTLVKYLVGVP+VLL+FIR++NDDDGPMEEQR+RYGPPLY+LT MV+ IR+ ++L W R+D  KLS + + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLS-EHLAILEQAVDVYTSELERFLGFIRE

Query:  VFNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGD
        VF NSPFFISAD AG    R+NE YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++MDIGFSVEY +  GEKTLILP++RYE+DQGNF T MAG+
Subjt:  VFNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGD

Query:  YKLIWDNTYSTFFKKVLRY
        YKL+WDN+YSTFFKKV RY
Subjt:  YKLIWDNTYSTFFKKVLRY

AT5G01010.4 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038).3.8e-16161.57Show/hide
Query:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ
        MAS EGL+PITR FLASYYDKY F+PLSDDVSRLS++M +++  L  + PP+  E++L+DEA R PPHKIDENMWKNRE +EEIL LL  S WP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQ

Query:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL
        S + ++E   IL  L++ F N+   +  FQ KNSE +F+T       DFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIRD Y LLWKQQMERRRQLAQL
Subjt:  SATGESELDIILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLS-EHLAILEQAVDVYTSELERFLGFIRE
        GSATGVYKTLVKYLVGVP+VLL+FIR++NDDDGPMEEQR+RYGPPLY+LT MV+ IR+ ++L W R+D  KLS + + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLS-EHLAILEQAVDVYTSELERFLGFIRE

Query:  VFNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNM--------------------------------------
        VF NSPFFISAD AG    R+NE YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++M                                      
Subjt:  VFNNSPFFISADVAGAADVRNNESYKEISVLAGKTYEVSLSVESINSYIAWDFSLVQGKMNM--------------------------------------

Query:  ---DIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVAEP
           DIGFSVEY +  GEKTLILP++RYE+DQGNF T MAG+YKL+WDN+YSTFFKK LRYKVDCI PV EP
Subjt:  ---DIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVAEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATGGAGGGTCTGGTGCCTATAACCAGGCATTTCCTAGCTTCGTATTACGATAAGTATACATTTACGCCTCTCTCGGATGACGTCTCTCGCCTTTCCACTGA
GATGCTCGCCATGGTGAACAGTTTGCTCGATGAACTCCCGCCTACTTCAGAGGAAAGCACCCTACTTGATGAAGCAAAACGTCATCCTCCTCATAAAATTGATGAGAATA
TGTGGAAGAATCGGGAAAATGTGGAGGAAATTTTGCTTCTGCTTGAAAAATCTCATTGGCCTCAAGAGGTTCAGCAGCAGTCTGCAACTGGTGAATCCGAACTTGATATT
ATTCTAGGAAAGCTAGAAGAAAAATTCCGGAATTCCTTAAACATGTTGGCAGTTTTCCAAGCTAAAAATTCTGAGAATGTGTTCAACACAGTTATGACCTACATGCCTCA
AGATTTTCGAGGAACAATAATTAGACAGCAAAGAGAGCGATCAGAGAGGAATAAGCAAGCAGAGGTTGATGCTCTGATTAATTCTGGAGGAAGTATACGTGATCGATATG
TTCTCTTATGGAAGCAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCAACAGGTGTCTACAAAACCCTTGTGAAATATTTGGTTGGAGTTCCAGAGGTA
TTACTAGAATTCATTCGACGACTAAATGATGATGATGGGCCAATGGAAGAACAACGACAGCGCTATGGACCACCTTTGTATAACCTTACAACAATGGTCCTCCTTATTCG
ACTCTGTATTTCATTATCATGGAGACGTTTTGATGCTAGGAAACTAAGCGAGCATCTAGCCATTTTGGAGCAAGCTGTTGATGTGTACACCTCTGAGCTTGAGAGGTTCC
TCGGGTTCATTCGGGAGGTCTTCAACAATTCTCCATTCTTTATTTCAGCAGATGTGGCTGGTGCAGCAGATGTGAGGAATAATGAGAGCTACAAAGAGATTAGTGTTCTA
GCTGGGAAGACCTATGAGGTTTCATTAAGTGTGGAGTCAATCAATTCATATATTGCCTGGGATTTCTCACTGGTTCAAGGCAAGATGAATATGGATATTGGATTCAGTGT
GGAGTATGAAAGTCCTGGAGGGGAAAAGACTTTGATATTGCCTCACAAACGGTACGAGTCCGATCAAGGAAACTTCTGCACTTGCATGGCTGGAGACTACAAGCTGATTT
GGGACAATACATATTCAACTTTTTTTAAGAAGGTTTTGCGCTACAAGGTCGACTGCATACCTCCGGTCGCAGAGCCGGTGCAACCCACCACCGAAGGTTGA
mRNA sequenceShow/hide mRNA sequence
CAGGAATGTCCCGGTTGTACCCGGTTGGCGTTTTAAGCTCGTTCTGGCCTCGTTATAGGTCAGAACTCATCCCGGTAAGGTCCTGGCTCATCGTAGGAGCAGGACGATCG
AGTGACCGAGAACCAAGATGGTTGAAAACAAGTTGCGACGTCGCGTTCAGAAGACCTTGGCCTAGAGTAGAGGCAAGGTAGAGCCGTGGGACTTGGCCTAGTGTAGAGGC
AAGCCGGCTAGAGTCGCGTTAGAGTTTAAATGCACGTACGGGCGACGTGCATTGCCGAGTCAAGGGAGAATTCCGCTGTAAGTGAAACTCGGTCGAGGAAGAAAGAACCT
CGCCTAAGGTCCGACGAAGCCACGAAAGGGCGTGAAAAGTGGATTGGAGCTAGAGTTCCTTGTAAGGCTGGCCATGGTCTAGCTTAGATCATGGCGCCGCACGGTTGAAA
AATGTACGACCGTGACACGTAGCGGCACAGAGCCGTCTTTTGCTGTTCTACGACTATTTTTGACAATCAAAGGCTACCAGAAGAAAATCCAGAGACCTTATTCACGGTGG
AAGTGTTTCAGTTTGCTATTTTTGCGCGAACGATTACTCGATCGGAGGAGGAAGTTTCACAGTGGAGACGACAACTGAAAGATTTCAATTCGGAGAAGAGAACCAAGGTA
ACTAAGGCGGAAATGGCTTCCATGGAGGGTCTGGTGCCTATAACCAGGCATTTCCTAGCTTCGTATTACGATAAGTATACATTTACGCCTCTCTCGGATGACGTCTCTCG
CCTTTCCACTGAGATGCTCGCCATGGTGAACAGTTTGCTCGATGAACTCCCGCCTACTTCAGAGGAAAGCACCCTACTTGATGAAGCAAAACGTCATCCTCCTCATAAAA
TTGATGAGAATATGTGGAAGAATCGGGAAAATGTGGAGGAAATTTTGCTTCTGCTTGAAAAATCTCATTGGCCTCAAGAGGTTCAGCAGCAGTCTGCAACTGGTGAATCC
GAACTTGATATTATTCTAGGAAAGCTAGAAGAAAAATTCCGGAATTCCTTAAACATGTTGGCAGTTTTCCAAGCTAAAAATTCTGAGAATGTGTTCAACACAGTTATGAC
CTACATGCCTCAAGATTTTCGAGGAACAATAATTAGACAGCAAAGAGAGCGATCAGAGAGGAATAAGCAAGCAGAGGTTGATGCTCTGATTAATTCTGGAGGAAGTATAC
GTGATCGATATGTTCTCTTATGGAAGCAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCAACAGGTGTCTACAAAACCCTTGTGAAATATTTGGTTGGA
GTTCCAGAGGTATTACTAGAATTCATTCGACGACTAAATGATGATGATGGGCCAATGGAAGAACAACGACAGCGCTATGGACCACCTTTGTATAACCTTACAACAATGGT
CCTCCTTATTCGACTCTGTATTTCATTATCATGGAGACGTTTTGATGCTAGGAAACTAAGCGAGCATCTAGCCATTTTGGAGCAAGCTGTTGATGTGTACACCTCTGAGC
TTGAGAGGTTCCTCGGGTTCATTCGGGAGGTCTTCAACAATTCTCCATTCTTTATTTCAGCAGATGTGGCTGGTGCAGCAGATGTGAGGAATAATGAGAGCTACAAAGAG
ATTAGTGTTCTAGCTGGGAAGACCTATGAGGTTTCATTAAGTGTGGAGTCAATCAATTCATATATTGCCTGGGATTTCTCACTGGTTCAAGGCAAGATGAATATGGATAT
TGGATTCAGTGTGGAGTATGAAAGTCCTGGAGGGGAAAAGACTTTGATATTGCCTCACAAACGGTACGAGTCCGATCAAGGAAACTTCTGCACTTGCATGGCTGGAGACT
ACAAGCTGATTTGGGACAATACATATTCAACTTTTTTTAAGAAGGTTTTGCGCTACAAGGTCGACTGCATACCTCCGGTCGCAGAGCCGGTGCAACCCACCACCGAAGGT
TGAGTGTGCCTCCACAGCAGCCTCAACAAATTGTAGACGATTTTTTAGGTTGTAACATACATTCATATCGAAGTTTGTTATTGCATTGAGGTCATATTAGAATTTGCTGC
TAGAGAAATCCTGTAAACAATGCAATTAACCGAAAAAAGCATTCACTGTAATTATGGAAATATGTTATTTTCTTAATCGACCTCAGCTTGTGTAATACCTTCTGTTCTCT
CATAAATGTCACAGAAAATTGAACAAATAGAAAAGATTGCAATCTTTCAAACGACAATAAGCTTATAATCTTCTTCCACGACTTAAAGCGGTCGGTCCAGTTGTACAAGG
TCCTATTGTCTTCTTTGTTTGACAATATGTTCGGAGTGTATCGTCTTCGTGAACAGATGAATGTGTCAAGGAAACCAAAACTTCAGAAGAGTGCATTATGCTGGAATAAT
GTTGGGGTTTGCATGTAGTTATACACTCGCTTGGCAGATTCTAGTGTGTTGTGCAAGAGCATAGCAACCGTCATTGGTCCAACTCCGCCAGGAACTGGTGTAATGGCAGA
TGCCACTCTTGATGCCTCGTCGTAGCAAACATCTCCGATTAGTCGATAGCCATGCTCGCAGCTAGGATCCTGGCAATT
Protein sequenceShow/hide protein sequence
MASMEGLVPITRHFLASYYDKYTFTPLSDDVSRLSTEMLAMVNSLLDELPPTSEESTLLDEAKRHPPHKIDENMWKNRENVEEILLLLEKSHWPQEVQQQSATGESELDI
ILGKLEEKFRNSLNMLAVFQAKNSENVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYVLLWKQQMERRRQLAQLGSATGVYKTLVKYLVGVPEV
LLEFIRRLNDDDGPMEEQRQRYGPPLYNLTTMVLLIRLCISLSWRRFDARKLSEHLAILEQAVDVYTSELERFLGFIREVFNNSPFFISADVAGAADVRNNESYKEISVL
AGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVEYESPGGEKTLILPHKRYESDQGNFCTCMAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVAEPVQPTTEG