; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G005100 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G005100
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGOLD domain-containing protein
Genome locationCG_Chr04:18688310..18704582
RNA-Seq ExpressionClCG04G005100
SyntenyClCG04G005100
Gene Ontology termsNA
InterPro domainsIPR009038 - GOLD domain
IPR036598 - GOLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147520.1 uncharacterized protein LOC101218161 [Cucumis sativus]1.2e-23895.41Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFL EKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANIIGKLEEK RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREHL ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ KM+MDIGFSVECESPGGVK LILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE

XP_008463316.1 PREDICTED: uncharacterized protein LOC103501503 isoform X1 [Cucumis melo]9.5e-23995.41Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPF PLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANIIGKLEEK RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREHL ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ KM+MDIGFSVECESPGGVKTLILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE

XP_023517640.1 uncharacterized protein LOC111781338 isoform X1 [Cucurbita pepo subsp. pepo]2.8e-22289.91Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPFTPLSD +SRLSTEMLALAN L+DELPPT EESTLLDEANQ PPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANI+GKLEEK++NTL  LV FQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDAL+NSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRL ISLSWRRFDA KLR+HL ILEQAVDVY+SELERFL FIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSL+VESINSYIAWDFSLVQ KM+MDIGFSVECESPGG KTLILPH+RYESDQGNFCTC+AG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEP+Q AAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE

XP_038881964.1 uncharacterized protein LOC120073287 isoform X1 [Benincasa hispida]6.8e-23795.4Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPFTPL DHVSRLSTEMLALANSLLDELPPTSEES LLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQ E
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANIIGKLEEKVRNTLH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLW QQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA K REHLVILEQAVDVY+SELERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ KM+MDIGFSVECESPGGVKTLILPH+RYESDQGNFCTC+AG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAE
        KL+WDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAE

XP_038881965.1 uncharacterized protein LOC120073287 isoform X2 [Benincasa hispida]1.5e-22391.72Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPFTPL DHVSRLSTEMLALANSLLDELPPTSEES LLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQ E
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANIIGKLEEKVRNTLH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLW QQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA K REHLVILEQAVDVY+SELERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ KM+M                LILPH+RYESDQGNFCTC+AG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAE
        KL+WDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAE

TrEMBL top hitse value%identityAlignment
A0A0A0L040 GOLD domain-containing protein6.0e-23995.41Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFL EKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANIIGKLEEK RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREHL ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ KM+MDIGFSVECESPGGVK LILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE

A0A1S3CJ00 uncharacterized protein LOC103501503 isoform X25.1e-22290.14Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPF PLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANIIGKLEEK RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREHL ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYE                         DIGFSVECESPGGVKTLILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE

A0A1S3CKI8 uncharacterized protein LOC103501503 isoform X14.6e-23995.41Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPF PLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANIIGKLEEK RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREHL ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ KM+MDIGFSVECESPGGVKTLILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE

A0A5A7SM14 Emp24/gp25L/p24 family/GOLD family protein4.6e-23995.41Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPF PLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANIIGKLEEK RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREHL ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ KM+MDIGFSVECESPGGVKTLILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE

A0A6J1EAU1 uncharacterized protein LOC111432376 isoform X12.3e-22289.91Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPFTPLSD +SRLSTEMLALAN LLDELPPT EESTLLDEAN  PPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANI+GKLEEK++NTL  LV FQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDAL+NSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRL ISLSWRRFDA KLR+HL ILEQAVDVY+SELERFL FIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSLSVES+NSYIAWDFSLVQ KM+MDIGFSVECESPGG KTLILPH+RYESDQGNFCTC+AG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEP+Q AAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G01010.1 CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 172 Blast hits to 172 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 95; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 14 (source: NCBI BLink).6.5e-16967.91Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYYDKYPF+PLSD VSRLS++M +L   L  + PP+  E++L+DEAN+ PPHKIDEN+WKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+I+  L++   N    +++FQ+KNSE +F+TVMTYMPQDFRGTLIRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKL-REHLVILEQAVDVYSSELERFLGFIRE
        GSA+GVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR RYGPPLY LT MV  IR+ ++L W R+D  KL ++ + +L +A  VY+SE ERF+ FI +
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKL-REHLVILEQAVDVYSSELERFLGFIRE

Query:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGE
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+Q K+SMDIGFSVE  +  G KTLILP+RRYE+DQGNF T MAG 
Subjt:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGE

Query:  YKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
        YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  YKLIWDNTYSTFFKKVLRYKVDCIPPVVEP

AT5G01010.2 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 85 Blast hits to 85 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 20; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).6.9e-16362Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYYDKYPF+PLSD VSRLS++M +L   L  + PP+  E++L+DEAN+ PPHKIDEN+WKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+I+  L++   N    +++FQ+KNSE +F+TVMTYMPQDFRGTLIRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKL-REHLVILEQAVDVYSSELERFLGFIRE
        GSA+GVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR RYGPPLY LT MV  IR+ ++L W R+D  KL ++ + +L +A  VY+SE ERF+ FI +
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKL-REHLVILEQAVDVYSSELERFLGFIRE

Query:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSM--------------------------------------
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+Q K+SM                                      
Subjt:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSM--------------------------------------

Query:  ---DIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
           DIGFSVE  +  G KTLILP+RRYE+DQGNF T MAG YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  ---DIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP

AT5G01010.3 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 76 Blast hits to 76 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).2.0e-16267.3Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYYDKYPF+PLSD VSRLS++M +L   L  + PP+  E++L+DEAN+ PPHKIDEN+WKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+I+  L++   N    +++FQ+KNSE +F+TVMTYMPQDFRGTLIRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKL-REHLVILEQAVDVYSSELERFLGFIRE
        GSA+GVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR RYGPPLY LT MV  IR+ ++L W R+D  KL ++ + +L +A  VY+SE ERF+ FI +
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKL-REHLVILEQAVDVYSSELERFLGFIRE

Query:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGE
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+Q K+SMDIGFSVE  +  G KTLILP+RRYE+DQGNF T MAG 
Subjt:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGE

Query:  YKLIWDNTYSTFFKKVLRY
        YKL+WDN+YSTFFKKV RY
Subjt:  YKLIWDNTYSTFFKKVLRY

AT5G01010.4 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038).2.2e-15660.51Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYYDKYPF+PLSD VSRLS++M +L   L  + PP+  E++L+DEAN+ PPHKIDEN+WKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+I+  L++   N    +++FQ+KNSE +F+T       DFRGTLIRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKL-REHLVILEQAVDVYSSELERFLGFIRE
        GSA+GVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR RYGPPLY LT MV  IR+ ++L W R+D  KL ++ + +L +A  VY+SE ERF+ FI +
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKL-REHLVILEQAVDVYSSELERFLGFIRE

Query:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSM--------------------------------------
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+Q K+SM                                      
Subjt:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKMSM--------------------------------------

Query:  ---DIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
           DIGFSVE  +  G KTLILP+RRYE+DQGNF T MAG YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  ---DIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATGGAGGGTCTGGTGCCTATTACCAGGCATTTCCTTGCTTCGTACTATGATAAGTACCCATTTACGCCTCTATCTGACCATGTCTCTCGCCTTTCGACTGA
GATGCTCGCTCTGGCGAACAGTTTGCTCGATGAACTACCGCCTACTTCAGAGGAAAGCACCCTTCTTGATGAAGCAAACCAACATCCTCCTCATAAAATTGACGAGAATT
TGTGGAAGAATCGGGAAAATGTGGAAGAAATTCTCTTTCTGCTTGAAAAATCTCGTTGGCCTCAAGAGGTTCAGAAGGAGTCTGCAACTGGTGAATCTGAACTTGCTAAT
ATTATAGGAAAGCTAGAAGAAAAAGTTCGGAATACCTTACACACGTTGGTGGCTTTCCAATCTAAAAATTCTGAGCACGTGTTCAACACAGTTATGACATACATGCCTCA
AGATTTTCGAGGAACGTTAATTAGACAGCAAAGAGAACGATCAGAGAGAAATAAGCAAGCAGAGGTTGATGCTTTGATTAATTCTGGAGGAAGTATACGTGAACGGTATG
CTCTCTTATGGAAACAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCATCAGGTGTCTACAAAACCCTTGTGAAATATTTGGTTGGAGTTCCAGAGGTA
TTGCTAGAATTCATTCAGAAAATAAATGATGACGATGGACCAATGGAAGAACAAAGATACCGCTATGGACCACCTCTGTATAAACTTACAACAATGGTCTGCCTTATTCG
ACTCTGTATTTCATTATCATGGAGACGTTTTGATGCGGCAAAACTAAGGGAGCATCTTGTTATTTTGGAGCAAGCTGTTGATGTGTACTCCTCTGAGCTTGAGCGGTTCC
TCGGGTTCATTCGCGAGGTCTTCAACAATGCTCCGTTCTTTATTTCAGCAGATGTGGCCTGTGCAGCAGATGAGAGGAAAAGTGATAGCTACAAAGAGATTAGTGTTCCA
GCTGGGAAGACTTACGAGGTTTCATTAAGTGTGGAGTCTATCAATTCATATATTGCCTGGGATTTTTCATTGGTTCAAAGCAAGATGAGTATGGATATTGGATTCAGTGT
GGAGTGTGAAAGTCCTGGAGGGGTAAAGACGTTGATATTGCCTCACAGACGTTACGAGTCTGATCAAGGAAACTTCTGCACTTGCATGGCTGGGGAATACAAGCTGATTT
GGGACAATACATATTCAACTTTTTTTAAGAAGGTTTTGCGCTATAAGGTTGACTGCATACCTCCCGTGGTAGAGCCGGTGCAACCCGCTGCAGAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCATGGAGGGTCTGGTGCCTATTACCAGGCATTTCCTTGCTTCGTACTATGATAAGTACCCATTTACGCCTCTATCTGACCATGTCTCTCGCCTTTCGACTGA
GATGCTCGCTCTGGCGAACAGTTTGCTCGATGAACTACCGCCTACTTCAGAGGAAAGCACCCTTCTTGATGAAGCAAACCAACATCCTCCTCATAAAATTGACGAGAATT
TGTGGAAGAATCGGGAAAATGTGGAAGAAATTCTCTTTCTGCTTGAAAAATCTCGTTGGCCTCAAGAGGTTCAGAAGGAGTCTGCAACTGGTGAATCTGAACTTGCTAAT
ATTATAGGAAAGCTAGAAGAAAAAGTTCGGAATACCTTACACACGTTGGTGGCTTTCCAATCTAAAAATTCTGAGCACGTGTTCAACACAGTTATGACATACATGCCTCA
AGATTTTCGAGGAACGTTAATTAGACAGCAAAGAGAACGATCAGAGAGAAATAAGCAAGCAGAGGTTGATGCTTTGATTAATTCTGGAGGAAGTATACGTGAACGGTATG
CTCTCTTATGGAAACAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCATCAGGTGTCTACAAAACCCTTGTGAAATATTTGGTTGGAGTTCCAGAGGTA
TTGCTAGAATTCATTCAGAAAATAAATGATGACGATGGACCAATGGAAGAACAAAGATACCGCTATGGACCACCTCTGTATAAACTTACAACAATGGTCTGCCTTATTCG
ACTCTGTATTTCATTATCATGGAGACGTTTTGATGCGGCAAAACTAAGGGAGCATCTTGTTATTTTGGAGCAAGCTGTTGATGTGTACTCCTCTGAGCTTGAGCGGTTCC
TCGGGTTCATTCGCGAGGTCTTCAACAATGCTCCGTTCTTTATTTCAGCAGATGTGGCCTGTGCAGCAGATGAGAGGAAAAGTGATAGCTACAAAGAGATTAGTGTTCCA
GCTGGGAAGACTTACGAGGTTTCATTAAGTGTGGAGTCTATCAATTCATATATTGCCTGGGATTTTTCATTGGTTCAAAGCAAGATGAGTATGGATATTGGATTCAGTGT
GGAGTGTGAAAGTCCTGGAGGGGTAAAGACGTTGATATTGCCTCACAGACGTTACGAGTCTGATCAAGGAAACTTCTGCACTTGCATGGCTGGGGAATACAAGCTGATTT
GGGACAATACATATTCAACTTTTTTTAAGAAGGTTTTGCGCTATAAGGTTGACTGCATACCTCCCGTGGTAGAGCCGGTGCAACCCGCTGCAGAAGAATAA
Protein sequenceShow/hide protein sequence
MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKESATGESELAN
IIGKLEEKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQLGSASGVYKTLVKYLVGVPEV
LLEFIQKINDDDGPMEEQRYRYGPPLYKLTTMVCLIRLCISLSWRRFDAAKLREHLVILEQAVDVYSSELERFLGFIREVFNNAPFFISADVACAADERKSDSYKEISVP
AGKTYEVSLSVESINSYIAWDFSLVQSKMSMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEYKLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQPAAEE