; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G064190 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G064190
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionGOLD domain-containing protein
Genome locationCicolChr04:20554835..20571855
RNA-Seq ExpressionCcUC04G064190
SyntenyCcUC04G064190
Gene Ontology termsNA
InterPro domainsIPR009038 - GOLD domain
IPR036598 - GOLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147520.1 uncharacterized protein LOC101218161 [Cucumis sativus]4.0e-23794.5Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFL EKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANIIGKLE+K RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREH+ ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ K++MDIGFSVECESPGGVK LILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQ AAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE

XP_008463316.1 PREDICTED: uncharacterized protein LOC103501503 isoform X1 [Cucumis melo]3.1e-23794.5Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPF PLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANIIGKLE+K RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREH+ ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ K++MDIGFSVECESPGGVKTLILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQ AAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE

XP_023517640.1 uncharacterized protein LOC111781338 isoform X1 [Cucurbita pepo subsp. pepo]8.1e-22289.22Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPFTPLSD +SRLSTEMLALAN L+DELPPT EESTLLDEANQ PPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANI+GKLE+K++NTL  LV FQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDAL+NSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRL ISLSWRRFDA KLR+H+ ILEQAVDVY+SELERFL FIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSL+VESINSYIAWDFSLVQ K++MDIGFSVECESPGG KTLILPH+RYESDQGNFCTC+AG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEP+Q AAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE

XP_038881964.1 uncharacterized protein LOC120073287 isoform X1 [Benincasa hispida]2.2e-23594.48Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPFTPL DHVSRLSTEMLALANSLLDELPPTSEES LLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQ E
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANIIGKLE+KVRNTLH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLW QQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA K REH+VILEQAVDVY+SELERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ K++MDIGFSVECESPGGVKTLILPH+RYESDQGNFCTC+AG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAE
        KL+WDNTYSTFFKKVLRYKVDCIPPVVEPVQ AAE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAE

XP_038881965.1 uncharacterized protein LOC120073287 isoform X2 [Benincasa hispida]4.7e-22290.8Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPFTPL DHVSRLSTEMLALANSLLDELPPTSEES LLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQ E
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANIIGKLE+KVRNTLH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLW QQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA K REH+VILEQAVDVY+SELERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ K++M                LILPH+RYESDQGNFCTC+AG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAE
        KL+WDNTYSTFFKKVLRYKVDCIPPVVEPVQ AAE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAE

TrEMBL top hitse value%identityAlignment
A0A0A0L040 GOLD domain-containing protein1.9e-23794.5Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFL EKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANIIGKLE+K RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREH+ ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ K++MDIGFSVECESPGGVK LILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQ AAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE

A0A1S3CJ00 uncharacterized protein LOC103501503 isoform X25.6e-22189.45Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPF PLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANIIGKLE+K RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREH+ ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYE                         DIGFSVECESPGGVKTLILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQ AAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE

A0A1S3CKI8 uncharacterized protein LOC103501503 isoform X11.5e-23794.5Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPF PLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANIIGKLE+K RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREH+ ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ K++MDIGFSVECESPGGVKTLILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQ AAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE

A0A5A7SM14 Emp24/gp25L/p24 family/GOLD family protein1.5e-23794.5Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYDKYPF PLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANIIGKLE+K RN LH LVAFQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRLCISLSWRRFDA KLREH+ ILEQAVDVY+SE+ERFLGFIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFISADVACAA+ERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQ K++MDIGFSVECESPGGVKTLILPH+RYESDQGNFCTCMAG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQ AAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE

A0A6J1EAU1 uncharacterized protein LOC111432376 isoform X16.7e-22289.22Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPFTPLSD +SRLSTEMLALAN LLDELPPT EESTLLDEAN  PPHKIDEN+WKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANI+GKLE+K++NTL  LV FQSKNSEHVFNTVMTYMPQDFRGT+IRQQRERSERNKQAEVDAL+NSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV
        GSA+GVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQR RYGPPLYKLTTMV LIRL ISLSWRRFDA KLR+H+ ILEQAVDVY+SELERFL FIREV
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREV

Query:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSLSVES+NSYIAWDFSLVQ K++MDIGFSVECESPGG KTLILPH+RYESDQGNFCTC+AG+Y
Subjt:  FNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEP+Q AAEE
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G01010.1 CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 172 Blast hits to 172 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 95; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 14 (source: NCBI BLink).6.5e-16968.14Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYYDKYPF+PLSD VSRLS++M +L   L  + PP+  E++L+DEAN+ PPHKIDEN+WKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+I+  L+    N    +++FQ+KNSE +F+TVMTYMPQDFRGTLIRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKL-REHIVILEQAVDVYSSELERFLGFIRE
        GSA+GVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR RYGPPLY LT MV  IR+ ++L W R+D  KL ++ + +L +A  VY+SE ERF+ FI +
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKL-REHIVILEQAVDVYSSELERFLGFIRE

Query:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGE
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+Q KISMDIGFSVE  +  G KTLILP+RRYE+DQGNF T MAG 
Subjt:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGE

Query:  YKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
        YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  YKLIWDNTYSTFFKKVLRYKVDCIPPVVEP

AT5G01010.2 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 85 Blast hits to 85 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 20; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).6.9e-16362.21Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYYDKYPF+PLSD VSRLS++M +L   L  + PP+  E++L+DEAN+ PPHKIDEN+WKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+I+  L+    N    +++FQ+KNSE +F+TVMTYMPQDFRGTLIRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKL-REHIVILEQAVDVYSSELERFLGFIRE
        GSA+GVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR RYGPPLY LT MV  IR+ ++L W R+D  KL ++ + +L +A  VY+SE ERF+ FI +
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKL-REHIVILEQAVDVYSSELERFLGFIRE

Query:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISM--------------------------------------
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+Q KISM                                      
Subjt:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISM--------------------------------------

Query:  ---DIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
           DIGFSVE  +  G KTLILP+RRYE+DQGNF T MAG YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  ---DIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP

AT5G01010.3 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 76 Blast hits to 76 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).2.6e-16267.54Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYYDKYPF+PLSD VSRLS++M +L   L  + PP+  E++L+DEAN+ PPHKIDEN+WKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+I+  L+    N    +++FQ+KNSE +F+TVMTYMPQDFRGTLIRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKL-REHIVILEQAVDVYSSELERFLGFIRE
        GSA+GVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR RYGPPLY LT MV  IR+ ++L W R+D  KL ++ + +L +A  VY+SE ERF+ FI +
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKL-REHIVILEQAVDVYSSELERFLGFIRE

Query:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGE
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+Q KISMDIGFSVE  +  G KTLILP+RRYE+DQGNF T MAG 
Subjt:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGE

Query:  YKLIWDNTYSTFFKKVLRY
        YKL+WDN+YSTFFKKV RY
Subjt:  YKLIWDNTYSTFFKKVLRY

AT5G01010.4 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038).2.2e-15660.72Show/hide
Query:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYYDKYPF+PLSD VSRLS++M +L   L  + PP+  E++L+DEAN+ PPHKIDEN+WKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+I+  L+    N    +++FQ+KNSE +F+T       DFRGTLIRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANIIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKL-REHIVILEQAVDVYSSELERFLGFIRE
        GSA+GVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR RYGPPLY LT MV  IR+ ++L W R+D  KL ++ + +L +A  VY+SE ERF+ FI +
Subjt:  GSASGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKL-REHIVILEQAVDVYSSELERFLGFIRE

Query:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISM--------------------------------------
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+Q KISM                                      
Subjt:  VFNNAPFFISADVACAADERKSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQSKISM--------------------------------------

Query:  ---DIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
           DIGFSVE  +  G KTLILP+RRYE+DQGNF T MAG YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  ---DIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATGGAGGGTCTGGTGCCTATTACCAGGCATTTCCTTGCTTCGTACTATGATAAGTACCCATTTACGCCTCTATCTGACCATGTCTCTCGCCTTTCGACTGA
GATGCTTGCTTTGGCGAACAGTTTGCTCGATGAACTACCGCCTACTTCAGAGGAAAGCACCCTTCTTGATGAAGCAAACCAACATCCTCCTCATAAAATTGACGAGAATT
TGTGGAAGAATCGGGAAAATGTGGAAGAAATTCTCTTTCTGCTTGAAAAATCTCGTTGGCCTCAAGAGGTTCAGAAGGAGTCTGCAACTGGTGAATCTGAACTTGCTAAT
ATTATTGGAAAGCTAGAAAAAAAAGTTCGGAATACCTTACACACGTTGGTGGCTTTCCAATCTAAAAATTCTGAGCACGTGTTCAACACAGTTATGACATACATGCCTCA
AGATTTTCGAGGAACTTTAATTAGACAGCAAAGAGAACGATCAGAGAGAAATAAGCAAGCAGAGGTTGATGCTTTGATTAATTCTGGAGGAAGTATACGTGAACGGTATG
CTCTCTTATGGAAACAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCATCAGGTGTCTACAAAACCCTTGTGAAATATTTGGTTGGAGTTCCAGAGGTA
TTGCTAGAATTCATTCAGAAAATAAATGATGACGATGGACCAATGGAAGAACAAAGACACCGCTATGGACCACCTCTGTATAAACTTACAACAATGGTCCACCTTATTCG
ACTCTGTATTTCATTATCATGGAGACGTTTTGATGCGGCAAAACTAAGGGAGCATATTGTTATTTTGGAGCAAGCTGTTGATGTGTACTCCTCTGAGCTTGAGCGGTTCC
TCGGGTTCATTCGCGAGGTCTTCAACAATGCTCCGTTCTTTATTTCAGCAGATGTGGCCTGTGCAGCAGATGAGAGGAAAAGTGATAGCTACAAAGAGATTAGTGTTCCA
GCTGGGAAGACTTACGAGGTTTCATTAAGTGTGGAGTCTATCAATTCATATATTGCCTGGGATTTTTCATTGGTTCAAAGCAAGATAAGTATGGATATTGGATTCAGTGT
GGAGTGTGAAAGTCCTGGAGGGGTAAAGACGTTGATATTGCCTCACAGACGTTACGAGTCTGATCAAGGAAACTTCTGCACTTGCATGGCTGGGGAATACAAGCTGATTT
GGGACAATACATATTCAACTTTTTTTAAGAAGGTGTTGCGCTATAAGGTCGACTGCATACCTCCCGTGGTAGAGCCGGTGCAAGCCGCTGCAGAAGAATAA
mRNA sequenceShow/hide mRNA sequence
AAAGGGGTGAAAGGTGAACGACAAGTCGATAAGAATATATAGCATATATATCAAAATGGATGATCGATGACGACAAGTCGACCGAAGAACAGGGGCGTAGCGGCGTAGAG
CCGTCTTCTTGTTGCTCTTCCTCTACGAGTACTTTGACGATCGACCAAAGGCTTACCAGAAGAACTAGGAGAATTTCTTCACGGTGGAAGTATTTTTGTTTGCTAACTTG
CTTTGACGATACACATCTTCTTCTTCGGCTTCAAACGATCGGAGGAGGAAATTTCAGAGTGGAGCAGTTGTACTGAAAGATTTTGATTCGGTGAAGAGAATCAAGTTCAA
GGCGAAAATGGCTTCCATGGAGGGTCTGGTGCCTATTACCAGGCATTTCCTTGCTTCGTACTATGATAAGTACCCATTTACGCCTCTATCTGACCATGTCTCTCGCCTTT
CGACTGAGATGCTTGCTTTGGCGAACAGTTTGCTCGATGAACTACCGCCTACTTCAGAGGAAAGCACCCTTCTTGATGAAGCAAACCAACATCCTCCTCATAAAATTGAC
GAGAATTTGTGGAAGAATCGGGAAAATGTGGAAGAAATTCTCTTTCTGCTTGAAAAATCTCGTTGGCCTCAAGAGGTTCAGAAGGAGTCTGCAACTGGTGAATCTGAACT
TGCTAATATTATTGGAAAGCTAGAAAAAAAAGTTCGGAATACCTTACACACGTTGGTGGCTTTCCAATCTAAAAATTCTGAGCACGTGTTCAACACAGTTATGACATACA
TGCCTCAAGATTTTCGAGGAACTTTAATTAGACAGCAAAGAGAACGATCAGAGAGAAATAAGCAAGCAGAGGTTGATGCTTTGATTAATTCTGGAGGAAGTATACGTGAA
CGGTATGCTCTCTTATGGAAACAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCATCAGGTGTCTACAAAACCCTTGTGAAATATTTGGTTGGAGTTCC
AGAGGTATTGCTAGAATTCATTCAGAAAATAAATGATGACGATGGACCAATGGAAGAACAAAGACACCGCTATGGACCACCTCTGTATAAACTTACAACAATGGTCCACC
TTATTCGACTCTGTATTTCATTATCATGGAGACGTTTTGATGCGGCAAAACTAAGGGAGCATATTGTTATTTTGGAGCAAGCTGTTGATGTGTACTCCTCTGAGCTTGAG
CGGTTCCTCGGGTTCATTCGCGAGGTCTTCAACAATGCTCCGTTCTTTATTTCAGCAGATGTGGCCTGTGCAGCAGATGAGAGGAAAAGTGATAGCTACAAAGAGATTAG
TGTTCCAGCTGGGAAGACTTACGAGGTTTCATTAAGTGTGGAGTCTATCAATTCATATATTGCCTGGGATTTTTCATTGGTTCAAAGCAAGATAAGTATGGATATTGGAT
TCAGTGTGGAGTGTGAAAGTCCTGGAGGGGTAAAGACGTTGATATTGCCTCACAGACGTTACGAGTCTGATCAAGGAAACTTCTGCACTTGCATGGCTGGGGAATACAAG
CTGATTTGGGACAATACATATTCAACTTTTTTTAAGAAGGTGTTGCGCTATAAGGTCGACTGCATACCTCCCGTGGTAGAGCCGGTGCAAGCCGCTGCAGAAGAATAAGT
GTCTTGGGCGACCACTCCATTGTAGACAACAATGTTTAGATTGTAACATATACATTCATTTTCTTTATAGTTTCGTCATCTCAAAGAGTTTGATCGCACGTTGAGATCCT
ATTCTTTGCAAGATTTGCTGATAGATGAATCTTGTAAACAGTACAATTAACCCAAAAAAGAAAATCATTTGTAATTATGGAAATACGTCATTTTCTTGTATAATGCTTAG
AATCAATCACCTTGTGAAATGCCTATTCTCTCATCGTTC
Protein sequenceShow/hide protein sequence
MASMEGLVPITRHFLASYYDKYPFTPLSDHVSRLSTEMLALANSLLDELPPTSEESTLLDEANQHPPHKIDENLWKNRENVEEILFLLEKSRWPQEVQKESATGESELAN
IIGKLEKKVRNTLHTLVAFQSKNSEHVFNTVMTYMPQDFRGTLIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQLGSASGVYKTLVKYLVGVPEV
LLEFIQKINDDDGPMEEQRHRYGPPLYKLTTMVHLIRLCISLSWRRFDAAKLREHIVILEQAVDVYSSELERFLGFIREVFNNAPFFISADVACAADERKSDSYKEISVP
AGKTYEVSLSVESINSYIAWDFSLVQSKISMDIGFSVECESPGGVKTLILPHRRYESDQGNFCTCMAGEYKLIWDNTYSTFFKKVLRYKVDCIPPVVEPVQAAAEE