; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G012300 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G012300
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGOLD domain-containing protein
Genome locationCmo_Chr07:6704533..6714312
RNA-Seq ExpressionCmoCh07G012300
SyntenyCmoCh07G012300
Gene Ontology termsNA
InterPro domainsIPR009038 - GOLD domain
IPR036598 - GOLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147520.1 uncharacterized protein LOC101218161 [Cucumis sativus]2.7e-22591.28Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPFTPLSD +SRLSTEMLALAN LLDELPPT EESTLLDEAN  PPHKIDENMWKNRENVEEILFL EKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        SATGESELANI+GKLEEK +N L  LV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA KLR+HL ILEQAVDVY SE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV

Query:  FNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSLSVES+NSYIAWDFSLVQGKMNMDIGFSVECESPGG K LILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEE
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEP+Q AAEE
Subjt:  KLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEE

XP_008463316.1 PREDICTED: uncharacterized protein LOC103501503 isoform X1 [Cucumis melo]9.1e-22691.51Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPF PLSD +SRLSTEMLALAN LLDELPPT EESTLLDEAN  PPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        SATG+SELANI+GKLEEK +N L VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA KLR+HL ILEQAVDVY SE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV

Query:  FNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSLSVES+NSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEE
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEP+Q AAEE
Subjt:  KLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEE

XP_022924994.1 uncharacterized protein LOC111432376 isoform X1 [Cucurbita moschata]1.2e-246100Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV

Query:  FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN
        FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN
Subjt:  FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN

Query:  TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED
        TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED
Subjt:  TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED

XP_022966319.1 uncharacterized protein LOC111466012 isoform X1 [Cucurbita maxima]1.3e-24398.61Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLA ANILLDELPPTPEESTL DEANHPPPHKIDENMWKNRENVEEILFLLEKS WPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLI+LFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV

Query:  FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN
        FNNAPFFIPADVRKGDSYKEISVPAGKTYEVS+SVES+NSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN
Subjt:  FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN

Query:  TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED
        TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED
Subjt:  TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED

XP_023517640.1 uncharacterized protein LOC111781338 isoform X1 [Cucurbita pepo subsp. pepo]6.7e-24599.07Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANIL+DELPPTPEESTLLDEAN PPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV

Query:  FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN
        FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSL+VES+NSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN
Subjt:  FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN

Query:  TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED
        TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED
Subjt:  TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED

TrEMBL top hitse value%identityAlignment
A0A0A0L040 GOLD domain-containing protein1.3e-22591.28Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPFTPLSD +SRLSTEMLALAN LLDELPPT EESTLLDEAN  PPHKIDENMWKNRENVEEILFL EKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        SATGESELANI+GKLEEK +N L  LV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA KLR+HL ILEQAVDVY SE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV

Query:  FNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSLSVES+NSYIAWDFSLVQGKMNMDIGFSVECESPGG K LILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEE
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEP+Q AAEE
Subjt:  KLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEE

A0A1S3CKI8 uncharacterized protein LOC103501503 isoform X14.4e-22691.51Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPF PLSD +SRLSTEMLALAN LLDELPPT EESTLLDEAN  PPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        SATG+SELANI+GKLEEK +N L VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA KLR+HL ILEQAVDVY SE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV

Query:  FNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSLSVES+NSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEE
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEP+Q AAEE
Subjt:  KLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEE

A0A5A7SM14 Emp24/gp25L/p24 family/GOLD family protein4.4e-22691.51Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPF PLSD +SRLSTEMLALAN LLDELPPT EESTLLDEAN  PPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        SATG+SELANI+GKLEEK +N L VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA KLR+HL ILEQAVDVY SE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV

Query:  FNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSLSVES+NSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEE
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEP+Q AAEE
Subjt:  KLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEE

A0A6J1EAU1 uncharacterized protein LOC111432376 isoform X15.9e-247100Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV

Query:  FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN
        FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN
Subjt:  FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN

Query:  TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED
        TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED
Subjt:  TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED

A0A6J1HTF5 uncharacterized protein LOC111466012 isoform X16.1e-24498.61Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLA ANILLDELPPTPEESTL DEANHPPPHKIDENMWKNRENVEEILFLLEKS WPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLI+LFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREV

Query:  FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN
        FNNAPFFIPADVRKGDSYKEISVPAGKTYEVS+SVES+NSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN
Subjt:  FNNAPFFIPADVRKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDN

Query:  TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED
        TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED
Subjt:  TYSTFFKKVVRYKVDCIPPVVEPLQTAAEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G01010.1 CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 172 Blast hits to 172 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 95; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 14 (source: NCBI BLink).3.4e-17067.91Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYY+KYPF+PLSDD+SRLS++M +L  +L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        S + ++E A+IL  L++   N    ++ FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAEVDALV+SGGSIRD YALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-RDHLAILEQAVDVYASELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL +D + +L +A  VY SE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-RDHLAILEQAVDVYASELERFLVFIRE

Query:  VFNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGD
        VF N+PFFI AD       R  + YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++MDIGFSVE  +  G KTLILP++RYE+DQGNF T +AG+
Subjt:  VFNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGD

Query:  YKLIWDNTYSTFFKKVVRYKVDCIPPVVEP
        YKL+WDN+YSTFFKK +RYKVDCI PVVEP
Subjt:  YKLIWDNTYSTFFKKVVRYKVDCIPPVVEP

AT5G01010.2 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 85 Blast hits to 85 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 20; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).3.6e-16462Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYY+KYPF+PLSDD+SRLS++M +L  +L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        S + ++E A+IL  L++   N    ++ FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAEVDALV+SGGSIRD YALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-RDHLAILEQAVDVYASELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL +D + +L +A  VY SE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-RDHLAILEQAVDVYASELERFLVFIRE

Query:  VFNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNM--------------------------------------
        VF N+PFFI AD       R  + YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++M                                      
Subjt:  VFNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNM--------------------------------------

Query:  ---DIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDNTYSTFFKKVVRYKVDCIPPVVEP
           DIGFSVE  +  G KTLILP++RYE+DQGNF T +AG+YKL+WDN+YSTFFKK +RYKVDCI PVVEP
Subjt:  ---DIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDNTYSTFFKKVVRYKVDCIPPVVEP

AT5G01010.3 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 76 Blast hits to 76 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).4.7e-16467.54Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYY+KYPF+PLSDD+SRLS++M +L  +L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        S + ++E A+IL  L++   N    ++ FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAEVDALV+SGGSIRD YALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-RDHLAILEQAVDVYASELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL +D + +L +A  VY SE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-RDHLAILEQAVDVYASELERFLVFIRE

Query:  VFNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGD
        VF N+PFFI AD       R  + YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++MDIGFSVE  +  G KTLILP++RYE+DQGNF T +AG+
Subjt:  VFNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGD

Query:  YKLIWDNTYSTFFKKVVRY
        YKL+WDN+YSTFFKKV RY
Subjt:  YKLIWDNTYSTFFKKVVRY

AT5G01010.4 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038).1.1e-15760.51Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGL+PITR FLASYY+KYPF+PLSDD+SRLS++M +L  +L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL
        S + ++E A+IL  L++   N    ++ FQ+KNSE +F+T       DFRGT+IRQQ+ERSERNKQAEVDALV+SGGSIRD YALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-RDHLAILEQAVDVYASELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL +D + +L +A  VY SE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-RDHLAILEQAVDVYASELERFLVFIRE

Query:  VFNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNM--------------------------------------
        VF N+PFFI AD       R  + YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++M                                      
Subjt:  VFNNAPFFIPADV------RKGDSYKEISVPAGKTYEVSLSVESLNSYIAWDFSLVQGKMNM--------------------------------------

Query:  ---DIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDNTYSTFFKKVVRYKVDCIPPVVEP
           DIGFSVE  +  G KTLILP++RYE+DQGNF T +AG+YKL+WDN+YSTFFKK +RYKVDCI PVVEP
Subjt:  ---DIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDNTYSTFFKKVVRYKVDCIPPVVEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATGGAGGGTCTGGTGCCAATAACCAGGCATTTCCTGGCTTCCTACTATGAAAAGTACCCATTTACGCCTCTATCTGACGATATCTCTCGCCTCTCAACTGA
GATGCTCGCCTTGGCGAACATTTTGCTTGATGAACTACCGCCTACTCCAGAGGAAAGCACCCTTCTTGATGAAGCAAATCATCCTCCTCCTCATAAAATTGATGAGAATA
TGTGGAAGAATCGGGAGAACGTGGAAGAAATCCTGTTTCTGCTTGAAAAATCTCGTTGGCCTCAAGAGGTCCAGAAGGAGTCTGCAACTGGCGAGTCTGAACTTGCTAAT
ATTCTAGGAAAGCTAGAAGAAAAATTGAAGAATACCTTAAAAGTGTTGGTGGATTTCCAATCTAAAAATTCTGAGCACGTGTTCAACACAGTAATGACCTACATGCCTCA
AGATTTTCGAGGAACAATAATTCGTCAGCAAAGAGAGCGATCAGAGAGGAATAAGCAAGCAGAGGTTGATGCTTTGGTTAATTCTGGAGGAAGCATACGTGATCGATATG
CCCTTTTATGGAAACAACAGATGGAGAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCAACAGGTGTCTACAAAACCCTCGTGAAGTATTTGGTTGGAGTTCCAGAGGTA
TTGCTAGAATTCATTCAGAAAATAAATGATGATGATGGGCCAATGGAAGAACAACGACAACGCTATGGACCACCTTTGTATAAGCTTACAACAATGGTCCGCCTTATTCG
ACTCTTTATTTCATTATCATGGAGACGCTTTGATGCTAGGAAACTAAGGGATCATCTTGCTATTTTGGAACAAGCCGTTGATGTGTATGCCTCTGAGCTTGAGAGGTTCC
TCGTCTTCATTCGCGAGGTTTTCAACAATGCTCCATTCTTTATTCCAGCAGATGTGAGGAAAGGTGATAGCTACAAAGAGATTAGTGTTCCAGCTGGAAAGACTTATGAG
GTTTCATTAAGTGTGGAATCTCTCAATTCATATATTGCCTGGGATTTCTCGTTGGTTCAAGGCAAGATGAATATGGACATTGGATTCAGTGTGGAGTGTGAAAGCCCTGG
TGGGGGAAAGACTTTGATATTGCCACACAAACGTTACGAGTCTGATCAGGGAAACTTCTGCACTTGCATTGCTGGGGACTACAAGCTGATTTGGGACAATACATATTCAA
CCTTTTTTAAGAAGGTTGTGCGCTATAAGGTCGATTGCATACCTCCTGTGGTAGAGCCATTGCAAACTGCTGCAGAAGAAGATTAA
mRNA sequenceShow/hide mRNA sequence
GTTTTGTAGCGGAGTAGAGCCGTCTTCTTGTTGCTCTACGACTCTTTTGACGATCAAAGGCTACCAGAAGAGCTACTGGGAGAATTCCTTCGACGGTGGAAGTTTTCCGG
TGTGCTAATTTGCTTCAACGATACGCAAACGCATCTTCTTCCGCTGCAAACAATCGGAGGAGGAATTTTCAGCAGGGAAAACTTCAACTGAGATTTTGAATCGGAGAAGA
GAGCCAAGTTAACGGCGGAAATGGCTTCCATGGAGGGTCTGGTGCCAATAACCAGGCATTTCCTGGCTTCCTACTATGAAAAGTACCCATTTACGCCTCTATCTGACGAT
ATCTCTCGCCTCTCAACTGAGATGCTCGCCTTGGCGAACATTTTGCTTGATGAACTACCGCCTACTCCAGAGGAAAGCACCCTTCTTGATGAAGCAAATCATCCTCCTCC
TCATAAAATTGATGAGAATATGTGGAAGAATCGGGAGAACGTGGAAGAAATCCTGTTTCTGCTTGAAAAATCTCGTTGGCCTCAAGAGGTCCAGAAGGAGTCTGCAACTG
GCGAGTCTGAACTTGCTAATATTCTAGGAAAGCTAGAAGAAAAATTGAAGAATACCTTAAAAGTGTTGGTGGATTTCCAATCTAAAAATTCTGAGCACGTGTTCAACACA
GTAATGACCTACATGCCTCAAGATTTTCGAGGAACAATAATTCGTCAGCAAAGAGAGCGATCAGAGAGGAATAAGCAAGCAGAGGTTGATGCTTTGGTTAATTCTGGAGG
AAGCATACGTGATCGATATGCCCTTTTATGGAAACAACAGATGGAGAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCAACAGGTGTCTACAAAACCCTCGTGAAGTATT
TGGTTGGAGTTCCAGAGGTATTGCTAGAATTCATTCAGAAAATAAATGATGATGATGGGCCAATGGAAGAACAACGACAACGCTATGGACCACCTTTGTATAAGCTTACA
ACAATGGTCCGCCTTATTCGACTCTTTATTTCATTATCATGGAGACGCTTTGATGCTAGGAAACTAAGGGATCATCTTGCTATTTTGGAACAAGCCGTTGATGTGTATGC
CTCTGAGCTTGAGAGGTTCCTCGTCTTCATTCGCGAGGTTTTCAACAATGCTCCATTCTTTATTCCAGCAGATGTGAGGAAAGGTGATAGCTACAAAGAGATTAGTGTTC
CAGCTGGAAAGACTTATGAGGTTTCATTAAGTGTGGAATCTCTCAATTCATATATTGCCTGGGATTTCTCGTTGGTTCAAGGCAAGATGAATATGGACATTGGATTCAGT
GTGGAGTGTGAAAGCCCTGGTGGGGGAAAGACTTTGATATTGCCACACAAACGTTACGAGTCTGATCAGGGAAACTTCTGCACTTGCATTGCTGGGGACTACAAGCTGAT
TTGGGACAATACATATTCAACCTTTTTTAAGAAGGTTGTGCGCTATAAGGTCGATTGCATACCTCCTGTGGTAGAGCCATTGCAAACTGCTGCAGAAGAAGATTAAGTGT
CTCGAGCAGTCACTTCACTCCATTGTAGACAACGTTTTTTAGATTGTAACATACATTCATTGTTTTTATAGTATCTTCGTCTCGAAGTTTGTGTTTGTTTGTTCGCACGT
GGACTAAATCTTGTAAACAATGCAACCAACCAGTGAAAGGAATTGCTGTAATTAAGGAAATATGCCATTTAGTTACATGATAATAGCATCAAAAGTGCTTCTAATCCTTC
TCTAGTTGATGAAAAGCACTTTTTATCCTTCA
Protein sequenceShow/hide protein sequence
MASMEGLVPITRHFLASYYEKYPFTPLSDDISRLSTEMLALANILLDELPPTPEESTLLDEANHPPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKESATGESELAN
ILGKLEEKLKNTLKVLVDFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALVNSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGVPEV
LLEFIQKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLRDHLAILEQAVDVYASELERFLVFIREVFNNAPFFIPADVRKGDSYKEISVPAGKTYE
VSLSVESLNSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCIAGDYKLIWDNTYSTFFKKVVRYKVDCIPPVVEPLQTAAEED