; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020115 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020115
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionIron-sulfur cluster biosynthesis family protein
Genome locationChr04:28917238..28921890
RNA-Seq ExpressionHG10020115
SyntenyHG10020115
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR044200 - Uncharacterized protein At5g03900-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149205.1 uncharacterized protein At5g03900, chloroplastic [Cucumis sativus]1.7e-26893.6Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
        MA+IST FAISQSSR YFHPLITLKPSICVKPS ITFPAL TRIAPP+SRARG    VRAGIDIPSDIRPG+ VESDKLPSDVRKR MEAV+ACGGRVTI
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI

Query:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
        GDVASRAGLKLNEAQKALQALAADT+GFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
Subjt:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS

Query:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN
        EEDNRGRR RSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYI+SNGGVV AEELAPYLDVSERN
Subjt:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN

Query:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
        TDDESY LPVLLRFDGQPEIDEE           RTASSQRSGRKEYVGRKWADWVGGIEK FKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
Subjt:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK

Query:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
        DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFI+QKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQ+TVIGQDRIVYSTDRDLIEQNY
Subjt:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY

Query:  ELQEWERKFREIEKSD
        ELQEWERKFREIEKSD
Subjt:  ELQEWERKFREIEKSD

XP_008442842.1 PREDICTED: uncharacterized protein At5g03900, chloroplastic [Cucumis melo]7.1e-27093.8Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
        MA+ISTCFAISQSSR YFHPLITLKPSICVKPSPITFPAL TRIAPP+SR+RG   +VRAGIDIPSDIRPGS VESDKLPSDVRKR MEAV+ACGGRVTI
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI

Query:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
        GDVASRAGLKLNEAQKALQALAADT+GFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
Subjt:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS

Query:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN
        EEDNRGRR RSYDSGFTFY SPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYI+SNGGVVTAEELAPYLDVSERN
Subjt:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN

Query:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
        TDDESY LPVLLRFDGQPEIDEE           RTASSQRSGRKEYVGRKWADWVGGIEK FKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
Subjt:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK

Query:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
        DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQ+TVI  DRIVYSTDRDLIEQNY
Subjt:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY

Query:  ELQEWERKFREIEKSD
        ELQEWERKFREIEKSD
Subjt:  ELQEWERKFREIEKSD

XP_022144706.1 uncharacterized protein At5g03900, chloroplastic isoform X1 [Momordica charantia]1.9e-26792.64Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
        MATISTCFAISQSSRFYFHPLITLKPSI +KP P+TFPA+QTRI+PP+SR RG   VVRAGIDIPSDIRPG+AVESDKLPSDVRKRAMEAVDACGGRVTI
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI

Query:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
        GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIK EPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
Subjt:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS

Query:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN
        EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDV E N
Subjt:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN

Query:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
         DDESY LPVLLRFDGQPEIDEE           RTASSQRSGRKEYVGRKWADWVGG+EK FKEKKWVFSKT++SERAMAIGLGGLNLFGVI+LG MLK
Subjt:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK

Query:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
        DVA+KPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRN+ARQKRAQALELPDV+LRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQ+Y
Subjt:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY

Query:  ELQEWERKFREIEKSD
        ELQEWERKFREIEKSD
Subjt:  ELQEWERKFREIEKSD

XP_022925696.1 uncharacterized protein At5g03900-like [Cucurbita moschata]3.3e-26793.22Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
        MATIST FAISQSSRFYFHPLITLKPSIC+KPSP+TFPA+ TRI+P D RARG   VVRAGIDIPSDIRPG +VESDKLPSDVRKRAMEAVDACGGRVTI
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI

Query:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
        GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
Subjt:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS

Query:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN
        EEDNRGRRGRSYDSGFTFYLSP DLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDV+E N
Subjt:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN

Query:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
        TDDESY LPVLLRFDGQPEIDEE           RTASSQRSGRKEYVGRKWADWVGGIEK FKEK+WVFSKTS SERAMAIGLGGLNLFGVIVLGAMLK
Subjt:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK

Query:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
        DVAVKPSGLIKFVSDIFPLLQIYAGSFF IPLVRWFI QKRNAEIGKRN+ARQKRAQALELPDV+LRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
Subjt:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY

Query:  ELQEWERKFREIEKSD
        ELQEWERKFREIEKSD
Subjt:  ELQEWERKFREIEKSD

XP_038904835.1 uncharacterized protein At5g03900, chloroplastic isoform X1 [Benincasa hispida]2.2e-27194.96Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
        MATISTCFAISQSSRFYFHPLITLKPSI VKPS ITFPALQTRIA P+SRARGL  VVRAGI+IPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI

Query:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
        GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIAL+SSRS
Subjt:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS

Query:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN
        EEDNRGRRGRSYDSGFTFY SPTDLFWYWDPYYYRRR+LQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVS+ N
Subjt:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN

Query:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
        TDDESY LPVLLRFDGQPEIDEE           RTASSQRSGRKEYVGRKWADWVGG+EK FKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
Subjt:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK

Query:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
        DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
Subjt:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY

Query:  ELQEWERKFREIEKSD
        ELQEWERKFREIEKSD
Subjt:  ELQEWERKFREIEKSD

TrEMBL top hitse value%identityAlignment
A0A0A0LE13 Uncharacterized protein8.4e-26993.6Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
        MA+IST FAISQSSR YFHPLITLKPSICVKPS ITFPAL TRIAPP+SRARG    VRAGIDIPSDIRPG+ VESDKLPSDVRKR MEAV+ACGGRVTI
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI

Query:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
        GDVASRAGLKLNEAQKALQALAADT+GFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
Subjt:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS

Query:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN
        EEDNRGRR RSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYI+SNGGVV AEELAPYLDVSERN
Subjt:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN

Query:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
        TDDESY LPVLLRFDGQPEIDEE           RTASSQRSGRKEYVGRKWADWVGGIEK FKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
Subjt:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK

Query:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
        DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFI+QKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQ+TVIGQDRIVYSTDRDLIEQNY
Subjt:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY

Query:  ELQEWERKFREIEKSD
        ELQEWERKFREIEKSD
Subjt:  ELQEWERKFREIEKSD

A0A1S3B7F8 uncharacterized protein At5g03900, chloroplastic3.4e-27093.8Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
        MA+ISTCFAISQSSR YFHPLITLKPSICVKPSPITFPAL TRIAPP+SR+RG   +VRAGIDIPSDIRPGS VESDKLPSDVRKR MEAV+ACGGRVTI
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI

Query:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
        GDVASRAGLKLNEAQKALQALAADT+GFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
Subjt:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS

Query:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN
        EEDNRGRR RSYDSGFTFY SPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYI+SNGGVVTAEELAPYLDVSERN
Subjt:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN

Query:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
        TDDESY LPVLLRFDGQPEIDEE           RTASSQRSGRKEYVGRKWADWVGGIEK FKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
Subjt:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK

Query:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
        DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQ+TVI  DRIVYSTDRDLIEQNY
Subjt:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY

Query:  ELQEWERKFREIEKSD
        ELQEWERKFREIEKSD
Subjt:  ELQEWERKFREIEKSD

A0A6J1CU13 uncharacterized protein At5g03900, chloroplastic isoform X19.3e-26892.64Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
        MATISTCFAISQSSRFYFHPLITLKPSI +KP P+TFPA+QTRI+PP+SR RG   VVRAGIDIPSDIRPG+AVESDKLPSDVRKRAMEAVDACGGRVTI
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI

Query:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
        GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIK EPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
Subjt:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS

Query:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN
        EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDV E N
Subjt:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN

Query:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
         DDESY LPVLLRFDGQPEIDEE           RTASSQRSGRKEYVGRKWADWVGG+EK FKEKKWVFSKT++SERAMAIGLGGLNLFGVI+LG MLK
Subjt:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK

Query:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
        DVA+KPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRN+ARQKRAQALELPDV+LRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQ+Y
Subjt:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY

Query:  ELQEWERKFREIEKSD
        ELQEWERKFREIEKSD
Subjt:  ELQEWERKFREIEKSD

A0A6J1ECX3 uncharacterized protein At5g03900-like1.6e-26793.22Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
        MATIST FAISQSSRFYFHPLITLKPSIC+KPSP+TFPA+ TRI+P D RARG   VVRAGIDIPSDIRPG +VESDKLPSDVRKRAMEAVDACGGRVTI
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI

Query:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
        GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
Subjt:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS

Query:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN
        EEDNRGRRGRSYDSGFTFYLSP DLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDV+E N
Subjt:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN

Query:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
        TDDESY LPVLLRFDGQPEIDEE           RTASSQRSGRKEYVGRKWADWVGGIEK FKEK+WVFSKTS SERAMAIGLGGLNLFGVIVLGAMLK
Subjt:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK

Query:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
        DVAVKPSGLIKFVSDIFPLLQIYAGSFF IPLVRWFI QKRNAEIGKRN+ARQKRAQALELPDV+LRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
Subjt:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY

Query:  ELQEWERKFREIEKSD
        ELQEWERKFREIEKSD
Subjt:  ELQEWERKFREIEKSD

A0A6J1L381 uncharacterized protein At5g03900, chloroplastic-like7.9e-26793.22Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI
        MATIST FAISQSSRFYFHPLITLKPSIC+KPSP+TF A+ TRI+P D RARG   VVRAGIDIPSDIRPG +VESDKLPSDVRKRAMEAVDACGGRVTI
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTI

Query:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
        GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS
Subjt:  GDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRS

Query:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN
        EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDV+E N
Subjt:  EEDNRGRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERN

Query:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK
        TDDESY LPVLLRFDGQPEIDEE           RTASSQRSGRKEYVGRKWADWVGGIEK FKEKKWVFSKTS SERAMAIGLGGLNLFGVIVLGAMLK
Subjt:  TDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLK

Query:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
        DVAVKPSGLIKFVSDIFPLLQIYAGSFF IPLVRWFI QKRNAEIGKRN+ARQK AQALELPDV+LRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY
Subjt:  DVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNY

Query:  ELQEWERKFREIEKSD
        ELQEWERKFREIEKSD
Subjt:  ELQEWERKFREIEKSD

SwissProt top hitse value%identityAlignment
Q8GW20 Uncharacterized protein At5g03900, chloroplastic1.1e-18064.71Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPI---TFPALQTRIAPPDSR---ARGLTWVVRAGID-IPSDIRPGSAVESDKLPSDVRKRAMEAVDA
        MA +STC  +  S R     L + KP +    SP+   +FP + T       R     G+  V  A +D +   I+PG  VESDKLP+DVRKRAM+AVD 
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPI---TFPALQTRIAPPDSR---ARGLTWVVRAGID-IPSDIRPGSAVESDKLPSDVRKRAMEAVDA

Query:  CGGRVTIGDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTII
        CG RVT+GDVASR GLK+ EAQ ALQA+AADT+GFLEVSDEGDVLYVFP+DYR+KLAAKS  I+ EP +EK+K A +YL RVSFGTALIASIV+VYT+II
Subjt:  CGGRVTIGDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTII

Query:  ALISSRSEEDNR-GRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNK-MNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELA
        AL+SS+SE+DNR  RRGRSYDSGF FY++P DL WYWDP YY RRR + ++ K MNFIES+FSFVFGDGDPNQGIEEERW++IGQYITS GGVV A+ELA
Subjt:  ALISSRSEEDNR-GRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNK-MNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELA

Query:  PYLDV--SERNTDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNL
        PYLDV  S+   +DESY LPVLLRFDGQPE+DEE           RTAS   S RKEYVG KW DWV  +EK FKEKKW FSKTS SERA+ IGLG +NL
Subjt:  PYLDV--SERNTDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNL

Query:  FGVIVLGAMLKDVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVY
        FGVIVL  +L +++V+P G + FV +I+PLLQIYAGSFFTIPL+RWF I+++N +I  RN+AR + A+ALE PD+ LRRKLLSARDMAQ+TVIG+DRIVY
Subjt:  FGVIVLGAMLKDVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVY

Query:  STDRDLIEQNYELQEWERKFREIEKSD
        STDRD++EQNYE  EW+R+F+E+EKSD
Subjt:  STDRDLIEQNYELQEWERKFREIEKSD

Arabidopsis top hitse value%identityAlignment
AT5G03900.1 Iron-sulphur cluster biosynthesis family protein8.6e-14163.05Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPI---TFPALQTRIAPPDSR---ARGLTWVVRAGID-IPSDIRPGSAVESDKLPSDVRKRAMEAVDA
        MA +STC  +  S R     L + KP +    SP+   +FP + T       R     G+  V  A +D +   I+PG  VESDKLP+DVRKRAM+AVD 
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPI---TFPALQTRIAPPDSR---ARGLTWVVRAGID-IPSDIRPGSAVESDKLPSDVRKRAMEAVDA

Query:  CGGRVTIGDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTII
        CG RVT+GDVASR GLK+ EAQ ALQA+AADT+GFLEVSDEGDVLYVFP+DYR+KLAAKS  I+ EP +EK+K A +YL RVSFGTALIASIV+VYT+II
Subjt:  CGGRVTIGDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTII

Query:  ALISSRSEEDNR-GRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNK-MNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELA
        AL+SS+SE+DNR  RRGRSYDSGF FY++P DL WYWDP YY RRR + ++ K MNFIES+FSFVFGDGDPNQGIEEERW++IGQYITS GGVV A+ELA
Subjt:  ALISSRSEEDNR-GRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNK-MNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELA

Query:  PYLDV--SERNTDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNL
        PYLDV  S+   +DESY LPVLLRFDGQPE+DEE           RTAS   S RKEYVG KW DWV  +EK FKEKKW FSKTS SERA+ IGLG +NL
Subjt:  PYLDV--SERNTDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNL

Query:  FGVIVLGAMLKDVAVKPSGLIKFVSDIFPLLQI
        FGVIVL  +L +++V+P G + FV +I+PLLQ+
Subjt:  FGVIVLGAMLKDVAVKPSGLIKFVSDIFPLLQI

AT5G03900.2 Iron-sulphur cluster biosynthesis family protein7.7e-18264.71Show/hide
Query:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPI---TFPALQTRIAPPDSR---ARGLTWVVRAGID-IPSDIRPGSAVESDKLPSDVRKRAMEAVDA
        MA +STC  +  S R     L + KP +    SP+   +FP + T       R     G+  V  A +D +   I+PG  VESDKLP+DVRKRAM+AVD 
Subjt:  MATISTCFAISQSSRFYFHPLITLKPSICVKPSPI---TFPALQTRIAPPDSR---ARGLTWVVRAGID-IPSDIRPGSAVESDKLPSDVRKRAMEAVDA

Query:  CGGRVTIGDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTII
        CG RVT+GDVASR GLK+ EAQ ALQA+AADT+GFLEVSDEGDVLYVFP+DYR+KLAAKS  I+ EP +EK+K A +YL RVSFGTALIASIV+VYT+II
Subjt:  CGGRVTIGDVASRAGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTII

Query:  ALISSRSEEDNR-GRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNK-MNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELA
        AL+SS+SE+DNR  RRGRSYDSGF FY++P DL WYWDP YY RRR + ++ K MNFIES+FSFVFGDGDPNQGIEEERW++IGQYITS GGVV A+ELA
Subjt:  ALISSRSEEDNR-GRRGRSYDSGFTFYLSPTDLFWYWDPYYYRRRRLQTEDNK-MNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELA

Query:  PYLDV--SERNTDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNL
        PYLDV  S+   +DESY LPVLLRFDGQPE+DEE           RTAS   S RKEYVG KW DWV  +EK FKEKKW FSKTS SERA+ IGLG +NL
Subjt:  PYLDV--SERNTDDESYTLPVLLRFDGQPEIDEE-----------RTASSQRSGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNL

Query:  FGVIVLGAMLKDVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVY
        FGVIVL  +L +++V+P G + FV +I+PLLQIYAGSFFTIPL+RWF I+++N +I  RN+AR + A+ALE PD+ LRRKLLSARDMAQ+TVIG+DRIVY
Subjt:  FGVIVLGAMLKDVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEARQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVY

Query:  STDRDLIEQNYELQEWERKFREIEKSD
        STDRD++EQNYE  EW+R+F+E+EKSD
Subjt:  STDRDLIEQNYELQEWERKFREIEKSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACAATCTCTACTTGTTTCGCCATTTCCCAGAGTTCCCGCTTCTATTTCCACCCTCTCATCACCCTTAAACCCTCAATTTGCGTAAAGCCGTCTCCCATTACTTT
TCCGGCCCTGCAGACGCGGATTGCGCCGCCGGATTCTAGGGCAAGGGGTTTGACTTGGGTTGTAAGAGCGGGCATTGACATTCCTTCCGATATCAGACCTGGAAGTGCGG
TGGAGAGCGATAAATTGCCGTCGGATGTGAGAAAGAGAGCGATGGAAGCTGTGGACGCGTGTGGAGGAAGAGTGACCATTGGAGACGTTGCAAGCAGGGCGGGACTTAAG
CTCAACGAAGCTCAGAAGGCTTTGCAGGCTCTAGCTGCTGATACTAATGGTTTTTTGGAGGTTTCTGATGAGGGCGACGTTCTCTACGTTTTCCCCAAAGATTATCGTTC
AAAGCTCGCTGCCAAATCGTTTTGGATCAAGTTTGAACCTCTTATAGAGAAGTCGAAGGCCGCTGCCGAATATCTTGTCAGGGTTTCATTTGGAACGGCACTAATTGCTT
CAATCGTTCTTGTATATACTACAATTATTGCTCTAATTTCAAGCAGAAGTGAAGAGGATAATCGTGGAAGACGCGGCAGGTCATATGATTCAGGATTCACGTTCTATTTA
AGTCCAACTGATCTTTTCTGGTACTGGGATCCATACTACTATAGGAGACGCCGACTTCAAACAGAAGATAATAAGATGAACTTCATTGAATCTATTTTCTCATTTGTTTT
TGGCGATGGTGATCCGAATCAAGGAATTGAAGAAGAGAGATGGAAGTTGATTGGGCAATACATCACCTCCAATGGTGGTGTCGTAACTGCTGAAGAACTTGCACCATATC
TGGATGTGTCAGAGAGGAACACGGATGATGAGTCATACACTTTACCAGTTCTTTTACGGTTTGATGGCCAACCTGAAATTGATGAAGAGCGCACAGCTTCTTCTCAGCGG
AGTGGAAGGAAGGAGTATGTGGGTAGAAAATGGGCTGACTGGGTTGGAGGGATTGAAAAAAATTTCAAAGAGAAGAAATGGGTCTTTAGTAAAACAAGTAATTCAGAGAG
AGCAATGGCCATTGGATTGGGAGGGCTTAATCTGTTTGGTGTTATAGTCCTCGGAGCCATGTTGAAGGATGTTGCTGTTAAACCAAGTGGACTTATTAAATTTGTATCAG
ATATATTTCCTCTACTGCAGATATACGCTGGTTCTTTCTTCACAATTCCACTGGTCCGTTGGTTCATTATCCAAAAGAGAAATGCTGAAATAGGAAAACGAAACGAAGCA
AGGCAAAAGCGAGCTCAAGCCCTTGAACTGCCAGACGTGACACTCAGACGCAAGCTTCTCAGTGCTCGAGACATGGCGCAAAGAACTGTAATTGGTCAGGATCGAATTGT
ATATAGTACTGATCGAGATTTAATCGAGCAAAATTATGAGCTCCAAGAATGGGAAAGGAAGTTTCGAGAGATAGAAAAATCAGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGACAATCTCTACTTGTTTCGCCATTTCCCAGAGTTCCCGCTTCTATTTCCACCCTCTCATCACCCTTAAACCCTCAATTTGCGTAAAGCCGTCTCCCATTACTTT
TCCGGCCCTGCAGACGCGGATTGCGCCGCCGGATTCTAGGGCAAGGGGTTTGACTTGGGTTGTAAGAGCGGGCATTGACATTCCTTCCGATATCAGACCTGGAAGTGCGG
TGGAGAGCGATAAATTGCCGTCGGATGTGAGAAAGAGAGCGATGGAAGCTGTGGACGCGTGTGGAGGAAGAGTGACCATTGGAGACGTTGCAAGCAGGGCGGGACTTAAG
CTCAACGAAGCTCAGAAGGCTTTGCAGGCTCTAGCTGCTGATACTAATGGTTTTTTGGAGGTTTCTGATGAGGGCGACGTTCTCTACGTTTTCCCCAAAGATTATCGTTC
AAAGCTCGCTGCCAAATCGTTTTGGATCAAGTTTGAACCTCTTATAGAGAAGTCGAAGGCCGCTGCCGAATATCTTGTCAGGGTTTCATTTGGAACGGCACTAATTGCTT
CAATCGTTCTTGTATATACTACAATTATTGCTCTAATTTCAAGCAGAAGTGAAGAGGATAATCGTGGAAGACGCGGCAGGTCATATGATTCAGGATTCACGTTCTATTTA
AGTCCAACTGATCTTTTCTGGTACTGGGATCCATACTACTATAGGAGACGCCGACTTCAAACAGAAGATAATAAGATGAACTTCATTGAATCTATTTTCTCATTTGTTTT
TGGCGATGGTGATCCGAATCAAGGAATTGAAGAAGAGAGATGGAAGTTGATTGGGCAATACATCACCTCCAATGGTGGTGTCGTAACTGCTGAAGAACTTGCACCATATC
TGGATGTGTCAGAGAGGAACACGGATGATGAGTCATACACTTTACCAGTTCTTTTACGGTTTGATGGCCAACCTGAAATTGATGAAGAGCGCACAGCTTCTTCTCAGCGG
AGTGGAAGGAAGGAGTATGTGGGTAGAAAATGGGCTGACTGGGTTGGAGGGATTGAAAAAAATTTCAAAGAGAAGAAATGGGTCTTTAGTAAAACAAGTAATTCAGAGAG
AGCAATGGCCATTGGATTGGGAGGGCTTAATCTGTTTGGTGTTATAGTCCTCGGAGCCATGTTGAAGGATGTTGCTGTTAAACCAAGTGGACTTATTAAATTTGTATCAG
ATATATTTCCTCTACTGCAGATATACGCTGGTTCTTTCTTCACAATTCCACTGGTCCGTTGGTTCATTATCCAAAAGAGAAATGCTGAAATAGGAAAACGAAACGAAGCA
AGGCAAAAGCGAGCTCAAGCCCTTGAACTGCCAGACGTGACACTCAGACGCAAGCTTCTCAGTGCTCGAGACATGGCGCAAAGAACTGTAATTGGTCAGGATCGAATTGT
ATATAGTACTGATCGAGATTTAATCGAGCAAAATTATGAGCTCCAAGAATGGGAAAGGAAGTTTCGAGAGATAGAAAAATCAGATTAG
Protein sequenceShow/hide protein sequence
MATISTCFAISQSSRFYFHPLITLKPSICVKPSPITFPALQTRIAPPDSRARGLTWVVRAGIDIPSDIRPGSAVESDKLPSDVRKRAMEAVDACGGRVTIGDVASRAGLK
LNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVSFGTALIASIVLVYTTIIALISSRSEEDNRGRRGRSYDSGFTFYL
SPTDLFWYWDPYYYRRRRLQTEDNKMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYITSNGGVVTAEELAPYLDVSERNTDDESYTLPVLLRFDGQPEIDEERTASSQR
SGRKEYVGRKWADWVGGIEKNFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLKDVAVKPSGLIKFVSDIFPLLQIYAGSFFTIPLVRWFIIQKRNAEIGKRNEA
RQKRAQALELPDVTLRRKLLSARDMAQRTVIGQDRIVYSTDRDLIEQNYELQEWERKFREIEKSD