; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017962 (gene) of Chayote v1 genome

Gene IDSed0017962
OrganismSechium edule (Chayote v1)
DescriptionGOLD domain-containing protein
Genome locationLG03:20220391..20246315
RNA-Seq ExpressionSed0017962
SyntenySed0017962
Gene Ontology termsNA
InterPro domainsIPR009038 - GOLD domain
IPR036598 - GOLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147520.1 uncharacterized protein LOC101218161 [Cucumis sativus]6.6e-22489.91Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MASMEGLVPITR FLASYY+ YPFTPLSD VSRLS +MLA AN+LLDEL PTSEE TL DEAN HPPHKIDENMWKNRENVEEILFL EKS WP+EVQKE
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        SATG+SELAN +GKL++K +N L+ LVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQA VDALINSGGSIRDRYALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA KL+EHL ILEQAVDVYTSE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY
        FNNAPFFISADVACAA+ RKSDSYKEISVPAGKTYEVSL+VESINSYIAWDFSLVQGKMNMDIGFSVECESPGG K LILPHKRYE+DQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

XP_008463316.1 PREDICTED: uncharacterized protein LOC103501503 isoform X1 [Cucumis melo]2.7e-22590.6Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MASMEGLVPITR FLASYY+ YPF PLSD VSRLS +MLA AN+LLDEL PTSEE TL DEAN HPPHKIDENMWKNRENVEEILFLLEKS WP+EVQKE
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        SATGKSELAN +GKL++K +N L+VLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQA VDALINSGGSIRDRYALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA KL+EHL ILEQAVDVYTSE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY
        FNNAPFFISADVACAA+ RKSDSYKEISVPAGKTYEVSL+VESINSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYE+DQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

XP_022924994.1 uncharacterized protein LOC111432376 isoform X1 [Cucurbita moschata]2.5e-22389.91Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MASMEGLVPITR FLASYYE YPFTPLSDD+SRLS +MLA AN LLDEL PT EE TL DEANH PPHKIDENMWKNRENVEEILFLLEKS WP+EVQKE
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        SATG+SELAN LGKL++K+KNTL VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQA VDAL+NSGGSIRDRYALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRLFISLSWRRFDARKL++HL ILEQAVDVY SELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSL+VES+NSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYE+DQGNFCTCIAGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEPLQ A E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

XP_022966319.1 uncharacterized protein LOC111466012 isoform X1 [Cucurbita maxima]6.6e-22490.14Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MASMEGLVPITR FLASYYE YPFTPLSDD+SRLS +MLASAN LLDEL PT EE TLFDEANH PPHKIDENMWKNRENVEEILFLLEKS WP+EVQKE
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        SATG+SELAN LGKL++K+KNTL VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQA VDAL+NSGGSIRDRYALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLI+LFISLSWRRFDARKL++HL ILEQAVDVY SELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVS++VESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYE+DQGNFCTCIAGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEPLQ A E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

XP_038881964.1 uncharacterized protein LOC120073287 isoform X1 [Benincasa hispida]2.3e-22490.8Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MASMEGLVPITR FLASYY+ YPFTPL D VSRLS +MLA AN+LLDEL PTSEE  L DEAN HPPHKIDENMWKNRENVEEILFLLEKS WP+EVQ E
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        SATG+SELAN +GKL++K++NTL+ LVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQA VDALINSGGSIRDRYALLWNQQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA K +EHLVILEQAVDVYTSELERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY
        FNNAPFFISADVACAAD RKSDSYKEISVPAGKTYEVSL+VESINSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYE+DQGNFCTCIAGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVE
        KL+WDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVE

TrEMBL top hitse value%identityAlignment
A0A0A0L040 GOLD domain-containing protein3.2e-22489.91Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MASMEGLVPITR FLASYY+ YPFTPLSD VSRLS +MLA AN+LLDEL PTSEE TL DEAN HPPHKIDENMWKNRENVEEILFL EKS WP+EVQKE
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        SATG+SELAN +GKL++K +N L+ LVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQA VDALINSGGSIRDRYALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA KL+EHL ILEQAVDVYTSE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY
        FNNAPFFISADVACAA+ RKSDSYKEISVPAGKTYEVSL+VESINSYIAWDFSLVQGKMNMDIGFSVECESPGG K LILPHKRYE+DQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

A0A1S3CKI8 uncharacterized protein LOC103501503 isoform X11.3e-22590.6Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MASMEGLVPITR FLASYY+ YPF PLSD VSRLS +MLA AN+LLDEL PTSEE TL DEAN HPPHKIDENMWKNRENVEEILFLLEKS WP+EVQKE
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        SATGKSELAN +GKL++K +N L+VLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQA VDALINSGGSIRDRYALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA KL+EHL ILEQAVDVYTSE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY
        FNNAPFFISADVACAA+ RKSDSYKEISVPAGKTYEVSL+VESINSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYE+DQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

A0A5A7SM14 Emp24/gp25L/p24 family/GOLD family protein1.3e-22590.6Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MASMEGLVPITR FLASYY+ YPF PLSD VSRLS +MLA AN+LLDEL PTSEE TL DEAN HPPHKIDENMWKNRENVEEILFLLEKS WP+EVQKE
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        SATGKSELAN +GKL++K +N L+VLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQA VDALINSGGSIRDRYALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA KL+EHL ILEQAVDVYTSE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY
        FNNAPFFISADVACAA+ RKSDSYKEISVPAGKTYEVSL+VESINSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYE+DQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

A0A6J1EAU1 uncharacterized protein LOC111432376 isoform X11.2e-22389.91Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MASMEGLVPITR FLASYYE YPFTPLSDD+SRLS +MLA AN LLDEL PT EE TL DEANH PPHKIDENMWKNRENVEEILFLLEKS WP+EVQKE
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        SATG+SELAN LGKL++K+KNTL VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQA VDAL+NSGGSIRDRYALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRLFISLSWRRFDARKL++HL ILEQAVDVY SELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVSL+VES+NSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYE+DQGNFCTCIAGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEPLQ A E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

A0A6J1HTF5 uncharacterized protein LOC111466012 isoform X13.2e-22490.14Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MASMEGLVPITR FLASYYE YPFTPLSDD+SRLS +MLASAN LLDEL PT EE TLFDEANH PPHKIDENMWKNRENVEEILFLLEKS WP+EVQKE
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        SATG+SELAN LGKL++K+KNTL VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQA VDAL+NSGGSIRDRYALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLI+LFISLSWRRFDARKL++HL ILEQAVDVY SELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY
        FNNAPFFI ADV      RK DSYKEISVPAGKTYEVS++VESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYE+DQGNFCTCIAGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEPLQ A E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G01010.1 CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 172 Blast hits to 172 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 95; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 14 (source: NCBI BLink).4.6e-16767.44Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MAS EGL+PITR FLASYY+ YPF+PLSDDVSRLS+ M +    L  +  P+  E +L DEAN  PPHKIDENMWKNRE +EEILFLL  S WP ++++ 
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        S +  +E A+ L  L     N    +++FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQA VDAL++SGGSIRD YALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-KEHLVILEQAVDVYTSELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL K+ + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-KEHLVILEQAVDVYTSELERFLVFIRE

Query:  VFNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGD
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++MDIGFSVE  +  G KTLILP++RYEADQGNF T +AG+
Subjt:  VFNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGD

Query:  YKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
        YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  YKLIWDNTYSTFFKKVLRYKVDCIPPVVEP

AT5G01010.2 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 85 Blast hits to 85 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 20; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).5.0e-16161.57Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MAS EGL+PITR FLASYY+ YPF+PLSDDVSRLS+ M +    L  +  P+  E +L DEAN  PPHKIDENMWKNRE +EEILFLL  S WP ++++ 
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        S +  +E A+ L  L     N    +++FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQA VDAL++SGGSIRD YALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-KEHLVILEQAVDVYTSELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL K+ + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-KEHLVILEQAVDVYTSELERFLVFIRE

Query:  VFNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNM--------------------------------------
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++M                                      
Subjt:  VFNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNM--------------------------------------

Query:  ---DIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
           DIGFSVE  +  G KTLILP++RYEADQGNF T +AG+YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  ---DIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP

AT5G01010.3 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 76 Blast hits to 76 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).1.9e-16066.83Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MAS EGL+PITR FLASYY+ YPF+PLSDDVSRLS+ M +    L  +  P+  E +L DEAN  PPHKIDENMWKNRE +EEILFLL  S WP ++++ 
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        S +  +E A+ L  L     N    +++FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQA VDAL++SGGSIRD YALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-KEHLVILEQAVDVYTSELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL K+ + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-KEHLVILEQAVDVYTSELERFLVFIRE

Query:  VFNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGD
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++MDIGFSVE  +  G KTLILP++RYEADQGNF T +AG+
Subjt:  VFNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGD

Query:  YKLIWDNTYSTFFKKVLRY
        YKL+WDN+YSTFFKKV RY
Subjt:  YKLIWDNTYSTFFKKVLRY

AT5G01010.4 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038).1.5e-15460.08Show/hide
Query:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE
        MAS EGL+PITR FLASYY+ YPF+PLSDDVSRLS+ M +    L  +  P+  E +L DEAN  PPHKIDENMWKNRE +EEILFLL  S WP ++++ 
Subjt:  MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKE

Query:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL
        S +  +E A+ L  L     N    +++FQ+KNSE +F+T       DFRGT+IRQQ+ERSERNKQA VDAL++SGGSIRD YALLW QQMERRRQLAQL
Subjt:  SATGKSELANTLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-KEHLVILEQAVDVYTSELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL K+ + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-KEHLVILEQAVDVYTSELERFLVFIRE

Query:  VFNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNM--------------------------------------
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++M                                      
Subjt:  VFNNAPFFISADVACAADGRKSDSYKEISVPAGKTYEVSLTVESINSYIAWDFSLVQGKMNM--------------------------------------

Query:  ---DIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
           DIGFSVE  +  G KTLILP++RYEADQGNF T +AG+YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  ---DIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATGGAGGGTCTGGTGCCTATAACCAGGCGTTTCCTGGCTTCCTACTATGAAAACTACCCATTTACGCCTCTATCCGACGATGTCTCTCGCCTTTCCGCCCA
GATGCTCGCCTCGGCCAACACCTTGCTCGATGAACTCTCGCCCACTTCAGAGGAAGTCACCCTTTTTGATGAAGCAAACCATCATCCTCCTCATAAAATTGATGAGAACA
TGTGGAAGAATCGGGAAAATGTGGAAGAAATCCTGTTTCTACTTGAAAAATCTCATTGGCCTCGAGAGGTTCAAAAGGAGTCTGCAACTGGAAAATCTGAACTTGCTAAT
ACTCTTGGAAAGCTAGATAAAAAAATGAAGAATACCTTAAATGTGTTGGTGGCTTTCCAATCTAAAAATTCTGAGCATGTATTCAACACAGTTATGACCTACATGCCTCA
AGATTTTCGAGGAACTATTATTCGACAGCAAAGGGAGCGATCGGAGAGGAATAAGCAAGCAGTGGTTGATGCTTTGATTAATTCTGGAGGAAGCATACGTGATCGATATG
CTCTCTTATGGAACCAACAGATGGAAAGGAGGAGACAGTTAGCACAACTGGGTTCTGCTACAGGTGTCTACAAAACCCTTGTGAAATATTTGGTTGGAGTTCCAGAGGTA
TTGCTAGAATTCATTCAAAAAATAAATGATGATGATGGGCCAATGGAAGAGCAACGGCAACGCTATGGACCACCTTTATATGAACTTACAACAATGGTCCGCCTTATTCG
ACTCTTTATTTCATTGTCATGGAGACGCTTTGATGCTAGGAAACTAAAGGAGCATCTTGTTATTTTGGAGCAAGCTGTTGATGTGTATACCTCTGAGCTTGAGAGGTTCC
TCGTCTTCATTCGCGAGGTCTTCAACAATGCACCATTCTTTATTTCAGCAGATGTGGCCTGTGCAGCAGATGGAAGGAAAAGTGATAGCTACAAGGAGATTAGTGTTCCA
GCTGGGAAAACTTATGAGGTTTCATTAACTGTGGAGTCTATCAATTCATATATTGCCTGGGATTTCTCGTTGGTTCAAGGCAAGATGAATATGGATATTGGATTCAGCGT
GGAGTGTGAAAGTCCTGGAGGAGGAAAGACTTTGATATTGCCGCACAAACGTTATGAGGCTGATCAGGGAAATTTCTGCACTTGCATCGCTGGGGACTACAAGCTGATTT
GGGACAATACATATTCAACTTTTTTTAAGAAGGTTTTGCGCTATAAGGTCGATTGCATACCTCCTGTGGTAGAGCCGTTGCAGCCCGCTGTAGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAGAAAAAAAGAAAAACGGACGGAAAAGGAAGAAAATATAGCCGACAGATCGATAAGAAGATATTAGAATGGGTCGTTGACGACGAATCGAACGAAGAGCAGTAGC
TTAGCGGCGTGGAGCCGTCTTTTTGTTGGTCTACCAATTATTCTGACGTTCAAAGTCTACCCGAAGAAGAAGAACTCAGAGAATTTCTTCAACTGTTCAAGTTTTCCAGT
TCGCTGATTTGCTTTCTCCTTACGCTACCAATCGATCGGGGAAGAATTTTTCAGAGAGTATTTGAACTGAAAGATTTCGATTCGGCGAAAATGGCTTCCATGGAGGGTCT
GGTGCCTATAACCAGGCGTTTCCTGGCTTCCTACTATGAAAACTACCCATTTACGCCTCTATCCGACGATGTCTCTCGCCTTTCCGCCCAGATGCTCGCCTCGGCCAACA
CCTTGCTCGATGAACTCTCGCCCACTTCAGAGGAAGTCACCCTTTTTGATGAAGCAAACCATCATCCTCCTCATAAAATTGATGAGAACATGTGGAAGAATCGGGAAAAT
GTGGAAGAAATCCTGTTTCTACTTGAAAAATCTCATTGGCCTCGAGAGGTTCAAAAGGAGTCTGCAACTGGAAAATCTGAACTTGCTAATACTCTTGGAAAGCTAGATAA
AAAAATGAAGAATACCTTAAATGTGTTGGTGGCTTTCCAATCTAAAAATTCTGAGCATGTATTCAACACAGTTATGACCTACATGCCTCAAGATTTTCGAGGAACTATTA
TTCGACAGCAAAGGGAGCGATCGGAGAGGAATAAGCAAGCAGTGGTTGATGCTTTGATTAATTCTGGAGGAAGCATACGTGATCGATATGCTCTCTTATGGAACCAACAG
ATGGAAAGGAGGAGACAGTTAGCACAACTGGGTTCTGCTACAGGTGTCTACAAAACCCTTGTGAAATATTTGGTTGGAGTTCCAGAGGTATTGCTAGAATTCATTCAAAA
AATAAATGATGATGATGGGCCAATGGAAGAGCAACGGCAACGCTATGGACCACCTTTATATGAACTTACAACAATGGTCCGCCTTATTCGACTCTTTATTTCATTGTCAT
GGAGACGCTTTGATGCTAGGAAACTAAAGGAGCATCTTGTTATTTTGGAGCAAGCTGTTGATGTGTATACCTCTGAGCTTGAGAGGTTCCTCGTCTTCATTCGCGAGGTC
TTCAACAATGCACCATTCTTTATTTCAGCAGATGTGGCCTGTGCAGCAGATGGAAGGAAAAGTGATAGCTACAAGGAGATTAGTGTTCCAGCTGGGAAAACTTATGAGGT
TTCATTAACTGTGGAGTCTATCAATTCATATATTGCCTGGGATTTCTCGTTGGTTCAAGGCAAGATGAATATGGATATTGGATTCAGCGTGGAGTGTGAAAGTCCTGGAG
GAGGAAAGACTTTGATATTGCCGCACAAACGTTATGAGGCTGATCAGGGAAATTTCTGCACTTGCATCGCTGGGGACTACAAGCTGATTTGGGACAATACATATTCAACT
TTTTTTAAGAAGGTTTTGCGCTATAAGGTCGATTGCATACCTCCTGTGGTAGAGCCGTTGCAGCCCGCTGTAGAAGATTGAAGTTCTTCCAGCGGCCACGGACAAGGGTT
TCTAGATTGTAACATACATTCATTGTCTTTGTAGTTTCTTCATCTCAAAGTTTGTTTGCATGTGGAGATCTTGTAAACTATGCCAATTAACCAAAAGGAAGCAATCGCTG
TAATTATGAATAGAACCATTCAGATTATGAAACGCTTGTTTTTATGCCTCTTGAAAGTGCTTTTAATCCCTTCTAATTAGTAATTGATAAAGCACTTTTGTTTTTACTCT
TCACTACATTGTTTTGAATGCCATTTTAAAATTTCAATCGAAAAGACTGCTCTTTGGTGAACATGGAAACAAGGTCTCATTGTAATCTTTTTGTACAATAAAACACCGTT
C
Protein sequenceShow/hide protein sequence
MASMEGLVPITRRFLASYYENYPFTPLSDDVSRLSAQMLASANTLLDELSPTSEEVTLFDEANHHPPHKIDENMWKNRENVEEILFLLEKSHWPREVQKESATGKSELAN
TLGKLDKKMKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAVVDALINSGGSIRDRYALLWNQQMERRRQLAQLGSATGVYKTLVKYLVGVPEV
LLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLKEHLVILEQAVDVYTSELERFLVFIREVFNNAPFFISADVACAADGRKSDSYKEISVP
AGKTYEVSLTVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYEADQGNFCTCIAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED