; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026686 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026686
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGOLD domain-containing protein
Genome locationtig00153033:2556095..2593564
RNA-Seq ExpressionSgr026686
SyntenySgr026686
Gene Ontology termsNA
InterPro domainsIPR036598 - GOLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147520.1 uncharacterized protein LOC101218161 [Cucumis sativus]1.9e-19287.59Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MASMEGLVPITR+FLASYYDKYPFTPLS  VSRLSTEML +ANSLLDELPPT+EESTLLDEA +HPPHKIDENMWKNRENVEEILFL EKS WP+EVQ++
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        SA+GESELA I+GKLEEK RN+L+ LV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAE+DALINSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI++INDDDGPMEEQRQRYGPPLY LTTMV LIRL ISLSWRRFDA K+ EHL ILEQAVDVYTSE+ERF+ FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV

Query:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC
        FNN+PFFISADVA AA+ RKSDSYKEISVPAGKTYEVS SVESINSYIAWDFSLVQGKMNMDIGFS+E ESPGG K LILPH+RYESDQ +   C
Subjt:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC

XP_008463316.1 PREDICTED: uncharacterized protein LOC103501503 isoform X1 [Cucumis melo]1.9e-19287.59Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MASMEGLVPITR+FLASYYDKYPF PLS  VSRLSTEML +ANSLLDELPPT+EESTLLDEA +HPPHKIDENMWKNRENVEEILFLLEKS WP+EVQ++
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        SA+G+SELA I+GKLEEK RN+L+ LV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAE+DALINSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI++INDDDGPMEEQRQRYGPPLY LTTMV LIRL ISLSWRRFDA K+ EHL ILEQAVDVYTSE+ERF+ FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV

Query:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC
        FNN+PFFISADVA AA+ RKSDSYKEISVPAGKTYEVS SVESINSYIAWDFSLVQGKMNMDIGFS+E ESPGG KTLILPH+RYESDQ +   C
Subjt:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC

XP_022156250.1 uncharacterized protein LOC111023184 [Momordica charantia]8.1e-19990.63Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MASMEGLVPITR+FLASYYDKY FTPLS DVSRLSTEML MANSLLDELPPT+EESTLLDEA RHPPHKIDENMWKNRENVEEIL LLEKS+WP+EVQQQ
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        SA+GESEL IILGKLEEKFRNSLN L VFQ+KNSE+VFNTVMTYMPQDFRGTIIRQQRERSERNKQAE+DALINSGGSIRDRY LLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIR++NDDDGPMEEQRQRYGPPLYNLTTMVLLIRL ISLSWRRFDARK+SEHLAILEQAVDVYTSELERF+ FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV

Query:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC
        FNNSPFFISADVA AAD R ++SYKEISV AGKTYEVS SVESINSYIAWDFSLVQGKMNMDIGFS+EYESPGGEKTLILPH+RYESDQ +   C
Subjt:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC

XP_022924994.1 uncharacterized protein LOC111432376 isoform X1 [Cucurbita moschata]1.3e-18886.08Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MASMEGLVPITR+FLASYY+KYPFTPLS D+SRLSTEML +AN LLDELPPT EESTLLDEA   PPHKIDENMWKNRENVEEILFLLEKS WP+EVQ++
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        SA+GESELA ILGKLEEK +N+L  LV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAE+DAL+NSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI++INDDDGPMEEQRQRYGPPLY LTTMV LIRLFISLSWRRFDARK+ +HLAILEQAVDVY SELERF+ FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV

Query:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC
        FNN+PFFI ADV      RK DSYKEISVPAGKTYEVS SVES+NSYIAWDFSLVQGKMNMDIGFS+E ESPGG KTLILPH+RYESDQ +   C
Subjt:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC

XP_038881964.1 uncharacterized protein LOC120073287 isoform X1 [Benincasa hispida]1.1e-19087.85Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MASMEGLVPITR+FLASYYDKYPFTPL   VSRLSTEML +ANSLLDELPPT+EES LLDEA +HPPHKIDENMWKNRENVEEILFLLEKS WP+EV Q+
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        SA+GESELA I+GKLEEK RN+L+ LV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAE+DALINSGGSIRDRYALLW QQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI++INDDDGPMEEQRQRYGPPLY LTTMV LIRL ISLSWRRFDA K  EHL ILEQAVDVYTSELERF+ FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV

Query:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC
        FNN+PFFISADVA AAD RKSDSYKEISVPAGKTYEVS SVESINSYIAWDFSLVQGKMNMDIGFS+E ESPGG KTLILPH+RYESDQ +   C
Subjt:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC

TrEMBL top hitse value%identityAlignment
A0A0A0L040 GOLD domain-containing protein9.4e-19387.59Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MASMEGLVPITR+FLASYYDKYPFTPLS  VSRLSTEML +ANSLLDELPPT+EESTLLDEA +HPPHKIDENMWKNRENVEEILFL EKS WP+EVQ++
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        SA+GESELA I+GKLEEK RN+L+ LV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAE+DALINSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI++INDDDGPMEEQRQRYGPPLY LTTMV LIRL ISLSWRRFDA K+ EHL ILEQAVDVYTSE+ERF+ FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV

Query:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC
        FNN+PFFISADVA AA+ RKSDSYKEISVPAGKTYEVS SVESINSYIAWDFSLVQGKMNMDIGFS+E ESPGG K LILPH+RYESDQ +   C
Subjt:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC

A0A1S3CKI8 uncharacterized protein LOC103501503 isoform X19.4e-19387.59Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MASMEGLVPITR+FLASYYDKYPF PLS  VSRLSTEML +ANSLLDELPPT+EESTLLDEA +HPPHKIDENMWKNRENVEEILFLLEKS WP+EVQ++
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        SA+G+SELA I+GKLEEK RN+L+ LV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAE+DALINSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI++INDDDGPMEEQRQRYGPPLY LTTMV LIRL ISLSWRRFDA K+ EHL ILEQAVDVYTSE+ERF+ FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV

Query:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC
        FNN+PFFISADVA AA+ RKSDSYKEISVPAGKTYEVS SVESINSYIAWDFSLVQGKMNMDIGFS+E ESPGG KTLILPH+RYESDQ +   C
Subjt:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC

A0A5A7SM14 Emp24/gp25L/p24 family/GOLD family protein9.4e-19387.59Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MASMEGLVPITR+FLASYYDKYPF PLS  VSRLSTEML +ANSLLDELPPT+EESTLLDEA +HPPHKIDENMWKNRENVEEILFLLEKS WP+EVQ++
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        SA+G+SELA I+GKLEEK RN+L+ LV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAE+DALINSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI++INDDDGPMEEQRQRYGPPLY LTTMV LIRL ISLSWRRFDA K+ EHL ILEQAVDVYTSE+ERF+ FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV

Query:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC
        FNN+PFFISADVA AA+ RKSDSYKEISVPAGKTYEVS SVESINSYIAWDFSLVQGKMNMDIGFS+E ESPGG KTLILPH+RYESDQ +   C
Subjt:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC

A0A6J1DPR8 uncharacterized protein LOC1110231843.9e-19990.63Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MASMEGLVPITR+FLASYYDKY FTPLS DVSRLSTEML MANSLLDELPPT+EESTLLDEA RHPPHKIDENMWKNRENVEEIL LLEKS+WP+EVQQQ
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        SA+GESEL IILGKLEEKFRNSLN L VFQ+KNSE+VFNTVMTYMPQDFRGTIIRQQRERSERNKQAE+DALINSGGSIRDRY LLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIR++NDDDGPMEEQRQRYGPPLYNLTTMVLLIRL ISLSWRRFDARK+SEHLAILEQAVDVYTSELERF+ FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV

Query:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC
        FNNSPFFISADVA AAD R ++SYKEISV AGKTYEVS SVESINSYIAWDFSLVQGKMNMDIGFS+EYESPGGEKTLILPH+RYESDQ +   C
Subjt:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC

A0A6J1EAU1 uncharacterized protein LOC111432376 isoform X16.3e-18986.08Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MASMEGLVPITR+FLASYY+KYPFTPLS D+SRLSTEML +AN LLDELPPT EESTLLDEA   PPHKIDENMWKNRENVEEILFLLEKS WP+EVQ++
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        SA+GESELA ILGKLEEK +N+L  LV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAE+DAL+NSGGSIRDRYALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV
        GSATGVYKTLVKYLVGVPEVLLEFI++INDDDGPMEEQRQRYGPPLY LTTMV LIRLFISLSWRRFDARK+ +HLAILEQAVDVY SELERF+ FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREV

Query:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC
        FNN+PFFI ADV      RK DSYKEISVPAGKTYEVS SVES+NSYIAWDFSLVQGKMNMDIGFS+E ESPGG KTLILPH+RYESDQ +   C
Subjt:  FNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G01010.1 CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 172 Blast hits to 172 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 95; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 14 (source: NCBI BLink).4.2e-15368.21Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MAS EGL+PITR FLASYYDKYPF+PLS DVSRLS++M ++   L  + PP+  E++L+DEA R PPHKIDENMWKNRE +EEILFLL  S WP ++++ 
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        S S ++E A IL  L++ F N+   ++ FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAE+DAL++SGGSIRD YALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVS-EHLAILEQAVDVYTSELERFITFIRE
        GSATGVYKTLVKYLVGVP+VLL+FIRQINDDDGPMEEQR+RYGPPLY+LT MV+ IR+F++L W R+D  K+S + + +L +A  VYTSE ERF+TFI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVS-EHLAILEQAVDVYTSELERFITFIRE

Query:  VFNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQ
        VF NSPFFISAD A    +R ++ YKEI V AG+TYE+S  VES NSYIAWDFSL+QGK++MDIGFS+EY +  GEKTLILP++RYE+DQ
Subjt:  VFNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQ

AT5G01010.2 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 85 Blast hits to 85 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 20; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).4.5e-14761.72Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MAS EGL+PITR FLASYYDKYPF+PLS DVSRLS++M ++   L  + PP+  E++L+DEA R PPHKIDENMWKNRE +EEILFLL  S WP ++++ 
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        S S ++E A IL  L++ F N+   ++ FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAE+DAL++SGGSIRD YALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVS-EHLAILEQAVDVYTSELERFITFIRE
        GSATGVYKTLVKYLVGVP+VLL+FIRQINDDDGPMEEQR+RYGPPLY+LT MV+ IR+F++L W R+D  K+S + + +L +A  VYTSE ERF+TFI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVS-EHLAILEQAVDVYTSELERFITFIRE

Query:  VFNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNM--------------------------------------
        VF NSPFFISAD A    +R ++ YKEI V AG+TYE+S  VES NSYIAWDFSL+QGK++M                                      
Subjt:  VFNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNM--------------------------------------

Query:  ---DIGFSLEYESPGGEKTLILPHQRYESDQ
           DIGFS+EY +  GEKTLILP++RYE+DQ
Subjt:  ---DIGFSLEYESPGGEKTLILPHQRYESDQ

AT5G01010.3 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 76 Blast hits to 76 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).4.2e-15368.21Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MAS EGL+PITR FLASYYDKYPF+PLS DVSRLS++M ++   L  + PP+  E++L+DEA R PPHKIDENMWKNRE +EEILFLL  S WP ++++ 
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        S S ++E A IL  L++ F N+   ++ FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAE+DAL++SGGSIRD YALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVS-EHLAILEQAVDVYTSELERFITFIRE
        GSATGVYKTLVKYLVGVP+VLL+FIRQINDDDGPMEEQR+RYGPPLY+LT MV+ IR+F++L W R+D  K+S + + +L +A  VYTSE ERF+TFI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVS-EHLAILEQAVDVYTSELERFITFIRE

Query:  VFNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQ
        VF NSPFFISAD A    +R ++ YKEI V AG+TYE+S  VES NSYIAWDFSL+QGK++MDIGFS+EY +  GEKTLILP++RYE+DQ
Subjt:  VFNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQ

AT5G01010.4 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038).1.4e-14060.09Show/hide
Query:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ
        MAS EGL+PITR FLASYYDKYPF+PLS DVSRLS++M ++   L  + PP+  E++L+DEA R PPHKIDENMWKNRE +EEILFLL  S WP ++++ 
Subjt:  MASMEGLVPITRNFLASYYDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQ

Query:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL
        S S ++E A IL  L++ F N+   ++ FQ+KNSE +F+T       DFRGT+IRQQ+ERSERNKQAE+DAL++SGGSIRD YALLWKQQMERRRQLAQL
Subjt:  SASGESELAIILGKLEEKFRNSLNTLVVFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVS-EHLAILEQAVDVYTSELERFITFIRE
        GSATGVYKTLVKYLVGVP+VLL+FIRQINDDDGPMEEQR+RYGPPLY+LT MV+ IR+F++L W R+D  K+S + + +L +A  VYTSE ERF+TFI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQRQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVS-EHLAILEQAVDVYTSELERFITFIRE

Query:  VFNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNM--------------------------------------
        VF NSPFFISAD A    +R ++ YKEI V AG+TYE+S  VES NSYIAWDFSL+QGK++M                                      
Subjt:  VFNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYIAWDFSLVQGKMNM--------------------------------------

Query:  ---DIGFSLEYESPGGEKTLILPHQRYESDQ
           DIGFS+EY +  GEKTLILP++RYE+DQ
Subjt:  ---DIGFSLEYESPGGEKTLILPHQRYESDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAGTGAAAAGTTCATTGCATACCTGATGGAATTTCCGGCTAAGAATAGCATCTCTTGCCTTCTCAGCAATCTCTATAGCCTTCATCTTTGGCTGAACATTAAAGG
TGATCCCAGAATCACTAGGAATCTCCACATATTCCTCCATTTCTGGATTAAAATATCCAGACCGGTTTCCATTCCAGAAGAAAGTAACATGACCAAATTTGACAGTCTCA
CTAGAGAATTGTCGTGTACAAGAATGAAGTTAAGTACACAGATGCAGACATTCGCTATTGCTCACCTGCAGGCAAAGGTCCGAACACCATTGTGCACAAGATATTCCCCC
GATGTTCTCTCAATCTCTGGGGGAGAAACAAGCACCAAGCGCGTTATGTCCGACCTCACTGTTGCCCATGTCATCTTCTGTAGGAAGTCCTACTGCCTTTCCATGTGCCT
TGAGCAACCTCCATTTCTCGGGAGCGCCCTAAGTGCTGCAGTTTGCTGTTTTGCGTGGACGATACTAATCTTCTTCCGCTGCAAACGATCGGAGGAGGAAGTTTCAGAGT
GGAGAAGTCAACTGAAAGATTTTGATTTGGAAAACAGAACCAAGATAAAGGAGATTATGGCTTCCATGGAGGGTCTGGTGCCTATAACCAGGAATTTCCTGGCTTCGTAC
TACGATAAGTACCCATTTACGCCTCTCTCTGGCGACGTCTCTCGCCTTTCGACTGAGATGCTCACCATGGCCAACAGTTTGCTTGATGAACTCCCCCCTACTACAGAGGA
AAGCACCCTACTTGATGAAGCAACCCGTCATCCTCCTCATAAAATTGATGAGAATATGTGGAAGAATCGGGAAAATGTCGAGGAAATTCTGTTTCTGCTTGAAAAATCTA
ATTGGCCTCGAGAGGTTCAGCAGCAGTCTGCATCTGGTGAATCTGAACTTGCTATTATTCTGGGAAAGCTAGAAGAAAAATTCCGGAATAGCTTAAACACGTTGGTGGTT
TTTCAATCTAAAAATTCAGAGCATGTGTTCAACACAGTTATGACCTACATGCCTCAAGATTTTCGAGGAACAATAATTAGACAGCAAAGAGAGCGATCGGAGAGGAATAA
GCAAGCAGAGATTGATGCTTTGATTAATTCTGGAGGAAGTATACGTGATCGATATGCTCTTTTATGGAAACAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTT
CTGCAACAGGTGTCTACAAAACCCTTGTGAAATATTTGGTTGGAGTTCCAGAGGTATTGCTAGAATTCATTCGACAAATAAATGATGATGATGGGCCAATGGAAGAACAA
CGACAACGCTATGGACCACCTTTGTATAACCTTACAACAATGGTCCTCCTTATTCGACTCTTTATTTCATTATCATGGAGACGTTTTGATGCTAGGAAAGTAAGTGAGCA
TCTAGCTATTTTGGAGCAAGCTGTTGATGTGTACACCTCTGAGCTTGAGAGGTTCATCACATTCATTCGCGAAGTCTTCAATAATTCTCCATTCTTTATTTCAGCGGATG
TGGCCTATGCAGCAGATGCGAGGAAAAGTGATAGCTACAAAGAGATTAGTGTTCCAGCTGGGAAGACTTATGAGGTTTCATTTAGTGTGGAGTCAATCAATTCATATATT
GCCTGGGATTTCTCATTGGTTCAAGGCAAGATGAATATGGATATTGGATTCAGTTTGGAATATGAAAGTCCTGGAGGGGAAAAGACTTTGATACTGCCTCACCAACGTTA
CGAGTCTGATCAAGTGAGTTTATGGTTTTGCGCTATAGAGTCGATTGCATACCTCCCATGGCAGAGCCAGTGCAACCGGTTGCAGAAGTTTGAGTGCCTCCAGCAGCCAC
TCCACTGTGGACGGCCTTCAAATATGAATGTGCCAAGGAAATCGAAACTGCAGAAGAGTGTGTTATGTTGGAATGATATTGATCGAGGTTTTAGCCAATTACCACGAACC
AGATTAGGAACTCCAGCAGCAGCAATCACTATGTCAGCTTCGCGAGTAATCTGTTCAGGATTCCTAGTAAATGCATGTACAATACTGACAGTAGCATGCAAGGACGTGGG
TAATCCAACAATATTACTTCTTCCAATCACCACGGCTTTCTTCCCCGAGATTTCTACACCGGATCTGATCAACAACTCAATGCAGCCTTTTGGAGTACAGGGCAAAGTAG
CAATTTTCGACTTGATTCCGGCTTCTTCACATGCAGCTATCTTGGAGAATCCGAAACAAACAACCATACATCACCTCGACCCTACATCAGCAAATTGTATGAAAGCATTC
GTGCTTCAGACGTCTCCTGACTTGAAATCTTTCCTCCATCGCCGCCGCCAGTGGTTGCCGTGGCCTCCACTATTCTTGATTTTTCTCACTGAGCTAAGCAAATCAAAGGA
TTTACTCTACAACTTGACAGTTCCAAAGAGGATGGAGCGAACAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAAGTGAAAAGTTCATTGCATACCTGATGGAATTTCCGGCTAAGAATAGCATCTCTTGCCTTCTCAGCAATCTCTATAGCCTTCATCTTTGGCTGAACATTAAAGG
TGATCCCAGAATCACTAGGAATCTCCACATATTCCTCCATTTCTGGATTAAAATATCCAGACCGGTTTCCATTCCAGAAGAAAGTAACATGACCAAATTTGACAGTCTCA
CTAGAGAATTGTCGTGTACAAGAATGAAGTTAAGTACACAGATGCAGACATTCGCTATTGCTCACCTGCAGGCAAAGGTCCGAACACCATTGTGCACAAGATATTCCCCC
GATGTTCTCTCAATCTCTGGGGGAGAAACAAGCACCAAGCGCGTTATGTCCGACCTCACTGTTGCCCATGTCATCTTCTGTAGGAAGTCCTACTGCCTTTCCATGTGCCT
TGAGCAACCTCCATTTCTCGGGAGCGCCCTAAGTGCTGCAGTTTGCTGTTTTGCGTGGACGATACTAATCTTCTTCCGCTGCAAACGATCGGAGGAGGAAGTTTCAGAGT
GGAGAAGTCAACTGAAAGATTTTGATTTGGAAAACAGAACCAAGATAAAGGAGATTATGGCTTCCATGGAGGGTCTGGTGCCTATAACCAGGAATTTCCTGGCTTCGTAC
TACGATAAGTACCCATTTACGCCTCTCTCTGGCGACGTCTCTCGCCTTTCGACTGAGATGCTCACCATGGCCAACAGTTTGCTTGATGAACTCCCCCCTACTACAGAGGA
AAGCACCCTACTTGATGAAGCAACCCGTCATCCTCCTCATAAAATTGATGAGAATATGTGGAAGAATCGGGAAAATGTCGAGGAAATTCTGTTTCTGCTTGAAAAATCTA
ATTGGCCTCGAGAGGTTCAGCAGCAGTCTGCATCTGGTGAATCTGAACTTGCTATTATTCTGGGAAAGCTAGAAGAAAAATTCCGGAATAGCTTAAACACGTTGGTGGTT
TTTCAATCTAAAAATTCAGAGCATGTGTTCAACACAGTTATGACCTACATGCCTCAAGATTTTCGAGGAACAATAATTAGACAGCAAAGAGAGCGATCGGAGAGGAATAA
GCAAGCAGAGATTGATGCTTTGATTAATTCTGGAGGAAGTATACGTGATCGATATGCTCTTTTATGGAAACAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTT
CTGCAACAGGTGTCTACAAAACCCTTGTGAAATATTTGGTTGGAGTTCCAGAGGTATTGCTAGAATTCATTCGACAAATAAATGATGATGATGGGCCAATGGAAGAACAA
CGACAACGCTATGGACCACCTTTGTATAACCTTACAACAATGGTCCTCCTTATTCGACTCTTTATTTCATTATCATGGAGACGTTTTGATGCTAGGAAAGTAAGTGAGCA
TCTAGCTATTTTGGAGCAAGCTGTTGATGTGTACACCTCTGAGCTTGAGAGGTTCATCACATTCATTCGCGAAGTCTTCAATAATTCTCCATTCTTTATTTCAGCGGATG
TGGCCTATGCAGCAGATGCGAGGAAAAGTGATAGCTACAAAGAGATTAGTGTTCCAGCTGGGAAGACTTATGAGGTTTCATTTAGTGTGGAGTCAATCAATTCATATATT
GCCTGGGATTTCTCATTGGTTCAAGGCAAGATGAATATGGATATTGGATTCAGTTTGGAATATGAAAGTCCTGGAGGGGAAAAGACTTTGATACTGCCTCACCAACGTTA
CGAGTCTGATCAAGTGAGTTTATGGTTTTGCGCTATAGAGTCGATTGCATACCTCCCATGGCAGAGCCAGTGCAACCGGTTGCAGAAGTTTGAGTGCCTCCAGCAGCCAC
TCCACTGTGGACGGCCTTCAAATATGAATGTGCCAAGGAAATCGAAACTGCAGAAGAGTGTGTTATGTTGGAATGATATTGATCGAGGTTTTAGCCAATTACCACGAACC
AGATTAGGAACTCCAGCAGCAGCAATCACTATGTCAGCTTCGCGAGTAATCTGTTCAGGATTCCTAGTAAATGCATGTACAATACTGACAGTAGCATGCAAGGACGTGGG
TAATCCAACAATATTACTTCTTCCAATCACCACGGCTTTCTTCCCCGAGATTTCTACACCGGATCTGATCAACAACTCAATGCAGCCTTTTGGAGTACAGGGCAAAGTAG
CAATTTTCGACTTGATTCCGGCTTCTTCACATGCAGCTATCTTGGAGAATCCGAAACAAACAACCATACATCACCTCGACCCTACATCAGCAAATTGTATGAAAGCATTC
GTGCTTCAGACGTCTCCTGACTTGAAATCTTTCCTCCATCGCCGCCGCCAGTGGTTGCCGTGGCCTCCACTATTCTTGATTTTTCTCACTGAGCTAAGCAAATCAAAGGA
TTTACTCTACAACTTGACAGTTCCAAAGAGGATGGAGCGAACAGAGTAA
Protein sequenceShow/hide protein sequence
MTSEKFIAYLMEFPAKNSISCLLSNLYSLHLWLNIKGDPRITRNLHIFLHFWIKISRPVSIPEESNMTKFDSLTRELSCTRMKLSTQMQTFAIAHLQAKVRTPLCTRYSP
DVLSISGGETSTKRVMSDLTVAHVIFCRKSYCLSMCLEQPPFLGSALSAAVCCFAWTILIFFRCKRSEEEVSEWRSQLKDFDLENRTKIKEIMASMEGLVPITRNFLASY
YDKYPFTPLSGDVSRLSTEMLTMANSLLDELPPTTEESTLLDEATRHPPHKIDENMWKNRENVEEILFLLEKSNWPREVQQQSASGESELAIILGKLEEKFRNSLNTLVV
FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEIDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGVPEVLLEFIRQINDDDGPMEEQ
RQRYGPPLYNLTTMVLLIRLFISLSWRRFDARKVSEHLAILEQAVDVYTSELERFITFIREVFNNSPFFISADVAYAADARKSDSYKEISVPAGKTYEVSFSVESINSYI
AWDFSLVQGKMNMDIGFSLEYESPGGEKTLILPHQRYESDQVSLWFCAIESIAYLPWQSQCNRLQKFECLQQPLHCGRPSNMNVPRKSKLQKSVLCWNDIDRGFSQLPRT
RLGTPAAAITMSASRVICSGFLVNACTILTVACKDVGNPTILLLPITTAFFPEISTPDLINNSMQPFGVQGKVAIFDLIPASSHAAILENPKQTTIHHLDPTSANCMKAF
VLQTSPDLKSFLHRRRQWLPWPPLFLIFLTELSKSKDLLYNLTVPKRMERTE