; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg013266 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg013266
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGOLD domain-containing protein
Genome locationscaffold1:16993686..17006061
RNA-Seq ExpressionSpg013266
SyntenySpg013266
Gene Ontology termsNA
InterPro domainsIPR036598 - GOLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147520.1 uncharacterized protein LOC101218161 [Cucumis sativus]3.0e-19376.68Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPFTPLSD VSRLSTEMLALANSLLDELPPTSEESTLLDEAN HPPHKIDENMWKNRENVEEILFL EK RWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        SATGESELANI+G LEEK +N L+ LV FQSKNSEHVFNT                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD
        PEVLLEFI+KINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA KLREHL ILEQA+DVYTSE+ERFL FIREVFNNAPFFISADVACAA+
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD

Query:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC
         R SDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGG K LILPHKRYESDQ + C
Subjt:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC

XP_008463316.1 PREDICTED: uncharacterized protein LOC103501503 isoform X1 [Cucumis melo]1.0e-19376.89Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPF PLSD VSRLSTEMLALANSLLDELPPTSEESTLLDEAN HPPHKIDENMWKNRENVEEILFLLEK RWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        SATG+SELANI+G LEEK +N L+VLV FQSKNSEHVFNT                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD
        PEVLLEFI+KINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA KLREHL ILEQA+DVYTSE+ERFL FIREVFNNAPFFISADVACAA+
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD

Query:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC
         R SDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYESDQ + C
Subjt:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC

XP_022924994.1 uncharacterized protein LOC111432376 isoform X1 [Cucurbita moschata]1.1e-19276.68Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MASMEGLVPITRHFLASYYEKYPFTPLSDD+SRLSTEMLALAN LLDELPPT EESTLLDEAN  PPHKIDENMWKNRENVEEILFLLEK RWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        SATGESELANILG LEEKLKNTL VLV FQSKNSEHVFNT                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD
        PEVLLEFI+KINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLR+HLAILEQA+DVY SELERFL+FIREVFNNAPFFI ADV     
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD

Query:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC
         R  DSYKEISVPAGKTYEVSLSVES+NSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQ + C
Subjt:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC

XP_023517640.1 uncharacterized protein LOC111781338 isoform X1 [Cucurbita pepo subsp. pepo]1.9e-19276.47Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MASMEGLVPITRHFLASYYEKYPFTPLSDD+SRLSTEMLALAN L+DELPPT EESTLLDEAN  PPHKIDENMWKNRENVEEILFLLEK RWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        SATGESELANILG LEEKLKNTL VLV FQSKNSEHVFNT                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD
        PEVLLEFI+KINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLR+HLAILEQA+DVY SELERFL+FIREVFNNAPFFI ADV     
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD

Query:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC
         R  DSYKEISVPAGKTYEVSL+VESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQ + C
Subjt:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC

XP_038881964.1 uncharacterized protein LOC120073287 isoform X1 [Benincasa hispida]1.2e-19176.68Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPFTPL D VSRLSTEMLALANSLLDELPPTSEES LLDEAN HPPHKIDENMWKNRENVEEILFLLEK RWPQEVQ E
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        SATGESELANI+G LEEK++NTL+ LV FQSKNSEHVFNT                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLW QQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD
        PEVLLEFI+KINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA K REHL ILEQA+DVYTSELERFL FIREVFNNAPFFISADVACAAD
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD

Query:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC
         R SDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYESDQ + C
Subjt:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC

TrEMBL top hitse value%identityAlignment
A0A0A0L040 GOLD domain-containing protein1.4e-19376.68Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPFTPLSD VSRLSTEMLALANSLLDELPPTSEESTLLDEAN HPPHKIDENMWKNRENVEEILFL EK RWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        SATGESELANI+G LEEK +N L+ LV FQSKNSEHVFNT                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD
        PEVLLEFI+KINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA KLREHL ILEQA+DVYTSE+ERFL FIREVFNNAPFFISADVACAA+
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD

Query:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC
         R SDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGG K LILPHKRYESDQ + C
Subjt:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC

A0A1S3CKI8 uncharacterized protein LOC103501503 isoform X14.9e-19476.89Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPF PLSD VSRLSTEMLALANSLLDELPPTSEESTLLDEAN HPPHKIDENMWKNRENVEEILFLLEK RWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        SATG+SELANI+G LEEK +N L+VLV FQSKNSEHVFNT                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD
        PEVLLEFI+KINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA KLREHL ILEQA+DVYTSE+ERFL FIREVFNNAPFFISADVACAA+
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD

Query:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC
         R SDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYESDQ + C
Subjt:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC

A0A5A7SM14 Emp24/gp25L/p24 family/GOLD family protein4.9e-19476.89Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MASMEGLVPITRHFLASYY+KYPF PLSD VSRLSTEMLALANSLLDELPPTSEESTLLDEAN HPPHKIDENMWKNRENVEEILFLLEK RWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        SATG+SELANI+G LEEK +N L+VLV FQSKNSEHVFNT                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD
        PEVLLEFI+KINDDDGPMEEQRQRYGPPLYKLTTMVRLIRL ISLSWRRFDA KLREHL ILEQA+DVYTSE+ERFL FIREVFNNAPFFISADVACAA+
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD

Query:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC
         R SDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGG KTLILPHKRYESDQ + C
Subjt:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC

A0A6J1EAU1 uncharacterized protein LOC111432376 isoform X15.4e-19376.68Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MASMEGLVPITRHFLASYYEKYPFTPLSDD+SRLSTEMLALAN LLDELPPT EESTLLDEAN  PPHKIDENMWKNRENVEEILFLLEK RWPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        SATGESELANILG LEEKLKNTL VLV FQSKNSEHVFNT                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD
        PEVLLEFI+KINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLR+HLAILEQA+DVY SELERFL+FIREVFNNAPFFI ADV     
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD

Query:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC
         R  DSYKEISVPAGKTYEVSLSVES+NSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQ + C
Subjt:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC

A0A6J1HTF5 uncharacterized protein LOC111466012 isoform X11.9e-19075.84Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MASMEGLVPITRHFLASYYEKYPFTPLSDD+SRLSTEMLA AN LLDELPPT EESTL DEAN  PPHKIDENMWKNRENVEEILFLLEK  WPQEVQKE
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        SATGESELANILG LEEKLKNTL VLV FQSKNSEHVFNT                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD
        PEVLLEFI+KINDDDGPMEEQRQRYGPPLYKLTTMVRLI+LFISLSWRRFDARKLR+HLAILEQA+DVY SELERFL+FIREVFNNAPFFI ADV     
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAAD

Query:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC
         R  DSYKEISVPAGKTYEVS+SVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQ + C
Subjt:  GRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G01010.1 CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 172 Blast hits to 172 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 95; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 14 (source: NCBI BLink).1.5e-13955.6Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MAS EGL+PITR FLASYY+KYPF+PLSDDVSRLS++M +L   L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL   RWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        S + ++E A+IL  L++   N    +++FQ+KNSE +F+T                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIRD YALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAA
        P+VLL+FIR+INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL ++ + +L +A  VYTSE ERF+ FI +VF N+PFFISAD A   
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAA

Query:  DGRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQ
          R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++MDIGFSVE  +  G KTLILP++RYE+DQ
Subjt:  DGRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQ

AT5G01010.2 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 85 Blast hits to 85 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 20; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).1.6e-13351.17Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MAS EGL+PITR FLASYY+KYPF+PLSDDVSRLS++M +L   L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL   RWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        S + ++E A+IL  L++   N    +++FQ+KNSE +F+T                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIRD YALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAA
        P+VLL+FIR+INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL ++ + +L +A  VYTSE ERF+ FI +VF N+PFFISAD A   
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAA

Query:  DGRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNM-----------------------------------------DIGFSVECESPGGG
          R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++M                                         DIGFSVE  +  G 
Subjt:  DGRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNM-----------------------------------------DIGFSVECESPGGG

Query:  KTLILPHKRYESDQ
        KTLILP++RYE+DQ
Subjt:  KTLILPHKRYESDQ

AT5G01010.3 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 76 Blast hits to 76 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).1.5e-13955.6Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MAS EGL+PITR FLASYY+KYPF+PLSDDVSRLS++M +L   L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL   RWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        S + ++E A+IL  L++   N    +++FQ+KNSE +F+T                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                               VMTYMPQDFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIRD YALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAA
        P+VLL+FIR+INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL ++ + +L +A  VYTSE ERF+ FI +VF N+PFFISAD A   
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAA

Query:  DGRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQ
          R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++MDIGFSVE  +  G KTLILP++RYE+DQ
Subjt:  DGRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQ

AT5G01010.4 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038).3.5e-12849.81Show/hide
Query:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE
        MAS EGL+PITR FLASYY+KYPF+PLSDDVSRLS++M +L   L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL   RWP ++++ 
Subjt:  MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKE

Query:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC
        S + ++E A+IL  L++   N    +++FQ+KNSE +F+T                                                            
Subjt:  SATGESELANILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVC

Query:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
                                      DFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIRD YALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV
Subjt:  SVNTLDCSETFFICCSRNGAFFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGV

Query:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAA
        P+VLL+FIR+INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL ++ + +L +A  VYTSE ERF+ FI +VF N+PFFISAD A   
Subjt:  PEVLLEFIRKINDDDGPMEEQRQRYGPPLYKLTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAA

Query:  DGRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNM-----------------------------------------DIGFSVECESPGGG
          R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++M                                         DIGFSVE  +  G 
Subjt:  DGRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQGKMNM-----------------------------------------DIGFSVECESPGGG

Query:  KTLILPHKRYESDQ
        KTLILP++RYE+DQ
Subjt:  KTLILPHKRYESDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATGGAGGGTCTGGTGCCTATAACCAGGCATTTCCTGGCTTCGTACTATGAGAAGTACCCATTTACGCCTCTATCTGACGACGTCTCTCGCCTTTCGACTGA
GATGCTCGCCTTGGCGAACAGTTTGCTTGATGAACTCCCGCCTACTTCAGAGGAAAGCACCCTTCTTGATGAAGCAAACGATCATCCTCCTCATAAAATTGACGAGAATA
TGTGGAAGAATCGGGAAAATGTGGAAGAAATTCTGTTTCTGCTTGAAAAACCTCGTTGGCCTCAAGAGGTTCAGAAGGAGTCTGCAACTGGCGAATCTGAACTTGCTAAT
ATTCTAGGAAATCTAGAAGAAAAATTGAAGAATACCTTAAATGTGTTGGTGACTTTCCAATCTAAAAATTCTGAGCACGTGTTCAATACAGGTTTATACAATATAGGCTG
GTGCTTTCCTTCCAGTGCTTCAGAAGCCCTACTTCAGCTTCTTGATGGTCAGCCTTTTAAGGGCCAAGTGCTCCGCTTCGCCATTTGTTTCCGTGTTTCTATCATCTTTC
GAGTATCAGATAGAGGACTGGCATCATTTGTTGTGGGGCTTTCAGTTTGTTCAGTCAACACTTTGGACTGTTCAGAGACATTCTTCATTTGTTGTTCCCGCAATGGTGCA
TTCTTTGTAGTTATGACCTACATGCCTCAAGATTTTCGAGGAACAATAATTCGACAACAAAGAGAGCGATCAGAGAGAAATAAGCAAGCAGAGGTTGATGCTTTGATTAA
TTCTGGAGGAAGTATACGTGATCGGTATGCGCTCTTATGGAAACAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCAACAGGTGTCTACAAAACCCTCG
TGAAATATTTGGTTGGAGTTCCAGAGGTATTGCTAGAATTCATTCGAAAAATAAATGATGATGATGGGCCAATGGAAGAACAACGACAACGCTATGGACCACCTTTGTAT
AAACTTACAACAATGGTTCGTCTTATTCGACTCTTTATTTCATTATCATGGAGACGTTTTGATGCTAGGAAACTAAGGGAGCATCTTGCTATTTTGGAGCAAGCTATTGA
TGTGTACACCTCCGAGCTTGAGAGGTTCCTCATCTTCATTCGCGAGGTCTTCAACAATGCTCCATTCTTTATTTCAGCAGATGTGGCCTGTGCAGCAGATGGGAGGACAA
GTGATAGTTACAAAGAGATTAGTGTTCCAGCTGGGAAGACTTATGAGGTTTCATTAAGTGTGGAGTCTATCAATTCATATATTGCCTGGGATTTCTCGTTGGTTCAAGGC
AAGATGAATATGGATATTGGATTCAGTGTGGAGTGTGAAAGTCCTGGAGGGGGAAAGACTTTGATATTGCCACACAAACGTTATGAGTCTGATCAGGTGAGTTTATGTTA
A
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCATGGAGGGTCTGGTGCCTATAACCAGGCATTTCCTGGCTTCGTACTATGAGAAGTACCCATTTACGCCTCTATCTGACGACGTCTCTCGCCTTTCGACTGA
GATGCTCGCCTTGGCGAACAGTTTGCTTGATGAACTCCCGCCTACTTCAGAGGAAAGCACCCTTCTTGATGAAGCAAACGATCATCCTCCTCATAAAATTGACGAGAATA
TGTGGAAGAATCGGGAAAATGTGGAAGAAATTCTGTTTCTGCTTGAAAAACCTCGTTGGCCTCAAGAGGTTCAGAAGGAGTCTGCAACTGGCGAATCTGAACTTGCTAAT
ATTCTAGGAAATCTAGAAGAAAAATTGAAGAATACCTTAAATGTGTTGGTGACTTTCCAATCTAAAAATTCTGAGCACGTGTTCAATACAGGTTTATACAATATAGGCTG
GTGCTTTCCTTCCAGTGCTTCAGAAGCCCTACTTCAGCTTCTTGATGGTCAGCCTTTTAAGGGCCAAGTGCTCCGCTTCGCCATTTGTTTCCGTGTTTCTATCATCTTTC
GAGTATCAGATAGAGGACTGGCATCATTTGTTGTGGGGCTTTCAGTTTGTTCAGTCAACACTTTGGACTGTTCAGAGACATTCTTCATTTGTTGTTCCCGCAATGGTGCA
TTCTTTGTAGTTATGACCTACATGCCTCAAGATTTTCGAGGAACAATAATTCGACAACAAAGAGAGCGATCAGAGAGAAATAAGCAAGCAGAGGTTGATGCTTTGATTAA
TTCTGGAGGAAGTATACGTGATCGGTATGCGCTCTTATGGAAACAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCAACAGGTGTCTACAAAACCCTCG
TGAAATATTTGGTTGGAGTTCCAGAGGTATTGCTAGAATTCATTCGAAAAATAAATGATGATGATGGGCCAATGGAAGAACAACGACAACGCTATGGACCACCTTTGTAT
AAACTTACAACAATGGTTCGTCTTATTCGACTCTTTATTTCATTATCATGGAGACGTTTTGATGCTAGGAAACTAAGGGAGCATCTTGCTATTTTGGAGCAAGCTATTGA
TGTGTACACCTCCGAGCTTGAGAGGTTCCTCATCTTCATTCGCGAGGTCTTCAACAATGCTCCATTCTTTATTTCAGCAGATGTGGCCTGTGCAGCAGATGGGAGGACAA
GTGATAGTTACAAAGAGATTAGTGTTCCAGCTGGGAAGACTTATGAGGTTTCATTAAGTGTGGAGTCTATCAATTCATATATTGCCTGGGATTTCTCGTTGGTTCAAGGC
AAGATGAATATGGATATTGGATTCAGTGTGGAGTGTGAAAGTCCTGGAGGGGGAAAGACTTTGATATTGCCACACAAACGTTATGAGTCTGATCAGGTGAGTTTATGTTA
A
Protein sequenceShow/hide protein sequence
MASMEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLALANSLLDELPPTSEESTLLDEANDHPPHKIDENMWKNRENVEEILFLLEKPRWPQEVQKESATGESELAN
ILGNLEEKLKNTLNVLVTFQSKNSEHVFNTGLYNIGWCFPSSASEALLQLLDGQPFKGQVLRFAICFRVSIIFRVSDRGLASFVVGLSVCSVNTLDCSETFFICCSRNGA
FFVVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRDRYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGVPEVLLEFIRKINDDDGPMEEQRQRYGPPLY
KLTTMVRLIRLFISLSWRRFDARKLREHLAILEQAIDVYTSELERFLIFIREVFNNAPFFISADVACAADGRTSDSYKEISVPAGKTYEVSLSVESINSYIAWDFSLVQG
KMNMDIGFSVECESPGGGKTLILPHKRYESDQVSLC