; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016506 (gene) of Snake gourd v1 genome

Gene IDTan0016506
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGOLD domain-containing protein
Genome locationLG03:62078208..62092618
RNA-Seq ExpressionTan0016506
SyntenyTan0016506
Gene Ontology termsNA
InterPro domainsIPR009038 - GOLD domain
IPR036598 - GOLD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147520.1 uncharacterized protein LOC101218161 [Cucumis sativus]2.3e-23293.12Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGLVPITRHFLASYY+KYPFTPLSD VSRLSTEML+LANSL+DELPPTSEESTLLDEAN HPPHKIDENMWKNRENVEEILFL EKSRWPQEVQKE
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANI+GKLEEK +N L+ LVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA KLREHL ILEQAVDVYTSE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY
        FNNAPFFISADVACAA+ RKSDSYKEI+VPAGKTYEVSLSVESINSYIAWDFSL+QGKMNMDIGFSVECESPGG K LILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

XP_008463316.1 PREDICTED: uncharacterized protein LOC103501503 isoform X1 [Cucumis melo]7.8e-23393.35Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGLVPITRHFLASYY+KYPF PLSD VSRLSTEML+LANSL+DELPPTSEESTLLDEAN HPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANI+GKLEEK +N L+VLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA KLREHL ILEQAVDVYTSE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY
        FNNAPFFISADVACAA+ RKSDSYKEI+VPAGKTYEVSLSVESINSYIAWDFSL+QGKMNMDIGFSVECESPGG KTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

XP_022924994.1 uncharacterized protein LOC111432376 isoform X1 [Cucurbita moschata]1.9e-23193.12Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGLVPITRHFLASYYEKYPFTPLSDD+SRLSTEML+LAN L+DELPPT EESTLLDEANH PPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANILGKLEEKLKNTL VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRLFISLSWRRFDARKLR+HLAILEQAVDVY SELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY
        FNNAPFFI ADV      RK DSYKEI+VPAGKTYEVSLSVES+NSYIAWDFSL+QGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEPLQ A E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

XP_023517640.1 uncharacterized protein LOC111781338 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-23093.12Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGLVPITRHFLASYYEKYPFTPLSDD+SRLSTEML+LAN LIDELPPT EESTLLDEAN  PPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANILGKLEEKLKNTL VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRLFISLSWRRFDARKLR+HLAILEQAVDVY SELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY
        FNNAPFFI ADV      RK DSYKEI+VPAGKTYEVSL+VESINSYIAWDFSL+QGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEPLQ A E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

XP_038881964.1 uncharacterized protein LOC120073287 isoform X1 [Benincasa hispida]1.6e-23093.1Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGLVPITRHFLASYY+KYPFTPL D VSRLSTEML+LANSL+DELPPTSEES LLDEAN HPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQ E
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANI+GKLEEK++NTL+ LVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIR+RYALLW QQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA K REHL ILEQAVDVYTSELERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY
        FNNAPFFISADVACAAD RKSDSYKEI+VPAGKTYEVSLSVESINSYIAWDFSL+QGKMNMDIGFSVECESPGG KTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVE
        KL+WDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVE

TrEMBL top hitse value%identityAlignment
A0A0A0L040 GOLD domain-containing protein1.1e-23293.12Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGLVPITRHFLASYY+KYPFTPLSD VSRLSTEML+LANSL+DELPPTSEESTLLDEAN HPPHKIDENMWKNRENVEEILFL EKSRWPQEVQKE
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANI+GKLEEK +N L+ LVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA KLREHL ILEQAVDVYTSE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY
        FNNAPFFISADVACAA+ RKSDSYKEI+VPAGKTYEVSLSVESINSYIAWDFSL+QGKMNMDIGFSVECESPGG K LILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

A0A1S3CKI8 uncharacterized protein LOC103501503 isoform X13.8e-23393.35Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGLVPITRHFLASYY+KYPF PLSD VSRLSTEML+LANSL+DELPPTSEESTLLDEAN HPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANI+GKLEEK +N L+VLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA KLREHL ILEQAVDVYTSE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY
        FNNAPFFISADVACAA+ RKSDSYKEI+VPAGKTYEVSLSVESINSYIAWDFSL+QGKMNMDIGFSVECESPGG KTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

A0A5A7SM14 Emp24/gp25L/p24 family/GOLD family protein3.8e-23393.35Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGLVPITRHFLASYY+KYPF PLSD VSRLSTEML+LANSL+DELPPTSEESTLLDEAN HPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATG+SELANI+GKLEEK +N L+VLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRL ISLSWRRFDA KLREHL ILEQAVDVYTSE+ERFL FIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY
        FNNAPFFISADVACAA+ RKSDSYKEI+VPAGKTYEVSLSVESINSYIAWDFSL+QGKMNMDIGFSVECESPGG KTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKVLRYKVDCIPPVVEP+QPA E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

A0A6J1EAU1 uncharacterized protein LOC111432376 isoform X19.3e-23293.12Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGLVPITRHFLASYYEKYPFTPLSDD+SRLSTEML+LAN L+DELPPT EESTLLDEANH PPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANILGKLEEKLKNTL VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLIRLFISLSWRRFDARKLR+HLAILEQAVDVY SELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY
        FNNAPFFI ADV      RK DSYKEI+VPAGKTYEVSLSVES+NSYIAWDFSL+QGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEPLQ A E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

A0A6J1HTF5 uncharacterized protein LOC111466012 isoform X13.3e-22992.2Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MAS EGLVPITRHFLASYYEKYPFTPLSDD+SRLSTEML+ AN L+DELPPT EESTL DEANH PPHKIDENMWKNRENVEEILFLLEKS WPQEVQKE
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        SATGESELANILGKLEEKLKNTL VLV FQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDAL+NSGGSIR+RYALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV
        GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLY+LTTMVRLI+LFISLSWRRFDARKLR+HLAILEQAVDVY SELERFLVFIREV
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREV

Query:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY
        FNNAPFFI ADV      RK DSYKEI+VPAGKTYEVS+SVESINSYIAWDFSL+QGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTC+AGDY
Subjt:  FNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDY

Query:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED
        KLIWDNTYSTFFKKV+RYKVDCIPPVVEPLQ A E+
Subjt:  KLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G01010.1 CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 172 Blast hits to 172 proteins in 43 species: Archae - 0; Bacteria - 0; Metazoa - 95; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 14 (source: NCBI BLink).9.7e-17368.84Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASTEGL+PITR FLASYY+KYPF+PLSDDVSRLS++M SL   L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+IL  L++   N    +++FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAVDVYTSELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL ++ + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAVDVYTSELERFLVFIRE

Query:  VFNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGD
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++MDIGFSVE  +  G KTLILP++RYE+DQGNF T +AG+
Subjt:  VFNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGD

Query:  YKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
        YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  YKLIWDNTYSTFFKKVLRYKVDCIPPVVEP

AT5G01010.2 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 85 Blast hits to 85 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 20; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).1.0e-16662.85Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASTEGL+PITR FLASYY+KYPF+PLSDDVSRLS++M SL   L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+IL  L++   N    +++FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAVDVYTSELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL ++ + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAVDVYTSELERFLVFIRE

Query:  VFNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNM--------------------------------------
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++M                                      
Subjt:  VFNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNM--------------------------------------

Query:  ---DIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
           DIGFSVE  +  G KTLILP++RYE+DQGNF T +AG+YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  ---DIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP

AT5G01010.3 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038); Has 76 Blast hits to 76 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 11; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).3.9e-16668.26Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASTEGL+PITR FLASYY+KYPF+PLSDDVSRLS++M SL   L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+IL  L++   N    +++FQ+KNSE +F+TVMTYMPQDFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAVDVYTSELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL ++ + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAVDVYTSELERFLVFIRE

Query:  VFNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGD
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++MDIGFSVE  +  G KTLILP++RYE+DQGNF T +AG+
Subjt:  VFNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGD

Query:  YKLIWDNTYSTFFKKVLRY
        YKL+WDN+YSTFFKKV RY
Subjt:  YKLIWDNTYSTFFKKVLRY

AT5G01010.4 EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: GOLD (InterPro:IPR009038).3.2e-16061.36Show/hide
Query:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE
        MASTEGL+PITR FLASYY+KYPF+PLSDDVSRLS++M SL   L  + PP+  E++L+DEAN  PPHKIDENMWKNRE +EEILFLL  SRWP ++++ 
Subjt:  MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKE

Query:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL
        S + ++E A+IL  L++   N    +++FQ+KNSE +F+T       DFRGT+IRQQ+ERSERNKQAEVDAL++SGGSIR+ YALLWKQQMERRRQLAQL
Subjt:  SATGESELANILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQL

Query:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAVDVYTSELERFLVFIRE
        GSATGVYKTLVKYLVGVP+VLL+FI++INDDDGPMEEQR+RYGPPLY LT MV  IR+F++L W R+D  KL ++ + +L +A  VYTSE ERF+ FI +
Subjt:  GSATGVYKTLVKYLVGVPEVLLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKL-REHLAILEQAVDVYTSELERFLVFIRE

Query:  VFNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNM--------------------------------------
        VF N+PFFISAD A     R ++ YKEI V AG+TYE+SL VES NSYIAWDFSL+QGK++M                                      
Subjt:  VFNNAPFFISADVACAADGRKSDSYKEITVPAGKTYEVSLSVESINSYIAWDFSLIQGKMNM--------------------------------------

Query:  ---DIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP
           DIGFSVE  +  G KTLILP++RYE+DQGNF T +AG+YKL+WDN+YSTFFKK LRYKVDCI PVVEP
Subjt:  ---DIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVVEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACGGAGGGTCTGGTGCCTATAACCAGGCACTTCCTGGCTTCGTACTATGAAAAGTACCCATTTACGCCCCTATCTGACGATGTCTCTCGCCTTTCGACTGA
GATGCTCTCCTTGGCGAACAGTTTGATTGATGAACTTCCGCCTACTTCAGAGGAAAGCACCCTTCTTGATGAAGCAAACCATCATCCTCCTCATAAAATTGACGAGAATA
TGTGGAAGAATCGGGAAAATGTGGAAGAGATTCTGTTTCTGCTTGAAAAATCTCGTTGGCCTCAAGAGGTTCAGAAGGAGTCTGCAACTGGCGAATCTGAACTTGCTAAT
ATTCTAGGAAAGCTAGAAGAAAAATTGAAGAATACCTTAAATGTGTTGGTGGCTTTCCAATCTAAAAATTCTGAGCACGTGTTCAACACAGTTATGACCTACATGCCTCA
AGATTTTCGAGGAACAATAATTCGTCAGCAAAGAGAGCGATCAGAGAGGAATAAGCAAGCAGAGGTTGATGCTTTGATTAATTCTGGAGGAAGCATACGTGAACGATATG
CTCTTTTATGGAAACAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCAACAGGTGTCTACAAAACCCTCGTGAAATATTTGGTTGGAGTTCCAGAGGTA
TTGCTAGAATTCATTCAAAAAATAAATGATGATGATGGGCCAATGGAAGAACAACGACAACGCTATGGACCACCTTTATATGAACTTACAACAATGGTCCGCCTTATTCG
ACTCTTTATTTCATTATCATGGAGACGTTTTGATGCTAGGAAACTAAGGGAGCATCTTGCTATTTTGGAGCAAGCTGTTGATGTGTATACCTCTGAGCTTGAGAGGTTCC
TCGTCTTCATTCGCGAGGTCTTCAACAATGCTCCATTCTTTATTTCAGCAGATGTGGCCTGTGCAGCAGATGGGAGGAAAAGTGATAGCTACAAAGAGATTACTGTTCCA
GCTGGGAAGACTTATGAGGTTTCATTAAGTGTGGAGTCTATCAATTCATATATTGCTTGGGATTTCTCGTTGATTCAAGGCAAGATGAATATGGATATTGGATTCAGCGT
GGAGTGTGAAAGTCCTGGAGGGGGGAAGACTTTGATATTGCCACACAAACGTTATGAGTCTGATCAGGGAAACTTCTGCACTTGCGTCGCTGGGGACTACAAGCTGATTT
GGGACAATACATATTCAACCTTTTTTAAGAAGGTTTTGCGCTATAAGGTCGATTGCATACCACCCGTGGTAGAGCCATTGCAACCTGCTGTAGAAGATTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAGGAAGAAAAAGTAAACGACAGGTCGATAAGAAGATATCAAAATGGTCGATCGATGACGACGAATCGATCGAAGAACAGCGGCGTGGAGCCGTCTTTTTAGTTGC
TCTAGCTCTACGACTATTTTGACGATCAAAGGCTACCTGAAGAACACCTGAGAGAATTTCTTCAACAGTGGAAGTGTTCCAGTTTGCTAATTTGCTTTGACAATACGCAT
CTTCTTCCGCTGCAAACGATCGGAGGAGGAATTTTCAGAGTGGAGAAGATTCGGAGCACTTGAACCGAAAGATTTTGATTCGGAGAAAAGAGAGCCAAGATAAAGGCGAA
AATGGCTTCCACGGAGGGTCTGGTGCCTATAACCAGGCACTTCCTGGCTTCGTACTATGAAAAGTACCCATTTACGCCCCTATCTGACGATGTCTCTCGCCTTTCGACTG
AGATGCTCTCCTTGGCGAACAGTTTGATTGATGAACTTCCGCCTACTTCAGAGGAAAGCACCCTTCTTGATGAAGCAAACCATCATCCTCCTCATAAAATTGACGAGAAT
ATGTGGAAGAATCGGGAAAATGTGGAAGAGATTCTGTTTCTGCTTGAAAAATCTCGTTGGCCTCAAGAGGTTCAGAAGGAGTCTGCAACTGGCGAATCTGAACTTGCTAA
TATTCTAGGAAAGCTAGAAGAAAAATTGAAGAATACCTTAAATGTGTTGGTGGCTTTCCAATCTAAAAATTCTGAGCACGTGTTCAACACAGTTATGACCTACATGCCTC
AAGATTTTCGAGGAACAATAATTCGTCAGCAAAGAGAGCGATCAGAGAGGAATAAGCAAGCAGAGGTTGATGCTTTGATTAATTCTGGAGGAAGCATACGTGAACGATAT
GCTCTTTTATGGAAACAACAGATGGAAAGGAGGAGACAGTTAGCACAGCTGGGTTCTGCAACAGGTGTCTACAAAACCCTCGTGAAATATTTGGTTGGAGTTCCAGAGGT
ATTGCTAGAATTCATTCAAAAAATAAATGATGATGATGGGCCAATGGAAGAACAACGACAACGCTATGGACCACCTTTATATGAACTTACAACAATGGTCCGCCTTATTC
GACTCTTTATTTCATTATCATGGAGACGTTTTGATGCTAGGAAACTAAGGGAGCATCTTGCTATTTTGGAGCAAGCTGTTGATGTGTATACCTCTGAGCTTGAGAGGTTC
CTCGTCTTCATTCGCGAGGTCTTCAACAATGCTCCATTCTTTATTTCAGCAGATGTGGCCTGTGCAGCAGATGGGAGGAAAAGTGATAGCTACAAAGAGATTACTGTTCC
AGCTGGGAAGACTTATGAGGTTTCATTAAGTGTGGAGTCTATCAATTCATATATTGCTTGGGATTTCTCGTTGATTCAAGGCAAGATGAATATGGATATTGGATTCAGCG
TGGAGTGTGAAAGTCCTGGAGGGGGGAAGACTTTGATATTGCCACACAAACGTTATGAGTCTGATCAGGGAAACTTCTGCACTTGCGTCGCTGGGGACTACAAGCTGATT
TGGGACAATACATATTCAACCTTTTTTAAGAAGGTTTTGCGCTATAAGGTCGATTGCATACCACCCGTGGTAGAGCCATTGCAACCTGCTGTAGAAGATTAAAGTGGTGT
CTCCATATTTAGATTGTAACATACACTTACATTCATTATATTTTTAGTTTGTTCGCACGTGGGAAGTCCTATTCTTTGCAAGATTTGCTGGTAGATTAATCTTGTAAACA
ATGCAAATTAACCAAAAAAAAAAAAAAGCAATCTCTGTAATTATGGAAATGTCATTTTCTTAACATGATGCATTGAATCACCACTCGGCTTATGAAATTGCCTGTTGTTT
CTC
Protein sequenceShow/hide protein sequence
MASTEGLVPITRHFLASYYEKYPFTPLSDDVSRLSTEMLSLANSLIDELPPTSEESTLLDEANHHPPHKIDENMWKNRENVEEILFLLEKSRWPQEVQKESATGESELAN
ILGKLEEKLKNTLNVLVAFQSKNSEHVFNTVMTYMPQDFRGTIIRQQRERSERNKQAEVDALINSGGSIRERYALLWKQQMERRRQLAQLGSATGVYKTLVKYLVGVPEV
LLEFIQKINDDDGPMEEQRQRYGPPLYELTTMVRLIRLFISLSWRRFDARKLREHLAILEQAVDVYTSELERFLVFIREVFNNAPFFISADVACAADGRKSDSYKEITVP
AGKTYEVSLSVESINSYIAWDFSLIQGKMNMDIGFSVECESPGGGKTLILPHKRYESDQGNFCTCVAGDYKLIWDNTYSTFFKKVLRYKVDCIPPVVEPLQPAVED