; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G03350 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G03350
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPlant protein of unknown function (DUF247)
Genome locationClcChr09:2580461..2582885
RNA-Seq ExpressionClc09G03350
SyntenyClc09G03350
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058728.1 putative UPF0481 protein [Cucumis melo var. makuwa]1.5e-17567.22Show/hide
Query:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK
        MSFSSKSRLHS PA NSWGL+SGF EERWV Q+RQS+DEEELEEDIG PVCI  VPKSLM I+P+SYTPQEVAIGPYHHWRQELY MERYKIAAAK+AQK
Subjt:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK

Query:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ
        QLQ LKFH+LVEKLT +E+KTRA YHKYLNFNSETFAWMMA++ASFLLEVLRVYT  E + S        + +L  LVDYEGRKS HNAILRDIVMLENQ
Subjt:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ

Query:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC
        IPLFVLR ML+LQ SA+EPADQLLLSMLLGLYEDLSPFK+MEDLVELQVSVSECFHLLDFLY MITP+L D+LE +E+DQNQ+E A E +E  S FK  C
Subjt:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC

Query:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE
             LGSEIWKILSKLN+           SR L                            SL K EEE D EK GSSRK GK K PLLEEITIPSVSE
Subjt:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE

Query:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN
        LTKSGV FLP++GGVSA AFDSKAVIF LP I LDVNSE                           NGIIDSEEDVKLLKEKGIILNHL SDAEVA++WN
Subjt:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN

Query:  GMSKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP
        GMSKSIKLTKVPFLDKV    +K  +    +    F  +++F S P
Subjt:  GMSKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP

XP_004136095.1 putative UPF0481 protein At3g02645 [Cucumis sativus]2.7e-17766.48Show/hide
Query:  MSFSSKSRLHSPPARNSWGLSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQ
        MS SSKSRLHS PA + WGL+  +EE WV+Q+RQS+DEEELEEDIG P CICTVP+SLM I+P+SYTPQEVAIGPYHHWRQELYVMERYKIAAA+KAQKQ
Subjt:  MSFSSKSRLHSPPARNSWGLSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQ

Query:  LQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQI
        LQ LKFHNLVEKL  YERKTRA+YHKYLNFNSETFAWMMAI+ASFLLEVL+VYTIRE  +   +   ++ L    +VD EGR+S  N ILRDIVMLENQI
Subjt:  LQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQI

Query:  PLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLCI
        PLFVLRKML+LQS ALE  +QLLLSMLLGL EDLSPF+M+E     QVSVSECFHLLDFLY MITP+LAD LEILENDQNQKES KEN E+ +AFK  C 
Subjt:  PLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLCI

Query:  SSIKLGSEIWKILSKLNKA----------SRTL---------------------------SLIKGEEENDLEKGSSRKVGKVKPPLLEEITIPSVSELTK
        S  +LGSEIWKILSK NK           SR L                           SL KGEEENDLEKGSS KVGK+K PL EEI IPSVS+LTK
Subjt:  SSIKLGSEIWKILSKLNKA----------SRTL---------------------------SLIKGEEENDLEKGSSRKVGKVKPPLLEEITIPSVSELTK

Query:  SGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGMS
        SGV F  +DGGVSA AFD KAVIFYLPTI+LDVNSE                           NGIIDSEEDVKLLKEKGIILNHL SDAEVA LWNGMS
Subjt:  SGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGMS

Query:  KSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP
        KSIKLTKVPFLDKV    +K  +G   +    F  +++F S P
Subjt:  KSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP

XP_008461155.1 PREDICTED: putative UPF0481 protein At3g02645 [Cucumis melo]2.5e-17567.22Show/hide
Query:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK
        MSFSSKSRLHS PA NSWGL+SGF EERWV Q+RQS+DEEELEEDIG PVCI  VPKSLM I+P+SYTPQEVAIGPYHHWRQELY MERYKIAAAK+AQK
Subjt:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK

Query:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ
        QLQ LKFH+LVEKLT +E+KTRA YHKYLNFNSETFAWMMA++ASFLLEVLRVYT  E + S        + +L  LVDYEGRKS HNAILRDIVMLENQ
Subjt:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ

Query:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC
        IPLFVLR ML+LQ SA+EPADQLLLSMLLGLYEDLSPFK+MEDLVELQVSVSECFHLLDFLY MITP+L D+LE +E+DQNQ+E A E +E  S FK  C
Subjt:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC

Query:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE
             LGSEIWKILSKLN+           SR L                            SL K EEE D EK GSSRK G  K PLLEEITIPSVSE
Subjt:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE

Query:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN
        LTKSGV FLP++GGVSA AFDSKAVIF LPTI LDVNSE                           NGIIDSEEDVKLLKEKGIILNHL SDAEVA++WN
Subjt:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN

Query:  GMSKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP
        GMSKSIKLTKVPFLDKV    +K  +    +    F  +++F S P
Subjt:  GMSKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP

XP_011659516.1 putative UPF0481 protein At3g02645 [Cucumis sativus]1.1e-17567.03Show/hide
Query:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK
        M FSSKSRLHS PA NSWGL+S F EERWV Q+RQS+DEEELEED G PVCI  VPKSLM I+P+SY PQEVAIGPYHHWRQELY MERYKIAAAK+AQK
Subjt:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK

Query:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ
        QLQ LKFH+LVEKLT +E+KTRA YHKYLNFNSETFAWMMA++ASFLLEVLRVYT  E + S        + +L  LVDYEGRKS HNAILRDIVMLENQ
Subjt:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ

Query:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC
        IPLFVLRKML+LQ SA+EPADQLLLSMLLGLYE LSPFK+MEDLVELQVSVSECFHLLDFLY +ITP+LAD+LE +E+DQNQ+E A E +E  S FK  C
Subjt:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC

Query:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE
             LGSEIWKILSKLNK           SR L                            SL KGEEEND EK GSSRK GK++ PLLEEITIPSVSE
Subjt:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE

Query:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN
        LTKSGV FLP+ GGVSA AFDSKAVIF LPTI LDVNSE                           NGIIDSEEDVKLL+EKGIILNHL SDAEVA+LWN
Subjt:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN

Query:  GMSKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP
        GMSKSIKLTKVPFLDKV    +K  +    +    F  +++F S P
Subjt:  GMSKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP

XP_038899461.1 LOW QUALITY PROTEIN: putative UPF0481 protein At3g02645 [Benincasa hispida]2.3e-18972.23Show/hide
Query:  MSFSSKSRLHSPPARNSWGLSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQ
        MSFSSKSRLHS PA NSWGL+SGFEERWVSQ+RQS DEEE EEDIG P CICTVPKSLM  +P+SYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQ
Subjt:  MSFSSKSRLHSPPARNSWGLSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQ

Query:  LQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQI
        LQ LKFH+LVEKLT YERKTRAYYHKYLNFNSETFAWMMAI+ SFLLEVLRVYT+ E  +   +   ++ L    +VDYEGR S HNAILRDI+MLENQI
Subjt:  LQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQI

Query:  PLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLCI
        PLF+LRKML+LQSSALEPADQLLLSMLLGLYEDLSPFK+ ED VELQVSVSECFHL+DFLY MITP+L D LEILEN+QNQKE  KEN+ENA+AFK  C 
Subjt:  PLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLCI

Query:  SSIKLGSEIWKILSKLNK----------ASRTLSLI----------------------------KGEEENDLEKGSSRKVGKVKPPLLEEITIPSVSELT
        S  +LGSEIWKILSK NK           SR L +I                            KGEEENDLEKGSSRKVGKVK PLLEEI IPSVSELT
Subjt:  SSIKLGSEIWKILSKLNK----------ASRTLSLI----------------------------KGEEENDLEKGSSRKVGKVKPPLLEEITIPSVSELT

Query:  KSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGM
        KSGVCF P+DGGVSA AF+SKA I YL TI+LDVNSE                           NGIIDSEEDV+LLKEKGIILNHLNSDAEVA+LWNGM
Subjt:  KSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGM

Query:  SKSIKLTKVPFLDKV
        SKSIKLTKVPFLDKV
Subjt:  SKSIKLTKVPFLDKV

TrEMBL top hitse value%identityAlignment
A0A0A0K8N9 Uncharacterized protein1.3e-17766.48Show/hide
Query:  MSFSSKSRLHSPPARNSWGLSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQ
        MS SSKSRLHS PA + WGL+  +EE WV+Q+RQS+DEEELEEDIG P CICTVP+SLM I+P+SYTPQEVAIGPYHHWRQELYVMERYKIAAA+KAQKQ
Subjt:  MSFSSKSRLHSPPARNSWGLSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQ

Query:  LQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQI
        LQ LKFHNLVEKL  YERKTRA+YHKYLNFNSETFAWMMAI+ASFLLEVL+VYTIRE  +   +   ++ L    +VD EGR+S  N ILRDIVMLENQI
Subjt:  LQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQI

Query:  PLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLCI
        PLFVLRKML+LQS ALE  +QLLLSMLLGL EDLSPF+M+E     QVSVSECFHLLDFLY MITP+LAD LEILENDQNQKES KEN E+ +AFK  C 
Subjt:  PLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLCI

Query:  SSIKLGSEIWKILSKLNKA----------SRTL---------------------------SLIKGEEENDLEKGSSRKVGKVKPPLLEEITIPSVSELTK
        S  +LGSEIWKILSK NK           SR L                           SL KGEEENDLEKGSS KVGK+K PL EEI IPSVS+LTK
Subjt:  SSIKLGSEIWKILSKLNKA----------SRTL---------------------------SLIKGEEENDLEKGSSRKVGKVKPPLLEEITIPSVSELTK

Query:  SGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGMS
        SGV F  +DGGVSA AFD KAVIFYLPTI+LDVNSE                           NGIIDSEEDVKLLKEKGIILNHL SDAEVA LWNGMS
Subjt:  SGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGMS

Query:  KSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP
        KSIKLTKVPFLDKV    +K  +G   +    F  +++F S P
Subjt:  KSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP

A0A0A0KA21 Uncharacterized protein4.6e-17569.77Show/hide
Query:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK
        M FSSKSRLHS PA NSWGL+S F EERWV Q+RQS+DEEELEED G PVCI  VPKSLM I+P+SY PQEVAIGPYHHWRQELY MERYKIAAAK+AQK
Subjt:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK

Query:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ
        QLQ LKFH+LVEKLT +E+KTRA YHKYLNFNSETFAWMMA++ASFLLEVLRVYT  E + S        + +L  LVDYEGRKS HNAILRDIVMLENQ
Subjt:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ

Query:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC
        IPLFVLRKML+LQ SA+EPADQLLLSMLLGLYE LSPFK+MEDLVELQVSVSECFHLLDFLY +ITP+LAD+LE +E+DQNQ+E A E +E  S FK  C
Subjt:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC

Query:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE
             LGSEIWKILSKLNK           SR L                            SL KGEEEND EK GSSRK GK++ PLLEEITIPSVSE
Subjt:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE

Query:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN
        LTKSGV FLP+ GGVSA AFDSKAVIF LPTI LDVNSE                           NGIIDSEEDVKLL+EKGIILNHL SDAEVA+LWN
Subjt:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN

Query:  GMSKSIKLTKVPFLDK
        GMSKSIKLTKVPFLDK
Subjt:  GMSKSIKLTKVPFLDK

A0A1S3CDL4 putative UPF0481 protein At3g026451.2e-17567.22Show/hide
Query:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK
        MSFSSKSRLHS PA NSWGL+SGF EERWV Q+RQS+DEEELEEDIG PVCI  VPKSLM I+P+SYTPQEVAIGPYHHWRQELY MERYKIAAAK+AQK
Subjt:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK

Query:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ
        QLQ LKFH+LVEKLT +E+KTRA YHKYLNFNSETFAWMMA++ASFLLEVLRVYT  E + S        + +L  LVDYEGRKS HNAILRDIVMLENQ
Subjt:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ

Query:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC
        IPLFVLR ML+LQ SA+EPADQLLLSMLLGLYEDLSPFK+MEDLVELQVSVSECFHLLDFLY MITP+L D+LE +E+DQNQ+E A E +E  S FK  C
Subjt:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC

Query:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE
             LGSEIWKILSKLN+           SR L                            SL K EEE D EK GSSRK G  K PLLEEITIPSVSE
Subjt:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE

Query:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN
        LTKSGV FLP++GGVSA AFDSKAVIF LPTI LDVNSE                           NGIIDSEEDVKLLKEKGIILNHL SDAEVA++WN
Subjt:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN

Query:  GMSKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP
        GMSKSIKLTKVPFLDKV    +K  +    +    F  +++F S P
Subjt:  GMSKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP

A0A5A7UU59 Putative UPF0481 protein7.0e-17667.22Show/hide
Query:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK
        MSFSSKSRLHS PA NSWGL+SGF EERWV Q+RQS+DEEELEEDIG PVCI  VPKSLM I+P+SYTPQEVAIGPYHHWRQELY MERYKIAAAK+AQK
Subjt:  MSFSSKSRLHSPPARNSWGLSSGF-EERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQK

Query:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ
        QLQ LKFH+LVEKLT +E+KTRA YHKYLNFNSETFAWMMA++ASFLLEVLRVYT  E + S        + +L  LVDYEGRKS HNAILRDIVMLENQ
Subjt:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ

Query:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC
        IPLFVLR ML+LQ SA+EPADQLLLSMLLGLYEDLSPFK+MEDLVELQVSVSECFHLLDFLY MITP+L D+LE +E+DQNQ+E A E +E  S FK  C
Subjt:  IPLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLC

Query:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE
             LGSEIWKILSKLN+           SR L                            SL K EEE D EK GSSRK GK K PLLEEITIPSVSE
Subjt:  ISSIKLGSEIWKILSKLNK----------ASRTL----------------------------SLIKGEEENDLEK-GSSRKVGKVKPPLLEEITIPSVSE

Query:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN
        LTKSGV FLP++GGVSA AFDSKAVIF LP I LDVNSE                           NGIIDSEEDVKLLKEKGIILNHL SDAEVA++WN
Subjt:  LTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWN

Query:  GMSKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP
        GMSKSIKLTKVPFLDKV    +K  +    +    F  +++F S P
Subjt:  GMSKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP

A0A5A7UUK5 Putative UPF0481 protein6.0e-17565.44Show/hide
Query:  MSFSSKSRLHSPPARNSWGLSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQ
        M  SSKSRLHS PA + WGL+  +EE WV+Q+RQSIDEEELEEDIG P CICTVPKSLM  +P+SYTPQEVAIGPYHHWRQELYVMERYKIAAAKK QKQ
Subjt:  MSFSSKSRLHSPPARNSWGLSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQ

Query:  LQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQI
        LQ LKFHNLVEKL  YERK RAYYHKYLNFNSETF WMMAI+ASFLLEVL+VYTIRE  +   I   ++ L    +VD EGR+S  N ILRDIVMLENQI
Subjt:  LQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQI

Query:  PLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLCI
        PLFVLRKMLKLQS ALE  DQLLLSMLLGLYEDLSPF+M+E     QVSVSECFHLLDFLY MITP+LA  LEILEND+NQKES KEN E+ +AFK  C 
Subjt:  PLFVLRKMLKLQSSALEPADQLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLCI

Query:  SSIKLGSEIWKILSKLNKA--------------------------------------SRTLSLIKGEEENDLEKGSSRKVGKVKPPLLEEITIPSVSELT
        S  +LGS IWKILSK NK                                       S   SL KGEEEND+EKGSS KVGK+K PL EEI IPSVS+LT
Subjt:  SSIKLGSEIWKILSKLNKA--------------------------------------SRTLSLIKGEEENDLEKGSSRKVGKVKPPLLEEITIPSVSELT

Query:  KSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGM
        KSGV F  +DGGVSA AFD  AVIFYLPTI+LDVNSE                           NGIIDSEEDV+LLKEKGIILNHL SDAEVA LWNGM
Subjt:  KSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSE---------------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGM

Query:  SKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP
        SKSIKLTKVPFLDKV    +K  +G   +    F  +++F S P
Subjt:  SKSIKLTKVPFLDKVRTQNSKPLTG---IAFFAFASRHLFSSSP

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.0e-8342.86Show/hide
Query:  EERWVSQVRQSIDEEELEEDI-GFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQGLKFHNLVEKLTMYERKTRAY
        E RWV  V++S+D E  E D+    V I  VPK+LM  +P+SYTP  V+IGPYH  + EL+ MERYK+  A+K + Q    +FH+LVEKL   E K RA 
Subjt:  EERWVSQVRQSIDEEELEEDI-GFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQGLKFHNLVEKLTMYERKTRAY

Query:  YHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQSSALEPADQLL
        YHKY+ FN ET  W+MA+++SFL+E L++Y+ R+V T               L++  G    HN ILRDI+M+ENQIPLFVLRK L+ Q  + E AD LL
Subjt:  YHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQSSALEPADQLL

Query:  LSMLLGLYEDLSPFKM-MEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENAS---------AFKLLCIS--SIKLGSEIWK
        LS+L GL +DLSP  +  +D   L+    EC H+LDFLY MI P + +  E LE D +++  A EN  N +          FK +  S  +  +    W+
Subjt:  LSMLLGLYEDLSPFKM-MEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENAS---------AFKLLCIS--SIKLGSEIWK

Query:  ILSKLN-----KASRTLSLIKGEEENDLEKGSSRKVGKV-KPPLLEEITIPSVSELTKSGVCFLP-VDGGVSAFAFDSKAVIFYLPTISLDVNSE-----
        I+S L      K S      + E E    +  S  +  + KPPL+EE+TIPSVS+L K+GV F P   G +S   FDS +  FYLP I+LD+N+E     
Subjt:  ILSKLN-----KASRTLSLIKGEEENDLEKGSSRKVGKV-KPPLLEEITIPSVSELTKSGVCFLP-VDGGVSAFAFDSKAVIFYLPTISLDVNSE-----

Query:  ----------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGMSKSIKLTKVPFLDKVRTQNSKPLTG
                              NGIIDSEEDV+LL+E+G++++ L SD E A++WNGMSKS++LTKV FLDK     ++  TG
Subjt:  ----------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGMSKSIKLTKVPFLDKVRTQNSKPLTG

Q9SD53 UPF0481 protein At3g472005.5e-0827.75Show/hide
Query:  SSKSRLHSPPARNSWG--LSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYK--IAAAKKAQK
        SS S   SPP  +++   LSSG +E  +  + +S  +E          CI  VP+S + +NP++Y P+ V+IGPYH+  + L +++++K  +      + 
Subjt:  SSKSRLHSPPARNSWG--LSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYK--IAAAKKAQK

Query:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ
        + + ++ + LV+ +   E K R  Y + L        +MM ++  F   +L V+ I   N  L+  P      +P L+         ++I  D+++LENQ
Subjt:  QLQGLKFHNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQ

Query:  IPLFVLRKM
        +P FVL+ +
Subjt:  IPLFVLRKM

Arabidopsis top hitse value%identityAlignment
AT3G02645.1 Plant protein of unknown function (DUF247)7.4e-8542.86Show/hide
Query:  EERWVSQVRQSIDEEELEEDI-GFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQGLKFHNLVEKLTMYERKTRAY
        E RWV  V++S+D E  E D+    V I  VPK+LM  +P+SYTP  V+IGPYH  + EL+ MERYK+  A+K + Q    +FH+LVEKL   E K RA 
Subjt:  EERWVSQVRQSIDEEELEEDI-GFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQGLKFHNLVEKLTMYERKTRAY

Query:  YHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQSSALEPADQLL
        YHKY+ FN ET  W+MA+++SFL+E L++Y+ R+V T               L++  G    HN ILRDI+M+ENQIPLFVLRK L+ Q  + E AD LL
Subjt:  YHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQSSALEPADQLL

Query:  LSMLLGLYEDLSPFKM-MEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENAS---------AFKLLCIS--SIKLGSEIWK
        LS+L GL +DLSP  +  +D   L+    EC H+LDFLY MI P + +  E LE D +++  A EN  N +          FK +  S  +  +    W+
Subjt:  LSMLLGLYEDLSPFKM-MEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENAS---------AFKLLCIS--SIKLGSEIWK

Query:  ILSKLN-----KASRTLSLIKGEEENDLEKGSSRKVGKV-KPPLLEEITIPSVSELTKSGVCFLP-VDGGVSAFAFDSKAVIFYLPTISLDVNSE-----
        I+S L      K S      + E E    +  S  +  + KPPL+EE+TIPSVS+L K+GV F P   G +S   FDS +  FYLP I+LD+N+E     
Subjt:  ILSKLN-----KASRTLSLIKGEEENDLEKGSSRKVGKV-KPPLLEEITIPSVSELTKSGVCFLP-VDGGVSAFAFDSKAVIFYLPTISLDVNSE-----

Query:  ----------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGMSKSIKLTKVPFLDKVRTQNSKPLTG
                              NGIIDSEEDV+LL+E+G++++ L SD E A++WNGMSKS++LTKV FLDK     ++  TG
Subjt:  ----------------------NGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGMSKSIKLTKVPFLDKVRTQNSKPLTG

AT3G50120.1 Plant protein of unknown function (DUF247)6.0e-1830.66Show/hide
Query:  WVSQVRQSIDEEELEED--IGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQL-QGLKFHNLVEKLTMYERKTRAYY
        WV  +   +++   ++D  +   +CI  VP  L + + +SY PQ V++GPYHH ++ L  M+R+K  A  +  K+  QG+K +  ++ +   E K RA Y
Subjt:  WVSQVRQSIDEEELEED--IGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQL-QGLKFHNLVEKLTMYERKTRAYY

Query:  HKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQSSALEPADQLLL
           L+ +S  F  M+ ++  F+LE+ R           A +  V          +  R S+H +I RD+VMLENQ+PLFVL ++L+LQ         L+ 
Subjt:  HKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQSSALEPADQLLL

Query:  SMLLGLYEDLSP
         + +  ++ L P
Subjt:  SMLLGLYEDLSP

AT3G50170.1 Plant protein of unknown function (DUF247)1.0e-1730.69Show/hide
Query:  ERWVSQVRQSIDEEELEED--IGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQGLKFHNLVEKLTMYERKTRAY
        + WV  +R  +++ + ++D  I   +CI  VP  L + + +SY PQ V++GPYHH ++ L  MER+K  A  K  K+L+  +       +   E K RA 
Subjt:  ERWVSQVRQSIDEEELEED--IGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQGLKFHNLVEKLTMYERKTRAY

Query:  YHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQ
        Y   ++ +   F  M+ ++  F+LE+ R           A +  V  +           + + ++I RD++MLENQ+PLFVL ++L+LQ
Subjt:  YHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQ

AT3G50170.2 Plant protein of unknown function (DUF247)1.0e-1730.69Show/hide
Query:  ERWVSQVRQSIDEEELEED--IGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQGLKFHNLVEKLTMYERKTRAY
        + WV  +R  +++ + ++D  I   +CI  VP  L + + +SY PQ V++GPYHH ++ L  MER+K  A  K  K+L+  +       +   E K RA 
Subjt:  ERWVSQVRQSIDEEELEED--IGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQGLKFHNLVEKLTMYERKTRAY

Query:  YHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQ
        Y   ++ +   F  M+ ++  F+LE+ R           A +  V  +           + + ++I RD++MLENQ+PLFVL ++L+LQ
Subjt:  YHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQ

AT3G50180.1 Plant protein of unknown function (DUF247)9.5e-1632.23Show/hide
Query:  SPPARNSWGLSSGFEE-RWVSQVRQSIDEEELEED--IGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQL-QGLKF
        S P  N     SG +   WV  ++  +++    +D      +CI  VP  L   + +SY PQ V++GPYHH RQ+   ME +K  A     K+  QG++ 
Subjt:  SPPARNSWGLSSGFEE-RWVSQVRQSIDEEELEED--IGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQL-QGLKF

Query:  HNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEG-----RKSIHNAILRDIVMLENQIP
           ++ +   E K RA Y   +  +S  F  M+ ++  F+LE+L+               GVN   L    D+       R S+H +I RD++MLENQ+P
Subjt:  HNLVEKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEG-----RKSIHNAILRDIVMLENQIP

Query:  LFVLRKMLKLQ
        LFVL ++L+LQ
Subjt:  LFVLRKMLKLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTCCAGCAAATCACGATTACATTCTCCTCCTGCTAGAAATTCTTGGGGATTATCCTCAGGCTTTGAGGAAAGATGGGTTAGTCAAGTTCGTCAATCCATAGA
TGAAGAAGAGCTCGAGGAAGACATAGGATTTCCAGTATGCATATGCACTGTCCCTAAGTCTCTTATGGACATTAATCCTGAATCCTATACTCCACAGGAAGTCGCAATTG
GTCCATACCATCACTGGCGCCAAGAGCTGTATGTGATGGAGAGGTACAAGATTGCTGCAGCCAAAAAAGCTCAAAAACAGCTCCAAGGCCTCAAGTTTCACAATCTTGTT
GAGAAATTGACCATGTACGAGCGAAAGACCCGGGCGTACTATCACAAATACCTCAATTTCAATAGTGAAACATTTGCTTGGATGATGGCCATCAATGCCTCCTTCCTGCT
GGAGGTCCTCCGAGTATATACCATCAGAGAAGTCAACACATCTCTGGCCATCCACCCAGGTGTAAATACCTTACAGTTGCCATGTTTGGTAGATTATGAGGGAAGGAAGT
CAATACATAATGCCATTCTGAGAGACATAGTAATGCTTGAGAATCAGATACCTTTATTTGTTTTGAGAAAAATGTTGAAACTTCAATCTTCTGCTCTGGAACCAGCAGAT
CAATTGTTGCTTTCTATGTTGCTGGGACTGTATGAAGATCTTTCTCCTTTCAAGATGATGGAAGATTTGGTGGAACTTCAAGTGTCTGTCTCGGAATGCTTTCATTTGCT
TGATTTTCTGTACATAATGATTACTCCAGAGTTGGCCGATTCATTGGAGATATTGGAAAATGATCAAAATCAAAAGGAATCGGCCAAAGAAAACATTGAAAACGCAAGCG
CCTTTAAGCTCTTATGCATTTCCTCGATTAAACTGGGAAGTGAAATTTGGAAGATTCTCTCAAAGTTAAACAAAGCCTCTCGAACACTTTCACTAATAAAGGGTGAAGAA
GAAAATGACCTAGAAAAGGGGAGTTCAAGGAAAGTTGGAAAAGTTAAGCCTCCTTTGTTGGAGGAAATAACAATTCCTTCAGTGTCCGAGCTGACAAAATCAGGTGTTTG
TTTCTTGCCCGTCGATGGAGGCGTCTCAGCCTTTGCCTTCGACTCAAAAGCAGTGATATTTTACCTTCCCACCATTAGTCTGGATGTGAACTCTGAAAATGGCATCATCG
ATTCTGAGGAGGACGTGAAATTGCTGAAGGAAAAAGGAATCATTCTGAACCATTTGAATAGCGATGCAGAAGTCGCCAAGCTCTGGAATGGGATGAGCAAATCCATCAAG
TTGACGAAAGTGCCATTCTTGGATAAGGTCAGAACCCAAAACTCAAAACCTCTAACCGGAATTGCATTTTTTGCCTTCGCTTCCCGCCATCTATTCTCTTCTTCTCCGAC
GTGTGTTCCTTCCGCCGGCACCAAAACAATGAATCCCCTTCCGGAGTTTGGTTTCAAGTTGAACAGCCCGGAGCCTCGTCCAATGCACGCTAATTTCGACCCTCCTACTG
TTCCCGAGGCGCTTGATTCTTACTGCAATGACGGATTAAGATGTTGCAAGAAAAAAGTTGAGGCTAACTATTTGAACTTTGATGTGGACGATTCTGTTATACAGAAATTT
GAGGGGAGAGACAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTCTCCAGCAAATCACGATTACATTCTCCTCCTGCTAGAAATTCTTGGGGATTATCCTCAGGCTTTGAGGAAAGATGGGTTAGTCAAGTTCGTCAATCCATAGA
TGAAGAAGAGCTCGAGGAAGACATAGGATTTCCAGTATGCATATGCACTGTCCCTAAGTCTCTTATGGACATTAATCCTGAATCCTATACTCCACAGGAAGTCGCAATTG
GTCCATACCATCACTGGCGCCAAGAGCTGTATGTGATGGAGAGGTACAAGATTGCTGCAGCCAAAAAAGCTCAAAAACAGCTCCAAGGCCTCAAGTTTCACAATCTTGTT
GAGAAATTGACCATGTACGAGCGAAAGACCCGGGCGTACTATCACAAATACCTCAATTTCAATAGTGAAACATTTGCTTGGATGATGGCCATCAATGCCTCCTTCCTGCT
GGAGGTCCTCCGAGTATATACCATCAGAGAAGTCAACACATCTCTGGCCATCCACCCAGGTGTAAATACCTTACAGTTGCCATGTTTGGTAGATTATGAGGGAAGGAAGT
CAATACATAATGCCATTCTGAGAGACATAGTAATGCTTGAGAATCAGATACCTTTATTTGTTTTGAGAAAAATGTTGAAACTTCAATCTTCTGCTCTGGAACCAGCAGAT
CAATTGTTGCTTTCTATGTTGCTGGGACTGTATGAAGATCTTTCTCCTTTCAAGATGATGGAAGATTTGGTGGAACTTCAAGTGTCTGTCTCGGAATGCTTTCATTTGCT
TGATTTTCTGTACATAATGATTACTCCAGAGTTGGCCGATTCATTGGAGATATTGGAAAATGATCAAAATCAAAAGGAATCGGCCAAAGAAAACATTGAAAACGCAAGCG
CCTTTAAGCTCTTATGCATTTCCTCGATTAAACTGGGAAGTGAAATTTGGAAGATTCTCTCAAAGTTAAACAAAGCCTCTCGAACACTTTCACTAATAAAGGGTGAAGAA
GAAAATGACCTAGAAAAGGGGAGTTCAAGGAAAGTTGGAAAAGTTAAGCCTCCTTTGTTGGAGGAAATAACAATTCCTTCAGTGTCCGAGCTGACAAAATCAGGTGTTTG
TTTCTTGCCCGTCGATGGAGGCGTCTCAGCCTTTGCCTTCGACTCAAAAGCAGTGATATTTTACCTTCCCACCATTAGTCTGGATGTGAACTCTGAAAATGGCATCATCG
ATTCTGAGGAGGACGTGAAATTGCTGAAGGAAAAAGGAATCATTCTGAACCATTTGAATAGCGATGCAGAAGTCGCCAAGCTCTGGAATGGGATGAGCAAATCCATCAAG
TTGACGAAAGTGCCATTCTTGGATAAGGTCAGAACCCAAAACTCAAAACCTCTAACCGGAATTGCATTTTTTGCCTTCGCTTCCCGCCATCTATTCTCTTCTTCTCCGAC
GTGTGTTCCTTCCGCCGGCACCAAAACAATGAATCCCCTTCCGGAGTTTGGTTTCAAGTTGAACAGCCCGGAGCCTCGTCCAATGCACGCTAATTTCGACCCTCCTACTG
TTCCCGAGGCGCTTGATTCTTACTGCAATGACGGATTAAGATGTTGCAAGAAAAAAGTTGAGGCTAACTATTTGAACTTTGATGTGGACGATTCTGTTATACAGAAATTT
GAGGGGAGAGACAAATAG
Protein sequenceShow/hide protein sequence
MSFSSKSRLHSPPARNSWGLSSGFEERWVSQVRQSIDEEELEEDIGFPVCICTVPKSLMDINPESYTPQEVAIGPYHHWRQELYVMERYKIAAAKKAQKQLQGLKFHNLV
EKLTMYERKTRAYYHKYLNFNSETFAWMMAINASFLLEVLRVYTIREVNTSLAIHPGVNTLQLPCLVDYEGRKSIHNAILRDIVMLENQIPLFVLRKMLKLQSSALEPAD
QLLLSMLLGLYEDLSPFKMMEDLVELQVSVSECFHLLDFLYIMITPELADSLEILENDQNQKESAKENIENASAFKLLCISSIKLGSEIWKILSKLNKASRTLSLIKGEE
ENDLEKGSSRKVGKVKPPLLEEITIPSVSELTKSGVCFLPVDGGVSAFAFDSKAVIFYLPTISLDVNSENGIIDSEEDVKLLKEKGIILNHLNSDAEVAKLWNGMSKSIK
LTKVPFLDKVRTQNSKPLTGIAFFAFASRHLFSSSPTCVPSAGTKTMNPLPEFGFKLNSPEPRPMHANFDPPTVPEALDSYCNDGLRCCKKKVEANYLNFDVDDSVIQKF
EGRDK