; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023865 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023865
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF604)
Genome locationtig00001047:1009191..1018466
RNA-Seq ExpressionSgr023865
SyntenySgr023865
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008375 - acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033336.1 hypothetical protein SDJN02_07391 [Cucurbita argyrosperma subsp. argyrosperma]5.1e-19275.81Show/hide
Query:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESE-DAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA
        +R  LLLCL+ASM LA+HSAF  +  R+F LP +TR  VSE +      ATNISHIVFGIGASVQTW DRSLY+ LWWDPNR+RGF WLD EP+K +L  
Subjt:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESE-DAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA

Query:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS
           +P+RVS +C G  YSC  AAVRMARIVVE FK+GL N RWFVMGDDDTVFF +NLVSVLA+YDHNQMYYIGGNSESVEQDE+H+YEMAFGGGGFAIS
Subjt:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS

Query:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN
        YPLAAQLVNV+DGCLQRY +FYGSDQR+WAC+SE+GVPLT +RGFHQFDIRG  YGILAAHP+ PLVSLHHLD VEPLFPN+TR+DSLKLLM+AYRVDS+
Subjt:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN

Query:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV
        RI+QQS CYDRRR+WS+SIAWGYTV+IY F+  A DL+ P QTFKTWRSWSNGPF+FN RPVS+DPC RPVVYFL  VQEV TRGT+TSYERF+AK+EKV
Subjt:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV

Query:  CETADYARVIAVKEVTVSSMKMDTELWKKA
        C+  DY RV+AVKEVTVSSMKMDT+LW KA
Subjt:  CETADYARVIAVKEVTVSSMKMDTELWKKA

XP_022133443.1 uncharacterized protein LOC111006016 [Momordica charantia]2.8e-20680.47Show/hide
Query:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGAE
        +R GLLLCLVASMSLA+HSAF+ T+SR+F LPD  R VVSE +   T AT+ISH+VFGIGASV+TWRDRSLY+ LWWDPNR+RGFVWLDEEP + QL   
Subjt:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGAE

Query:  KSIPYRVSEKCRGAGYSC-SVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS
          +PYRVSE+CRGAGYSC + AAVR+ARIVVES+K+GL N RWFVMGDDDTVFF  NLVSVL +YDHN+MYYIG NSESVEQD++H YEMAFGGGGFAIS
Subjt:  KSIPYRVSEKCRGAGYSC-SVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS

Query:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN
        YPLAA+LVN +DGCLQRY+ FYGSDQR+WACV+E GVPLT  RGFHQFD+RG+PYGILAAHPL PLVSLHHLDHVEPLFPN+TR+DSL  LM+AY++D N
Subjt:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN

Query:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV
        RILQQSFCYDRR++WSISIAWGYTVQI SF+E AKDLQTPL+TFKTWRSWSNGPFTFNTR VSSDPCRRPVVYFLNRVQEV TRGTQT YERF+AKEEKV
Subjt:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV

Query:  CETADYARVIAVKEVTVSSMKMDTELWKKA
        CETADYA V+AVKEVTVSSMKMD ELWKKA
Subjt:  CETADYARVIAVKEVTVSSMKMDTELWKKA

XP_022955345.1 uncharacterized protein LOC111457380 [Cucurbita moschata]2.1e-19376.28Show/hide
Query:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESED-AGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA
        +R  LLLCL+ASM LA+HSAF  +  R+F LP +TR  VSE +      ATNISHIVFGIGASVQTW DRSLY+ LWWDPNR+RGF WLD EP+K  L  
Subjt:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESED-AGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA

Query:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS
           +P+RVS +C GA YSC  AAVRMARIVVE FK+GL N RWFVMGDDDTVFF +NLVSVLA+YDHNQMYYIGGNSESVEQDE+H+YEMAFGGGGFAIS
Subjt:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS

Query:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN
        YPLAAQLVNV+DGCLQRY +FYGSDQR+WAC+SE+GVPLT +RGFHQFDIRG  YGILAAHP+ PLVSLHHLD VEPLFPN+TR+DSLKLLM+AYRVDS+
Subjt:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN

Query:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV
        RI+QQS CYDRRR+WS+SIAWGYTVQIY F+  A DL+ P QTFKTWRSWSNGPF+FN RPVS+DPC RPVVYFL  VQEV TRGT+TSYERF+AK+EKV
Subjt:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV

Query:  CETADYARVIAVKEVTVSSMKMDTELWKKA
        C+  DY RV+AVKEVTVSSMKMDT+LW KA
Subjt:  CETADYARVIAVKEVTVSSMKMDTELWKKA

XP_022991085.1 uncharacterized protein LOC111487786 [Cucurbita maxima]7.8e-19376.05Show/hide
Query:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESE-DAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA
        +R  LLLCLVASM LA+HSAF  +  R+F LP  TR  VSE +      ATNISHIVFGIGASVQTW DRSLY+ LWWDPNR+RGF WLD EP+K QL  
Subjt:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESE-DAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA

Query:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS
           +PYRVS +C G  YSC  AAVRMARIVVE FK+GL N RWFVMGDDDTVFF +NLVSVLA+YDHNQMYYIGGNSESVEQDE+H+YEMAFGGGGFAIS
Subjt:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS

Query:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN
        YPLAAQLVNV+DGCLQRY +FYGSDQR+WAC+SE+GVPLT +RGFHQFDIRG PYGILAAHP+ PLVSLHHLD VEPLFPN+TR+DSLKLLM+AYRVDS+
Subjt:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN

Query:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV
        RI+QQS CYDRRR+WS+S+AWGYTVQIY F+  A DL+ P QTFKTWRSWSN PF+FN RPVS+DPC RPVVYFL  VQEV TRGT+TSYERF+ K+EKV
Subjt:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV

Query:  CETADYARVIAVKEVTVSSMKMDTELWKKA
        C+  DY+RV+AVKE TVSSMKMDT+LW KA
Subjt:  CETADYARVIAVKEVTVSSMKMDTELWKKA

XP_023552436.1 uncharacterized protein LOC111810092 [Cucurbita pepo subsp. pepo]2.7e-19376.05Show/hide
Query:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESE-DAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA
        +R  LLLCL+ASM LA+HSAF  +  R+F LP  TR  VSE +      ATNISHIVFGIGASVQTW DRSLY+ LWWDPNR+RGF WLD EP+K QL  
Subjt:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESE-DAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA

Query:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS
           +P+RVS +C G  YSC  AAVRMARIVVE FK+GL N RWFVMGDDDTVFF +NLVSVLA+YDHNQMYYIGGNSESVEQDE+H+YEMAFGGGGFAIS
Subjt:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS

Query:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN
        YPLAAQLVNV+DGCLQRY +FYGSDQR+WAC+ E+GVPLT +RGFHQFDIRG PYGILAAHP+ PLVSLHHLD VEPLFPN+TR+DSLKLLM+AYRVDS+
Subjt:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN

Query:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV
        RI+QQS CYDRRR+WS+SIAWGYTVQIY F+  A DL+ P QTFKTWRSWSNGPF+FN RPVS+DPC RPVVYFL  VQEV TRGT+TSY+RF+AK+EKV
Subjt:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV

Query:  CETADYARVIAVKEVTVSSMKMDTELWKKA
        C+  DY RV+AVKEVTVSSMKMDT+LW KA
Subjt:  CETADYARVIAVKEVTVSSMKMDTELWKKA

TrEMBL top hitse value%identityAlignment
A0A1S3B3B2 uncharacterized protein LOC1034855211.1e-18773.17Show/hide
Query:  SDRVGTQSRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQ
        SD +   +R GLLLCLVASMSL + S+F+ +QS QF LP  T+  V +     T  TNISHIVFGIGASVQTW+DR+LY+ LWW+ NR+RGF WLD +P 
Subjt:  SDRVGTQSRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQ

Query:  KKQLGAEKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGG
        K        +PY+VS+ C G+GYSC  +AVR+ARIVVES+K+GL N RWFVMGDDDTVFF +NLV+VLA+YDHNQMYYIGGNSESVEQD++H+Y MAFGG
Subjt:  KKQLGAEKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGG

Query:  GGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQA
        GGFAISYPLAA+LV V+DGCL RYS+FYGSDQR+WAC++E+GVPLTT+RGFHQFDIRG PYGILAAHPL PLVSLHHLDHVEPLFPNQTRVDSL LLMQA
Subjt:  GGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQA

Query:  YRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFM
        YRVDS+RILQQ+ CYDRR+ WSISIAWGY VQIY FM  A D+Q P QTFKTWRS S+GPF FNTRPVSSDPC +PVVYFL +VQEV TRGT+T+YERF+
Subjt:  YRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFM

Query:  AKEEKVCETADYARVIAVKEVTVSSMKMDTELWKKA
         K+ KVCE  DYARV+AVK+VTVSSMKMDT+LW KA
Subjt:  AKEEKVCETADYARVIAVKEVTVSSMKMDTELWKKA

A0A5A7TJ31 Beta-1,3-N-acetylglucosaminyltransferase lunatic fringe, putative isoform 11.1e-18773.17Show/hide
Query:  SDRVGTQSRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQ
        SD +   +R GLLLCLVASMSL + S+F+ +QS QF LP  T+  V +     T  TNISHIVFGIGASVQTW+DR+LY+ LWW+ NR+RGF WLD +P 
Subjt:  SDRVGTQSRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQ

Query:  KKQLGAEKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGG
        K        +PY+VS+ C G+GYSC  +AVR+ARIVVES+K+GL N RWFVMGDDDTVFF +NLV+VLA+YDHNQMYYIGGNSESVEQD++H+Y MAFGG
Subjt:  KKQLGAEKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGG

Query:  GGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQA
        GGFAISYPLAA+LV V+DGCL RYS+FYGSDQR+WAC++E+GVPLTT+RGFHQFDIRG PYGILAAHPL PLVSLHHLDHVEPLFPNQTRVDSL LLMQA
Subjt:  GGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQA

Query:  YRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFM
        YRVDS+RILQQ+ CYDRR+ WSISIAWGY VQIY FM  A D+Q P QTFKTWRS S+GPF FNTRPVSSDPC +PVVYFL +VQEV TRGT+T+YERF+
Subjt:  YRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFM

Query:  AKEEKVCETADYARVIAVKEVTVSSMKMDTELWKKA
         K+ KVCE  DYARV+AVK+VTVSSMKMDT+LW KA
Subjt:  AKEEKVCETADYARVIAVKEVTVSSMKMDTELWKKA

A0A6J1BZ54 uncharacterized protein LOC1110060161.3e-20680.47Show/hide
Query:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGAE
        +R GLLLCLVASMSLA+HSAF+ T+SR+F LPD  R VVSE +   T AT+ISH+VFGIGASV+TWRDRSLY+ LWWDPNR+RGFVWLDEEP + QL   
Subjt:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGAE

Query:  KSIPYRVSEKCRGAGYSC-SVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS
          +PYRVSE+CRGAGYSC + AAVR+ARIVVES+K+GL N RWFVMGDDDTVFF  NLVSVL +YDHN+MYYIG NSESVEQD++H YEMAFGGGGFAIS
Subjt:  KSIPYRVSEKCRGAGYSC-SVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS

Query:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN
        YPLAA+LVN +DGCLQRY+ FYGSDQR+WACV+E GVPLT  RGFHQFD+RG+PYGILAAHPL PLVSLHHLDHVEPLFPN+TR+DSL  LM+AY++D N
Subjt:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN

Query:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV
        RILQQSFCYDRR++WSISIAWGYTVQI SF+E AKDLQTPL+TFKTWRSWSNGPFTFNTR VSSDPCRRPVVYFLNRVQEV TRGTQT YERF+AKEEKV
Subjt:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV

Query:  CETADYARVIAVKEVTVSSMKMDTELWKKA
        CETADYA V+AVKEVTVSSMKMD ELWKKA
Subjt:  CETADYARVIAVKEVTVSSMKMDTELWKKA

A0A6J1GUW4 uncharacterized protein LOC1114573801.0e-19376.28Show/hide
Query:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESED-AGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA
        +R  LLLCL+ASM LA+HSAF  +  R+F LP +TR  VSE +      ATNISHIVFGIGASVQTW DRSLY+ LWWDPNR+RGF WLD EP+K  L  
Subjt:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESED-AGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA

Query:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS
           +P+RVS +C GA YSC  AAVRMARIVVE FK+GL N RWFVMGDDDTVFF +NLVSVLA+YDHNQMYYIGGNSESVEQDE+H+YEMAFGGGGFAIS
Subjt:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS

Query:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN
        YPLAAQLVNV+DGCLQRY +FYGSDQR+WAC+SE+GVPLT +RGFHQFDIRG  YGILAAHP+ PLVSLHHLD VEPLFPN+TR+DSLKLLM+AYRVDS+
Subjt:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN

Query:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV
        RI+QQS CYDRRR+WS+SIAWGYTVQIY F+  A DL+ P QTFKTWRSWSNGPF+FN RPVS+DPC RPVVYFL  VQEV TRGT+TSYERF+AK+EKV
Subjt:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV

Query:  CETADYARVIAVKEVTVSSMKMDTELWKKA
        C+  DY RV+AVKEVTVSSMKMDT+LW KA
Subjt:  CETADYARVIAVKEVTVSSMKMDTELWKKA

A0A6J1JRW0 uncharacterized protein LOC1114877863.8e-19376.05Show/hide
Query:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESE-DAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA
        +R  LLLCLVASM LA+HSAF  +  R+F LP  TR  VSE +      ATNISHIVFGIGASVQTW DRSLY+ LWWDPNR+RGF WLD EP+K QL  
Subjt:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVVSESE-DAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGA

Query:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS
           +PYRVS +C G  YSC  AAVRMARIVVE FK+GL N RWFVMGDDDTVFF +NLVSVLA+YDHNQMYYIGGNSESVEQDE+H+YEMAFGGGGFAIS
Subjt:  EKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAIS

Query:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN
        YPLAAQLVNV+DGCLQRY +FYGSDQR+WAC+SE+GVPLT +RGFHQFDIRG PYGILAAHP+ PLVSLHHLD VEPLFPN+TR+DSLKLLM+AYRVDS+
Subjt:  YPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSN

Query:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV
        RI+QQS CYDRRR+WS+S+AWGYTVQIY F+  A DL+ P QTFKTWRSWSN PF+FN RPVS+DPC RPVVYFL  VQEV TRGT+TSYERF+ K+EKV
Subjt:  RILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFMAKEEKV

Query:  CETADYARVIAVKEVTVSSMKMDTELWKKA
        C+  DY+RV+AVKE TVSSMKMDT+LW KA
Subjt:  CETADYARVIAVKEVTVSSMKMDTELWKKA

SwissProt top hitse value%identityAlignment
P79948 Beta-1,3-N-acetylglucosaminyltransferase lunatic fringe9.8e-0527.23Show/hide
Query:  SGTQSRQFSLPDHTR----AVVSESEDAGTGATNI--SHIVFGIGASVQTWRDR-SLYSQLWWDPNRSRGFVWLDEEPQKKQLGAEKSIPYRVSEKCRGA
        SGT S  F+     R     V + S+D+     +I  + +   +  + +  R R  L    W   N+ + F++ D E ++ Q    K     +S  C  A
Subjt:  SGTQSRQFSLPDHTR----AVVSESEDAGTGATNI--SHIVFGIGASVQTWRDR-SLYSQLWWDPNRSRGFVWLDEEPQKKQLGAEKSIPYRVSEKCRGA

Query:  GYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNS--------ESVEQDEIHA--YEMAFGGGGFAISYPLAA
         +S    + +MA   VE  K   S+ +WF   DDD       LV +L+RY H    YIG  S        E + +  +    +  A GG GF IS  LA 
Subjt:  GYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNS--------ESVEQDEIHA--YEMAFGGGGFAISYPLAA

Query:  QL
        ++
Subjt:  QL

Arabidopsis top hitse value%identityAlignment
AT2G37730.1 Protein of unknown function (DUF604)1.8e-13958.44Show/hide
Query:  ATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGAEKSIPYRVSEKCRGAGYSC---SVAAVRMARIVVESFKVGLSNARWFVM
        AT+ISHI FGIG S+QTWRDRS YS+LWW PN +RGF+WLDEEP         S PY+VS       Y+C   S +A+RMARI+ E+F++GL++ RWF+M
Subjt:  ATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGAEKSIPYRVSEKCRGAGYSC---SVAAVRMARIVVESFKVGLSNARWFVM

Query:  GDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFH
        GDDDTVFF DNL++VL +YDHNQMYYIGGNSESVEQD +H+Y MA+GGGG AISYPLA +LV +LDGC+ RY+  YGSDQ+I AC+SEIGVPLT + GFH
Subjt:  GDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFH

Query:  QFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKT
        Q DIRG+PYG+LAAHP+ PLV+LHHLD+V+P+FP  T++D+L+ L+ AY+ D +RI+Q SFC+D+ R W +S++WGYT+QIY  +  AK+L+TP  TFK+
Subjt:  QFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKT

Query:  WRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFM-AKEEKVCETADYARVIAVKEVTVSSMKMDTELWKKA
        WR+ S+ PF+F+TRP+S DPC RP+VYFL+RV EV +  T T+Y + +   E   C + DY+R   V+ + VS+  +  +LWK A
Subjt:  WRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYERFM-AKEEKVCETADYARVIAVKEVTVSSMKMDTELWKKA

AT3G11420.1 Protein of unknown function (DUF604)1.6e-13552.69Show/hide
Query:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFS------LPDHTRAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQ-
        +R  +L CL+ S+SL + + F    +R +S      L    +  ++    A  G TNISHI F I  + +TW DRS Y  LWW  N +RGFVWLDE  + 
Subjt:  SRVGLLLCLVASMSLAVHSAFSGTQSRQFS------LPDHTRAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQ-

Query:  -KKQLGAEKSIPYRVSEK-CRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAF
         +       SIP RVS+       +S S AAVR+ARI+ +S+++ L N RWFVMGDDDTVFF +NLV VL++YDH QM+YIGGNSESVEQD +HAY+MAF
Subjt:  -KKQLGAEKSIPYRVSEK-CRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAF

Query:  GGGGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLM
        GGGGFA+S PLAA+L   +D CLQRY YFYGSDQRI +C+SEIGVP T +RGFHQ DIRG PYG LAAHPL PLVSLHHL +++P+FPN+  ++SL+ LM
Subjt:  GGGGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLM

Query:  QAYRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYER
        + Y +D NRILQQ  C+DR+R+WSISI+WGYT+QIY++   A +L TPLQTFKTWRS S+GPF FNTRP+  DPC RPV YF++  ++V   GT+T Y  
Subjt:  QAYRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRPVVYFLNRVQEVHTRGTQTSYER

Query:  FMAKEEKVCETADYARVIAVKEVTVSSMKMDTELWKKAK-----FTLKASAGSARNKEAERKARK
           K    C   ++ R+  VK + V+SMK D E W KA        ++   G  + KE   + RK
Subjt:  FMAKEEKVCETADYARVIAVKEVTVSSMKMDTELWKKAK-----FTLKASAGSARNKEAERKARK

AT4G11350.1 Protein of unknown function (DUF604)4.4e-9343.46Show/hide
Query:  RAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQ-KKQLGAEKSIP-YRVSEKCRGAGYSCSV---AAVRMARIVV
        +AV +E E     AT+++H+VFGI AS + W+ R  Y ++W+ P + RG+VWLDEE + K + G ++S+P  R+S       Y+      +A+R++RIV 
Subjt:  RAVVSESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQ-KKQLGAEKSIP-YRVSEKCRGAGYSCSV---AAVRMARIVV

Query:  ESF----KVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQR
        E+          N RWFVMGDDDTVF  DNL+ VL +YDH QMYYIG  SES  Q+ I +Y MA+GGGGFAISYPLA  L  + D C+QRY   YGSD R
Subjt:  ESF----KVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQR

Query:  IWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSNRILQQSFCYDRRRRWSISIAWGYTVQI
        + AC++E+GVPLT + GFHQ+D+ G+ +G+LAAHP+ P VS+HHLD VEP+FPN TRV ++K L    ++DS  +LQQS CYD+ + W+IS++WG+ VQ+
Subjt:  IWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSNRILQQSFCYDRRRRWSISIAWGYTVQI

Query:  YSFMEPAKDLQTPLQTFKTWRSWSN-GPFTFNTRPVSSDPCRRPVVYFLNRVQ-EVHTRGTQTSYERFMAKEEKVCETADYARVIAVKEVTVSSMKMDTE
        +      ++++ P +TF  W   ++   + FNTRPVS + C++P V+ ++  + +     T + Y R     +  C   D A    +  + V   K D  
Subjt:  YSFMEPAKDLQTPLQTFKTWRSWSN-GPFTFNTRPVSSDPCRRPVVYFLNRVQ-EVHTRGTQTSYERFMAKEEKVCETADYARVIAVKEVTVSSMKMDTE

Query:  LWKKA
        LW ++
Subjt:  LWKKA

AT4G23490.1 Protein of unknown function (DUF604)9.5e-9642.19Show/hide
Query:  SGTQSRQFSLPDHTRAVVSESEDAGTG-ATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEE--------------PQKKQLGAEKSIPYR
        S   S   SL    R    E ED      T+++H+VFGI AS + W+ R  Y ++W+ P R RG+VWLD+E              P  K  G   S PY 
Subjt:  SGTQSRQFSLPDHTRAVVSESEDAGTG-ATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEE--------------PQKKQLGAEKSIPYR

Query:  VSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAISYPLAAQL
          +  R        +A+R++RIV E+ ++G  N RWFVMGDDDTVF  DNL+ VL +YDH QMYYIG  SES  Q+   +Y MA+GGGGFAISYPLA  L
Subjt:  VSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAISYPLAAQL

Query:  VNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSNRILQQSF
          + D C+QRY   YGSD R+ AC++E+GVPLT + GFHQ+D+ G+ +G+LAAHP+ P VS+HHLD VEP+FPN TRV +LK + +  ++DS  +LQQS 
Subjt:  VNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSNRILQQSF

Query:  CYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSN-GPFTFNTRPVSSDPCRRPVVYFLNRV---QEVHTRGTQTSYERFM--AKEEKVC
        CYD+ + W+IS++WGY VQI+  +   ++++ P +TF  W   ++   + FNTRPVS +PC++P V++++     Q+++T  ++ +  R    +   K+ 
Subjt:  CYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSN-GPFTFNTRPVSSDPCRRPVVYFLNRV---QEVHTRGTQTSYERFM--AKEEKVC

Query:  ETADYARVIAVKEVTVSSMKMDTELWKKA
          A+   ++  K       K D  LW+++
Subjt:  ETADYARVIAVKEVTVSSMKMDTELWKKA

AT5G41460.1 Protein of unknown function (DUF604)1.3e-9746.37Show/hide
Query:  TNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWL-------DEE-----PQKKQLGAEKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGL
        T   H+VFGI AS + W+ R  Y ++W+ PN+ R +VWL       DEE     P  K  G     PY+  +  R        +A+R++RIV E+ K+GL
Subjt:  TNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWL-------DEE-----PQKKQLGAEKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGL

Query:  SNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVP
         + RWFVMGDDDTVF A+NL+ VL +YDHNQMYYIG  SES  Q+   +Y MA+GGGGFAISYPLA  L  + D C++RY   YGSD R+ AC++E+GVP
Subjt:  SNARWFVMGDDDTVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVP

Query:  LTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQ
        LT + GFHQ+D+ G+ +G+LAAHP+ PLV+LHHLD VEP+FPN TRVD+LK L    ++DS  ++QQS CYD+RR+W++S++WG+ VQI+  +  A++++
Subjt:  LTTQRGFHQFDIRGHPYGILAAHPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQ

Query:  TPLQTFKTW-RSWSNGPFTFNTRPVSSDPCRRPVVYFL--NRVQEVHTRGTQTSYE-RFMAKEE---KVCETADYARVIAVKEVTVSSMKMDTELWKKA
         P +TF  W R      + FNTRPVS  PC++P V+++   RV  V T  T + YE   +A  E   K+   +D   VI  K       K D  LW ++
Subjt:  TPLQTFKTW-RSWSNGPFTFNTRPVSSDPCRRPVVYFL--NRVQEVHTRGTQTSYE-RFMAKEE---KVCETADYARVIAVKEVTVSSMKMDTELWKKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTTGTGGAGGGAGAAAAAGGGGCGTTTGCTTTTACGGGGCTGGAGTGGGATACCCATTTGGCAGTTAAAAGAGGTGGCATTTCTTTTGAAAGCAGGAATCTCCGC
CCACATAACACACCATTGGAAAATTGTGCTGGTATTCCGCTTCCCTCTTGCATTTCATCATGCAAGCAGAGAACATGATGATTCAGACCGGGTCGGAACCCAATCAAGGG
TTGGCCTCCTGCTCTGCCTCGTGGCGTCCATGTCACTCGCCGTCCACTCCGCTTTCTCCGGAACCCAGTCTCGGCAGTTCAGTCTCCCCGACCACACCCGAGCCGTCGTC
TCAGAGTCGGAGGACGCAGGGACAGGCGCCACCAACATATCGCACATCGTGTTCGGAATCGGCGCTTCGGTTCAAACGTGGCGAGATCGCAGCCTCTACAGCCAACTGTG
GTGGGACCCGAACCGGAGTCGTGGGTTCGTTTGGCTGGACGAAGAACCACAGAAGAAGCAACTCGGGGCGGAGAAGTCGATTCCGTATCGGGTGTCGGAGAAGTGCCGGG
GGGCGGGTTATTCGTGCTCGGTGGCGGCGGTTCGAATGGCCCGAATAGTAGTGGAGAGCTTCAAAGTGGGGCTATCGAACGCGAGATGGTTTGTGATGGGAGACGACGAT
ACGGTGTTCTTCGCTGACAACTTAGTTTCGGTGTTAGCGAGATACGATCACAATCAGATGTACTACATTGGTGGGAACTCGGAGAGCGTGGAGCAGGACGAGATCCATGC
GTACGAGATGGCGTTCGGTGGCGGTGGGTTTGCGATAAGCTACCCATTGGCAGCGCAACTGGTCAACGTATTGGACGGCTGTCTCCAACGCTACAGTTATTTCTACGGCT
CTGATCAGAGGATATGGGCATGTGTAAGCGAGATCGGGGTGCCACTCACCACGCAGCGCGGGTTCCACCAGTTTGATATTAGAGGTCACCCGTATGGGATTTTGGCAGCG
CATCCCTTGGTGCCACTTGTATCGCTTCATCACCTTGACCATGTGGAACCCTTGTTTCCAAACCAGACTAGGGTTGATTCATTGAAGTTACTTATGCAAGCATACCGAGT
CGACTCGAATCGGATTCTTCAGCAAAGTTTCTGCTATGACAGGAGGAGAAGATGGTCCATCTCCATTGCATGGGGCTACACCGTTCAGATATACTCTTTCATGGAGCCGG
CCAAGGACTTGCAGACTCCATTACAGACGTTCAAGACATGGAGGAGCTGGAGCAATGGACCTTTCACCTTCAATACACGACCCGTGAGTTCTGACCCGTGCAGGCGACCG
GTTGTTTATTTTTTGAACCGAGTTCAGGAGGTGCACACGAGGGGTACCCAAACCAGTTACGAAAGGTTCATGGCCAAGGAAGAGAAAGTATGTGAGACAGCTGATTACGC
TCGTGTCATAGCAGTGAAAGAAGTGACTGTCTCCTCAATGAAGATGGACACTGAGCTTTGGAAGAAGGCAAAATTTACCTTGAAAGCTAGCGCAGGATCCGCCAGGAACA
AAGAGGCGGAAAGGAAAGCGAGGAAGCTGGCTTTAGAGGCGGACTCTCCCACGGAAATTCAGTTCACACACACATTCGCATTCGGTATCTTGTCGGCACTCGCAGACATT
GACGTCTGCCCACTACCTCCATGGCTCTCCCATTACCTAAAGTCCAACTCTTTGGAAAGGGTATCAATGAAGCCAGCCGCCCCGCAATTTCTTTTCTTTGCCCATATTCC
ATATCCGGCCGGAAATCAGCTAGAGCTTCGAAACACGGACCCTCCCCTCTCTCTCGCACTTCCTGACTCAGAGTGCAGAAGCCCTCGCGTGAGCAACTGCTGCTTTGTTG
TACCATTCTATGCGACCATTCTTCGTAGTTGGAGAGAGAGAGAGAGAGAGCCGGATGCTATTTTCTTCGGCCTTTTCCATCTATATTTTGCCTCACTGCCTGTACCTGCT
GGAAACGAAAGTCCATTGGATAGCAAAAAGGGTTCTTTTTTTTCCTTTGTTCGGTTGGTGTTGGTCATTGATCTTATTCGGTGGATTCTGTTTTGCTTGTGTTTCTCTGC
TTCTCCTTTCTTGGCTATAATAACTCCGGCTGGTTGTCGCGGGTTTTCATGCAAAAAAGAATCGCACTTTGTTTTGGAGCAAGGAGTTTCAGAAGAATTGCAAGGCAGAG
AGGTTATTGAATATTGTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTTTGTGGAGGGAGAAAAAGGGGCGTTTGCTTTTACGGGGCTGGAGTGGGATACCCATTTGGCAGTTAAAAGAGGTGGCATTTCTTTTGAAAGCAGGAATCTCCGC
CCACATAACACACCATTGGAAAATTGTGCTGGTATTCCGCTTCCCTCTTGCATTTCATCATGCAAGCAGAGAACATGATGATTCAGACCGGGTCGGAACCCAATCAAGGG
TTGGCCTCCTGCTCTGCCTCGTGGCGTCCATGTCACTCGCCGTCCACTCCGCTTTCTCCGGAACCCAGTCTCGGCAGTTCAGTCTCCCCGACCACACCCGAGCCGTCGTC
TCAGAGTCGGAGGACGCAGGGACAGGCGCCACCAACATATCGCACATCGTGTTCGGAATCGGCGCTTCGGTTCAAACGTGGCGAGATCGCAGCCTCTACAGCCAACTGTG
GTGGGACCCGAACCGGAGTCGTGGGTTCGTTTGGCTGGACGAAGAACCACAGAAGAAGCAACTCGGGGCGGAGAAGTCGATTCCGTATCGGGTGTCGGAGAAGTGCCGGG
GGGCGGGTTATTCGTGCTCGGTGGCGGCGGTTCGAATGGCCCGAATAGTAGTGGAGAGCTTCAAAGTGGGGCTATCGAACGCGAGATGGTTTGTGATGGGAGACGACGAT
ACGGTGTTCTTCGCTGACAACTTAGTTTCGGTGTTAGCGAGATACGATCACAATCAGATGTACTACATTGGTGGGAACTCGGAGAGCGTGGAGCAGGACGAGATCCATGC
GTACGAGATGGCGTTCGGTGGCGGTGGGTTTGCGATAAGCTACCCATTGGCAGCGCAACTGGTCAACGTATTGGACGGCTGTCTCCAACGCTACAGTTATTTCTACGGCT
CTGATCAGAGGATATGGGCATGTGTAAGCGAGATCGGGGTGCCACTCACCACGCAGCGCGGGTTCCACCAGTTTGATATTAGAGGTCACCCGTATGGGATTTTGGCAGCG
CATCCCTTGGTGCCACTTGTATCGCTTCATCACCTTGACCATGTGGAACCCTTGTTTCCAAACCAGACTAGGGTTGATTCATTGAAGTTACTTATGCAAGCATACCGAGT
CGACTCGAATCGGATTCTTCAGCAAAGTTTCTGCTATGACAGGAGGAGAAGATGGTCCATCTCCATTGCATGGGGCTACACCGTTCAGATATACTCTTTCATGGAGCCGG
CCAAGGACTTGCAGACTCCATTACAGACGTTCAAGACATGGAGGAGCTGGAGCAATGGACCTTTCACCTTCAATACACGACCCGTGAGTTCTGACCCGTGCAGGCGACCG
GTTGTTTATTTTTTGAACCGAGTTCAGGAGGTGCACACGAGGGGTACCCAAACCAGTTACGAAAGGTTCATGGCCAAGGAAGAGAAAGTATGTGAGACAGCTGATTACGC
TCGTGTCATAGCAGTGAAAGAAGTGACTGTCTCCTCAATGAAGATGGACACTGAGCTTTGGAAGAAGGCAAAATTTACCTTGAAAGCTAGCGCAGGATCCGCCAGGAACA
AAGAGGCGGAAAGGAAAGCGAGGAAGCTGGCTTTAGAGGCGGACTCTCCCACGGAAATTCAGTTCACACACACATTCGCATTCGGTATCTTGTCGGCACTCGCAGACATT
GACGTCTGCCCACTACCTCCATGGCTCTCCCATTACCTAAAGTCCAACTCTTTGGAAAGGGTATCAATGAAGCCAGCCGCCCCGCAATTTCTTTTCTTTGCCCATATTCC
ATATCCGGCCGGAAATCAGCTAGAGCTTCGAAACACGGACCCTCCCCTCTCTCTCGCACTTCCTGACTCAGAGTGCAGAAGCCCTCGCGTGAGCAACTGCTGCTTTGTTG
TACCATTCTATGCGACCATTCTTCGTAGTTGGAGAGAGAGAGAGAGAGAGCCGGATGCTATTTTCTTCGGCCTTTTCCATCTATATTTTGCCTCACTGCCTGTACCTGCT
GGAAACGAAAGTCCATTGGATAGCAAAAAGGGTTCTTTTTTTTCCTTTGTTCGGTTGGTGTTGGTCATTGATCTTATTCGGTGGATTCTGTTTTGCTTGTGTTTCTCTGC
TTCTCCTTTCTTGGCTATAATAACTCCGGCTGGTTGTCGCGGGTTTTCATGCAAAAAAGAATCGCACTTTGTTTTGGAGCAAGGAGTTTCAGAAGAATTGCAAGGCAGAG
AGGTTATTGAATATTGTATGTGA
Protein sequenceShow/hide protein sequence
MRLWREKKGRLLLRGWSGIPIWQLKEVAFLLKAGISAHITHHWKIVLVFRFPLAFHHASREHDDSDRVGTQSRVGLLLCLVASMSLAVHSAFSGTQSRQFSLPDHTRAVV
SESEDAGTGATNISHIVFGIGASVQTWRDRSLYSQLWWDPNRSRGFVWLDEEPQKKQLGAEKSIPYRVSEKCRGAGYSCSVAAVRMARIVVESFKVGLSNARWFVMGDDD
TVFFADNLVSVLARYDHNQMYYIGGNSESVEQDEIHAYEMAFGGGGFAISYPLAAQLVNVLDGCLQRYSYFYGSDQRIWACVSEIGVPLTTQRGFHQFDIRGHPYGILAA
HPLVPLVSLHHLDHVEPLFPNQTRVDSLKLLMQAYRVDSNRILQQSFCYDRRRRWSISIAWGYTVQIYSFMEPAKDLQTPLQTFKTWRSWSNGPFTFNTRPVSSDPCRRP
VVYFLNRVQEVHTRGTQTSYERFMAKEEKVCETADYARVIAVKEVTVSSMKMDTELWKKAKFTLKASAGSARNKEAERKARKLALEADSPTEIQFTHTFAFGILSALADI
DVCPLPPWLSHYLKSNSLERVSMKPAAPQFLFFAHIPYPAGNQLELRNTDPPLSLALPDSECRSPRVSNCCFVVPFYATILRSWREREREPDAIFFGLFHLYFASLPVPA
GNESPLDSKKGSFFSFVRLVLVIDLIRWILFCLCFSASPFLAIITPAGCRGFSCKKESHFVLEQGVSEELQGREVIEYCM