; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017994 (gene) of Chayote v1 genome

Gene IDSed0017994
OrganismSechium edule (Chayote v1)
DescriptionXS domain-containing protein
Genome locationLG04:33794665..33805950
RNA-Seq ExpressionSed0017994
SyntenySed0017994
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
InterPro domainsIPR005380 - XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574981.1 Protein SUPPRESSOR OF GENE SILENCING 3-like protein, partial [Cucurbita argyrosperma subsp. sororia]4.3e-26883.36Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA
        MAGG+NT +SSQKPSS+SAAPSNRKSRWESST NNPP+DPK DSKSSKP    +NPSSKS IS     PK+ ADK V PTPA AP+PSPG+PLPFPDPSA
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA

Query:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSG--VPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDP
        LGPPPPPSYGFHML+RRT+VLADGSVRSYFALP+DY+DF PPAR +DLA RFLPMGSG   PG EY GFDHRFPPGGPMSP+EFRGVREEQFARARPQD 
Subjt:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSG--VPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDP

Query:  WNSRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGN
        WNSRGT+ERGGPA+ S+KRKF DDNEKD KD       RQQQFLHNGNANGFLTG G  RGDFLAGTSDPYGRTEDMRFSKY R GGSY +EG+RLG GN
Subjt:  WNSRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGN

Query:  NVAPKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNS
        NVAPKY EVDQNALRKAFLHFVKTIYENANQKK+YLEDGK GRLQCLACARSS+DFPDMH LIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNS
Subjt:  NVAPKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNS

Query:  RGYQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDN
        RGY+FLSADEA ANQEDLIMWPP VIIHN  TGK KDGRMEG+GNKAMDSKIRDLGF GGKSKSLYGR+GHLG+TLIKFSGDQ GLNEAKRL+EFF KDN
Subjt:  RGYQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDN

Query:  HGRAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR
        HGR  WA VRPAA+S+DDDKNPNLVKVDEK+GE+KRIFYG+LAT AD++KVDF+TRKKV IESCRDFK  R
Subjt:  HGRAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR

KAG7013553.1 Protein SUPPRESSOR OF GENE SILENCING 3-like protein [Cucurbita argyrosperma subsp. argyrosperma]1.9e-26883.54Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA
        MAGG+NT +SSQKPSSSSAAPSNRKSRWESST NNPP+DPK DSKSSKP    +NPSSKS IS     PK+ ADK V PTPA AP+PSPG+PLPFPDPSA
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA

Query:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSG--VPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDP
        LGPPPPPSYGFHML+RRT+VLADGSVRSYFALP+DY+DF PPAR +DLA RFLPMGSG   PG EY GFDHRFPPGGPMSP+EFRGVREEQFARARPQD 
Subjt:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSG--VPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDP

Query:  WNSRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGN
        WNSRGT+ERGGPA+ S+KRKF DDNEKD KD       RQQQFLHNGNANGFLTG G  RGDFLAGTSDPYGRTEDMRFSKY R GGSY +EG+RLG GN
Subjt:  WNSRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGN

Query:  NVAPKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNS
        NVAPKY EVDQNALRKAFLHFVKTIYENANQKK+YLEDGK GRLQCLACARSS+DFPDMH LIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNS
Subjt:  NVAPKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNS

Query:  RGYQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDN
        RGY+FLSADEA ANQEDLIMWPP VIIHN  TGK KDGRMEG+GNKAMDSKIRDLGF GGKSKSLYGR+GHLG+TLIKFSGDQ GLNEAKRL+EFF KDN
Subjt:  RGYQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDN

Query:  HGRAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR
        HGR  WA VRPAA+S+DDDKNPNLVKVDEK+GE+KRIFYG+LAT AD++KVDF+TRKKV IESCRDFK  R
Subjt:  HGRAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR

XP_008447151.1 PREDICTED: uncharacterized protein LOC103489668 [Cucumis melo]2.5e-26884.4Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVISPKNPADKAVKPTPALAPIPSPGLPLPFPDPSALGPPP
        MAGG+NT KSSQKP SSSAAPS+RKSRWESS +NNPPSDPK DSKSSKP    H+PSSKS ISP +   K + PTPA AP+PSPG+PLPFPDPSALGPPP
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVISPKNPADKAVKPTPALAPIPSPGLPLPFPDPSALGPPP

Query:  PPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWNSRGTD
        PPSYGFHML+RRT+VLADGSVRSYFALP+DY +F PPAR +DLAARFLPMGS   G EY GFDHRFPPGGPMSP+E RG REEQF RARPQD WNSRGTD
Subjt:  PPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWNSRGTD

Query:  ERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNVAPKYL
        ERGGPAD SMKRKF DDNEKD KD       RQQQ LHNGNANGFLTGSG  RGDFLAGTSDPYGRTED RFSKY RVGGSYE+EG+RLG G NVAPKYL
Subjt:  ERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNVAPKYL

Query:  EVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRGYQFLS
        EVDQ+ALRKAFLHFVKTI ENANQKK+YLEDGKHGRLQCLACARSS+DFPDMH LIMHTYNS+SADSQVDHLGLHKALCVLMGWNYSKPPDNSRGY+FLS
Subjt:  EVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRGYQFLS

Query:  ADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAWA
        ADEAAANQEDLIMWPP VIIHN  TGK KDGRMEG+GNKAMDSKIRDLGF GGKSKSLYGR+GHLG+TLIKFSGDQ GLNEAKRLAEFF KDNHGR+AWA
Subjt:  ADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAWA

Query:  LVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR
         VRPAAYS+DDDKNPNLVKVDEK+GEKKRIFYG+LAT AD+DKVDF+TRKKV IESCRDFKS R
Subjt:  LVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR

XP_023549106.1 uncharacterized protein LOC111807566 [Cucurbita pepo subsp. pepo]1.4e-26682.95Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA
        MAGG+NT +SSQKPSS+SAAPSNRKSRWESST NNPPSDPK DSKSSKP    +NPSSKS IS     PK+ ADK V PTPA AP+PSPG+PLPFPDPSA
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA

Query:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN
        LGPPPPPSYGFHML+RRT+VLADGSVRSYFALP+DY+DF PPAR +DLA RFLPMGSG PG EY GFDHRFPPGGPMSP+EFRGVREEQFARARP D WN
Subjt:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN

Query:  SRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV
        SRG +ERGGPA+ S+KRKF DDNEKD KD       RQQQFLHNGNANGFLTG G  RGDFLAGTSDPYGRTEDMRFSKY R GGSY +EG+RLG GNNV
Subjt:  SRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV

Query:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG
        APKY EVDQNALRKAFLHFVKTIYENANQKK+YLEDGK GRLQCLACARSS+DFPDMH LI+HTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG
Subjt:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG

Query:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHG
        Y+FLSA EA ANQEDLIMWPP VIIHN  TGK KDGRMEG+GNKAMDSKIRDLGF GGKSKSLYGR+GHLG+TLIKFSGD  GLNEAKRLAEFF KDNHG
Subjt:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHG

Query:  RAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR
        R  WA VRPAA+S+DDDKNPNLVKVDEK+GE+ RIFYG+LAT AD++KVDF+TRKKV IESCRDFK  R
Subjt:  RAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR

XP_038874568.1 uncharacterized protein LOC120067168 [Benincasa hispida]1.7e-27285.19Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA
        MAGG+NT KSSQKPSSSSAAPSNRKSRWESST NNPPSDPK DSKSSKP    HN SSKS IS     PK+P DKAV PTPA AP+PSPG+PLPFPD SA
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA

Query:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN
        LGPPPPPSYGFHML+RRT+VLADGSVRSYF+LP+DY DF PP+R +DLA RFLPMGSG PG EY GFDHRFPPGGPMSP+EFRGVREEQF RARPQD WN
Subjt:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN

Query:  SRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV
        SRGTDERGGPAD SMKRKF DDNEKD KD       RQQQFLHNGNANGFLTGSG  RGDFLAGTSDPYGRTED RFSKY RVGGSYE+EG+RLG GNNV
Subjt:  SRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV

Query:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG
        APKYLEVDQNALRKAFLHFVKTI ENANQKK+YLEDGKHGRLQCLACARSS+DFPDMH LIMHTYNS+SAD QVDHLGLHKALCVLMGWNYSKPPDNSRG
Subjt:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG

Query:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHG
        Y+FLSADEAAANQEDLIMWPP VIIHN  TGK KDGRMEG+GNKAMDSK+RDLGF GGKSKSLYGR+GHLG+TLIKFSGDQ GLNEAKRLAEFF KDNHG
Subjt:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHG

Query:  RAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKS
        R AWA VRPAA+S+DDDKNPNLVKVDEK+GEKKRIFYG+LAT AD+DKVDF+TRKKV IES RDFKS
Subjt:  RAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKS

TrEMBL top hitse value%identityAlignment
A0A1S3BHM0 uncharacterized protein LOC1034896681.2e-26884.4Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVISPKNPADKAVKPTPALAPIPSPGLPLPFPDPSALGPPP
        MAGG+NT KSSQKP SSSAAPS+RKSRWESS +NNPPSDPK DSKSSKP    H+PSSKS ISP +   K + PTPA AP+PSPG+PLPFPDPSALGPPP
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVISPKNPADKAVKPTPALAPIPSPGLPLPFPDPSALGPPP

Query:  PPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWNSRGTD
        PPSYGFHML+RRT+VLADGSVRSYFALP+DY +F PPAR +DLAARFLPMGS   G EY GFDHRFPPGGPMSP+E RG REEQF RARPQD WNSRGTD
Subjt:  PPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWNSRGTD

Query:  ERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNVAPKYL
        ERGGPAD SMKRKF DDNEKD KD       RQQQ LHNGNANGFLTGSG  RGDFLAGTSDPYGRTED RFSKY RVGGSYE+EG+RLG G NVAPKYL
Subjt:  ERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNVAPKYL

Query:  EVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRGYQFLS
        EVDQ+ALRKAFLHFVKTI ENANQKK+YLEDGKHGRLQCLACARSS+DFPDMH LIMHTYNS+SADSQVDHLGLHKALCVLMGWNYSKPPDNSRGY+FLS
Subjt:  EVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRGYQFLS

Query:  ADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAWA
        ADEAAANQEDLIMWPP VIIHN  TGK KDGRMEG+GNKAMDSKIRDLGF GGKSKSLYGR+GHLG+TLIKFSGDQ GLNEAKRLAEFF KDNHGR+AWA
Subjt:  ADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAWA

Query:  LVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR
         VRPAAYS+DDDKNPNLVKVDEK+GEKKRIFYG+LAT AD+DKVDF+TRKKV IESCRDFKS R
Subjt:  LVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR

A0A6J1CA20 uncharacterized protein LOC1110095865.3e-26482.07Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA
        MAGGNNT K SQKPSSSSAAPSNRKSRWES++NNNPPSDPK DS++SKP    H PS K+ IS     P +P DK V PTPA APIPSP    PFPDPSA
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA

Query:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN
        LGP PPPSYGFHML+RRT+VLADGSVRSYFALP+DYQDF PPARP+DLA RFLPMGSG PGREY GFDHRFPPGGPMSP+EFRG REEQFARARPQD WN
Subjt:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN

Query:  SRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV
        SRG DERGG A+FSMKRKF DDNEKD KD       RQQQFLHNGNANGF +G G  RGDFL GTSDPY RTEDMRFSKY RVGG YE+EG+RLG G+NV
Subjt:  SRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV

Query:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG
        APKYLEVDQNAL+KAF+HFVKTI ENANQ+K+YL +GKHGRLQCLACARSS++FPDMH L+MHTYNSDSADSQVDHLGLHKALCVLMGWNYSK PDNSRG
Subjt:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG

Query:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHG
        YQFLSADEAAANQ+DLIMWPP VIIHN  TGKGKDGRMEG+GNKAMDSKIRDLGF GGKSKSLYGR+GHLG+TLIKFSGDQ GLNEAKRLAEFF K  HG
Subjt:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHG

Query:  RAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR
        R AWA VRPA +S DDDKNPNLVK DEKTGEKKRIFYG+LAT AD+DKVDF+TRKKVAIESCRDFKS R
Subjt:  RAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR

A0A6J1EWJ6 uncharacterized protein LOC1114367861.4e-26482.78Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA
        MAGGNNT + SQKP SSS APSNRKSRWESST NNPPSDPK DSKSSKP    HNPSSK  IS     PK+PADKA  PTPA AP+ SPG+ LPF DPS 
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA

Query:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN
        LGPPPPPSYGFHMLDRRT+VLADGSVRSYFALP+DYQDF PPARP+DLAARFLPMGSG PGREY GFD RFPPGGPMSPNEFRGVREEQFAR+RPQD WN
Subjt:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN

Query:  SRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV
        SRGTDERGGPA+FSMKRKF DDNEKD KD       RQQQFLHN NANGF TGSG  RGDFL GTSDPYGRTED+RFSKY RVGGSYE+EG+R G G+NV
Subjt:  SRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV

Query:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG
        APKYLE+DQNALRKAFLHFVKTI ENA QKK+YLEDGKHGRLQCLAC+RSS+DFPDMH LIMHTYN DSAD  VDHLGLHKALCVLMGWNYSKPPD+SRG
Subjt:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG

Query:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHG
        Y++LSADEAAANQEDLIMWPP VIIHN   GKGKDGRMEG+GNKAMDSKIR+LGF GGKSKSLYGREGHLG+TLIKFSG+Q GLNEAKRLAEFF KDNHG
Subjt:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHG

Query:  RAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR
        R AWA +RPAA   DDDKNPNLVKVDEKTGE+ RIFYG+LAT AD+DKVDF+TRKKVAIES RD KS R
Subjt:  RAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR

A0A6J1KEJ0 uncharacterized protein LOC111494278 isoform X22.7e-20783.59Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA
        MAGGNNT K SQKP SSS APSNRKSRWESST NNPPSDPK DSKSSKP    HNPSSK  IS     PK+PADKA  PTPA AP+ SPG+PLPF DPS 
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA

Query:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN
        LGPPPPPSYGFHMLDRRT+VLADGSVRSYFALP+DYQDF PPARP+DLAARFLPMGSG PGREY GFD RFPPGGPMSPNEFRGVREEQFAR+RPQ+ WN
Subjt:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN

Query:  SRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV
        SRGTDERGGPA+FSMKRKF DDNEKD KD       RQQQFLHN NANGF TGSG  RGDFLAGTSDPYGRTED+RFSKY RVGGSYE+EG+R G G+NV
Subjt:  SRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV

Query:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG
        APKYLEVDQNALRKAFLHFVKTI ENA QKK+YLEDGKHGRLQCLAC+RSS+DFPDMH LIMHTYN DSAD  VDHLGLHKALCVLMGWNYSKPPD+SRG
Subjt:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG

Query:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIR
        Y++LSADEAAANQEDLIMWPP VIIHN   GKGKDGRMEG+GNKAMDSKIR
Subjt:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIR

A0A6J1L434 uncharacterized protein LOC1114989661.8e-26483.13Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA
        MAGG+NT +SSQKPSS+SAAPSNRKSRWESST NNPPS    DSKSSKP    +NPSSKS IS     PK+ ADK V PTPA AP+PSPG+PLPFPDPSA
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVIS-----PKNPADKAVKPTPALAPIPSPGLPLPFPDPSA

Query:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN
        LGPPPPPSYGFHML+RRT+VLADGSVRSYFALP+DY+DF PPAR +DLA RFLPMGSG PG EY GFDHRFPPGGPMSP+EFRGV+EEQFARARPQD WN
Subjt:  LGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWN

Query:  SRGTDERGGPADFSMKRKFIDDNEK---DEKD----RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV
        SRGT+ERGGPA+ S+KRKF DDNEK   DEKD    RQQQFLHNGNANGFLTG G  RGDFLAGTSDPYGRTEDMRFSKY R GGSY +EG+RLG GNNV
Subjt:  SRGTDERGGPADFSMKRKFIDDNEK---DEKD----RQQQFLHNGNANGFLTGSG-GRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNV

Query:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG
        APKY EVDQNALRKAFLHFVKTIYENA QKK+YLEDGK GRLQCLACARSS+DFPDMH LIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG
Subjt:  APKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRG

Query:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHG
        Y+FLSADEA ANQEDLIMWPP VIIHN  TGK KDGRMEG+GNKAMDSKIRDLGF GGKSKSLYGR+GHLG+TLIKFSGDQ GLNEAK+LAEFF KDNHG
Subjt:  YQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHG

Query:  RAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR
        R  WA VRPAA+S+DDDKNPNLVKVDEK+GE+KRIFYG+LAT AD++KVDF+TRKKV IESCRDFK  R
Subjt:  RAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRDFKSPR

SwissProt top hitse value%identityAlignment
A1Y2B7 Protein SUPPRESSOR OF GENE SILENCING 3 homolog4.9e-0927.27Show/hide
Query:  LHKALCVLMGWNYSKPPDNSRGYQFLSADEAAANQEDL-------IMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLG
        LH+ L  L+    S+     RG   L A E     + L       I+WPP VI+ N +  K +D + +G+GN+ +     +  +   K++  YG  GH G
Subjt:  LHKALCVLMGWNYSKPPDNSRGYQFLSADEAAANQEDL-------IMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLG

Query:  STLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLD
         +++ F     G  EA+RL + F      R +W                +L KV    G K+++ YG LA   D++
Subjt:  STLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLD

Q9LHB1 Factor of DNA methylation 38.2e-0429.79Show/hide
Query:  QEDLIMWPPQVIIHNLYTGKGKDGR--MEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAW
        Q + ++WP + ++ N+ T   +DGR      G K  D  IR  GF   + ++++ R GH G+ +++F+ D  GL +A    + +  D HG+  W
Subjt:  QEDLIMWPPQVIIHNLYTGKGKDGR--MEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAW

Arabidopsis top hitse value%identityAlignment
AT3G12550.1 XH/XS domain-containing protein5.8e-0529.79Show/hide
Query:  QEDLIMWPPQVIIHNLYTGKGKDGR--MEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAW
        Q + ++WP + ++ N+ T   +DGR      G K  D  IR  GF   + ++++ R GH G+ +++F+ D  GL +A    + +  D HG+  W
Subjt:  QEDLIMWPPQVIIHNLYTGKGKDGR--MEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAW

AT3G12550.2 XH/XS domain-containing protein5.8e-0529.79Show/hide
Query:  QEDLIMWPPQVIIHNLYTGKGKDGR--MEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAW
        Q + ++WP + ++ N+ T   +DGR      G K  D  IR  GF   + ++++ R GH G+ +++F+ D  GL +A    + +  D HG+  W
Subjt:  QEDLIMWPPQVIIHNLYTGKGKDGR--MEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAW

AT3G22430.1 CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380)4.6e-12748.88Show/hide
Query:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNP-PSDPKPDSKSSKPHNHNHNPSSKSVISPKNPADK---AVKPTPALAPIPS-------------
        M G NN  KSS     +++ P+   S   SS N+ P  +    D   S  +N+N   S   +I  +  ADK      P+P LAPIP              
Subjt:  MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNP-PSDPKPDSKSSKPHNHNHNPSSKSVISPKNPADK---AVKPTPALAPIPS-------------

Query:  ---PGLPLPFPDPS-ALGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRG
           P  P PFPD S ALGPPP P+YGFHML+RRT+VLADGSVRSYFALP +Y DF P +R          +   V G        RFP   P  P EFR 
Subjt:  ---PGLPLPFPDPS-ALGPPPPPSYGFHMLDRRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRG

Query:  VREEQFARARPQDPWNSRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSGGRGDFLAGTSDPYGRTEDMRFSKYSRVGG
                                   D  MKRK+  + E D +D       ++QQF+   N N           F+AGTS   G  ED+R +K+ RVG 
Subjt:  VREEQFARARPQDPWNSRGTDERGGPADFSMKRKFIDDNEKDEKD-------RQQQFLHNGNANGFLTGSGGRGDFLAGTSDPYGRTEDMRFSKYSRVGG

Query:  SYEHEGVRLGIGNNVAPKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCV
        S    G              +VDQ AL+K+FL FVK ++E+  +KK+YLE+G+ GRLQCL C RSSKD  D HSL+MHTY SD + S+V HLGLHKALCV
Subjt:  SYEHEGVRLGIGNNVAPKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQCLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCV

Query:  LMGWNYSKPPDNSRGYQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLN
        LMGWN+SK PDNS+ YQ L ADEAA NQ  LI+WPP VI+ N  TGKGK+GRMEG GNK MD++IR+LG  GGKSKSLYGREGHLG TL KF+GD  GL 
Subjt:  LMGWNYSKPPDNSRGYQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDLGFVGGKSKSLYGREGHLGSTLIKFSGDQPGLN

Query:  EAKRLAEFFAKDNHGRAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRD
        +A R+AE+F K N GR +W  V+P   SKDD+KNP LV+VD +TGEKKRIFYG+LATV DLDKVD ET+KK  IES R+
Subjt:  EAKRLAEFFAKDNHGRAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCRD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGCGGTAACAACACAGGCAAGTCGTCCCAGAAACCTTCTTCTTCTTCCGCCGCCCCTTCTAATCGGAAATCCCGCTGGGAGTCCTCCACCAACAATAACCCTCC
CTCCGATCCCAAACCCGATTCTAAATCTTCAAAACCTCATAATCATAATCATAATCCTTCCTCCAAGTCTGTGATTAGCCCCAAGAATCCCGCCGATAAGGCTGTGAAAC
CAACTCCGGCCTTGGCGCCGATCCCTTCACCTGGACTCCCACTTCCCTTCCCCGACCCGTCGGCTCTCGGCCCACCGCCTCCGCCGTCTTATGGATTCCACATGCTCGAC
CGCCGCACGGTTGTTCTTGCCGATGGCAGTGTACGGTCGTACTTCGCTCTCCCTATCGACTACCAAGATTTTATTCCGCCTGCAAGGCCCGTGGATCTTGCTGCTCGATT
TTTGCCCATGGGGTCCGGTGTCCCTGGCCGGGAATATGCTGGATTTGATCATCGGTTTCCCCCTGGTGGCCCTATGAGTCCGAATGAGTTTCGGGGTGTTCGGGAAGAAC
AATTTGCGCGTGCTAGGCCTCAAGATCCCTGGAATTCTCGAGGGACGGATGAACGGGGTGGCCCTGCAGATTTTTCGATGAAGAGGAAATTCATTGACGACAATGAGAAG
GACGAGAAAGATCGACAGCAGCAATTCTTGCATAATGGGAATGCTAATGGATTTCTTACTGGTTCCGGCGGACGGGGTGATTTTTTGGCGGGAACGAGTGATCCATATGG
TCGGACTGAGGATATGAGGTTCTCGAAGTACTCACGAGTTGGTGGAAGCTACGAACACGAAGGTGTAAGGCTAGGTATTGGTAACAATGTAGCTCCTAAATATCTGGAAG
TTGACCAGAACGCATTGAGAAAGGCATTCCTTCACTTTGTGAAAACCATCTATGAAAATGCCAATCAGAAGAAGCATTATTTGGAGGATGGGAAACATGGTCGTCTTCAA
TGTCTTGCTTGTGCAAGGTCCTCTAAAGATTTTCCTGATATGCATAGCCTGATTATGCACACATACAACTCTGATAGTGCTGACTCCCAAGTGGATCATTTGGGTTTACA
CAAAGCCCTTTGTGTATTAATGGGATGGAACTACTCAAAGCCTCCTGATAATTCAAGAGGGTACCAGTTTCTATCTGCTGATGAAGCAGCAGCCAATCAGGAGGATCTGA
TTATGTGGCCTCCTCAAGTTATCATCCATAATCTATATACTGGAAAGGGAAAAGATGGGCGCATGGAGGGTGTTGGAAACAAAGCAATGGATAGTAAAATTAGAGACCTT
GGATTTGTGGGTGGGAAGTCCAAATCTTTGTATGGAAGGGAAGGTCATTTAGGCTCAACTTTGATCAAGTTTTCGGGTGATCAGCCTGGCCTCAATGAAGCCAAGAGACT
GGCTGAATTTTTTGCGAAGGACAATCATGGACGTGCAGCTTGGGCTCTCGTGCGACCTGCAGCATATAGCAAGGATGATGATAAAAATCCAAACCTTGTGAAGGTGGATG
AAAAGACCGGAGAGAAGAAAAGAATTTTTTATGGCCATCTTGCAACTGTAGCTGATTTGGACAAAGTTGATTTCGAAACAAGGAAGAAGGTGGCCATAGAGAGCTGTAGG
GATTTCAAATCGCCCAGGTAG
mRNA sequenceShow/hide mRNA sequence
GTCCGGGAGGTGAATCAGAGAAGCAATCGTGGCGGTGAATAAAACCCCCATTTAACAATTCTTCAGCCCTAAGCCAGAATCGGCGTTTTTCCGATCTCTCCGTCGTAAAC
CCTAGGCGGCTCCGCCCGCTTCTACAGCAGGCATGGCCGGCGGTAACAACACAGGCAAGTCGTCCCAGAAACCTTCTTCTTCTTCCGCCGCCCCTTCTAATCGGAAATCC
CGCTGGGAGTCCTCCACCAACAATAACCCTCCCTCCGATCCCAAACCCGATTCTAAATCTTCAAAACCTCATAATCATAATCATAATCCTTCCTCCAAGTCTGTGATTAG
CCCCAAGAATCCCGCCGATAAGGCTGTGAAACCAACTCCGGCCTTGGCGCCGATCCCTTCACCTGGACTCCCACTTCCCTTCCCCGACCCGTCGGCTCTCGGCCCACCGC
CTCCGCCGTCTTATGGATTCCACATGCTCGACCGCCGCACGGTTGTTCTTGCCGATGGCAGTGTACGGTCGTACTTCGCTCTCCCTATCGACTACCAAGATTTTATTCCG
CCTGCAAGGCCCGTGGATCTTGCTGCTCGATTTTTGCCCATGGGGTCCGGTGTCCCTGGCCGGGAATATGCTGGATTTGATCATCGGTTTCCCCCTGGTGGCCCTATGAG
TCCGAATGAGTTTCGGGGTGTTCGGGAAGAACAATTTGCGCGTGCTAGGCCTCAAGATCCCTGGAATTCTCGAGGGACGGATGAACGGGGTGGCCCTGCAGATTTTTCGA
TGAAGAGGAAATTCATTGACGACAATGAGAAGGACGAGAAAGATCGACAGCAGCAATTCTTGCATAATGGGAATGCTAATGGATTTCTTACTGGTTCCGGCGGACGGGGT
GATTTTTTGGCGGGAACGAGTGATCCATATGGTCGGACTGAGGATATGAGGTTCTCGAAGTACTCACGAGTTGGTGGAAGCTACGAACACGAAGGTGTAAGGCTAGGTAT
TGGTAACAATGTAGCTCCTAAATATCTGGAAGTTGACCAGAACGCATTGAGAAAGGCATTCCTTCACTTTGTGAAAACCATCTATGAAAATGCCAATCAGAAGAAGCATT
ATTTGGAGGATGGGAAACATGGTCGTCTTCAATGTCTTGCTTGTGCAAGGTCCTCTAAAGATTTTCCTGATATGCATAGCCTGATTATGCACACATACAACTCTGATAGT
GCTGACTCCCAAGTGGATCATTTGGGTTTACACAAAGCCCTTTGTGTATTAATGGGATGGAACTACTCAAAGCCTCCTGATAATTCAAGAGGGTACCAGTTTCTATCTGC
TGATGAAGCAGCAGCCAATCAGGAGGATCTGATTATGTGGCCTCCTCAAGTTATCATCCATAATCTATATACTGGAAAGGGAAAAGATGGGCGCATGGAGGGTGTTGGAA
ACAAAGCAATGGATAGTAAAATTAGAGACCTTGGATTTGTGGGTGGGAAGTCCAAATCTTTGTATGGAAGGGAAGGTCATTTAGGCTCAACTTTGATCAAGTTTTCGGGT
GATCAGCCTGGCCTCAATGAAGCCAAGAGACTGGCTGAATTTTTTGCGAAGGACAATCATGGACGTGCAGCTTGGGCTCTCGTGCGACCTGCAGCATATAGCAAGGATGA
TGATAAAAATCCAAACCTTGTGAAGGTGGATGAAAAGACCGGAGAGAAGAAAAGAATTTTTTATGGCCATCTTGCAACTGTAGCTGATTTGGACAAAGTTGATTTCGAAA
CAAGGAAGAAGGTGGCCATAGAGAGCTGTAGGGATTTCAAATCGCCCAGGTAGACTATATCAATAAGTATTTATAGTTCAGCATGCGATCAATTTGCATCCTTGGTTGAG
CTGCAAATTTCAAATCCACTTCAATTTTAGCCCAATGGTACATCGATGAAGTCTTTACCAAGCAGCTTGTTAAATATTGTCCTCTGTTATAAAGAGGTGTGGGGGGTTGT
TTGGTCTACCAGTAAACATTACTTCAGGAATTTCTCTGGATTTGTTGCGTTTATTTATTTTACCTTAGGATGTGGAACATGGACCTGGTTGTCTGTCAACTGTCTTTAAT
CCTAATGATTGAAAGAAGCTTGAATTATTCTGTTTTTTGCTATTATCTTAACTCTCGAACCTGCTATCAGTTGGGAACTAGGTATAAGTAAATTGACTTTAATGAGTCAG
CTGTGTTAGCACACCTTGGAAGCTAGCTGATTCTGTGTAAACAAGCCTTCCATTGCTCAGTATTTTTCAGGCCAAGCATACAATTTTAAAGGTTTTATTTTTTATATTTT
TTTGATTCAACAACCTTCGTGAAAAAGGATTCAAACATGTAATCTGATTGTTACTAAGTTATATAAAATATTACTCTTGACTGTCTTGGTTACATATCTGAAGGAGGGAG
TTAATTGAGTAGGTGAGTTACATGGAAGCATGTATATTTGGACTATACGGTTGTGCAATTTAATTTTGTGCATAGTAATTCAATTTAGCATCTTTTGAGGGTGAT
Protein sequenceShow/hide protein sequence
MAGGNNTGKSSQKPSSSSAAPSNRKSRWESSTNNNPPSDPKPDSKSSKPHNHNHNPSSKSVISPKNPADKAVKPTPALAPIPSPGLPLPFPDPSALGPPPPPSYGFHMLD
RRTVVLADGSVRSYFALPIDYQDFIPPARPVDLAARFLPMGSGVPGREYAGFDHRFPPGGPMSPNEFRGVREEQFARARPQDPWNSRGTDERGGPADFSMKRKFIDDNEK
DEKDRQQQFLHNGNANGFLTGSGGRGDFLAGTSDPYGRTEDMRFSKYSRVGGSYEHEGVRLGIGNNVAPKYLEVDQNALRKAFLHFVKTIYENANQKKHYLEDGKHGRLQ
CLACARSSKDFPDMHSLIMHTYNSDSADSQVDHLGLHKALCVLMGWNYSKPPDNSRGYQFLSADEAAANQEDLIMWPPQVIIHNLYTGKGKDGRMEGVGNKAMDSKIRDL
GFVGGKSKSLYGREGHLGSTLIKFSGDQPGLNEAKRLAEFFAKDNHGRAAWALVRPAAYSKDDDKNPNLVKVDEKTGEKKRIFYGHLATVADLDKVDFETRKKVAIESCR
DFKSPR