; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G23190 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G23190
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionXS domain-containing protein
Genome locationClcChr02:35219017..35224070
RNA-Seq ExpressionClc02G23190
SyntenyClc02G23190
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005380 - XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458617.1 PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo]0.0e+0078.89Show/hide
Query:  MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRNHDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLF
        M+ RE + DKRS+  SPS FGRRTSE RV E PHC+ HWFSRSSRE PMTN+L GSSIR+H NGSRL  +KDEHF KLSQFCENLQ ES +KKF+WENLF
Subjt:  MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRNHDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLF

Query:  ANNP-ANANSKSSIGLKHGNMCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHV
         NN  AN NSK+S+GLKH N  DG NRGIRVSGSHLGTSSK IL G NL+ FHMNIGA KDSNVKNNGDTSRS GI+D +HLSSSRK+DGP ++ N+VHV
Subjt:  ANNP-ANANSKSSIGLKHGNMCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHV

Query:  RDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD
        RDRPIFE  ENS+RGRRNETSS GIQASHL SSAPV ESK I Q EFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKR+DIDA  + PFSQ +VRIPQDD
Subjt:  RDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD

Query:  FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKW
        FY+ STRTSVVMD VVEGF  TESH EETTRPRD ++ F     IEGS M TAPFAMEQY +VLGSGTESS   EREAYISSEKLLL +EDGYRTN+GKW
Subjt:  FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKW

Query:  SNEDGLSGSLVSKH--DLSDMEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTS
        + EDG++GS VSKH  DL DMED RKL W+A HSTKPRVEG R +MHDP  GS +K NVFSRIQFL+H      VKDTD NL  R+    DEDTS    S
Subjt:  SNEDGLSGSLVSKH--DLSDMEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTS

Query:  SKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV
        SKR LPWV+NH SPRSK KRR+LKKRLG PLGDP+SN LVRERE K NKRLRKT V+H CLDVQTGDYLEEKVQSPTSRPPLED EELNQLIKSAFLKFV
Subjt:  SKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV

Query:  KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPV
        KVL+ENPARRKK  EPG GII CIVCGSKSKEF DALSLSQHAS+TL G RAEHLGLHKALCWLMGWSSE APNGLW+RRILPL EVLALKEDLIIWPPV
Subjt:  KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPV

Query:  LIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIAT
        LIIHNSSIA+D LS+ +AISCEELE VIRGMGCGGKI+VVRG+PGN SIM+ TFGAMFSGLQEAERLHK FADKSHGRDE HKIN  HLIDS+ DLH AT
Subjt:  LIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIAT

Query:  GANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC
        GANTLESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQAIVNASL C
Subjt:  GANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC

XP_011657058.1 uncharacterized protein LOC105435801 [Cucumis sativus]2.3e-9280.09Show/hide
Query:  SKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVV
        SKSKEF DALSL QHAS+TL G RAEHLGLHKALCWLMGWSSE APNGLW+R ILP VEVLALKEDLIIWP VLIIHNSSIA+D   E +AISCE+LE  
Subjt:  SKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVV

Query:  IRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETK
        +R MGCGGK +VVRGK  N SIM+ TFGAMF GLQEAERLH  FADKSHGRDEFHKIN   L+DS+ D+H ATGANTLESV YGYLGL EDLDKLDFETK
Subjt:  IRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETK

Query:  KRSVVKSKKEIQAIVNASLDC
        KRSVV+SKKEIQAIV+ASL C
Subjt:  KRSVVKSKKEIQAIVNASLDC

XP_017982234.1 PREDICTED: uncharacterized protein LOC18590378 [Theobroma cacao]5.6e-7852.57Show/hide
Query:  RRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPG-S
        R+ +K+RLG P    + N + R    K  K L++  VN     VQ  D     V+   + PP EDSEE  Q I  AF+KFVK+LNENPA+R+K+RE G +
Subjt:  RRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPG-S

Query:  GIIKCIVCGSKSKEFADALSLSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERL
        G +KC VCGSKS+EF + LSL  HA +  + GLRA HLGLHK+LC+LMGW+S AA NGLW ++ LP VE LA+KEDL+IWPP++I+HNSSIA  N   R+
Subjt:  GIIKCIVCGSKSKEFADALSLSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERL

Query:  AISCEELEVVIRGMGCG-GKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLA
         +S EE+E  +R MG G G  +V RGKP N SIM   F   FSGL+EAERLHK +A+  HGR EF +IN S        L      + ++ VLYGYLG+A
Subjt:  AISCEELEVVIRGMGCG-GKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLA

Query:  EDLDKLDFETKKRSVVKSKKEIQAIVNASLD
         DLDKLDFETK R++VKSKKEI A  +A L+
Subjt:  EDLDKLDFETKKRSVVKSKKEIQAIVNASLD

XP_022140332.1 uncharacterized protein LOC111011032 [Momordica charantia]4.3e-7883.15Show/hide
Query:  MGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEA
        MGWSSE APNGLW++RILP VE  ALKEDLIIWPPVLIIHNSSIA DN SE++ ISCEELEVVIRGMG GGKI+VVRGKP N SIM+ TF AMFSGLQEA
Subjt:  MGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEA

Query:  ERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC
        ERLHK FADKSHGRDEFH+INSSH IDSH DLH A GAN +ESVLYGYLGLAED +KLDFETKKRSVVKSKKEIQAIV+A+L C
Subjt:  ERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC

XP_038900433.1 uncharacterized protein LOC120087658 [Benincasa hispida]0.0e+0083.59Show/hide
Query:  MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRNHDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLF
        M+ RETS DKRS+  SPSSFGRRTSE RV ENPHCH  WFSRSSRE P+TN LAGSSIR+H NGSRL EN DEHF KLSQ CENLQ ES SKKFRWENLF
Subjt:  MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRNHDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLF

Query:  ANNPANANSKSSIGLKHGNMCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHVR
        ANNPANANSKSS+GLKH N+CDG+NRGIRVSGSHLGTSS  IL G+NL+ FHMNIG  KDSNVKNNGD SRSFGIDD SHLSSSRKFDGP YET+DVHVR
Subjt:  ANNPANANSKSSIGLKHGNMCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHVR

Query:  DRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDDF
        DRPIFESAENS+RGRRN  SSHG+QAS+LQSSAPVTESK I QDEFHD LEYKRARRN+IE FDDSNQYFSVQP KRSDIDA LNS FSQQ+VRIPQDDF
Subjt:  DRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDDF

Query:  YQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWS
        YQ STRTSVVMD VVEGF  TESHLEETTRPRDRYD FKEPF+IEGSYM TAPF ME YG+ LGSG ESS+K EREAYISSEKLLL +EDGYRT YGKW 
Subjt:  YQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWS

Query:  NEDGLSGSLVSKH--DLSDMEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTSS
        +EDG++GSLVSKH  DLSDME SRKLRW+A +STK RVEGTRC MH+P S SSRK NVFSRIQFLSH  E  AVKDTDINL  R K WN+EDTSI LTSS
Subjt:  NEDGLSGSLVSKH--DLSDMEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTSS

Query:  KRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFVK
        KR LPWVINHASP SK KRRDL+KRLGFPL DPSS+PLVR+R+ K NKRLRK  VNH CLDVQT DY+EEKVQSPTSR  LED EELNQLIKSAFLKFVK
Subjt:  KRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFVK

Query:  VLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVL
        VL+ENPARRKKF EPG GIIKCIVCGSKSKEFADALSLSQHASQTL G RAEHLGL KALCWLMGWSSEAAP+G W+RRILPL EVLALKEDLIIWPPVL
Subjt:  VLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVL

Query:  IIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATG
        IIHNSSIA+D+ SER+AISCEELEVVIRGMGCGGKI+VVRGKPGN SIMI TF AMFSGLQEAERLHK FADKSHGRDEF KI SSHLIDSH DLH ATG
Subjt:  IIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATG

Query:  ANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC
        ANTL++VLYGYLGL EDLDKLDFETKKRSVVKSKKEIQAIVNASL C
Subjt:  ANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC

TrEMBL top hitse value%identityAlignment
A0A0A0KGN5 XS domain-containing protein1.1e-9280.09Show/hide
Query:  SKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVV
        SKSKEF DALSL QHAS+TL G RAEHLGLHKALCWLMGWSSE APNGLW+R ILP VEVLALKEDLIIWP VLIIHNSSIA+D   E +AISCE+LE  
Subjt:  SKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVV

Query:  IRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETK
        +R MGCGGK +VVRGK  N SIM+ TFGAMF GLQEAERLH  FADKSHGRDEFHKIN   L+DS+ D+H ATGANTLESV YGYLGL EDLDKLDFETK
Subjt:  IRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETK

Query:  KRSVVKSKKEIQAIVNASLDC
        KRSVV+SKKEIQAIV+ASL C
Subjt:  KRSVVKSKKEIQAIVNASLDC

A0A1S3C894 uncharacterized protein LOC1034979640.0e+0078.89Show/hide
Query:  MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRNHDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLF
        M+ RE + DKRS+  SPS FGRRTSE RV E PHC+ HWFSRSSRE PMTN+L GSSIR+H NGSRL  +KDEHF KLSQFCENLQ ES +KKF+WENLF
Subjt:  MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRNHDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLF

Query:  ANNP-ANANSKSSIGLKHGNMCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHV
         NN  AN NSK+S+GLKH N  DG NRGIRVSGSHLGTSSK IL G NL+ FHMNIGA KDSNVKNNGDTSRS GI+D +HLSSSRK+DGP ++ N+VHV
Subjt:  ANNP-ANANSKSSIGLKHGNMCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHV

Query:  RDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD
        RDRPIFE  ENS+RGRRNETSS GIQASHL SSAPV ESK I Q EFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKR+DIDA  + PFSQ +VRIPQDD
Subjt:  RDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD

Query:  FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKW
        FY+ STRTSVVMD VVEGF  TESH EETTRPRD ++ F     IEGS M TAPFAMEQY +VLGSGTESS   EREAYISSEKLLL +EDGYRTN+GKW
Subjt:  FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKW

Query:  SNEDGLSGSLVSKH--DLSDMEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTS
        + EDG++GS VSKH  DL DMED RKL W+A HSTKPRVEG R +MHDP  GS +K NVFSRIQFL+H      VKDTD NL  R+    DEDTS    S
Subjt:  SNEDGLSGSLVSKH--DLSDMEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTS

Query:  SKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV
        SKR LPWV+NH SPRSK KRR+LKKRLG PLGDP+SN LVRERE K NKRLRKT V+H CLDVQTGDYLEEKVQSPTSRPPLED EELNQLIKSAFLKFV
Subjt:  SKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV

Query:  KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPV
        KVL+ENPARRKK  EPG GII CIVCGSKSKEF DALSLSQHAS+TL G RAEHLGLHKALCWLMGWSSE APNGLW+RRILPL EVLALKEDLIIWPPV
Subjt:  KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPV

Query:  LIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIAT
        LIIHNSSIA+D LS+ +AISCEELE VIRGMGCGGKI+VVRG+PGN SIM+ TFGAMFSGLQEAERLHK FADKSHGRDE HKIN  HLIDS+ DLH AT
Subjt:  LIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIAT

Query:  GANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC
        GANTLESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQAIVNASL C
Subjt:  GANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC

A0A5A7SQC0 XS domain-containing protein0.0e+0078.89Show/hide
Query:  MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRNHDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLF
        M+ RE + DKRS+  SPS FGRRTSE RV E PHC+ HWFSRSSRE PMTN+L GSSIR+H NGSRL  +KDEHF KLSQFCENLQ ES +KKF+WENLF
Subjt:  MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRNHDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLF

Query:  ANNP-ANANSKSSIGLKHGNMCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHV
         NN  AN NSK+S+GLKH N  DG NRGIRVSGSHLGTSSK IL G NL+ FHMNIGA KDSNVKNNGDTSRS GI+D +HLSSSRK+DGP ++ N+VHV
Subjt:  ANNP-ANANSKSSIGLKHGNMCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHV

Query:  RDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD
        RDRPIFE  ENS+RGRRNETSS GIQASHL SSAPV ESK I Q EFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKR+DIDA  + PFSQ +VRIPQDD
Subjt:  RDRPIFESAENSYRGRRNETSSHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDD

Query:  FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKW
        FY+ STRTSVVMD VVEGF  TESH EETTRPRD ++ F     IEGS M TAPFAMEQY +VLGSGTESS   EREAYISSEKLLL +EDGYRTN+GKW
Subjt:  FYQASTRTSVVMDPVVEGF--TESHLEETTRPRDRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKW

Query:  SNEDGLSGSLVSKH--DLSDMEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTS
        + EDG++GS VSKH  DL DMED RKL W+A HSTKPRVEG R +MHDP  GS +K NVFSRIQFL+H      VKDTD NL  R+    DEDTS    S
Subjt:  SNEDGLSGSLVSKH--DLSDMEDSRKLRWEAPHSTKPRVEGTRCRMHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTS

Query:  SKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV
        SKR LPWV+NH SPRSK KRR+LKKRLG PLGDP+SN LVRERE K NKRLRKT V+H CLDVQTGDYLEEKVQSPTSRPPLED EELNQLIKSAFLKFV
Subjt:  SKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFV

Query:  KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPV
        KVL+ENPARRKK  EPG GII CIVCGSKSKEF DALSLSQHAS+TL G RAEHLGLHKALCWLMGWSSE APNGLW+RRILPL EVLALKEDLIIWPPV
Subjt:  KVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPV

Query:  LIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIAT
        LIIHNSSIA+D LS+ +AISCEELE VIRGMGCGGKI+VVRG+PGN SIM+ TFGAMFSGLQEAERLHK FADKSHGRDE HKIN  HLIDS+ DLH AT
Subjt:  LIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIAT

Query:  GANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC
        GANTLESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQAIVNASL C
Subjt:  GANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC

A0A6J0ZXA5 uncharacterized protein LOC1104129797.9e-7852.57Show/hide
Query:  RRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPG-S
        R+ +K+RLG P    + N + R +  K  K L K  VN     VQ  D     V+   + PP EDS+E  Q I+ AF+++VK+LNENPA+R+K+ E G +
Subjt:  RRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTKVNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPG-S

Query:  GIIKCIVCGSKSKEFADALSLSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERL
        G +KC VCGSKS+EF + LSL  HA +  + GLR  HLGLHKALC+LMGW+S AA NGLW ++ LP VE LA+KEDL+IWPPV+I+HNSSIA  N   R+
Subjt:  GIIKCIVCGSKSKEFADALSLSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERL

Query:  AISCEELEVVIRGMGCG-GKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLA
         +S EE+E  +R MG G G  +V RGKP N SIM   F   FSGL+EAERLHK +A+  HGR EF +IN S        L      + +E VLYGYLG+A
Subjt:  AISCEELEVVIRGMGCG-GKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLA

Query:  EDLDKLDFETKKRSVVKSKKEIQAIVNASLD
         DLDKLDFETK R++VKSKKEI A  +A LD
Subjt:  EDLDKLDFETKKRSVVKSKKEIQAIVNASLD

A0A6J1CGJ5 uncharacterized protein LOC1110110322.1e-7883.15Show/hide
Query:  MGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEA
        MGWSSE APNGLW++RILP VE  ALKEDLIIWPPVLIIHNSSIA DN SE++ ISCEELEVVIRGMG GGKI+VVRGKP N SIM+ TF AMFSGLQEA
Subjt:  MGWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEA

Query:  ERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC
        ERLHK FADKSHGRDEFH+INSSH IDSH DLH A GAN +ESVLYGYLGLAED +KLDFETKKRSVVKSKKEIQAIV+A+L C
Subjt:  ERLHKRFADKSHGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22430.1 CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380)4.2e-3134.25Show/hide
Query:  IKSAFLKFVKVLNENPARRKKFREPG-SGIIKCIVCGSKSKEFADALSLSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLA
        +K +FL FVK + E+P  +K + E G  G ++C+VCG  SK+  D  SL  H         R  HLGLHKALC LMGW+   AP+     + LP  E   
Subjt:  IKSAFLKFVKVLNENPARRKKFREPG-SGIIKCIVCGSKSKEFADALSLSQHA-SQTLGGLRAEHLGLHKALCWLMGWSSEAAPNGLWIRRILPLVEVLA

Query:  LKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMG-CGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKIN--S
         +  LIIWPP +I+ N+S              + ++  IR +G  GGK + + G+ G+  I +  F    SGL++A R+ + F   + GR  + ++   +
Subjt:  LKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMG-CGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKSHGRDEFHKIN--S

Query:  SHLIDSHNDLHIATGANTLES--VLYGYLGLAEDLDKLDFETKKRSVVKSKKEI
            D  N   +     T E   + YGYL    DLDK+D ETKK++ ++S +E+
Subjt:  SHLIDSHNDLHIATGANTLES--VLYGYLGLAEDLDKLDFETKKRSVVKSKKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTGTAGAGAAACGAGTGGAGATAAGAGGTCTCGGTCTCCTTCTCCGTCGTCGTTTGGACGGAGAACTTCGGAACTTCGGGTTGCAGAAAATCCACATTGTCATTT
GCACTGGTTTTCCCGTTCTTCACGGGAAGGACCGATGACGAATGACCTTGCGGGTTCTTCTATCAGAAACCATGACAATGGAAGTCGTCTTTGTGAAAATAAAGACGAAC
ATTTCTGTAAACTCTCTCAGTTTTGCGAGAATTTACAATGGGAATCGGCATCGAAAAAGTTTCGGTGGGAAAATTTGTTTGCCAATAATCCCGCCAATGCGAATTCGAAA
TCGAGTATAGGGTTGAAACATGGAAATATGTGTGATGGTCATAATCGAGGAATTAGGGTTTCTGGTTCACATTTGGGTACGTCGTCCAAGGAAATTTTAGTTGGTAATAA
TTTGCAGATATTCCATATGAACATTGGGGCCATTAAAGATAGTAACGTAAAGAACAATGGGGATACTTCCAGAAGCTTTGGAATCGATGACTATAGCCATTTGTCTTCAT
CTAGAAAGTTTGATGGGCCCTCTTACGAGACCAATGATGTTCATGTTCGGGACCGTCCGATCTTTGAATCAGCAGAAAATTCCTACAGAGGAAGACGAAACGAAACTTCT
TCACATGGGATACAAGCGTCTCATCTACAGTCCAGTGCACCTGTTACTGAATCTAAGAGCATTTTGCAAGATGAATTTCATGATTTACTGGAGTATAAACGAGCTCGAAG
GAACCATATTGAGCACTTTGACGATAGCAATCAGTATTTCTCAGTTCAGCCATGTAAGAGGAGTGACATTGATGCTGCTCTCAACAGTCCTTTCTCTCAGCAATTGGTTC
GTATCCCGCAAGATGATTTCTATCAAGCTTCTACTCGGACCAGTGTTGTAATGGATCCAGTTGTTGAAGGATTCACTGAAAGCCATTTGGAAGAGACCACCCGACCAAGA
GACCGTTATGATCTTTTCAAAGAACCATTCATCATTGAAGGTTCTTATATGGACACTGCCCCTTTTGCGATGGAACAGTATGGCAAAGTTTTGGGTTCAGGAACTGAAAG
TTCGCTGAAGAGTGAAAGAGAAGCATATATAAGCAGCGAGAAATTACTCTTGCCTAAAGAAGATGGTTATAGGACAAATTATGGGAAATGGTCGAATGAGGATGGATTAA
GTGGATCATTAGTATCAAAACATGATTTGAGCGACATGGAAGACAGTAGAAAGCTGAGATGGGAAGCCCCACATTCAACAAAGCCGAGGGTTGAAGGAACAAGATGTAGA
ATGCATGATCCTAGGTCTGGTTCATCTAGAAAATCAAATGTGTTTAGCAGAATCCAGTTTTTAAGCCATAGAGTTGAAAAGAGTGCTGTTAAAGATACTGACATCAATTT
AATTGGTAGAGACAAGCGATGGAATGACGAGGATACTTCTATATCCTTGACATCCTCTAAACGGTCGTTGCCTTGGGTAATAAACCATGCCTCTCCGCGTTCAAAGCCTA
AGCGTAGAGACCTAAAGAAGCGTTTGGGTTTCCCCTTAGGGGATCCCAGTTCAAACCCTTTAGTAAGAGAACGAGAAGGTAAAACAAACAAGCGTCTGAGGAAGACGAAA
GTCAATCATAGGTGCCTTGATGTTCAAACAGGTGATTACTTGGAAGAGAAGGTGCAAAGTCCAACCAGTAGGCCACCACTTGAAGATTCAGAGGAGTTGAACCAGCTAAT
AAAGAGCGCCTTTCTCAAGTTTGTCAAAGTTCTGAATGAGAATCCAGCCAGACGAAAGAAGTTCAGAGAGCCGGGGTCTGGTATTATAAAGTGCATTGTCTGTGGCAGCA
AGTCCAAGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTCCCAGACGTTGGGAGGATTGAGGGCAGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATG
GGATGGAGCAGTGAAGCAGCGCCAAACGGTCTATGGATTCGAAGGATATTGCCTCTTGTAGAAGTACTTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTAT
CATTCATAACAGTTCTATTGCAGTTGATAACCTGTCCGAACGGTTAGCCATAAGTTGTGAGGAGCTGGAGGTTGTCATTAGAGGAATGGGTTGTGGAGGGAAGATCGAAG
TGGTACGTGGTAAACCTGGAAACCATAGTATTATGATAGCAACTTTTGGTGCAATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGGTTTGCAGATAAGAGT
CATGGTAGGGACGAGTTCCATAAAATCAATTCGAGTCATCTCATCGACAGCCACAATGATCTGCATATAGCAACAGGAGCAAACACATTGGAGAGTGTACTGTATGGTTA
CTTAGGCCTCGCAGAGGACTTGGATAAACTTGACTTCGAGACCAAGAAGCGATCTGTGGTGAAAAGCAAGAAAGAAATCCAAGCCATTGTGAATGCGTCCCTTGACTGTT
AG
mRNA sequenceShow/hide mRNA sequence
CTCGTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCAATGTCTGCCATTACCATTACCATTACCAGTATTCCAACCCAACCACAAAAGCTTCATCCAACTTAACACTCGATT
TTCCTCTCACAGTCCCACTCCTTCTTTTCCCCCGTAGCATTGCATTCTCCTTTGATCGTTTCATCTTCTCTGGTTATGTTTTTCTTTTTCGTTTCAACTCAATCTTGCTT
GCTTGTTTTTTTTGTTTTTTGTTTTTATTTTCACTGTTTTCCTTCTAGGTTTTTTGTTCTGTTTTCTTCTAGGGTTCTTAGTTTTTGTTCTGGAGAGTGAAAAAACGCAA
TGAGCTGTAGAGAAACGAGTGGAGATAAGAGGTCTCGGTCTCCTTCTCCGTCGTCGTTTGGACGGAGAACTTCGGAACTTCGGGTTGCAGAAAATCCACATTGTCATTTG
CACTGGTTTTCCCGTTCTTCACGGGAAGGACCGATGACGAATGACCTTGCGGGTTCTTCTATCAGAAACCATGACAATGGAAGTCGTCTTTGTGAAAATAAAGACGAACA
TTTCTGTAAACTCTCTCAGTTTTGCGAGAATTTACAATGGGAATCGGCATCGAAAAAGTTTCGGTGGGAAAATTTGTTTGCCAATAATCCCGCCAATGCGAATTCGAAAT
CGAGTATAGGGTTGAAACATGGAAATATGTGTGATGGTCATAATCGAGGAATTAGGGTTTCTGGTTCACATTTGGGTACGTCGTCCAAGGAAATTTTAGTTGGTAATAAT
TTGCAGATATTCCATATGAACATTGGGGCCATTAAAGATAGTAACGTAAAGAACAATGGGGATACTTCCAGAAGCTTTGGAATCGATGACTATAGCCATTTGTCTTCATC
TAGAAAGTTTGATGGGCCCTCTTACGAGACCAATGATGTTCATGTTCGGGACCGTCCGATCTTTGAATCAGCAGAAAATTCCTACAGAGGAAGACGAAACGAAACTTCTT
CACATGGGATACAAGCGTCTCATCTACAGTCCAGTGCACCTGTTACTGAATCTAAGAGCATTTTGCAAGATGAATTTCATGATTTACTGGAGTATAAACGAGCTCGAAGG
AACCATATTGAGCACTTTGACGATAGCAATCAGTATTTCTCAGTTCAGCCATGTAAGAGGAGTGACATTGATGCTGCTCTCAACAGTCCTTTCTCTCAGCAATTGGTTCG
TATCCCGCAAGATGATTTCTATCAAGCTTCTACTCGGACCAGTGTTGTAATGGATCCAGTTGTTGAAGGATTCACTGAAAGCCATTTGGAAGAGACCACCCGACCAAGAG
ACCGTTATGATCTTTTCAAAGAACCATTCATCATTGAAGGTTCTTATATGGACACTGCCCCTTTTGCGATGGAACAGTATGGCAAAGTTTTGGGTTCAGGAACTGAAAGT
TCGCTGAAGAGTGAAAGAGAAGCATATATAAGCAGCGAGAAATTACTCTTGCCTAAAGAAGATGGTTATAGGACAAATTATGGGAAATGGTCGAATGAGGATGGATTAAG
TGGATCATTAGTATCAAAACATGATTTGAGCGACATGGAAGACAGTAGAAAGCTGAGATGGGAAGCCCCACATTCAACAAAGCCGAGGGTTGAAGGAACAAGATGTAGAA
TGCATGATCCTAGGTCTGGTTCATCTAGAAAATCAAATGTGTTTAGCAGAATCCAGTTTTTAAGCCATAGAGTTGAAAAGAGTGCTGTTAAAGATACTGACATCAATTTA
ATTGGTAGAGACAAGCGATGGAATGACGAGGATACTTCTATATCCTTGACATCCTCTAAACGGTCGTTGCCTTGGGTAATAAACCATGCCTCTCCGCGTTCAAAGCCTAA
GCGTAGAGACCTAAAGAAGCGTTTGGGTTTCCCCTTAGGGGATCCCAGTTCAAACCCTTTAGTAAGAGAACGAGAAGGTAAAACAAACAAGCGTCTGAGGAAGACGAAAG
TCAATCATAGGTGCCTTGATGTTCAAACAGGTGATTACTTGGAAGAGAAGGTGCAAAGTCCAACCAGTAGGCCACCACTTGAAGATTCAGAGGAGTTGAACCAGCTAATA
AAGAGCGCCTTTCTCAAGTTTGTCAAAGTTCTGAATGAGAATCCAGCCAGACGAAAGAAGTTCAGAGAGCCGGGGTCTGGTATTATAAAGTGCATTGTCTGTGGCAGCAA
GTCCAAGGAGTTTGCAGATGCACTAAGCTTATCACAACATGCCTCCCAGACGTTGGGAGGATTGAGGGCAGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTCATGG
GATGGAGCAGTGAAGCAGCGCCAAACGGTCTATGGATTCGAAGGATATTGCCTCTTGTAGAAGTACTTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATC
ATTCATAACAGTTCTATTGCAGTTGATAACCTGTCCGAACGGTTAGCCATAAGTTGTGAGGAGCTGGAGGTTGTCATTAGAGGAATGGGTTGTGGAGGGAAGATCGAAGT
GGTACGTGGTAAACCTGGAAACCATAGTATTATGATAGCAACTTTTGGTGCAATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGGTTTGCAGATAAGAGTC
ATGGTAGGGACGAGTTCCATAAAATCAATTCGAGTCATCTCATCGACAGCCACAATGATCTGCATATAGCAACAGGAGCAAACACATTGGAGAGTGTACTGTATGGTTAC
TTAGGCCTCGCAGAGGACTTGGATAAACTTGACTTCGAGACCAAGAAGCGATCTGTGGTGAAAAGCAAGAAAGAAATCCAAGCCATTGTGAATGCGTCCCTTGACTGTTA
GTGTTTACCATGTGCGAGAGGTTTGTATAGATAGATAGTCAGTCTCACACATTGGGTTCGGTTAAGGTACCTGAACCCAAGACTGTCATCTTAGTTTTGTATCTTTTGGG
TCAACTAAACTAGTTCTTTCTTTGTCTCTTTTGTGGCTGGGCAAATTTTTGGTGGGATGCAAGTTTGATGTGTTGTGAGGAGCTTGGTAGCAGAGAGAGCTCTCTTGGCT
TAAGTTAGGGAAGGAATTTCCATTGGCTTTGCTACCCTTGAACTGATTTCATATGCTAATTGTAATTAGCTCAAAATTAGTAGCCCATGTTTATAGTTGCATTGGTTAAT
TCTATTTCCTTCTGTGTCAACTCCACTATTACTATAGTACTCTTCATGTTTCTTTCTTGGTTATCATTGCACACTTTATACTGAATAGTCGTTT
Protein sequenceShow/hide protein sequence
MSCRETSGDKRSRSPSPSSFGRRTSELRVAENPHCHLHWFSRSSREGPMTNDLAGSSIRNHDNGSRLCENKDEHFCKLSQFCENLQWESASKKFRWENLFANNPANANSK
SSIGLKHGNMCDGHNRGIRVSGSHLGTSSKEILVGNNLQIFHMNIGAIKDSNVKNNGDTSRSFGIDDYSHLSSSRKFDGPSYETNDVHVRDRPIFESAENSYRGRRNETS
SHGIQASHLQSSAPVTESKSILQDEFHDLLEYKRARRNHIEHFDDSNQYFSVQPCKRSDIDAALNSPFSQQLVRIPQDDFYQASTRTSVVMDPVVEGFTESHLEETTRPR
DRYDLFKEPFIIEGSYMDTAPFAMEQYGKVLGSGTESSLKSEREAYISSEKLLLPKEDGYRTNYGKWSNEDGLSGSLVSKHDLSDMEDSRKLRWEAPHSTKPRVEGTRCR
MHDPRSGSSRKSNVFSRIQFLSHRVEKSAVKDTDINLIGRDKRWNDEDTSISLTSSKRSLPWVINHASPRSKPKRRDLKKRLGFPLGDPSSNPLVREREGKTNKRLRKTK
VNHRCLDVQTGDYLEEKVQSPTSRPPLEDSEELNQLIKSAFLKFVKVLNENPARRKKFREPGSGIIKCIVCGSKSKEFADALSLSQHASQTLGGLRAEHLGLHKALCWLM
GWSSEAAPNGLWIRRILPLVEVLALKEDLIIWPPVLIIHNSSIAVDNLSERLAISCEELEVVIRGMGCGGKIEVVRGKPGNHSIMIATFGAMFSGLQEAERLHKRFADKS
HGRDEFHKINSSHLIDSHNDLHIATGANTLESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVNASLDC