; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017996 (gene) of Chayote v1 genome

Gene IDSed0017996
OrganismSechium edule (Chayote v1)
DescriptionXS domain-containing protein
Genome locationLG05:20223370..20228056
RNA-Seq ExpressionSed0017996
SyntenySed0017996
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR005380 - XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458617.1 PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo]3.4e-24058.63Show/hide
Query:  MNWRETTGDQRSESP----WRISEPRVERNRPCHPHWLPGSSPEEPVTTNGLVAFSVTDPYYQT----SPCENFRKLSRFCQNLQRESPSTKFRWENLF-
        MN RE   D+RS+SP     R SEPRVE    C+ HW   SS E P+ TN L   S+ D Y  +       E+FRKLS+FC+NLQ ESP+ KF+WENLF 
Subjt:  MNWRETTGDQRSESP----WRISEPRVERNRPCHPHWLPGSSPEEPVTTNGLVAFSVTDPYYQT----SPCENFRKLSRFCQNLQRESPSTKFRWENLF-

Query:  ----ANATSKSRLGFDGCDG--GGNGPIRVSGSQLCAGSSSTDVLEGFHLRPFQINTGATKNRNV-NNGVISTSLGIRDYGCHFSSSRKFDGSVYEPSDL
            AN  SK+ +G    +G  G N  IRVSGS L  G+SS  +L G +LR F +N GATK+ NV NNG  S S+GI D   H SSSRK+DG +++ +++
Subjt:  ----ANATSKSRLGFDGCDG--GGNGPIRVSGSQLCAGSSSTDVLEGFHLRPFQINTGATKNRNV-NNGVISTSLGIRDYGCHFSSSRKFDGSVYEPSDL

Query:  HVRDR------------------SHGIGTSHLQSSARVSDSKRISQDVFHGYCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDD
        HVRDR                  S GI  SHL SSA V++SK ISQ  FH                                 L+EYKRA+ NHIEHFDD
Subjt:  HVRDR------------------SHGIGTSHLQSSARVSDSKRISQDVFHGYCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDD

Query:  HSQYFKVQPCKRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAMEQDGEV
         +QYF VQPCKR+DI    +  F+Q MVRIPQD+FY+DS RTS+VMD V+EGF+DT+S+     EET  T P D       +E S     PFAMEQ  EV
Subjt:  HSQYFKVQPCKRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAMEQDGEV

Query:  LFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDGTRCGGHYPGSGSSRKPNVF
        L SG  S    ER+AY SSE LLL EEDG R N G  KW+ EDG+NGS VSK++ DL DMED RK  WK A  STK RV+G R   H PG GS +KPNVF
Subjt:  LFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDGTRCGGHYPGSGSSRKPNVF

Query:  SRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQEQERKRNKHTRKINVNHGF
        SRIQFL+H D    V D D NLNCRN    +  TS    SSKR L WVVN  S  SK K R+LKKRL  P      N   +E+ERKRNK  RK NV+HG 
Subjt:  SRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQEQERKRNKHTRKINVNHGF

Query:  LDVQAQAGDCFQEKTQSLTSR-PVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEPRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLEASKAEHLGLH
        LDV  Q GD  +EK QS TSR P+EDPE LNQ+IKSAFLKFVKVLSEN ARRKK TEP  GII CIVCGS SKEF DA SLSQHA +TLE S+AEHLGLH
Subjt:  LDVQAQAGDCFQEKTQSLTSR-PVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEPRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLEASKAEHLGLH

Query:  KALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR-----GKMKVVRGKPGNQSIMVATFDAM
        KALCWLMGWSSE  APNG+WVRR LPL E +ALKEDLIIWPPVLIIHNSSIAID  S+  AISCEELE VIR     GK+KVVRG+PGNQSIMV TF AM
Subjt:  KALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR-----GKMKVVRGKPGNQSIMVATFDAM

Query:  FTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLQC
        F+GLQE ERLH SF DK+HGRDE  KIN RH ID S+ +LH A G N  E VLYGYLGLAEDL KLDFETKKRSVVKSKKEIQAIV+ASLQC
Subjt:  FTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLQC

XP_010651723.1 PREDICTED: uncharacterized protein LOC104879702 [Vitis vinifera]1.6e-6931.55Show/hide
Query:  EPSDLHVRDRSHGIGTSHLQSSARVSDSKRISQDVFHG-------------YCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDD
        E S+ +  D SH     H+   A+   S  +S+D FHG               E    P       +      P + EL  +  V++   +    +H   
Subjt:  EPSDLHVRDRSHGIGTSHLQSSARVSDSKRISQDVFHG-------------YCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDD

Query:  HSQYFKVQPC------KRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAM
                PC      +R D+G   +  F  +M  + +D  +QD +R S++ D V++  +DT+S     +++++  + ++  ++  I +        + +
Subjt:  HSQYFKVQPC------KRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAM

Query:  EQDGEVLFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNG-------------SLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDG
          DGEVL S             Q SE+  + +E G         W  E+ L               SL  +        E+C KR+          R   
Subjt:  EQDGEVLFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNG-------------SLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDG

Query:  TRCGGHYPGSGSSRKPNVFSRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQ
         + G H P S  S      ++I  LS++ E  +  DI          W+ R   ++ S S R L+ + N+ S+ +K   +D KKRL     ++ I+    
Subjt:  TRCGGHYPGSGSSRKPNVFSRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQ

Query:  EQERKRNKHTRKINVNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEP-RSGIIKCIVCGSNSKEFGDAQSL
         + +      R ++ +HG L ++   G   + K  +  +   E  E   Q + S FLKFVK+L+EN A+R+ +TE  ++  +KC +CGSNSKEF +   L
Subjt:  EQERKRNKHTRKINVNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEP-RSGIIKCIVCGSNSKEFGDAQSL

Query:  SQHAFQTLEAS-KAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR------GK
          H   + +   + +HLGL KALC LMGW+S+   PN  WV + LP AE++ALKEDLIIWPPV+I+HNSSI   +  ER  ++ + L  ++R      GK
Subjt:  SQHAFQTLEAS-KAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR------GK

Query:  MKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNAGENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKK
         ++ RGKP NQSIMV  F+A F+GLQ+ E+LH  + +  HGR EF +IN  +    S      A  ++ E VLYGYLG+A DLDKLDFE K+R VVKSKK
Subjt:  MKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNAGENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKK

Query:  EIQAIVDASL
        EI AI D  L
Subjt:  EIQAIVDASL

XP_011657058.1 uncharacterized protein LOC105435801 [Cucumis sativus]3.2e-7371.75Show/hide
Query:  SNSKEFGDAQSLSQHAFQTLEASKAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEV
        S SKEF DA SL QHA +TLE S+AEHLGLHKALCWLMGWSSE  APNG+WVR  LP  E +ALKEDLIIWP VLIIHNSSIAID   E  AISCE+LE 
Subjt:  SNSKEFGDAQSLSQHAFQTLEASKAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEV

Query:  VIR-----GKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFE
         +R     GK KVVRGK  NQSIMV TF AMF GLQE ERLH +F DK+HGRDEF KIN R  +D S+ ++H A G N  E V YGYLGL EDLDKLDFE
Subjt:  VIR-----GKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFE

Query:  TKKRSVVKSKKEIQAIVDASLQC
        TKKRSVV+SKKEIQAIV ASLQC
Subjt:  TKKRSVVKSKKEIQAIVDASLQC

XP_031386944.1 uncharacterized protein LOC116200259 [Punica granatum]2.5e-7039.5Show/hide
Query:  GIGSP-LKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRW----------KEAAQSTKSRVDGTRCGGHYPGSG
        G+G+  +   R+ Y   E   + E DG     GD  W   D      +S +E  L D E   + R            E     K   D  R G ++ G  
Subjt:  GIGSP-LKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRW----------KEAAQSTKSRVDGTRCGGHYPGSG

Query:  SSRKPNVFSRIQFLSHEDEKSSVNDIDINLNC-RNKMWNNR--GTSISLSSSKRPLAWVVNQASEYSKT------KH-----RDLKKRLREPSSSIQINE
             NV S    L +ED+      I+ N NC   ++ N R  GT  S S+ +R +      + + SKT      KH     RD+KKRL  P  + +   
Subjt:  SSRKPNVFSRIQFLSHEDEKSSVNDIDINLNC-RNKMWNNR--GTSISLSSSKRPLAWVVNQASEYSKT------KH-----RDLKKRLREPSSSIQINE

Query:  WEQE--QERKRNKHTRKINVNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEP-RSGIIKCIVCGSNSKEFG
        W +    + K N      +     L+  A   D       +  + P E+ E   Q++  AFLKFVK+L+EN+A+R+K+ +  +SGI+KC +CG NSKEF 
Subjt:  WEQE--QERKRNKHTRKINVNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEP-RSGIIKCIVCGSNSKEFG

Query:  DAQSLSQHAFQTLEAS-KAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR---
        D  SL+ HAF T     +AEHLGLHKALC LMGW+S A  PNG WVR+ LP AEA AL++DLIIWPP++IIHNSSIA     +R  +  EE++ +++   
Subjt:  DAQSLSQHAFQTLEAS-KAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR---

Query:  ---GKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNAGENRNE--GVLYGYLGLAEDLDKLDFETKKR
            K KV+RGKP NQSIMVA+F   F+GL E E LH  F +K  GR EF +I+      G    +H  G+   +   VLYGYLG+AEDLDKL+FE KKR
Subjt:  ---GKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNAGENRNE--GVLYGYLGLAEDLDKLDFETKKR

Query:  SVVKSKKEIQAIVDASLQC
         VV+S++EIQ IVD  L+C
Subjt:  SVVKSKKEIQAIVDASLQC

XP_038900433.1 uncharacterized protein LOC120087658 [Benincasa hispida]2.0e-26962.05Show/hide
Query:  MNWRETTGDQRSESP----WRISEPRVERNRPCHPHWLPGSSPEEPVTTNGLVAFSVTDPY-----YQTSPCENFRKLSRFCQNLQRESPSTKFRWENLF
        MN+RET+ D+RS+SP     R SEPRVE N  CH  W   SS E PV TNGL   S+ D Y     Y+ +  E+FRKLS+ C+NLQRESPS KFRWENLF
Subjt:  MNWRETTGDQRSESP----WRISEPRVERNRPCHPHWLPGSSPEEPVTTNGLVAFSVTDPY-----YQTSPCENFRKLSRFCQNLQRESPSTKFRWENLF

Query:  ----ANATSKSRLGF---DGCDGGGNGPIRVSGSQLCAGSSSTDVLEGFHLRPFQINTGATKNRNV-NNGVISTSLGIRDYGCHFSSSRKFDGSVYEPSD
            ANA SKS +G    + CDG   G IRVSGS L  G+SS ++L G +LR F +N G TK+ NV NNG IS S GI D   H SSSRKFDG +YE SD
Subjt:  ----ANATSKSRLGF---DGCDGGGNGPIRVSGSQLCAGSSSTDVLEGFHLRPFQINTGATKNRNV-NNGVISTSLGIRDYGCHFSSSRKFDGSVYEPSD

Query:  LHVRDR------------------SHGIGTSHLQSSARVSDSKRISQDVFHGYCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFD
        +HVRDR                  SHG+  S+LQSSA V++SK ISQD FH +                                +EYKRA+ N+IE FD
Subjt:  LHVRDR------------------SHGIGTSHLQSSARVSDSKRISQDVFHGYCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFD

Query:  DHSQYFKVQPCKRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAMEQDGE
        D +QYF VQP KRSDI    NS+F+QQMVRIPQD+FYQDS RTS+VMD V+EGF+DT+S++    E T+P + YD +++ F++E SY G  PF ME  GE
Subjt:  DHSQYFKVQPCKRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAMEQDGE

Query:  VLFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDGTRCGGHYPGSGSSRKPNV
         L SG  S +K ER+AY SSE LLLAEEDG R  +G  KW +EDG+NGSLVSK++ DL DME  RK RWK A  STK RV+GTRC  H PGS SSRKPNV
Subjt:  VLFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDGTRCGGHYPGSGSSRKPNV

Query:  FSRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKR----LREPSSSIQINEWEQEQERKRNKHTRKIN
        FSRIQFLSH DE  +V D DINLNCR+K WN   TSI L+SSKRPL WV+N AS +SK K RDL+KR    LR+PSSS  +    ++++RK+NK  RK N
Subjt:  FSRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKR----LREPSSSIQINEWEQEQERKRNKHTRKIN

Query:  VNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEPRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLEASKAEH
        VNH  LDVQ    D  +EK QS TSR +ED E LNQ+IKSAFLKFVKVLSEN ARRKKFTEP  GIIKCIVCGS SKEF DA SLSQHA QTLE S+AEH
Subjt:  VNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEPRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLEASKAEH

Query:  LGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR-----GKMKVVRGKPGNQSIMVAT
        LGL KALCWLMGWSSE AAP+G WVRR LPL E +ALKEDLIIWPPVLIIHNSSIAID+ SER AISCEELEVVIR     GK+KVVRGKPGNQSIM+ T
Subjt:  LGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR-----GKMKVVRGKPGNQSIMVAT

Query:  FDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLQC
        FDAMF+GLQE ERLH SF DK+HGRDEF+KI S H ID SH +LH A G N  + VLYGYLGL EDLDKLDFETKKRSVVKSKKEIQAIV+ASL C
Subjt:  FDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLQC

TrEMBL top hitse value%identityAlignment
A0A0A0KGN5 XS domain-containing protein1.5e-7371.75Show/hide
Query:  SNSKEFGDAQSLSQHAFQTLEASKAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEV
        S SKEF DA SL QHA +TLE S+AEHLGLHKALCWLMGWSSE  APNG+WVR  LP  E +ALKEDLIIWP VLIIHNSSIAID   E  AISCE+LE 
Subjt:  SNSKEFGDAQSLSQHAFQTLEASKAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEV

Query:  VIR-----GKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFE
         +R     GK KVVRGK  NQSIMV TF AMF GLQE ERLH +F DK+HGRDEF KIN R  +D S+ ++H A G N  E V YGYLGL EDLDKLDFE
Subjt:  VIR-----GKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFE

Query:  TKKRSVVKSKKEIQAIVDASLQC
        TKKRSVV+SKKEIQAIV ASLQC
Subjt:  TKKRSVVKSKKEIQAIVDASLQC

A0A1S3C894 uncharacterized protein LOC1034979641.6e-24058.63Show/hide
Query:  MNWRETTGDQRSESP----WRISEPRVERNRPCHPHWLPGSSPEEPVTTNGLVAFSVTDPYYQT----SPCENFRKLSRFCQNLQRESPSTKFRWENLF-
        MN RE   D+RS+SP     R SEPRVE    C+ HW   SS E P+ TN L   S+ D Y  +       E+FRKLS+FC+NLQ ESP+ KF+WENLF 
Subjt:  MNWRETTGDQRSESP----WRISEPRVERNRPCHPHWLPGSSPEEPVTTNGLVAFSVTDPYYQT----SPCENFRKLSRFCQNLQRESPSTKFRWENLF-

Query:  ----ANATSKSRLGFDGCDG--GGNGPIRVSGSQLCAGSSSTDVLEGFHLRPFQINTGATKNRNV-NNGVISTSLGIRDYGCHFSSSRKFDGSVYEPSDL
            AN  SK+ +G    +G  G N  IRVSGS L  G+SS  +L G +LR F +N GATK+ NV NNG  S S+GI D   H SSSRK+DG +++ +++
Subjt:  ----ANATSKSRLGFDGCDG--GGNGPIRVSGSQLCAGSSSTDVLEGFHLRPFQINTGATKNRNV-NNGVISTSLGIRDYGCHFSSSRKFDGSVYEPSDL

Query:  HVRDR------------------SHGIGTSHLQSSARVSDSKRISQDVFHGYCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDD
        HVRDR                  S GI  SHL SSA V++SK ISQ  FH                                 L+EYKRA+ NHIEHFDD
Subjt:  HVRDR------------------SHGIGTSHLQSSARVSDSKRISQDVFHGYCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDD

Query:  HSQYFKVQPCKRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAMEQDGEV
         +QYF VQPCKR+DI    +  F+Q MVRIPQD+FY+DS RTS+VMD V+EGF+DT+S+     EET  T P D       +E S     PFAMEQ  EV
Subjt:  HSQYFKVQPCKRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAMEQDGEV

Query:  LFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDGTRCGGHYPGSGSSRKPNVF
        L SG  S    ER+AY SSE LLL EEDG R N G  KW+ EDG+NGS VSK++ DL DMED RK  WK A  STK RV+G R   H PG GS +KPNVF
Subjt:  LFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDGTRCGGHYPGSGSSRKPNVF

Query:  SRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQEQERKRNKHTRKINVNHGF
        SRIQFL+H D    V D D NLNCRN    +  TS    SSKR L WVVN  S  SK K R+LKKRL  P      N   +E+ERKRNK  RK NV+HG 
Subjt:  SRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQEQERKRNKHTRKINVNHGF

Query:  LDVQAQAGDCFQEKTQSLTSR-PVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEPRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLEASKAEHLGLH
        LDV  Q GD  +EK QS TSR P+EDPE LNQ+IKSAFLKFVKVLSEN ARRKK TEP  GII CIVCGS SKEF DA SLSQHA +TLE S+AEHLGLH
Subjt:  LDVQAQAGDCFQEKTQSLTSR-PVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEPRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLEASKAEHLGLH

Query:  KALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR-----GKMKVVRGKPGNQSIMVATFDAM
        KALCWLMGWSSE  APNG+WVRR LPL E +ALKEDLIIWPPVLIIHNSSIAID  S+  AISCEELE VIR     GK+KVVRG+PGNQSIMV TF AM
Subjt:  KALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR-----GKMKVVRGKPGNQSIMVATFDAM

Query:  FTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLQC
        F+GLQE ERLH SF DK+HGRDE  KIN RH ID S+ +LH A G N  E VLYGYLGLAEDL KLDFETKKRSVVKSKKEIQAIV+ASLQC
Subjt:  FTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLQC

A0A218WNW1 uncharacterized protein LOC1162002591.2e-7039.5Show/hide
Query:  GIGSP-LKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRW----------KEAAQSTKSRVDGTRCGGHYPGSG
        G+G+  +   R+ Y   E   + E DG     GD  W   D      +S +E  L D E   + R            E     K   D  R G ++ G  
Subjt:  GIGSP-LKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRW----------KEAAQSTKSRVDGTRCGGHYPGSG

Query:  SSRKPNVFSRIQFLSHEDEKSSVNDIDINLNC-RNKMWNNR--GTSISLSSSKRPLAWVVNQASEYSKT------KH-----RDLKKRLREPSSSIQINE
             NV S    L +ED+      I+ N NC   ++ N R  GT  S S+ +R +      + + SKT      KH     RD+KKRL  P  + +   
Subjt:  SSRKPNVFSRIQFLSHEDEKSSVNDIDINLNC-RNKMWNNR--GTSISLSSSKRPLAWVVNQASEYSKT------KH-----RDLKKRLREPSSSIQINE

Query:  WEQE--QERKRNKHTRKINVNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEP-RSGIIKCIVCGSNSKEFG
        W +    + K N      +     L+  A   D       +  + P E+ E   Q++  AFLKFVK+L+EN+A+R+K+ +  +SGI+KC +CG NSKEF 
Subjt:  WEQE--QERKRNKHTRKINVNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEP-RSGIIKCIVCGSNSKEFG

Query:  DAQSLSQHAFQTLEAS-KAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR---
        D  SL+ HAF T     +AEHLGLHKALC LMGW+S A  PNG WVR+ LP AEA AL++DLIIWPP++IIHNSSIA     +R  +  EE++ +++   
Subjt:  DAQSLSQHAFQTLEAS-KAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR---

Query:  ---GKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNAGENRNE--GVLYGYLGLAEDLDKLDFETKKR
            K KV+RGKP NQSIMVA+F   F+GL E E LH  F +K  GR EF +I+      G    +H  G+   +   VLYGYLG+AEDLDKL+FE KKR
Subjt:  ---GKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNAGENRNE--GVLYGYLGLAEDLDKLDFETKKR

Query:  SVVKSKKEIQAIVDASLQC
         VV+S++EIQ IVD  L+C
Subjt:  SVVKSKKEIQAIVDASLQC

A0A5A7SQC0 XS domain-containing protein1.6e-24058.63Show/hide
Query:  MNWRETTGDQRSESP----WRISEPRVERNRPCHPHWLPGSSPEEPVTTNGLVAFSVTDPYYQT----SPCENFRKLSRFCQNLQRESPSTKFRWENLF-
        MN RE   D+RS+SP     R SEPRVE    C+ HW   SS E P+ TN L   S+ D Y  +       E+FRKLS+FC+NLQ ESP+ KF+WENLF 
Subjt:  MNWRETTGDQRSESP----WRISEPRVERNRPCHPHWLPGSSPEEPVTTNGLVAFSVTDPYYQT----SPCENFRKLSRFCQNLQRESPSTKFRWENLF-

Query:  ----ANATSKSRLGFDGCDG--GGNGPIRVSGSQLCAGSSSTDVLEGFHLRPFQINTGATKNRNV-NNGVISTSLGIRDYGCHFSSSRKFDGSVYEPSDL
            AN  SK+ +G    +G  G N  IRVSGS L  G+SS  +L G +LR F +N GATK+ NV NNG  S S+GI D   H SSSRK+DG +++ +++
Subjt:  ----ANATSKSRLGFDGCDG--GGNGPIRVSGSQLCAGSSSTDVLEGFHLRPFQINTGATKNRNV-NNGVISTSLGIRDYGCHFSSSRKFDGSVYEPSDL

Query:  HVRDR------------------SHGIGTSHLQSSARVSDSKRISQDVFHGYCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDD
        HVRDR                  S GI  SHL SSA V++SK ISQ  FH                                 L+EYKRA+ NHIEHFDD
Subjt:  HVRDR------------------SHGIGTSHLQSSARVSDSKRISQDVFHGYCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDD

Query:  HSQYFKVQPCKRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAMEQDGEV
         +QYF VQPCKR+DI    +  F+Q MVRIPQD+FY+DS RTS+VMD V+EGF+DT+S+     EET  T P D       +E S     PFAMEQ  EV
Subjt:  HSQYFKVQPCKRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAMEQDGEV

Query:  LFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDGTRCGGHYPGSGSSRKPNVF
        L SG  S    ER+AY SSE LLL EEDG R N G  KW+ EDG+NGS VSK++ DL DMED RK  WK A  STK RV+G R   H PG GS +KPNVF
Subjt:  LFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDGTRCGGHYPGSGSSRKPNVF

Query:  SRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQEQERKRNKHTRKINVNHGF
        SRIQFL+H D    V D D NLNCRN    +  TS    SSKR L WVVN  S  SK K R+LKKRL  P      N   +E+ERKRNK  RK NV+HG 
Subjt:  SRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQEQERKRNKHTRKINVNHGF

Query:  LDVQAQAGDCFQEKTQSLTSR-PVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEPRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLEASKAEHLGLH
        LDV  Q GD  +EK QS TSR P+EDPE LNQ+IKSAFLKFVKVLSEN ARRKK TEP  GII CIVCGS SKEF DA SLSQHA +TLE S+AEHLGLH
Subjt:  LDVQAQAGDCFQEKTQSLTSR-PVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEPRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLEASKAEHLGLH

Query:  KALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR-----GKMKVVRGKPGNQSIMVATFDAM
        KALCWLMGWSSE  APNG+WVRR LPL E +ALKEDLIIWPPVLIIHNSSIAID  S+  AISCEELE VIR     GK+KVVRG+PGNQSIMV TF AM
Subjt:  KALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR-----GKMKVVRGKPGNQSIMVATFDAM

Query:  FTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLQC
        F+GLQE ERLH SF DK+HGRDE  KIN RH ID S+ +LH A G N  E VLYGYLGLAEDL KLDFETKKRSVVKSKKEIQAIV+ASLQC
Subjt:  FTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNA-GENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLQC

D7SN41 XS domain-containing protein7.9e-7031.55Show/hide
Query:  EPSDLHVRDRSHGIGTSHLQSSARVSDSKRISQDVFHG-------------YCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDD
        E S+ +  D SH     H+   A+   S  +S+D FHG               E    P       +      P + EL  +  V++   +    +H   
Subjt:  EPSDLHVRDRSHGIGTSHLQSSARVSDSKRISQDVFHG-------------YCERRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDD

Query:  HSQYFKVQPC------KRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAM
                PC      +R D+G   +  F  +M  + +D  +QD +R S++ D V++  +DT+S     +++++  + ++  ++  I +        + +
Subjt:  HSQYFKVQPC------KRSDIG---NSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEETQPTNPYDFYEKKFIMECSYKGNNPFAM

Query:  EQDGEVLFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNG-------------SLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDG
          DGEVL S             Q SE+  + +E G         W  E+ L               SL  +        E+C KR+          R   
Subjt:  EQDGEVLFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNG-------------SLVSKYENDLRDMEDCRKRRWKEAAQSTKSRVDG

Query:  TRCGGHYPGSGSSRKPNVFSRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQ
         + G H P S  S      ++I  LS++ E  +  DI          W+ R   ++ S S R L+ + N+ S+ +K   +D KKRL     ++ I+    
Subjt:  TRCGGHYPGSGSSRKPNVFSRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQ

Query:  EQERKRNKHTRKINVNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEP-RSGIIKCIVCGSNSKEFGDAQSL
         + +      R ++ +HG L ++   G   + K  +  +   E  E   Q + S FLKFVK+L+EN A+R+ +TE  ++  +KC +CGSNSKEF +   L
Subjt:  EQERKRNKHTRKINVNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEP-RSGIIKCIVCGSNSKEFGDAQSL

Query:  SQHAFQTLEAS-KAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR------GK
          H   + +   + +HLGL KALC LMGW+S+   PN  WV + LP AE++ALKEDLIIWPPV+I+HNSSI   +  ER  ++ + L  ++R      GK
Subjt:  SQHAFQTLEAS-KAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR------GK

Query:  MKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNAGENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKK
         ++ RGKP NQSIMV  F+A F+GLQ+ E+LH  + +  HGR EF +IN  +    S      A  ++ E VLYGYLG+A DLDKLDFE K+R VVKSKK
Subjt:  MKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSRHTIDGSHNNLHNAGENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKK

Query:  EIQAIVDASL
        EI AI D  L
Subjt:  EIQAIVDASL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22430.1 CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380)7.9e-3033.33Show/hide
Query:  IKSAFLKFVKVLSENSARRKKFTE-PRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLE-ASKAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAI
        +K +FL FVK + E+   +K + E  R G ++C+VCG +SK+  D  SL  H + + + +S+  HLGLHKALC LMGW+  + AP+     + LP  EA 
Subjt:  IKSAFLKFVKVLSENSARRKKFTE-PRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLE-ASKAEHLGLHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAI

Query:  ALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR------GKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSR
          +  LIIWPP +I+ N+S         E    + ++  IR      GK K + G+ G+  I +  F    +GL++  R+   F+  N GR  + ++   
Subjt:  ALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIR------GKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHISFQDKNHGRDEFRKINSR

Query:  HTIDGSHNNLHNAGENRNEG-------VLYGYLGLAEDLDKLDFETKKRSVVKSKKEI
          +  S ++  N G    +G       + YGYL    DLDK+D ETKK++ ++S +E+
Subjt:  HTIDGSHNNLHNAGENRNEG-------VLYGYLGLAEDLDKLDFETKKRSVVKSKKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGGAGAGAAACCACTGGGGATCAAAGGTCCGAGTCTCCCTGGAGAATCTCGGAACCTAGGGTTGAACGAAACAGGCCTTGTCATCCCCACTGGTTACCCGGTTC
TTCACCGGAAGAACCGGTGACGACCAATGGCCTTGTGGCTTTTTCTGTTACAGACCCCTACTACCAAACTTCTCCTTGTGAAAATTTTCGTAAACTATCTCGGTTCTGCC
AGAATTTACAGCGGGAATCGCCGTCCACAAAGTTCCGCTGGGAAAATTTGTTTGCCAATGCGACTTCGAAATCGCGTCTGGGGTTTGATGGATGCGACGGTGGTGGTAAT
GGACCAATTAGGGTTTCCGGCTCCCAGTTGTGTGCGGGATCGTCGTCTACGGATGTTTTGGAGGGTTTTCATTTGCGCCCTTTTCAAATCAACACTGGGGCAACTAAAAA
CAGAAACGTAAACAATGGCGTTATTTCCACAAGTCTTGGAATCCGTGACTATGGCTGCCATTTTTCTTCATCTAGGAAGTTTGATGGGTCCGTTTATGAACCCAGTGATC
TTCATGTCCGGGACCGTTCACATGGGATTGGAACGTCGCATCTTCAATCCAGTGCACGTGTTTCTGATTCAAAACGCATTTCTCAAGATGTTTTTCATGGTTATTGTGAG
CGTCGTCTACCTCCAAACGCTCTGGATTCCACCTGGAAGAAGGGAAACCATACAGAACCTGTTGAAACTGAACTTGATACGGAAGGTTTAGTGGAGTATAAACGAGCTCA
GGGGAACCATATCGAGCACTTTGATGATCATAGTCAGTATTTCAAAGTCCAGCCGTGTAAGAGGAGTGACATTGGCAACAGTTCTTTCACTCAGCAGATGGTTCGTATCC
CACAAGATGAGTTCTATCAAGATTCTATGCGGACCAGCATTGTAATGGATCCAGTTATTGAGGGATTTGAAGACACTCAAAGCTATGTTGTGGGTGAAATGGAAGAGACC
CAGCCAACCAACCCTTATGATTTTTACGAAAAAAAGTTTATCATGGAATGTTCGTATAAGGGTAACAACCCTTTTGCGATGGAACAGGATGGCGAAGTTTTGTTTTCTGG
AATTGGAAGTCCACTAAAAGTTGAAAGAGATGCATATCAAAGTAGCGAGACGTTGCTCCTGGCTGAAGAAGATGGTATCAGGATAAACCATGGGGACTGGAAATGGTCGA
ATGAGGACGGATTAAATGGATCATTAGTATCAAAATATGAAAATGATTTGAGGGACATGGAAGACTGTAGAAAGCGGAGATGGAAAGAAGCTGCACAGTCGACAAAATCG
AGAGTCGACGGGACAAGATGTGGAGGGCATTATCCTGGGTCGGGATCATCTAGAAAACCTAATGTGTTCAGCAGAATCCAATTTTTAAGTCATGAAGATGAAAAGAGTTC
TGTTAATGATATTGACATCAATTTGAACTGTAGAAACAAGATGTGGAACAACAGGGGTACTTCCATTTCCTTGTCCTCCTCTAAACGGCCGTTGGCTTGGGTAGTAAACC
AAGCCTCCGAATATTCAAAGACTAAACATAGAGATCTAAAGAAACGTTTGAGGGAACCGAGTTCAAGCATTCAAATTAATGAATGGGAACAAGAACAAGAACGTAAAAGA
AACAAGCATACGAGAAAGATAAATGTCAATCATGGGTTCCTTGATGTCCAAGCCCAAGCAGGTGATTGCTTTCAAGAGAAGACGCAAAGTCTAACTAGTAGGCCAGTTGA
AGATCCTGAGGTTTTGAACCAGATAATAAAGAGTGCTTTTCTCAAGTTTGTCAAAGTTTTGAGTGAGAATTCAGCCAGACGTAAGAAGTTCACTGAACCAAGGTCTGGTA
TTATAAAGTGCATTGTCTGCGGCAGCAACTCCAAGGAGTTTGGAGATGCGCAAAGCTTATCACAACATGCCTTCCAGACGCTGGAAGCATCCAAGGCAGAGCACTTGGGT
CTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGCAGCAGCGCCAAATGGTGTATGGGTTCGAAGGAGATTGCCTCTTGCAGAGGCAATTGCTTTAAAGGA
GGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTGCAATTGATAATACGTCTGAACGGGAAGCCATAAGTTGTGAAGAGCTTGAGGTTGTTATTA
GAGGGAAGATGAAAGTGGTACGTGGTAAGCCTGGCAACCAGAGTATTATGGTAGCAACTTTCGATGCCATGTTTACTGGGTTGCAAGAGACAGAAAGACTACACATAAGT
TTTCAAGACAAAAACCATGGTAGAGACGAGTTCCGCAAAATCAATTCCAGGCATACCATTGATGGAAGCCACAACAATCTGCATAACGCAGGAGAAAACAGGAACGAAGG
CGTACTTTATGGCTACTTGGGGCTTGCAGAGGACTTGGATAAACTTGACTTCGAGACCAAGAAGAGATCTGTGGTGAAAAGCAAGAAAGAGATCCAGGCCATTGTGGATG
CATCTCTTCAATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTGGAGAGAAACCACTGGGGATCAAAGGTCCGAGTCTCCCTGGAGAATCTCGGAACCTAGGGTTGAACGAAACAGGCCTTGTCATCCCCACTGGTTACCCGGTTC
TTCACCGGAAGAACCGGTGACGACCAATGGCCTTGTGGCTTTTTCTGTTACAGACCCCTACTACCAAACTTCTCCTTGTGAAAATTTTCGTAAACTATCTCGGTTCTGCC
AGAATTTACAGCGGGAATCGCCGTCCACAAAGTTCCGCTGGGAAAATTTGTTTGCCAATGCGACTTCGAAATCGCGTCTGGGGTTTGATGGATGCGACGGTGGTGGTAAT
GGACCAATTAGGGTTTCCGGCTCCCAGTTGTGTGCGGGATCGTCGTCTACGGATGTTTTGGAGGGTTTTCATTTGCGCCCTTTTCAAATCAACACTGGGGCAACTAAAAA
CAGAAACGTAAACAATGGCGTTATTTCCACAAGTCTTGGAATCCGTGACTATGGCTGCCATTTTTCTTCATCTAGGAAGTTTGATGGGTCCGTTTATGAACCCAGTGATC
TTCATGTCCGGGACCGTTCACATGGGATTGGAACGTCGCATCTTCAATCCAGTGCACGTGTTTCTGATTCAAAACGCATTTCTCAAGATGTTTTTCATGGTTATTGTGAG
CGTCGTCTACCTCCAAACGCTCTGGATTCCACCTGGAAGAAGGGAAACCATACAGAACCTGTTGAAACTGAACTTGATACGGAAGGTTTAGTGGAGTATAAACGAGCTCA
GGGGAACCATATCGAGCACTTTGATGATCATAGTCAGTATTTCAAAGTCCAGCCGTGTAAGAGGAGTGACATTGGCAACAGTTCTTTCACTCAGCAGATGGTTCGTATCC
CACAAGATGAGTTCTATCAAGATTCTATGCGGACCAGCATTGTAATGGATCCAGTTATTGAGGGATTTGAAGACACTCAAAGCTATGTTGTGGGTGAAATGGAAGAGACC
CAGCCAACCAACCCTTATGATTTTTACGAAAAAAAGTTTATCATGGAATGTTCGTATAAGGGTAACAACCCTTTTGCGATGGAACAGGATGGCGAAGTTTTGTTTTCTGG
AATTGGAAGTCCACTAAAAGTTGAAAGAGATGCATATCAAAGTAGCGAGACGTTGCTCCTGGCTGAAGAAGATGGTATCAGGATAAACCATGGGGACTGGAAATGGTCGA
ATGAGGACGGATTAAATGGATCATTAGTATCAAAATATGAAAATGATTTGAGGGACATGGAAGACTGTAGAAAGCGGAGATGGAAAGAAGCTGCACAGTCGACAAAATCG
AGAGTCGACGGGACAAGATGTGGAGGGCATTATCCTGGGTCGGGATCATCTAGAAAACCTAATGTGTTCAGCAGAATCCAATTTTTAAGTCATGAAGATGAAAAGAGTTC
TGTTAATGATATTGACATCAATTTGAACTGTAGAAACAAGATGTGGAACAACAGGGGTACTTCCATTTCCTTGTCCTCCTCTAAACGGCCGTTGGCTTGGGTAGTAAACC
AAGCCTCCGAATATTCAAAGACTAAACATAGAGATCTAAAGAAACGTTTGAGGGAACCGAGTTCAAGCATTCAAATTAATGAATGGGAACAAGAACAAGAACGTAAAAGA
AACAAGCATACGAGAAAGATAAATGTCAATCATGGGTTCCTTGATGTCCAAGCCCAAGCAGGTGATTGCTTTCAAGAGAAGACGCAAAGTCTAACTAGTAGGCCAGTTGA
AGATCCTGAGGTTTTGAACCAGATAATAAAGAGTGCTTTTCTCAAGTTTGTCAAAGTTTTGAGTGAGAATTCAGCCAGACGTAAGAAGTTCACTGAACCAAGGTCTGGTA
TTATAAAGTGCATTGTCTGCGGCAGCAACTCCAAGGAGTTTGGAGATGCGCAAAGCTTATCACAACATGCCTTCCAGACGCTGGAAGCATCCAAGGCAGAGCACTTGGGT
CTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGCAGCAGCGCCAAATGGTGTATGGGTTCGAAGGAGATTGCCTCTTGCAGAGGCAATTGCTTTAAAGGA
GGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTGCAATTGATAATACGTCTGAACGGGAAGCCATAAGTTGTGAAGAGCTTGAGGTTGTTATTA
GAGGGAAGATGAAAGTGGTACGTGGTAAGCCTGGCAACCAGAGTATTATGGTAGCAACTTTCGATGCCATGTTTACTGGGTTGCAAGAGACAGAAAGACTACACATAAGT
TTTCAAGACAAAAACCATGGTAGAGACGAGTTCCGCAAAATCAATTCCAGGCATACCATTGATGGAAGCCACAACAATCTGCATAACGCAGGAGAAAACAGGAACGAAGG
CGTACTTTATGGCTACTTGGGGCTTGCAGAGGACTTGGATAAACTTGACTTCGAGACCAAGAAGAGATCTGTGGTGAAAAGCAAGAAAGAGATCCAGGCCATTGTGGATG
CATCTCTTCAATGTTAG
Protein sequenceShow/hide protein sequence
MNWRETTGDQRSESPWRISEPRVERNRPCHPHWLPGSSPEEPVTTNGLVAFSVTDPYYQTSPCENFRKLSRFCQNLQRESPSTKFRWENLFANATSKSRLGFDGCDGGGN
GPIRVSGSQLCAGSSSTDVLEGFHLRPFQINTGATKNRNVNNGVISTSLGIRDYGCHFSSSRKFDGSVYEPSDLHVRDRSHGIGTSHLQSSARVSDSKRISQDVFHGYCE
RRLPPNALDSTWKKGNHTEPVETELDTEGLVEYKRAQGNHIEHFDDHSQYFKVQPCKRSDIGNSSFTQQMVRIPQDEFYQDSMRTSIVMDPVIEGFEDTQSYVVGEMEET
QPTNPYDFYEKKFIMECSYKGNNPFAMEQDGEVLFSGIGSPLKVERDAYQSSETLLLAEEDGIRINHGDWKWSNEDGLNGSLVSKYENDLRDMEDCRKRRWKEAAQSTKS
RVDGTRCGGHYPGSGSSRKPNVFSRIQFLSHEDEKSSVNDIDINLNCRNKMWNNRGTSISLSSSKRPLAWVVNQASEYSKTKHRDLKKRLREPSSSIQINEWEQEQERKR
NKHTRKINVNHGFLDVQAQAGDCFQEKTQSLTSRPVEDPEVLNQIIKSAFLKFVKVLSENSARRKKFTEPRSGIIKCIVCGSNSKEFGDAQSLSQHAFQTLEASKAEHLG
LHKALCWLMGWSSEAAAPNGVWVRRRLPLAEAIALKEDLIIWPPVLIIHNSSIAIDNTSEREAISCEELEVVIRGKMKVVRGKPGNQSIMVATFDAMFTGLQETERLHIS
FQDKNHGRDEFRKINSRHTIDGSHNNLHNAGENRNEGVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLQC