; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0214 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0214
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionXS domain-containing protein
Genome locationMC02:2037030..2041667
RNA-Seq ExpressionMC02g0214
SyntenyMC02g0214
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005380 - XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458617.1 PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo]0.066.82Show/hide
Query:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAQEGPVTNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE
        M+ RE ++D   RS+SPSL  RR SEPRVEE  HC+SHWFS S++E P+TN   LPG S+RDH+N +RLY ++DEHFRKLSQFCE+L+  ESPAKKF WE
Subjt:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAQEGPVTNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE

Query:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSSE-ANDLRTFHTNIRATNDSNVMD-GNASRSFGVNDCSHLSSSRKFDGPVYETTDV
        +LF  N  AN +SK+S+GLKHVNG DGDN+G+RV GSHL   S S    +LRTFH NI AT DSNV + G+ SRS G+NDC+HLSSSRK+DGP+++  +V
Subjt:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSSE-ANDLRTFHTNIRATNDSNVMD-GNASRSFGVNDCSHLSSSRKFDGPVYETTDV

Query:  YIQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI
        +++D   +E   NSH  RG++  TSS G Q SH HSSA V ESKGISQ EFH  LEYKRAR  HIEHFDD N+YF  QPCKR+DI A  +   SQ MVRI
Subjt:  YIQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI

Query:  PQDDFYQDCTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAE-DGY
        PQDDFY+D TRTSV++D VVEGF+DTES+     E  RP D+  F      IEGS     PFAMEQ  EVLGSGT S    E+EAY  SEKLLL E DGY
Subjt:  PQDDFYQDCTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAE-DGY

Query:  NTNYGKWSGDDGLNGS-LSRNKQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDSSKKRNVFSRIHFLGNGDEKSTVKHIDINLKRRN
         TN+GKW+ +DG+NGS +S++KQDLG  +MED RKL WKA HSTK RV+G     +R  MH  G  S KK NVFSRI FL +GD K T    D NL  RN
Subjt:  NTNYGKWSGDDGLNGS-LSRNKQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDSSKKRNVFSRIHFLGNGDEKSTVKHIDINLKRRN

Query:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRER--KRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPP-EDPE
            DEDTS S   SKR LPW++N  S R K KR++LKKRLG+ L DP+ N LVRER  KRNKRL  TN+ H CLD Q  D  E+K QS T+RPP EDPE
Subjt:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRER--KRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPP-EDPE

Query:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE
        ELNQLIKSAF KF+KVL+EN ARRKK TEPG GII CIVCGSKSKEF DALSLSQHA  +L GSRAEHLGLHKALCWLMGWSSE APNGLWV+RILP  E
Subjt:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE

Query:  AFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINS
          ALKEDLIIWPPVLIIHNSSIA D  S+ V ISCEELE VIRGMG GGKIKVVRG+P NQSIMVVTF AMFSGLQEAERLHK+FADKSHGRDE H+IN 
Subjt:  AFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINS

Query:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
         H IDS+ DLHKA GAN +ESVLYGYLGLAED  KLDFETKKRSVVKSKKEIQAIV+A+LQC
Subjt:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

XP_011657058.1 uncharacterized protein LOC105435801 [Cucumis sativus]2.53e-10477.38Show/hide
Query:  SKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVV
        SKSKEF DALSL QHA  +L GSRAEHLGLHKALCWLMGWSSE+APNGLWV+ ILP VE  ALKEDLIIWP VLIIHNSSIA D   E V ISCE+LE  
Subjt:  SKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVV

Query:  IRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETK
        +R MG GGK KVVRGK  NQSIMVVTF AMF GLQEAERLH NFADKSHGRDEFH+IN    +DS+ D+HKA GAN +ESV YGYLGL ED +KLDFETK
Subjt:  IRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETK

Query:  KRSVVKSKKEIQAIVDATLQC
        KRSVV+SKKEIQAIV A+LQC
Subjt:  KRSVVKSKKEIQAIVDATLQC

XP_017982234.1 PREDICTED: uncharacterized protein LOC18590378 [Theobroma cacao]1.63e-9553.94Show/hide
Query:  SKRKDLKKRLG--VSLRDPSLNPLVRERKRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPPEDPEELNQLIKSAFFKFIKVLNENLARRKKFTEPG-
        S RK +K+RLG    + +P+  P V ER + ++L+  N++      QA D      +     PPED EE  Q I  AF KF+K+LNEN A+R+K+ E G 
Subjt:  SKRKDLKKRLG--VSLRDPSLNPLVRERKRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPPEDPEELNQLIKSAFFKFIKVLNENLARRKKFTEPG-

Query:  SGIIKCIVCGSKSKEFADALSLSQHAFNS-LVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQ
        +G +KC VCGSKS+EF + LSL  HAF S +VG RA HLGLHK+LC+LMGW+S  A NGLW Q+ LP VEA A+KEDL+IWPP++I+HNSSIAT N+  +
Subjt:  SGIIKCIVCGSKSKEFADALSLSQHAFNS-LVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQ

Query:  VTISCEELEVVIRGMGSG-GKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLA
        + +S EE+E  +R MG G G  KV RGKPANQSIM V F   FSGL+EAERLHK +A+  HGR EF +IN S      G+  KA  +K++ VLYGYLG+A
Subjt:  VTISCEELEVVIRGMGSG-GKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLA

Query:  EDFEKLDFETKKRSVVKSKKEIQAIVDATL
         D +KLDFETK R++VKSKKEI A  DA L
Subjt:  EDFEKLDFETKKRSVVKSKKEIQAIVDATL

XP_022140332.1 uncharacterized protein LOC111011032 [Momordica charantia]5.18e-118100Show/hide
Query:  MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEA
        MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEA
Subjt:  MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEA

Query:  ERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
        ERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
Subjt:  ERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

XP_038900433.1 uncharacterized protein LOC120087658 [Benincasa hispida]0.069.53Show/hide
Query:  MSWRERSKDDRSRSRSPSLRRRNSEPRVEENRHCHSHWFSGSAQEGPVTNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWES
        M++RE S D RS+S S S  RR SEPRVEEN HCHS WFS S++E PVTNG  L G S+RDH+N +RLYEN DEHFRKLSQ CE+L+R ESP+KKF WE+
Subjt:  MSWRERSKDDRSRSRSPSLRRRNSEPRVEENRHCHSHWFSGSAQEGPVTNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWES

Query:  LFAKNPANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSS--EANDLRTFHTNIRATNDSNVMD-GNASRSFGVNDCSHLSSSRKFDGPVYETTDVY
        LFA NPANA+SKSS+GLKH N CDG N+G+RV GSHL   S++    ++LRTFH NI  T DSNV + G+ SRSFG++DCSHLSSSRKFDGP+YET+DV+
Subjt:  LFAKNPANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSS--EANDLRTFHTNIRATNDSNVMD-GNASRSFGVNDCSHLSSSRKFDGPVYETTDVY

Query:  IQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRIP
        ++D   +ESA NSH  RG++   SSHG Q S+  SSA VTESKGISQDEFH FLEYKRAR  +IE FDD N+YF  QP KRSDI A LNS+ SQQMVRIP
Subjt:  IQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRIP

Query:  QDDFYQDCTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAE-DGYN
        QDDFYQD TRTSV++D VVEGF+DTES++    E  RP D Y  FKEP +IEGSY G  PF ME   E LGSG  S +K E+EAY  SEKLLLAE DGY 
Subjt:  QDDFYQDCTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAE-DGYN

Query:  TNYGKWSGDDGLNGSL-SRNKQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDSSKKRNVFSRIHFLGNGDEKSTVKHIDINLKRRNE
        T YGKW  +DG+NGSL S++KQDL   +ME SRKLRWKA++STK RV+G     +RC MH  GS SS+K NVFSRI FL +GDE   VK  DINL  R++
Subjt:  TNYGKWSGDDGLNGSL-SRNKQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDSSKKRNVFSRIHFLGNGDEKSTVKHIDINLKRRNE

Query:  LWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRERKR--NKRLINTNISHECLDFQASDCFEDKTQSSTNRPPEDPEEL
         WN+EDTS+ LTSSKR LPW+IN  S   K KR+DL+KRLG  LRDPS +PLVR+RKR  NKRL   N++H CLD Q  D  E+K QS T+R  ED EEL
Subjt:  LWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRERKR--NKRLINTNISHECLDFQASDCFEDKTQSSTNRPPEDPEEL

Query:  NQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAF
        NQLIKSAF KF+KVL+EN ARRKKFTEPG GIIKCIVCGSKSKEFADALSLSQHA  +L GSRAEHLGL KALCWLMGWSSE AP+G WV+RILP  E  
Subjt:  NQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAF

Query:  ALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINSSH
        ALKEDLIIWPPVLIIHNSSIA D+ SE+V ISCEELEVVIRGMG GGKIKVVRGKP NQSIM+VTF AMFSGLQEAERLHK+FADKSHGRDEF +I SSH
Subjt:  ALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINSSH

Query:  RIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
         IDSH DLHKA GAN +++VLYGYLGL ED +KLDFETKKRSVVKSKKEIQAIV+A+L C
Subjt:  RIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

TrEMBL top hitse value%identityAlignment
A0A0A0KGN5 XS domain-containing protein1.23e-10477.38Show/hide
Query:  SKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVV
        SKSKEF DALSL QHA  +L GSRAEHLGLHKALCWLMGWSSE+APNGLWV+ ILP VE  ALKEDLIIWP VLIIHNSSIA D   E V ISCE+LE  
Subjt:  SKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVV

Query:  IRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETK
        +R MG GGK KVVRGK  NQSIMVVTF AMF GLQEAERLH NFADKSHGRDEFH+IN    +DS+ D+HKA GAN +ESV YGYLGL ED +KLDFETK
Subjt:  IRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETK

Query:  KRSVVKSKKEIQAIVDATLQC
        KRSVV+SKKEIQAIV A+LQC
Subjt:  KRSVVKSKKEIQAIVDATLQC

A0A1S3C894 uncharacterized protein LOC1034979640.066.82Show/hide
Query:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAQEGPVTNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE
        M+ RE ++D   RS+SPSL  RR SEPRVEE  HC+SHWFS S++E P+TN   LPG S+RDH+N +RLY ++DEHFRKLSQFCE+L+  ESPAKKF WE
Subjt:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAQEGPVTNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE

Query:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSSE-ANDLRTFHTNIRATNDSNVMD-GNASRSFGVNDCSHLSSSRKFDGPVYETTDV
        +LF  N  AN +SK+S+GLKHVNG DGDN+G+RV GSHL   S S    +LRTFH NI AT DSNV + G+ SRS G+NDC+HLSSSRK+DGP+++  +V
Subjt:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSSE-ANDLRTFHTNIRATNDSNVMD-GNASRSFGVNDCSHLSSSRKFDGPVYETTDV

Query:  YIQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI
        +++D   +E   NSH  RG++  TSS G Q SH HSSA V ESKGISQ EFH  LEYKRAR  HIEHFDD N+YF  QPCKR+DI A  +   SQ MVRI
Subjt:  YIQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI

Query:  PQDDFYQDCTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAE-DGY
        PQDDFY+D TRTSV++D VVEGF+DTES+     E  RP D+  F      IEGS     PFAMEQ  EVLGSGT S    E+EAY  SEKLLL E DGY
Subjt:  PQDDFYQDCTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAE-DGY

Query:  NTNYGKWSGDDGLNGS-LSRNKQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDSSKKRNVFSRIHFLGNGDEKSTVKHIDINLKRRN
         TN+GKW+ +DG+NGS +S++KQDLG  +MED RKL WKA HSTK RV+G     +R  MH  G  S KK NVFSRI FL +GD K T    D NL  RN
Subjt:  NTNYGKWSGDDGLNGS-LSRNKQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDSSKKRNVFSRIHFLGNGDEKSTVKHIDINLKRRN

Query:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRER--KRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPP-EDPE
            DEDTS S   SKR LPW++N  S R K KR++LKKRLG+ L DP+ N LVRER  KRNKRL  TN+ H CLD Q  D  E+K QS T+RPP EDPE
Subjt:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRER--KRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPP-EDPE

Query:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE
        ELNQLIKSAF KF+KVL+EN ARRKK TEPG GII CIVCGSKSKEF DALSLSQHA  +L GSRAEHLGLHKALCWLMGWSSE APNGLWV+RILP  E
Subjt:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE

Query:  AFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINS
          ALKEDLIIWPPVLIIHNSSIA D  S+ V ISCEELE VIRGMG GGKIKVVRG+P NQSIMVVTF AMFSGLQEAERLHK+FADKSHGRDE H+IN 
Subjt:  AFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINS

Query:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
         H IDS+ DLHKA GAN +ESVLYGYLGLAED  KLDFETKKRSVVKSKKEIQAIV+A+LQC
Subjt:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

A0A5A7SQC0 XS domain-containing protein0.066.82Show/hide
Query:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAQEGPVTNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE
        M+ RE ++D   RS+SPSL  RR SEPRVEE  HC+SHWFS S++E P+TN   LPG S+RDH+N +RLY ++DEHFRKLSQFCE+L+  ESPAKKF WE
Subjt:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAQEGPVTNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE

Query:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSSE-ANDLRTFHTNIRATNDSNVMD-GNASRSFGVNDCSHLSSSRKFDGPVYETTDV
        +LF  N  AN +SK+S+GLKHVNG DGDN+G+RV GSHL   S S    +LRTFH NI AT DSNV + G+ SRS G+NDC+HLSSSRK+DGP+++  +V
Subjt:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSSE-ANDLRTFHTNIRATNDSNVMD-GNASRSFGVNDCSHLSSSRKFDGPVYETTDV

Query:  YIQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI
        +++D   +E   NSH  RG++  TSS G Q SH HSSA V ESKGISQ EFH  LEYKRAR  HIEHFDD N+YF  QPCKR+DI A  +   SQ MVRI
Subjt:  YIQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI

Query:  PQDDFYQDCTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAE-DGY
        PQDDFY+D TRTSV++D VVEGF+DTES+     E  RP D+  F      IEGS     PFAMEQ  EVLGSGT S    E+EAY  SEKLLL E DGY
Subjt:  PQDDFYQDCTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAE-DGY

Query:  NTNYGKWSGDDGLNGS-LSRNKQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDSSKKRNVFSRIHFLGNGDEKSTVKHIDINLKRRN
         TN+GKW+ +DG+NGS +S++KQDLG  +MED RKL WKA HSTK RV+G     +R  MH  G  S KK NVFSRI FL +GD K T    D NL  RN
Subjt:  NTNYGKWSGDDGLNGS-LSRNKQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDSSKKRNVFSRIHFLGNGDEKSTVKHIDINLKRRN

Query:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRER--KRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPP-EDPE
            DEDTS S   SKR LPW++N  S R K KR++LKKRLG+ L DP+ N LVRER  KRNKRL  TN+ H CLD Q  D  E+K QS T+RPP EDPE
Subjt:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRER--KRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPP-EDPE

Query:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE
        ELNQLIKSAF KF+KVL+EN ARRKK TEPG GII CIVCGSKSKEF DALSLSQHA  +L GSRAEHLGLHKALCWLMGWSSE APNGLWV+RILP  E
Subjt:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE

Query:  AFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINS
          ALKEDLIIWPPVLIIHNSSIA D  S+ V ISCEELE VIRGMG GGKIKVVRG+P NQSIMVVTF AMFSGLQEAERLHK+FADKSHGRDE H+IN 
Subjt:  AFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINS

Query:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
         H IDS+ DLHKA GAN +ESVLYGYLGLAED  KLDFETKKRSVVKSKKEIQAIV+A+LQC
Subjt:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

A0A6J1CGJ5 uncharacterized protein LOC1110110322.51e-118100Show/hide
Query:  MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEA
        MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEA
Subjt:  MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEA

Query:  ERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
        ERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
Subjt:  ERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

A0A6P5XUQ8 uncharacterized protein LOC1112861698.92e-9153.64Show/hide
Query:  SKRKDLKKRLG--VSLRDPSLNPLVRERKRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPPEDPEELNQLIKSAFFKFIKVLNENLARRKKFTEPGS
        S RK +K+RLG    + +P++ P + ER + ++L+  N++      QA D      +     PPED EE  QLI +AF KF+KVLNEN A+R+K+TE G+
Subjt:  SKRKDLKKRLG--VSLRDPSLNPLVRERKRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPPEDPEELNQLIKSAFFKFIKVLNENLARRKKFTEPGS

Query:  G-IIKCIVCGSKSKEFADALSLSQHAFNS-LVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQ
           +KC VCGS SK+F +  SL  HAF S +VG R +HLGLHKALC LMGWSS  A N LWVQ+ LP  EA A+ EDL++WPPV+I+HNSSIA  N  +Q
Subjt:  G-IIKCIVCGSKSKEFADALSLSQHAFNS-LVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQ

Query:  VTISCEELEVVIRGMGSG-GKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLA
        + +S EELE  +R MG G G  KV RGKPANQSIM V F   FSGLQEAERLH  +A+  HGR EF  +  S      G+  K   +K + VLYGYLG+A
Subjt:  VTISCEELEVVIRGMGSG-GKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLA

Query:  EDFEKLDFETKKRSVVKSKKEIQAIVDATL
         D +KLDFETK RSVVKSKKEI AI DA L
Subjt:  EDFEKLDFETKKRSVVKSKKEIQAIVDATL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22430.1 CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380)1.0e-2930Show/hide
Query:  LVRERKRNKRLINTNISHECLDFQASDCFED-----KTQSSTNRPPEDPEELNQL-IKSAFFKFIKVLNENLARRKKFTEPG-SGIIKCIVCGSKSKEFA
        ++R+R++  +  N N  H  +   + D  ED       +  ++R      +++Q+ +K +F  F+K + E+   +K + E G  G ++C+VCG  SK+  
Subjt:  LVRERKRNKRLINTNISHECLDFQASDCFED-----KTQSSTNRPPEDPEELNQL-IKSAFFKFIKVLNENLARRKKFTEPG-SGIIKCIVCGSKSKEFA

Query:  DALSLSQHAF-NSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMG-
        D  SL  H + +    SR  HLGLHKALC LMGW+   AP+     + LP  EA   +  LIIWPP +I+ N+S              + ++  IR +G 
Subjt:  DALSLSQHAF-NSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMG-

Query:  SGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEIN----SSHRIDSHGDLHKAG-ANKMESVLYGYLGLAEDFEKLDFETKK
        +GGK K + G+  +  I +  F    SGL++A R+ + F   + GR  +  +     S     + G +   G   + + + YGYL    D +K+D ETKK
Subjt:  SGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAERLHKNFADKSHGRDEFHEIN----SSHRIDSHGDLHKAG-ANKMESVLYGYLGLAEDFEKLDFETKK

Query:  RSVVKSKKEI
        ++ ++S +E+
Subjt:  RSVVKSKKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTGGAGAGAAAGGAGTAAAGATGATAGGTCTCGGTCTCGGTCTCCGTCGCTTCGACGAAGAAATTCAGAACCTCGGGTTGAGGAAAACCGGCACTGTCATTCTCA
CTGGTTTTCGGGCTCTGCACAAGAAGGACCGGTGACGAATGGCCCTGCGCTTCCGGGTTATTCTGTGAGAGACCATTTTAATGAAACTCGTCTTTATGAGAATAGAGACG
AACATTTTCGTAAACTCTCTCAGTTTTGCGAGAGTTTGGAGCGGAGGGAATCGCCGGCGAAAAAGTTTGGGTGGGAAAGTTTGTTCGCCAAAAATCCCGCCAATGCGAGT
TCGAAATCGAGTTTGGGGTTGAAACATGTAAACGGATGTGATGGTGATAATCAAGGACTTAGGGTTTACGGTTCTCATTTGATTCCGGAATCGTCGTCAGAAGCTAATGA
TTTACGCACATTCCATACGAACATTAGAGCAACTAATGATAGTAATGTAATGGATGGGAATGCTTCCAGAAGTTTTGGAGTCAATGACTGTAGTCATTTGTCTTCATCTA
GAAAGTTTGATGGGCCCGTATACGAGACCACTGATGTTTATATTCAGGACCATTCACCGTATGAATCAGCAAGAAATTCCCACTCCCACAGAGGAAAACAAAAGGGAACT
TCCTCACATGGGACACAAGGGTCACATCCGCACTCCAGTGCACGTGTTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGGTTTCTGGAGTATAAACGTGCTCG
TGGGGAACATATCGAGCACTTCGATGATTGCAATAAGTATTTTAAAGCTCAACCATGCAAGAGGAGTGACATCGGTGCTGCTCTCAACAGTTCTTTGTCTCAGCAGATGG
TCCGTATCCCACAAGACGATTTCTATCAAGACTGTACTCGGACCAGTGTTATAGTGGATCCAGTTGTCGAGGGATTTGAAGACACTGAAAGCTATGTCATGGGTGATATG
GAAGAGAACCGGCCAAGCGACAACTATGGTTTTTTCAAAGAACCACACATCATTGAAGGTTCTTATAGGGGAAACGGTCCTTTTGCCATGGAACAGGATGATGAAGTTTT
GGGTTCTGGAACCGGGAGTCTGCTGAAGTGTGAAAAAGAAGCATATACAGGCAGTGAGAAGTTGCTCTTGGCAGAAGATGGTTATAATACAAATTATGGGAAATGGTCGG
GTGATGATGGATTAAATGGATCCTTATCAAGAAATAAACAAGATTTGGGTGGCATGGAAATGGAAGACAGTAGGAAGCTGAGATGGAAAGCCTCGCATTCAACAAAACGA
AGGGTCAAGGGGAAATGCTTTGTATCTTCAAGATGCGGAATGCATTATCGTGGGTCCGATTCATCTAAAAAACGTAACGTGTTTAGCAGAATCCATTTTTTAGGTAATGG
AGATGAAAAGAGTACTGTTAAACACATTGATATCAATTTAAAACGTAGAAACGAGTTGTGGAATGATGAGGATACTTCCATGTCCTTAACCTCCTCCAAACGGCTGTTGC
CTTGGATAATAAACCGTGGCTCTCAGCGTCTGAAGTCTAAACGCAAAGACCTTAAGAAACGTTTGGGTGTCTCCTTGAGGGATCCCAGTTTAAATCCTCTAGTTAGAGAA
CGTAAAAGAAATAAGCGTCTGATAAACACAAATATCAGTCATGAGTGCCTTGATTTTCAAGCAAGTGATTGCTTTGAAGACAAGACGCAAAGTTCAACCAATAGGCCACC
TGAAGATCCTGAGGAGTTGAACCAGCTAATAAAGAGTGCCTTTTTCAAGTTTATCAAAGTTCTGAATGAGAACCTAGCCCGACGAAAGAAGTTCACAGAGCCAGGGTCTG
GTATTATAAAGTGCATTGTCTGCGGCAGCAAGTCCAAGGAGTTTGCGGATGCACTAAGCTTATCACAACATGCCTTCAATTCGCTGGTAGGATCGAGGGCGGAACACTTG
GGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGTAGCGCCAAATGGTCTATGGGTTCAAAGGATATTGCCCCATGTAGAAGCCTTTGCTTTGAAGGA
GGATCTCATTATATGGCCTCCTGTTCTTATCATTCATAACAGTTCTATTGCAACTGATAATACGTCTGAACAGGTAACCATAAGTTGTGAAGAGCTCGAGGTTGTTATTA
GAGGAATGGGTTCCGGAGGGAAGATCAAAGTGGTTCGTGGTAAACCTGCAAATCAGAGCATTATGGTAGTAACTTTCTGTGCAATGTTTTCTGGATTGCAAGAAGCAGAA
AGACTACACAAAAACTTTGCCGATAAAAGTCATGGGAGGGATGAGTTCCATGAAATCAATTCGAGTCATCGCATTGACAGCCATGGGGATTTGCATAAAGCAGGAGCAAA
CAAGATGGAAAGCGTTCTTTATGGCTACTTAGGCCTCGCAGAGGACTTCGAAAAACTTGACTTTGAGACCAAGAAGAGGTCCGTGGTGAAAAGCAAGAAAGAAATCCAGG
CCATTGTGGATGCAACTCTTCAATGT
mRNA sequenceShow/hide mRNA sequence
ATGAGCTGGAGAGAAAGGAGTAAAGATGATAGGTCTCGGTCTCGGTCTCCGTCGCTTCGACGAAGAAATTCAGAACCTCGGGTTGAGGAAAACCGGCACTGTCATTCTCA
CTGGTTTTCGGGCTCTGCACAAGAAGGACCGGTGACGAATGGCCCTGCGCTTCCGGGTTATTCTGTGAGAGACCATTTTAATGAAACTCGTCTTTATGAGAATAGAGACG
AACATTTTCGTAAACTCTCTCAGTTTTGCGAGAGTTTGGAGCGGAGGGAATCGCCGGCGAAAAAGTTTGGGTGGGAAAGTTTGTTCGCCAAAAATCCCGCCAATGCGAGT
TCGAAATCGAGTTTGGGGTTGAAACATGTAAACGGATGTGATGGTGATAATCAAGGACTTAGGGTTTACGGTTCTCATTTGATTCCGGAATCGTCGTCAGAAGCTAATGA
TTTACGCACATTCCATACGAACATTAGAGCAACTAATGATAGTAATGTAATGGATGGGAATGCTTCCAGAAGTTTTGGAGTCAATGACTGTAGTCATTTGTCTTCATCTA
GAAAGTTTGATGGGCCCGTATACGAGACCACTGATGTTTATATTCAGGACCATTCACCGTATGAATCAGCAAGAAATTCCCACTCCCACAGAGGAAAACAAAAGGGAACT
TCCTCACATGGGACACAAGGGTCACATCCGCACTCCAGTGCACGTGTTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGGTTTCTGGAGTATAAACGTGCTCG
TGGGGAACATATCGAGCACTTCGATGATTGCAATAAGTATTTTAAAGCTCAACCATGCAAGAGGAGTGACATCGGTGCTGCTCTCAACAGTTCTTTGTCTCAGCAGATGG
TCCGTATCCCACAAGACGATTTCTATCAAGACTGTACTCGGACCAGTGTTATAGTGGATCCAGTTGTCGAGGGATTTGAAGACACTGAAAGCTATGTCATGGGTGATATG
GAAGAGAACCGGCCAAGCGACAACTATGGTTTTTTCAAAGAACCACACATCATTGAAGGTTCTTATAGGGGAAACGGTCCTTTTGCCATGGAACAGGATGATGAAGTTTT
GGGTTCTGGAACCGGGAGTCTGCTGAAGTGTGAAAAAGAAGCATATACAGGCAGTGAGAAGTTGCTCTTGGCAGAAGATGGTTATAATACAAATTATGGGAAATGGTCGG
GTGATGATGGATTAAATGGATCCTTATCAAGAAATAAACAAGATTTGGGTGGCATGGAAATGGAAGACAGTAGGAAGCTGAGATGGAAAGCCTCGCATTCAACAAAACGA
AGGGTCAAGGGGAAATGCTTTGTATCTTCAAGATGCGGAATGCATTATCGTGGGTCCGATTCATCTAAAAAACGTAACGTGTTTAGCAGAATCCATTTTTTAGGTAATGG
AGATGAAAAGAGTACTGTTAAACACATTGATATCAATTTAAAACGTAGAAACGAGTTGTGGAATGATGAGGATACTTCCATGTCCTTAACCTCCTCCAAACGGCTGTTGC
CTTGGATAATAAACCGTGGCTCTCAGCGTCTGAAGTCTAAACGCAAAGACCTTAAGAAACGTTTGGGTGTCTCCTTGAGGGATCCCAGTTTAAATCCTCTAGTTAGAGAA
CGTAAAAGAAATAAGCGTCTGATAAACACAAATATCAGTCATGAGTGCCTTGATTTTCAAGCAAGTGATTGCTTTGAAGACAAGACGCAAAGTTCAACCAATAGGCCACC
TGAAGATCCTGAGGAGTTGAACCAGCTAATAAAGAGTGCCTTTTTCAAGTTTATCAAAGTTCTGAATGAGAACCTAGCCCGACGAAAGAAGTTCACAGAGCCAGGGTCTG
GTATTATAAAGTGCATTGTCTGCGGCAGCAAGTCCAAGGAGTTTGCGGATGCACTAAGCTTATCACAACATGCCTTCAATTCGCTGGTAGGATCGAGGGCGGAACACTTG
GGTCTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGTAGCGCCAAATGGTCTATGGGTTCAAAGGATATTGCCCCATGTAGAAGCCTTTGCTTTGAAGGA
GGATCTCATTATATGGCCTCCTGTTCTTATCATTCATAACAGTTCTATTGCAACTGATAATACGTCTGAACAGGTAACCATAAGTTGTGAAGAGCTCGAGGTTGTTATTA
GAGGAATGGGTTCCGGAGGGAAGATCAAAGTGGTTCGTGGTAAACCTGCAAATCAGAGCATTATGGTAGTAACTTTCTGTGCAATGTTTTCTGGATTGCAAGAAGCAGAA
AGACTACACAAAAACTTTGCCGATAAAAGTCATGGGAGGGATGAGTTCCATGAAATCAATTCGAGTCATCGCATTGACAGCCATGGGGATTTGCATAAAGCAGGAGCAAA
CAAGATGGAAAGCGTTCTTTATGGCTACTTAGGCCTCGCAGAGGACTTCGAAAAACTTGACTTTGAGACCAAGAAGAGGTCCGTGGTGAAAAGCAAGAAAGAAATCCAGG
CCATTGTGGATGCAACTCTTCAATGT
Protein sequenceShow/hide protein sequence
MSWRERSKDDRSRSRSPSLRRRNSEPRVEENRHCHSHWFSGSAQEGPVTNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWESLFAKNPANAS
SKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSSEANDLRTFHTNIRATNDSNVMDGNASRSFGVNDCSHLSSSRKFDGPVYETTDVYIQDHSPYESARNSHSHRGKQKGT
SSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRIPQDDFYQDCTRTSVIVDPVVEGFEDTESYVMGDM
EENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAEDGYNTNYGKWSGDDGLNGSLSRNKQDLGGMEMEDSRKLRWKASHSTKR
RVKGKCFVSSRCGMHYRGSDSSKKRNVFSRIHFLGNGDEKSTVKHIDINLKRRNELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRE
RKRNKRLINTNISHECLDFQASDCFEDKTQSSTNRPPEDPEELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHL
GLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSEQVTISCEELEVVIRGMGSGGKIKVVRGKPANQSIMVVTFCAMFSGLQEAE
RLHKNFADKSHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC