; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001290 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001290
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionXS domain-containing protein
Genome locationChr09:15746641..15750926
RNA-Seq ExpressionHG10001290
SyntenyHG10001290
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005380 - XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458617.1 PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo]0.0e+0078.77Show/hide
Query:  MNCRETSGDKRSQSPSFFGRRTSEPRVEENPHCHSHSLSRSSRERPVTNGLTGSSIRDHYNGSRLYENEDEHFRKLSQFCENLQQESPSKKFLWENLFAN
        MN RE + DKRSQSPS FGRRTSEPRVEE PHC+SH  SRSSRERP+TN L GSSIRDHYNGSRLY ++DEHFRKLSQFCENLQ ESP+KKF WENLF N
Subjt:  MNCRETSGDKRSQSPSFFGRRTSEPRVEENPHCHSHSLSRSSRERPVTNGLTGSSIRDHYNGSRLYENEDEHFRKLSQFCENLQQESPSKKFLWENLFAN

Query:  NP-ANAKSKSSIGLKHGSMCDGRNRGIRVSGSHLGTSSKNILDGNNLQTFYMTIGATKDSIVKNNADTSRSFEFDDCSHLSSFRKFDGPLYETNDVHVRD
        N  AN  SK+S+GLKH +  DG NRGIRVSGSHLGTSSK+IL G NL+TF+M IGATKDS VKNN DTSRS   +DC+HLSS RK+DGPL++ N+VHVRD
Subjt:  NP-ANAKSKSSIGLKHGSMCDGRNRGIRVSGSHLGTSSKNILDGNNLQTFYMTIGATKDSIVKNNADTSRSFEFDDCSHLSSFRKFDGPLYETNDVHVRD

Query:  RSSFESAENSHRGRRNGTSSHGIQVSHLHSSAPVTESKSISQDEFHDLLEYKRARRNHIEHLDDSNQYFSVQPCKRSDIDAALNGPFSQQMVRNPQDDFY
        R  FE  ENSHRGRRN TSS GIQ SHLHSSAPV ESK ISQ EFHDLLEYKRARRNHIEH DDSNQYFSVQPCKR+DIDA  + PFSQ MVR PQDDFY
Subjt:  RSSFESAENSHRGRRNGTSSHGIQVSHLHSSAPVTESKSISQDEFHDLLEYKRARRNHIEHLDDSNQYFSVQPCKRSDIDAALNGPFSQQMVRNPQDDFY

Query:  QDSTRTSVVMDSVVSGFEDNETHLEETTRPRDLYDLFKQPFIIEGSYMGTAPFAMEQYGEVLGSGTGSPRKGEREVYRSSEKLLLAEEDGYRTNYGKWSH
        +DSTRTSVVMDSVV GF+D E+H EETTRPRD ++ F     IEGS M TAPFAMEQY EVLGSGT S + GERE Y SSEKLLL EEDGYRTN+GKW+ 
Subjt:  QDSTRTSVVMDSVVSGFEDNETHLEETTRPRDLYDLFKQPFIIEGSYMGTAPFAMEQYGEVLGSGTGSPRKGEREVYRSSEKLLLAEEDGYRTNYGKWSH

Query:  EDDGLNGLLVSKHKQDLSDMKDSRKLRWKATDSTKPRVEGSRCRMHDPGSGPSRKPNVFSRIQFLSHGDENIVVKDIDINLNCRNKWWNDEDTSISLTSS
        E DG+NG  VSKHKQDL DM+D RKL WKA  STKPRVEG+R +MHDPG G  +KPNVFSRIQFL+HGD    VKD D NLNCRN W  DEDTS    SS
Subjt:  EDDGLNGLLVSKHKQDLSDMKDSRKLRWKATDSTKPRVEGSRCRMHDPGSGPSRKPNVFSRIQFLSHGDENIVVKDIDINLNCRNKWWNDEDTSISLTSS

Query:  KWLLPWVVNHASPRSKPKRRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDYL-EKVQSPTRR-PLEDPEELNQLIKSAFLKFVK
        K  LPWVVNH SPRSK KRR+LKKRLG  L DP SN LVRERE KRNKRLRKTN +H C DVQTGDYL EKVQSPT R PLEDPEELNQLIKSAFLKFVK
Subjt:  KWLLPWVVNHASPRSKPKRRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDYL-EKVQSPTRR-PLEDPEELNQLIKSAFLKFVK

Query:  VLSENPARRKTFTEPGSGIIKCIVCGSKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVL
        VLSENPARRK  TEPG GII CIVCGSKSKEF +ALSLSQHAS++LEGSRAEHLGLHKALCWLMGWSSE APN LWVRRILPL EVLALKEDLIIWPPVL
Subjt:  VLSENPARRKTFTEPGSGIIKCIVCGSKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVL

Query:  IIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAA
        IIHNSSIAID LS+ VAISCEELE VIRGMGCG KIK+VRG+P + SIM+ TFGAMFSGLQEAERLHKSFADKSHGRDE HKIN  HL DS+ D LHKA 
Subjt:  IIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAA

Query:  GANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC
        GANTL+S+LYGYLGLAEDL KLDFETKKRSV+KSKKEIQAIVNASL C
Subjt:  GANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC

XP_011657058.1 uncharacterized protein LOC105435801 [Cucumis sativus]6.6e-8778.38Show/hide
Query:  SKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVV
        SKSKEF +ALSL QHAS++LEGSRAEHLGLHKALCWLMGWSSE APN LWVR ILP VEVLALKEDLIIWP VLIIHNSSIAID   E VAISCE+LE  
Subjt:  SKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVV

Query:  IRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFET
        +R MGCG K K+VRGK  + SIM+ TFGAMF GLQEAERLH +FADKSHGRDEFHKIN   L DS+ D +HKA GANTL+S+ YGYLGL EDLDKLDFET
Subjt:  IRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFET

Query:  KKRSVLKSKKEIQAIVNASLHC
        KKRSV++SKKEIQAIV+ASL C
Subjt:  KKRSVLKSKKEIQAIVNASLHC

XP_021279328.1 uncharacterized protein LOC110412979 [Herrania umbratica]2.3e-7150.91Show/hide
Query:  RRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDY-LEKVQSPTRRPLEDPEELNQLIKSAFLKFVKVLSENPARRKTFTEPG-SG
        R+ +K+RLG        N + R +  K  K L K N N     VQ  D  L  V+     P ED +E  Q I+ AF+++VK+L+ENPA+R+ +TE G +G
Subjt:  RRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDY-LEKVQSPTRRPLEDPEELNQLIKSAFLKFVKVLSENPARRKTFTEPG-SG

Query:  IIKCIVCGSKSKEFANALSLSQHASQS-LEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVA
         +KC VCGSKS+EF N LSL  HA  S + G R  HLGLHKALC+LMGW+S AA N LW ++ LP VE LA+KEDL+IWPPV+I+HNSSIA  N   R+ 
Subjt:  IIKCIVCGSKSKEFANALSLSQHASQS-LEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVA

Query:  ISCEELEVVIRGMGCGAKI-KMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLA
        +S EE+E  +R MG G  I K+ RGKP + SIM   F   FSGL+EAERLHK +A+  HGR EF +IN S   ++ K  L K      ++ +LYGYLG+A
Subjt:  ISCEELEVVIRGMGCGAKI-KMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLA

Query:  EDLDKLDFETKKRSVLKSKKEIQAIVNASL
         DLDKLDFETK R+++KSKKEI A  +A L
Subjt:  EDLDKLDFETKKRSVLKSKKEIQAIVNASL

XP_022140332.1 uncharacterized protein LOC111011032 [Momordica charantia]4.6e-7280.54Show/hide
Query:  MGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEA
        MGWSSE APN LWV+RILP VE  ALKEDLIIWPPVLIIHNSSIA DN SE+V ISCEELEVVIRGMG G KIK+VRGKP + SIM+ TF AMFSGLQEA
Subjt:  MGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEA

Query:  ERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC
        ERLHK+FADKSHGRDEFH+INSSH  DSH D LHK AGAN ++S+LYGYLGLAED +KLDFETKKRSV+KSKKEIQAIV+A+L C
Subjt:  ERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC

XP_038900433.1 uncharacterized protein LOC120087658 [Benincasa hispida]0.0e+0083.92Show/hide
Query:  MNCRETSGDKRSQSPSFFGRRTSEPRVEENPHCHSHSLSRSSRERPVTNGLTGSSIRDHYNGSRLYENEDEHFRKLSQFCENLQQESPSKKFLWENLFAN
        MN RETS DKRSQSPS FGRRTSEPRVEENPHCHS   SRSSRE PVTNGL GSSIRDHYNGSRLYEN DEHFRKLSQ CENLQ+ESPSKKF WENLFAN
Subjt:  MNCRETSGDKRSQSPSFFGRRTSEPRVEENPHCHSHSLSRSSRERPVTNGLTGSSIRDHYNGSRLYENEDEHFRKLSQFCENLQQESPSKKFLWENLFAN

Query:  NPANAKSKSSIGLKHGSMCDGRNRGIRVSGSHLGTSSKNILDGNNLQTFYMTIGATKDSIVKNNADTSRSFEFDDCSHLSSFRKFDGPLYETNDVHVRDR
        NPANA SKSS+GLKH ++CDG NRGIRVSGSHLGTSS NIL G+NL+TF+M IG TKDS VKNN D SRSF  DDCSHLSS RKFDGPLYET+DVHVRDR
Subjt:  NPANAKSKSSIGLKHGSMCDGRNRGIRVSGSHLGTSSKNILDGNNLQTFYMTIGATKDSIVKNNADTSRSFEFDDCSHLSSFRKFDGPLYETNDVHVRDR

Query:  SSFESAENSHRGRRNGTSSHGIQVSHLHSSAPVTESKSISQDEFHDLLEYKRARRNHIEHLDDSNQYFSVQPCKRSDIDAALNGPFSQQMVRNPQDDFYQ
          FESAENSHRGRRN  SSHG+Q S+L SSAPVTESK ISQDEFHD LEYKRARRN+IE  DDSNQYFSVQP KRSDIDA LN  FSQQMVR PQDDFYQ
Subjt:  SSFESAENSHRGRRNGTSSHGIQVSHLHSSAPVTESKSISQDEFHDLLEYKRARRNHIEHLDDSNQYFSVQPCKRSDIDAALNGPFSQQMVRNPQDDFYQ

Query:  DSTRTSVVMDSVVSGFEDNETHLEETTRPRDLYDLFKQPFIIEGSYMGTAPFAMEQYGEVLGSGTGSPRKGEREVYRSSEKLLLAEEDGYRTNYGKWSHE
        DSTRTSVVMDSVV GF+D E+HLEETTRPRD YD FK+PF+IEGSYMGTAPF ME YGE LGSG  S  KGERE Y SSEKLLLAEEDGYRT YGKW HE
Subjt:  DSTRTSVVMDSVVSGFEDNETHLEETTRPRDLYDLFKQPFIIEGSYMGTAPFAMEQYGEVLGSGTGSPRKGEREVYRSSEKLLLAEEDGYRTNYGKWSHE

Query:  DDGLNGLLVSKHKQDLSDMKDSRKLRWKATDSTKPRVEGSRCRMHDPGSGPSRKPNVFSRIQFLSHGDENIVVKDIDINLNCRNKWWNDEDTSISLTSSK
         DG+NG LVSKHKQDLSDM+ SRKLRWKAT+STK RVEG+RC MH+PGS  SRKPNVFSRIQFLSHGDENI VKD DINLNCR+KWWN+EDTSI LTSSK
Subjt:  DDGLNGLLVSKHKQDLSDMKDSRKLRWKATDSTKPRVEGSRCRMHDPGSGPSRKPNVFSRIQFLSHGDENIVVKDIDINLNCRNKWWNDEDTSISLTSSK

Query:  WLLPWVVNHASPRSKPKRRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDYL-EKVQSPTRRPLEDPEELNQLIKSAFLKFVKVL
          LPWV+NHASP SK KRRDL+KRLGF LRDP+S+PLVR+R+ K+NKRLRK N NH C DVQT DY+ EKVQSPT R LED EELNQLIKSAFLKFVKVL
Subjt:  WLLPWVVNHASPRSKPKRRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDYL-EKVQSPTRRPLEDPEELNQLIKSAFLKFVKVL

Query:  SENPARRKTFTEPGSGIIKCIVCGSKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLII
        SENPARRK FTEPG GIIKCIVCGSKSKEFA+ALSLSQHASQ+LEGSRAEHLGL KALCWLMGWSSEAAP+  WVRRILPL EVLALKEDLIIWPPVLII
Subjt:  SENPARRKTFTEPGSGIIKCIVCGSKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLII

Query:  HNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGA
        HNSSIAID+ SERVAISCEELEVVIRGMGCG KIK+VRGKP + SIMI TF AMFSGLQEAERLHKSFADKSHGRDEF KI SSHL DSHKD LHKA GA
Subjt:  HNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGA

Query:  NTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC
        NTLD++LYGYLGL EDLDKLDFETKKRSV+KSKKEIQAIVNASLHC
Subjt:  NTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC

TrEMBL top hitse value%identityAlignment
A0A0A0KGN5 XS domain-containing protein3.2e-8778.38Show/hide
Query:  SKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVV
        SKSKEF +ALSL QHAS++LEGSRAEHLGLHKALCWLMGWSSE APN LWVR ILP VEVLALKEDLIIWP VLIIHNSSIAID   E VAISCE+LE  
Subjt:  SKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVV

Query:  IRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFET
        +R MGCG K K+VRGK  + SIM+ TFGAMF GLQEAERLH +FADKSHGRDEFHKIN   L DS+ D +HKA GANTL+S+ YGYLGL EDLDKLDFET
Subjt:  IRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFET

Query:  KKRSVLKSKKEIQAIVNASLHC
        KKRSV++SKKEIQAIV+ASL C
Subjt:  KKRSVLKSKKEIQAIVNASLHC

A0A1S3C894 uncharacterized protein LOC1034979640.0e+0078.77Show/hide
Query:  MNCRETSGDKRSQSPSFFGRRTSEPRVEENPHCHSHSLSRSSRERPVTNGLTGSSIRDHYNGSRLYENEDEHFRKLSQFCENLQQESPSKKFLWENLFAN
        MN RE + DKRSQSPS FGRRTSEPRVEE PHC+SH  SRSSRERP+TN L GSSIRDHYNGSRLY ++DEHFRKLSQFCENLQ ESP+KKF WENLF N
Subjt:  MNCRETSGDKRSQSPSFFGRRTSEPRVEENPHCHSHSLSRSSRERPVTNGLTGSSIRDHYNGSRLYENEDEHFRKLSQFCENLQQESPSKKFLWENLFAN

Query:  NP-ANAKSKSSIGLKHGSMCDGRNRGIRVSGSHLGTSSKNILDGNNLQTFYMTIGATKDSIVKNNADTSRSFEFDDCSHLSSFRKFDGPLYETNDVHVRD
        N  AN  SK+S+GLKH +  DG NRGIRVSGSHLGTSSK+IL G NL+TF+M IGATKDS VKNN DTSRS   +DC+HLSS RK+DGPL++ N+VHVRD
Subjt:  NP-ANAKSKSSIGLKHGSMCDGRNRGIRVSGSHLGTSSKNILDGNNLQTFYMTIGATKDSIVKNNADTSRSFEFDDCSHLSSFRKFDGPLYETNDVHVRD

Query:  RSSFESAENSHRGRRNGTSSHGIQVSHLHSSAPVTESKSISQDEFHDLLEYKRARRNHIEHLDDSNQYFSVQPCKRSDIDAALNGPFSQQMVRNPQDDFY
        R  FE  ENSHRGRRN TSS GIQ SHLHSSAPV ESK ISQ EFHDLLEYKRARRNHIEH DDSNQYFSVQPCKR+DIDA  + PFSQ MVR PQDDFY
Subjt:  RSSFESAENSHRGRRNGTSSHGIQVSHLHSSAPVTESKSISQDEFHDLLEYKRARRNHIEHLDDSNQYFSVQPCKRSDIDAALNGPFSQQMVRNPQDDFY

Query:  QDSTRTSVVMDSVVSGFEDNETHLEETTRPRDLYDLFKQPFIIEGSYMGTAPFAMEQYGEVLGSGTGSPRKGEREVYRSSEKLLLAEEDGYRTNYGKWSH
        +DSTRTSVVMDSVV GF+D E+H EETTRPRD ++ F     IEGS M TAPFAMEQY EVLGSGT S + GERE Y SSEKLLL EEDGYRTN+GKW+ 
Subjt:  QDSTRTSVVMDSVVSGFEDNETHLEETTRPRDLYDLFKQPFIIEGSYMGTAPFAMEQYGEVLGSGTGSPRKGEREVYRSSEKLLLAEEDGYRTNYGKWSH

Query:  EDDGLNGLLVSKHKQDLSDMKDSRKLRWKATDSTKPRVEGSRCRMHDPGSGPSRKPNVFSRIQFLSHGDENIVVKDIDINLNCRNKWWNDEDTSISLTSS
        E DG+NG  VSKHKQDL DM+D RKL WKA  STKPRVEG+R +MHDPG G  +KPNVFSRIQFL+HGD    VKD D NLNCRN W  DEDTS    SS
Subjt:  EDDGLNGLLVSKHKQDLSDMKDSRKLRWKATDSTKPRVEGSRCRMHDPGSGPSRKPNVFSRIQFLSHGDENIVVKDIDINLNCRNKWWNDEDTSISLTSS

Query:  KWLLPWVVNHASPRSKPKRRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDYL-EKVQSPTRR-PLEDPEELNQLIKSAFLKFVK
        K  LPWVVNH SPRSK KRR+LKKRLG  L DP SN LVRERE KRNKRLRKTN +H C DVQTGDYL EKVQSPT R PLEDPEELNQLIKSAFLKFVK
Subjt:  KWLLPWVVNHASPRSKPKRRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDYL-EKVQSPTRR-PLEDPEELNQLIKSAFLKFVK

Query:  VLSENPARRKTFTEPGSGIIKCIVCGSKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVL
        VLSENPARRK  TEPG GII CIVCGSKSKEF +ALSLSQHAS++LEGSRAEHLGLHKALCWLMGWSSE APN LWVRRILPL EVLALKEDLIIWPPVL
Subjt:  VLSENPARRKTFTEPGSGIIKCIVCGSKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVL

Query:  IIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAA
        IIHNSSIAID LS+ VAISCEELE VIRGMGCG KIK+VRG+P + SIM+ TFGAMFSGLQEAERLHKSFADKSHGRDE HKIN  HL DS+ D LHKA 
Subjt:  IIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAA

Query:  GANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC
        GANTL+S+LYGYLGLAEDL KLDFETKKRSV+KSKKEIQAIVNASL C
Subjt:  GANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC

A0A5A7SQC0 XS domain-containing protein0.0e+0078.77Show/hide
Query:  MNCRETSGDKRSQSPSFFGRRTSEPRVEENPHCHSHSLSRSSRERPVTNGLTGSSIRDHYNGSRLYENEDEHFRKLSQFCENLQQESPSKKFLWENLFAN
        MN RE + DKRSQSPS FGRRTSEPRVEE PHC+SH  SRSSRERP+TN L GSSIRDHYNGSRLY ++DEHFRKLSQFCENLQ ESP+KKF WENLF N
Subjt:  MNCRETSGDKRSQSPSFFGRRTSEPRVEENPHCHSHSLSRSSRERPVTNGLTGSSIRDHYNGSRLYENEDEHFRKLSQFCENLQQESPSKKFLWENLFAN

Query:  NP-ANAKSKSSIGLKHGSMCDGRNRGIRVSGSHLGTSSKNILDGNNLQTFYMTIGATKDSIVKNNADTSRSFEFDDCSHLSSFRKFDGPLYETNDVHVRD
        N  AN  SK+S+GLKH +  DG NRGIRVSGSHLGTSSK+IL G NL+TF+M IGATKDS VKNN DTSRS   +DC+HLSS RK+DGPL++ N+VHVRD
Subjt:  NP-ANAKSKSSIGLKHGSMCDGRNRGIRVSGSHLGTSSKNILDGNNLQTFYMTIGATKDSIVKNNADTSRSFEFDDCSHLSSFRKFDGPLYETNDVHVRD

Query:  RSSFESAENSHRGRRNGTSSHGIQVSHLHSSAPVTESKSISQDEFHDLLEYKRARRNHIEHLDDSNQYFSVQPCKRSDIDAALNGPFSQQMVRNPQDDFY
        R  FE  ENSHRGRRN TSS GIQ SHLHSSAPV ESK ISQ EFHDLLEYKRARRNHIEH DDSNQYFSVQPCKR+DIDA  + PFSQ MVR PQDDFY
Subjt:  RSSFESAENSHRGRRNGTSSHGIQVSHLHSSAPVTESKSISQDEFHDLLEYKRARRNHIEHLDDSNQYFSVQPCKRSDIDAALNGPFSQQMVRNPQDDFY

Query:  QDSTRTSVVMDSVVSGFEDNETHLEETTRPRDLYDLFKQPFIIEGSYMGTAPFAMEQYGEVLGSGTGSPRKGEREVYRSSEKLLLAEEDGYRTNYGKWSH
        +DSTRTSVVMDSVV GF+D E+H EETTRPRD ++ F     IEGS M TAPFAMEQY EVLGSGT S + GERE Y SSEKLLL EEDGYRTN+GKW+ 
Subjt:  QDSTRTSVVMDSVVSGFEDNETHLEETTRPRDLYDLFKQPFIIEGSYMGTAPFAMEQYGEVLGSGTGSPRKGEREVYRSSEKLLLAEEDGYRTNYGKWSH

Query:  EDDGLNGLLVSKHKQDLSDMKDSRKLRWKATDSTKPRVEGSRCRMHDPGSGPSRKPNVFSRIQFLSHGDENIVVKDIDINLNCRNKWWNDEDTSISLTSS
        E DG+NG  VSKHKQDL DM+D RKL WKA  STKPRVEG+R +MHDPG G  +KPNVFSRIQFL+HGD    VKD D NLNCRN W  DEDTS    SS
Subjt:  EDDGLNGLLVSKHKQDLSDMKDSRKLRWKATDSTKPRVEGSRCRMHDPGSGPSRKPNVFSRIQFLSHGDENIVVKDIDINLNCRNKWWNDEDTSISLTSS

Query:  KWLLPWVVNHASPRSKPKRRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDYL-EKVQSPTRR-PLEDPEELNQLIKSAFLKFVK
        K  LPWVVNH SPRSK KRR+LKKRLG  L DP SN LVRERE KRNKRLRKTN +H C DVQTGDYL EKVQSPT R PLEDPEELNQLIKSAFLKFVK
Subjt:  KWLLPWVVNHASPRSKPKRRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDYL-EKVQSPTRR-PLEDPEELNQLIKSAFLKFVK

Query:  VLSENPARRKTFTEPGSGIIKCIVCGSKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVL
        VLSENPARRK  TEPG GII CIVCGSKSKEF +ALSLSQHAS++LEGSRAEHLGLHKALCWLMGWSSE APN LWVRRILPL EVLALKEDLIIWPPVL
Subjt:  VLSENPARRKTFTEPGSGIIKCIVCGSKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVL

Query:  IIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAA
        IIHNSSIAID LS+ VAISCEELE VIRGMGCG KIK+VRG+P + SIM+ TFGAMFSGLQEAERLHKSFADKSHGRDE HKIN  HL DS+ D LHKA 
Subjt:  IIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAA

Query:  GANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC
        GANTL+S+LYGYLGLAEDL KLDFETKKRSV+KSKKEIQAIVNASL C
Subjt:  GANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC

A0A6J0ZXA5 uncharacterized protein LOC1104129791.1e-7150.91Show/hide
Query:  RRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDY-LEKVQSPTRRPLEDPEELNQLIKSAFLKFVKVLSENPARRKTFTEPG-SG
        R+ +K+RLG        N + R +  K  K L K N N     VQ  D  L  V+     P ED +E  Q I+ AF+++VK+L+ENPA+R+ +TE G +G
Subjt:  RRDLKKRLGFSLRDPTSNPLVREREGKRNKRLRKTNANHRCPDVQTGDY-LEKVQSPTRRPLEDPEELNQLIKSAFLKFVKVLSENPARRKTFTEPG-SG

Query:  IIKCIVCGSKSKEFANALSLSQHASQS-LEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVA
         +KC VCGSKS+EF N LSL  HA  S + G R  HLGLHKALC+LMGW+S AA N LW ++ LP VE LA+KEDL+IWPPV+I+HNSSIA  N   R+ 
Subjt:  IIKCIVCGSKSKEFANALSLSQHASQS-LEGSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVA

Query:  ISCEELEVVIRGMGCGAKI-KMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLA
        +S EE+E  +R MG G  I K+ RGKP + SIM   F   FSGL+EAERLHK +A+  HGR EF +IN S   ++ K  L K      ++ +LYGYLG+A
Subjt:  ISCEELEVVIRGMGCGAKI-KMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLA

Query:  EDLDKLDFETKKRSVLKSKKEIQAIVNASL
         DLDKLDFETK R+++KSKKEI A  +A L
Subjt:  EDLDKLDFETKKRSVLKSKKEIQAIVNASL

A0A6J1CGJ5 uncharacterized protein LOC1110110322.2e-7280.54Show/hide
Query:  MGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEA
        MGWSSE APN LWV+RILP VE  ALKEDLIIWPPVLIIHNSSIA DN SE+V ISCEELEVVIRGMG G KIK+VRGKP + SIM+ TF AMFSGLQEA
Subjt:  MGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKP-DLSIMIATFGAMFSGLQEA

Query:  ERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC
        ERLHK+FADKSHGRDEFH+INSSH  DSH D LHK AGAN ++S+LYGYLGLAED +KLDFETKKRSV+KSKKEIQAIV+A+L C
Subjt:  ERLHKSFADKSHGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22430.1 CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380)6.1e-3034.25Show/hide
Query:  IKSAFLKFVKVLSENPARRKTFTEPG-SGIIKCIVCGSKSKEFANALSLSQHASQSLE-GSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLA
        +K +FL FVK + E+P  +K + E G  G ++C+VCG  SK+  +  SL  H   S +  SR  HLGLHKALC LMGW+   AP++    + LP  E   
Subjt:  IKSAFLKFVKVLSENPARRKTFTEPG-SGIIKCIVCGSKSKEFANALSLSQHASQSLE-GSRAEHLGLHKALCWLMGWSSEAAPNDLWVRRILPLVEVLA

Query:  LKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMG-CGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKI---N
         +  LIIWPP +I+ N+S              + ++  IR +G  G K K + G+   L I +  F    SGL++A R+ + F   + GR  + ++    
Subjt:  LKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMG-CGAKIKMVRGKP-DLSIMIATFGAMFSGLQEAERLHKSFADKSHGRDEFHKI---N

Query:  SSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEI
         S  ++ +  L+           I YGYL    DLDK+D ETKK++ ++S +E+
Subjt:  SSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGTAGAGAAACGAGTGGGGATAAGAGGTCTCAGTCTCCATCGTTTTTTGGACGGAGAACTTCGGAACCTCGGGTTGAAGAAAATCCACATTGTCATTCGCACTC
GCTTTCCCGTTCTTCACGGGAACGACCGGTGACGAATGGCCTTACGGGTTCTTCTATCAGAGACCATTACAATGGAAGTCGTCTTTATGAAAATGAAGACGAACATTTTC
GTAAACTCTCTCAGTTTTGCGAAAATTTACAACAAGAATCGCCGTCGAAAAAGTTTCTGTGGGAAAATTTGTTTGCCAATAATCCCGCCAATGCGAAATCGAAATCGAGT
ATAGGGTTGAAACATGGAAGTATGTGTGATGGTCGTAATCGAGGAATTAGGGTTTCTGGTTCACATTTGGGTACGTCGTCCAAGAATATTTTAGATGGTAATAATTTGCA
GACATTCTACATGACCATTGGGGCAACTAAAGATAGTATCGTCAAGAACAATGCGGATACTTCCAGAAGCTTTGAATTCGATGACTGTAGCCATTTGTCTTCATTTAGAA
AGTTTGATGGGCCCTTGTATGAGACCAATGATGTTCATGTTCGCGACCGTTCGAGCTTTGAATCAGCAGAAAATTCCCATAGAGGAAGACGAAACGGAACTTCTTCACAT
GGGATACAAGTGTCCCATCTTCACTCCAGTGCACCTGTTACTGAATCTAAGAGCATTTCGCAAGATGAATTTCATGATTTACTGGAGTATAAACGAGCTCGGAGGAACCA
TATTGAGCACTTAGACGATAGCAATCAGTATTTCTCAGTTCAGCCATGTAAGAGAAGTGACATTGATGCTGCGCTAAACGGTCCTTTCTCTCAGCAAATGGTTCGTAATC
CACAAGATGATTTCTATCAAGATTCTACGCGGACCAGTGTTGTAATGGATTCTGTTGTTTCAGGATTCGAAGACAATGAAACCCATTTGGAAGAGACCACCCGGCCAAGA
GACCTTTATGATCTTTTCAAACAACCATTCATCATTGAAGGTTCTTATATGGGCACTGCTCCTTTTGCGATGGAACAGTATGGCGAAGTTTTGGGTTCTGGAACTGGAAG
TCCGCGGAAGGGTGAAAGAGAAGTATATAGAAGTAGCGAGAAATTACTCTTGGCTGAAGAAGATGGTTATAGGACAAATTATGGGAAATGGTCACATGAGGATGATGGAT
TAAATGGATTATTAGTATCAAAACATAAACAAGATTTGAGCGACATGAAAGACAGTAGAAAGCTGAGATGGAAAGCCACAGATTCAACAAAGCCGAGGGTTGAAGGATCA
AGATGCAGAATGCATGATCCTGGGTCTGGTCCATCTAGAAAACCAAATGTGTTTAGCAGAATCCAGTTTTTAAGTCATGGAGATGAAAACATTGTTGTTAAAGATATTGA
CATCAATTTGAACTGTAGAAACAAGTGGTGGAATGACGAGGATACCTCTATATCCTTGACATCCTCTAAATGGTTGTTGCCTTGGGTAGTAAACCATGCCTCTCCGCGTT
CAAAGCCTAAACGTAGAGACCTAAAGAAACGTTTGGGTTTCTCCTTAAGGGATCCCACTTCAAACCCTTTAGTAAGAGAACGAGAAGGTAAAAGAAACAAGCGTCTGAGG
AAGACAAATGCCAATCATAGGTGCCCTGATGTTCAAACAGGTGACTACTTGGAGAAGGTCCAAAGTCCAACCAGAAGGCCACTTGAAGATCCAGAGGAGTTGAACCAGCT
AATAAAGAGTGCCTTTCTAAAGTTTGTCAAAGTTCTGAGTGAGAATCCAGCCAGACGAAAGACGTTCACAGAGCCAGGGTCTGGTATTATAAAGTGCATTGTCTGTGGCA
GCAAGTCCAAGGAGTTTGCAAATGCACTAAGTTTATCACAACATGCCTCTCAGTCGCTAGAAGGATCTAGGGCAGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTC
ATGGGATGGAGTAGTGAAGCAGCGCCAAACGATCTATGGGTTCGAAGGATATTGCCTCTTGTAGAAGTACTTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCT
TATCATTCATAACAGTTCTATTGCAATTGATAACCTGTCTGAACGGGTAGCCATAAGTTGTGAGGAGCTTGAGGTTGTTATTAGAGGAATGGGTTGTGGAGCGAAGATCA
AAATGGTACGTGGTAAACCTGATTTGAGTATCATGATAGCAACTTTTGGTGCAATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGCTTTGCAGATAAGAGT
CATGGTAGGGACGAGTTCCATAAAATCAATTCGAGTCATCTCAACGACAGCCACAAGGATCTGTTGCATAAAGCAGCAGGAGCTAACACATTGGATAGCATACTGTATGG
TTACCTAGGCCTTGCAGAGGACTTGGATAAACTCGACTTCGAGACCAAGAAGAGATCTGTGCTGAAAAGCAAGAAAGAAATCCAAGCCATTGTGAATGCATCTCTTCACT
GTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTGTAGAGAAACGAGTGGGGATAAGAGGTCTCAGTCTCCATCGTTTTTTGGACGGAGAACTTCGGAACCTCGGGTTGAAGAAAATCCACATTGTCATTCGCACTC
GCTTTCCCGTTCTTCACGGGAACGACCGGTGACGAATGGCCTTACGGGTTCTTCTATCAGAGACCATTACAATGGAAGTCGTCTTTATGAAAATGAAGACGAACATTTTC
GTAAACTCTCTCAGTTTTGCGAAAATTTACAACAAGAATCGCCGTCGAAAAAGTTTCTGTGGGAAAATTTGTTTGCCAATAATCCCGCCAATGCGAAATCGAAATCGAGT
ATAGGGTTGAAACATGGAAGTATGTGTGATGGTCGTAATCGAGGAATTAGGGTTTCTGGTTCACATTTGGGTACGTCGTCCAAGAATATTTTAGATGGTAATAATTTGCA
GACATTCTACATGACCATTGGGGCAACTAAAGATAGTATCGTCAAGAACAATGCGGATACTTCCAGAAGCTTTGAATTCGATGACTGTAGCCATTTGTCTTCATTTAGAA
AGTTTGATGGGCCCTTGTATGAGACCAATGATGTTCATGTTCGCGACCGTTCGAGCTTTGAATCAGCAGAAAATTCCCATAGAGGAAGACGAAACGGAACTTCTTCACAT
GGGATACAAGTGTCCCATCTTCACTCCAGTGCACCTGTTACTGAATCTAAGAGCATTTCGCAAGATGAATTTCATGATTTACTGGAGTATAAACGAGCTCGGAGGAACCA
TATTGAGCACTTAGACGATAGCAATCAGTATTTCTCAGTTCAGCCATGTAAGAGAAGTGACATTGATGCTGCGCTAAACGGTCCTTTCTCTCAGCAAATGGTTCGTAATC
CACAAGATGATTTCTATCAAGATTCTACGCGGACCAGTGTTGTAATGGATTCTGTTGTTTCAGGATTCGAAGACAATGAAACCCATTTGGAAGAGACCACCCGGCCAAGA
GACCTTTATGATCTTTTCAAACAACCATTCATCATTGAAGGTTCTTATATGGGCACTGCTCCTTTTGCGATGGAACAGTATGGCGAAGTTTTGGGTTCTGGAACTGGAAG
TCCGCGGAAGGGTGAAAGAGAAGTATATAGAAGTAGCGAGAAATTACTCTTGGCTGAAGAAGATGGTTATAGGACAAATTATGGGAAATGGTCACATGAGGATGATGGAT
TAAATGGATTATTAGTATCAAAACATAAACAAGATTTGAGCGACATGAAAGACAGTAGAAAGCTGAGATGGAAAGCCACAGATTCAACAAAGCCGAGGGTTGAAGGATCA
AGATGCAGAATGCATGATCCTGGGTCTGGTCCATCTAGAAAACCAAATGTGTTTAGCAGAATCCAGTTTTTAAGTCATGGAGATGAAAACATTGTTGTTAAAGATATTGA
CATCAATTTGAACTGTAGAAACAAGTGGTGGAATGACGAGGATACCTCTATATCCTTGACATCCTCTAAATGGTTGTTGCCTTGGGTAGTAAACCATGCCTCTCCGCGTT
CAAAGCCTAAACGTAGAGACCTAAAGAAACGTTTGGGTTTCTCCTTAAGGGATCCCACTTCAAACCCTTTAGTAAGAGAACGAGAAGGTAAAAGAAACAAGCGTCTGAGG
AAGACAAATGCCAATCATAGGTGCCCTGATGTTCAAACAGGTGACTACTTGGAGAAGGTCCAAAGTCCAACCAGAAGGCCACTTGAAGATCCAGAGGAGTTGAACCAGCT
AATAAAGAGTGCCTTTCTAAAGTTTGTCAAAGTTCTGAGTGAGAATCCAGCCAGACGAAAGACGTTCACAGAGCCAGGGTCTGGTATTATAAAGTGCATTGTCTGTGGCA
GCAAGTCCAAGGAGTTTGCAAATGCACTAAGTTTATCACAACATGCCTCTCAGTCGCTAGAAGGATCTAGGGCAGAACACTTGGGTCTTCACAAAGCACTTTGTTGGCTC
ATGGGATGGAGTAGTGAAGCAGCGCCAAACGATCTATGGGTTCGAAGGATATTGCCTCTTGTAGAAGTACTTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCT
TATCATTCATAACAGTTCTATTGCAATTGATAACCTGTCTGAACGGGTAGCCATAAGTTGTGAGGAGCTTGAGGTTGTTATTAGAGGAATGGGTTGTGGAGCGAAGATCA
AAATGGTACGTGGTAAACCTGATTTGAGTATCATGATAGCAACTTTTGGTGCAATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGCTTTGCAGATAAGAGT
CATGGTAGGGACGAGTTCCATAAAATCAATTCGAGTCATCTCAACGACAGCCACAAGGATCTGTTGCATAAAGCAGCAGGAGCTAACACATTGGATAGCATACTGTATGG
TTACCTAGGCCTTGCAGAGGACTTGGATAAACTCGACTTCGAGACCAAGAAGAGATCTGTGCTGAAAAGCAAGAAAGAAATCCAAGCCATTGTGAATGCATCTCTTCACT
GTTAG
Protein sequenceShow/hide protein sequence
MNCRETSGDKRSQSPSFFGRRTSEPRVEENPHCHSHSLSRSSRERPVTNGLTGSSIRDHYNGSRLYENEDEHFRKLSQFCENLQQESPSKKFLWENLFANNPANAKSKSS
IGLKHGSMCDGRNRGIRVSGSHLGTSSKNILDGNNLQTFYMTIGATKDSIVKNNADTSRSFEFDDCSHLSSFRKFDGPLYETNDVHVRDRSSFESAENSHRGRRNGTSSH
GIQVSHLHSSAPVTESKSISQDEFHDLLEYKRARRNHIEHLDDSNQYFSVQPCKRSDIDAALNGPFSQQMVRNPQDDFYQDSTRTSVVMDSVVSGFEDNETHLEETTRPR
DLYDLFKQPFIIEGSYMGTAPFAMEQYGEVLGSGTGSPRKGEREVYRSSEKLLLAEEDGYRTNYGKWSHEDDGLNGLLVSKHKQDLSDMKDSRKLRWKATDSTKPRVEGS
RCRMHDPGSGPSRKPNVFSRIQFLSHGDENIVVKDIDINLNCRNKWWNDEDTSISLTSSKWLLPWVVNHASPRSKPKRRDLKKRLGFSLRDPTSNPLVREREGKRNKRLR
KTNANHRCPDVQTGDYLEKVQSPTRRPLEDPEELNQLIKSAFLKFVKVLSENPARRKTFTEPGSGIIKCIVCGSKSKEFANALSLSQHASQSLEGSRAEHLGLHKALCWL
MGWSSEAAPNDLWVRRILPLVEVLALKEDLIIWPPVLIIHNSSIAIDNLSERVAISCEELEVVIRGMGCGAKIKMVRGKPDLSIMIATFGAMFSGLQEAERLHKSFADKS
HGRDEFHKINSSHLNDSHKDLLHKAAGANTLDSILYGYLGLAEDLDKLDFETKKRSVLKSKKEIQAIVNASLHC