; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005152 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005152
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionXS domain-containing protein
Genome locationscaffold176:3071893..3076524
RNA-Seq ExpressionMS005152
SyntenyMS005152
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005380 - XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458617.1 PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo]2.7e-29066.59Show/hide
Query:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAREGPVMNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE
        M+ RE ++D   RS+SPSL  RR SEPRVEE  HC+SHWFS S+RE P+ N   LPG S+RDH+N +RLY ++DEHFRKLSQFCE+L+  ESPAKKF WE
Subjt:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAREGPVMNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE

Query:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSS-EANDLRTFHTNIRATNDSNVM-DGNASRSFGVNDCSHLSSSRKFDGPVYETTDV
        +LF  N  AN +SK+S+GLKHVNG DGDN+G+RV GSHL   S S    +LRTFH NI AT DSNV  +G+ SRS G+NDC+HLSSSRK+DGP+++  +V
Subjt:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSS-EANDLRTFHTNIRATNDSNVM-DGNASRSFGVNDCSHLSSSRKFDGPVYETTDV

Query:  YVQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI
        +V+D   +E   N  SHRG++  TSS G Q SH HSSA V ESKGISQ EFH  LEYKRAR  HIEHFDD N+YF  QPCKR+DI A  +   SQ MVRI
Subjt:  YVQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI

Query:  PQDDFYQDSTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLA-EDGY
        PQDDFY+DSTRTSV++D VVEGF+DTES+     E  RP D+  F      IEGS     PFAMEQ  EVLGSGT S    E+EAY  SEKLLL  EDGY
Subjt:  PQDDFYQDSTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLA-EDGY

Query:  NTNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDS--KRNVFSRIHFLGNGDEKSTVKHIDINLNRRN
         TN+GKW+ +DG+NGS VS+++QDLG  +MED RKL WKA HSTK RV+G     +R  MH  G  S  K NVFSRI FL +GD K T    D NLN RN
Subjt:  NTNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDS--KRNVFSRIHFLGNGDEKSTVKHIDINLNRRN

Query:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLV--RERKRNKRLINTNISHECLDFQASDCFEDKTQISTSRPP-EDPE
            DEDTS    SSKR LPW++N  S R K KR++LKKRLG+ L DP+ N LV  RERKRNKRL  TN+ H CLD Q  D  E+K Q  TSRPP EDPE
Subjt:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLV--RERKRNKRLINTNISHECLDFQASDCFEDKTQISTSRPP-EDPE

Query:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE
        ELNQLIKSAF KF+KVL+EN ARRKK TEPG GII CIVCGSKSKEF DALSLSQHA  +L GSRAEHLGLHKALCWLMGWSSE APNGLWV+RILP  E
Subjt:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE

Query:  AFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINS
          ALKEDLIIWPPVLIIHNSSIA D  S+ V ISCEELE VIR     GKIKVVRG+P NQSIMVVTFGAMFSGLQEAERLHK+FADK HGRDE H+IN 
Subjt:  AFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINS

Query:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
         H IDS+ DLHKA GAN +ESVLYGYLGLAED  KLDFETKKRSVVKSKKEIQAIV+A+LQC
Subjt:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

XP_011657058.1 uncharacterized protein LOC105435801 [Cucumis sativus]1.2e-8376.02Show/hide
Query:  SKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVV
        SKSKEF DALSL QHA  +L GSRAEHLGLHKALCWLMGWSSE+APNGLWV+ ILP VE  ALKEDLIIWP VLIIHNSSIA D   E V ISCE+LE  
Subjt:  SKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVV

Query:  IR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETK
        +R     GK KVVRGK  NQSIMVVTFGAMF GLQEAERLH NFADK HGRDEFH+IN    +DS+ D+HKA GAN +ESV YGYLGL ED +KLDFETK
Subjt:  IR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETK

Query:  KRSVVKSKKEIQAIVDATLQC
        KRSVV+SKKEIQAIV A+LQC
Subjt:  KRSVVKSKKEIQAIVDATLQC

XP_017982234.1 PREDICTED: uncharacterized protein LOC18590378 [Theobroma cacao]1.3e-8252.68Show/hide
Query:  GSQRLKSKRKDLKKRLG--VSLRDPSLNPLVRERKRNKRLINTNISHECLDFQASDCFEDKTQISTSRPPEDPEELNQLIKSAFFKFIKVLNENLARRKK
        G     S RK +K+RLG    + +P+  P V ER + ++L+  N++      QA D      +   + PPED EE  Q I  AF KF+K+LNEN A+R+K
Subjt:  GSQRLKSKRKDLKKRLG--VSLRDPSLNPLVRERKRNKRLINTNISHECLDFQASDCFEDKTQISTSRPPEDPEELNQLIKSAFFKFIKVLNENLARRKK

Query:  FTEPG-SGIIKCIVCGSKSKEFADALSLSQHAFNS-LVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIAT
        + E G +G +KC VCGSKS+EF + LSL  HAF S +VG RA HLGLHK+LC+LMGW+S  A NGLW Q+ LP VEA A+KEDL+IWPP++I+HNSSIAT
Subjt:  FTEPG-SGIIKCIVCGSKSKEFADALSLSQHAFNS-LVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIAT

Query:  DNTSERVTISCEELEVVIR------GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLY
         N+  R+ +S EE+E  +R      G  KV RGKPANQSIM V F   FSGL+EAERLHK +A+  HGR EF +IN      S G+  KA  +K++ VLY
Subjt:  DNTSERVTISCEELEVVIR------GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLY

Query:  GYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATL
        GYLG+A D +KLDFETK R++VKSKKEI A  DA L
Subjt:  GYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATL

XP_022140332.1 uncharacterized protein LOC111011032 [Momordica charantia]1.9e-8995.63Show/hide
Query:  MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEA
        MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSE+VTISCEELEVVIR     GKIKVVRGKPANQSIMVVTF AMFSGLQEA
Subjt:  MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEA

Query:  ERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
        ERLHKNFADK HGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
Subjt:  ERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

XP_038900433.1 uncharacterized protein LOC120087658 [Benincasa hispida]0.0e+0069.3Show/hide
Query:  MSWRERSKDDRSRSRSPSLRRRNSEPRVEENRHCHSHWFSGSAREGPVMNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWES
        M++RE S D RS+S S S  RR SEPRVEEN HCHS WFS S+RE PV NG  L G S+RDH+N +RLYEN DEHFRKLSQ CE+L+ RESP+KKF WE+
Subjt:  MSWRERSKDDRSRSRSPSLRRRNSEPRVEENRHCHSHWFSGSAREGPVMNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWES

Query:  LFAKNPANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSS--EANDLRTFHTNIRATNDSNVM-DGNASRSFGVNDCSHLSSSRKFDGPVYETTDVY
        LFA NPANA+SKSS+GLKH N CDG N+G+RV GSHL   S++    ++LRTFH NI  T DSNV  +G+ SRSFG++DCSHLSSSRKFDGP+YET+DV+
Subjt:  LFAKNPANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSS--EANDLRTFHTNIRATNDSNVM-DGNASRSFGVNDCSHLSSSRKFDGPVYETTDVY

Query:  VQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRIP
        V+D   +ESA N  SHRG++   SSHG Q S+  SSA VTESKGISQDEFH FLEYKRAR  +IE FDD N+YF  QP KRSDI A LNS+ SQQMVRIP
Subjt:  VQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRIP

Query:  QDDFYQDSTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLA-EDGYN
        QDDFYQDSTRTSV++D VVEGF+DTES++    E  RP D Y  FKEP +IEGSY G  PF ME   E LGSG  S +K E+EAY  SEKLLLA EDGY 
Subjt:  QDDFYQDSTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLA-EDGYN

Query:  TNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDSKR--NVFSRIHFLGNGDEKSTVKHIDINLNRRNE
        T YGKW  +DG+NGSLVS+++QDL   +ME SRKLRWKA++STK RV+G     +RC MH  GS S R  NVFSRI FL +GDE   VK  DINLN R++
Subjt:  TNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDSKR--NVFSRIHFLGNGDEKSTVKHIDINLNRRNE

Query:  LWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRERKR--NKRLINTNISHECLDFQASDCFEDKTQISTSRPPEDPEEL
         WN+EDTS+ LTSSKR LPW+IN  S   K KR+DL+KRLG  LRDPS +PLVR+RKR  NKRL   N++H CLD Q  D  E+K Q  TSR  ED EEL
Subjt:  LWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRERKR--NKRLINTNISHECLDFQASDCFEDKTQISTSRPPEDPEEL

Query:  NQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAF
        NQLIKSAF KF+KVL+EN ARRKKFTEPG GIIKCIVCGSKSKEFADALSLSQHA  +L GSRAEHLGL KALCWLMGWSSE AP+G WV+RILP  E  
Subjt:  NQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAF

Query:  ALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINSSH
        ALKEDLIIWPPVLIIHNSSIA D+ SERV ISCEELEVVIR     GKIKVVRGKP NQSIM+VTF AMFSGLQEAERLHK+FADK HGRDEF +I SSH
Subjt:  ALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINSSH

Query:  RIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
         IDSH DLHKA GAN +++VLYGYLGL ED +KLDFETKKRSVVKSKKEIQAIV+A+L C
Subjt:  RIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

TrEMBL top hitse value%identityAlignment
A0A0A0KGN5 XS domain-containing protein5.7e-8476.02Show/hide
Query:  SKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVV
        SKSKEF DALSL QHA  +L GSRAEHLGLHKALCWLMGWSSE+APNGLWV+ ILP VE  ALKEDLIIWP VLIIHNSSIA D   E V ISCE+LE  
Subjt:  SKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVV

Query:  IR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETK
        +R     GK KVVRGK  NQSIMVVTFGAMF GLQEAERLH NFADK HGRDEFH+IN    +DS+ D+HKA GAN +ESV YGYLGL ED +KLDFETK
Subjt:  IR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETK

Query:  KRSVVKSKKEIQAIVDATLQC
        KRSVV+SKKEIQAIV A+LQC
Subjt:  KRSVVKSKKEIQAIVDATLQC

A0A1S3C894 uncharacterized protein LOC1034979641.3e-29066.59Show/hide
Query:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAREGPVMNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE
        M+ RE ++D   RS+SPSL  RR SEPRVEE  HC+SHWFS S+RE P+ N   LPG S+RDH+N +RLY ++DEHFRKLSQFCE+L+  ESPAKKF WE
Subjt:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAREGPVMNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE

Query:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSS-EANDLRTFHTNIRATNDSNVM-DGNASRSFGVNDCSHLSSSRKFDGPVYETTDV
        +LF  N  AN +SK+S+GLKHVNG DGDN+G+RV GSHL   S S    +LRTFH NI AT DSNV  +G+ SRS G+NDC+HLSSSRK+DGP+++  +V
Subjt:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSS-EANDLRTFHTNIRATNDSNVM-DGNASRSFGVNDCSHLSSSRKFDGPVYETTDV

Query:  YVQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI
        +V+D   +E   N  SHRG++  TSS G Q SH HSSA V ESKGISQ EFH  LEYKRAR  HIEHFDD N+YF  QPCKR+DI A  +   SQ MVRI
Subjt:  YVQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI

Query:  PQDDFYQDSTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLA-EDGY
        PQDDFY+DSTRTSV++D VVEGF+DTES+     E  RP D+  F      IEGS     PFAMEQ  EVLGSGT S    E+EAY  SEKLLL  EDGY
Subjt:  PQDDFYQDSTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLA-EDGY

Query:  NTNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDS--KRNVFSRIHFLGNGDEKSTVKHIDINLNRRN
         TN+GKW+ +DG+NGS VS+++QDLG  +MED RKL WKA HSTK RV+G     +R  MH  G  S  K NVFSRI FL +GD K T    D NLN RN
Subjt:  NTNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDS--KRNVFSRIHFLGNGDEKSTVKHIDINLNRRN

Query:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLV--RERKRNKRLINTNISHECLDFQASDCFEDKTQISTSRPP-EDPE
            DEDTS    SSKR LPW++N  S R K KR++LKKRLG+ L DP+ N LV  RERKRNKRL  TN+ H CLD Q  D  E+K Q  TSRPP EDPE
Subjt:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLV--RERKRNKRLINTNISHECLDFQASDCFEDKTQISTSRPP-EDPE

Query:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE
        ELNQLIKSAF KF+KVL+EN ARRKK TEPG GII CIVCGSKSKEF DALSLSQHA  +L GSRAEHLGLHKALCWLMGWSSE APNGLWV+RILP  E
Subjt:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE

Query:  AFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINS
          ALKEDLIIWPPVLIIHNSSIA D  S+ V ISCEELE VIR     GKIKVVRG+P NQSIMVVTFGAMFSGLQEAERLHK+FADK HGRDE H+IN 
Subjt:  AFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINS

Query:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
         H IDS+ DLHKA GAN +ESVLYGYLGLAED  KLDFETKKRSVVKSKKEIQAIV+A+LQC
Subjt:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

A0A218WNW1 uncharacterized protein LOC1162002594.1e-8239.33Show/hide
Query:  IIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAEDGYNTNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHS---TKRRV
        +++  +  +G   ME  DE  G GT  ++   +E        ++  DG+    G W   D      +S +E+ L   E     +L  +   S   +++ V
Subjt:  IIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAEDGYNTNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHS---TKRRV

Query:  KGKC-FVSSRCGMHYRGSDSKRNVFSRIHF----------------LGNGDEKSTVKHIDINLNRRNELWND----EDTSMSLTSSKRLLPWIINRGSQR
        K KC F   R G ++ G   +    S +++                L NG  + T    D + N+R  +  D    +DTS ++  + R          + 
Subjt:  KGKC-FVSSRCGMHYRGSDSKRNVFSRIHF----------------LGNGDEKSTVKHIDINLNRRNELWND----EDTSMSLTSSKRLLPWIINRGSQR

Query:  LKSKRKDLKKRLGVSLRDPSLNPLVRERKRNKRLINTNISH-----ECLDFQASDCFEDKTQISTSR--PPEDPEELNQLIKSAFFKFIKVLNENLARRK
        +    +D+KKRLG    +    P V+      +L N  ISH     + L+  A D       + T++  PPE+ EE  QL+  AF KF+K+LNEN A+R+
Subjt:  LKSKRKDLKKRLGVSLRDPSLNPLVRERKRNKRLINTNISH-----ECLDFQASDCFEDKTQISTSR--PPEDPEELNQLIKSAFFKFIKVLNENLARRK

Query:  KFTEPG-SGIIKCIVCGSKSKEFADALSLSQHAFNS-LVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIA
        K+ + G SGI+KC +CG  SKEF D LSL+ HAF++   G RAEHLGLHKALC LMGW+S + PNG WV+++LP  EA AL++DLIIWPP++IIHNSSIA
Subjt:  KFTEPG-SGIIKCIVCGSKSKEFADALSLSQHAFNS-LVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIA

Query:  TDNTSERVTISCEELEVVIR------GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKAGANKMESVL
             +RV +  EE++ +++       K KV+RGKPANQSIMV +F   FSGL EAE LHK+F +K  GR EF +I S  R   HG   KA  +K+ SVL
Subjt:  TDNTSERVTISCEELEVVIR------GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKAGANKMESVL

Query:  YGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
        YGYLG+AED +KL+FE KKR VV+S++EIQ IVD  L+C
Subjt:  YGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

A0A5A7SQC0 XS domain-containing protein1.3e-29066.59Show/hide
Query:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAREGPVMNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE
        M+ RE ++D   RS+SPSL  RR SEPRVEE  HC+SHWFS S+RE P+ N   LPG S+RDH+N +RLY ++DEHFRKLSQFCE+L+  ESPAKKF WE
Subjt:  MSWRERSKDDRSRSRSPSL-RRRNSEPRVEENRHCHSHWFSGSAREGPVMNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWE

Query:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSS-EANDLRTFHTNIRATNDSNVM-DGNASRSFGVNDCSHLSSSRKFDGPVYETTDV
        +LF  N  AN +SK+S+GLKHVNG DGDN+G+RV GSHL   S S    +LRTFH NI AT DSNV  +G+ SRS G+NDC+HLSSSRK+DGP+++  +V
Subjt:  SLFAKNP-ANASSKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSS-EANDLRTFHTNIRATNDSNVM-DGNASRSFGVNDCSHLSSSRKFDGPVYETTDV

Query:  YVQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI
        +V+D   +E   N  SHRG++  TSS G Q SH HSSA V ESKGISQ EFH  LEYKRAR  HIEHFDD N+YF  QPCKR+DI A  +   SQ MVRI
Subjt:  YVQDHSPYESARNSHSHRGKQKGTSSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRI

Query:  PQDDFYQDSTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLA-EDGY
        PQDDFY+DSTRTSV++D VVEGF+DTES+     E  RP D+  F      IEGS     PFAMEQ  EVLGSGT S    E+EAY  SEKLLL  EDGY
Subjt:  PQDDFYQDSTRTSVIVDPVVEGFEDTESYVMGDMEENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLA-EDGY

Query:  NTNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDS--KRNVFSRIHFLGNGDEKSTVKHIDINLNRRN
         TN+GKW+ +DG+NGS VS+++QDLG  +MED RKL WKA HSTK RV+G     +R  MH  G  S  K NVFSRI FL +GD K T    D NLN RN
Subjt:  NTNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHSTKRRVKGKCFVSSRCGMHYRGSDS--KRNVFSRIHFLGNGDEKSTVKHIDINLNRRN

Query:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLV--RERKRNKRLINTNISHECLDFQASDCFEDKTQISTSRPP-EDPE
            DEDTS    SSKR LPW++N  S R K KR++LKKRLG+ L DP+ N LV  RERKRNKRL  TN+ H CLD Q  D  E+K Q  TSRPP EDPE
Subjt:  ELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLV--RERKRNKRLINTNISHECLDFQASDCFEDKTQISTSRPP-EDPE

Query:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE
        ELNQLIKSAF KF+KVL+EN ARRKK TEPG GII CIVCGSKSKEF DALSLSQHA  +L GSRAEHLGLHKALCWLMGWSSE APNGLWV+RILP  E
Subjt:  ELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVE

Query:  AFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINS
          ALKEDLIIWPPVLIIHNSSIA D  S+ V ISCEELE VIR     GKIKVVRG+P NQSIMVVTFGAMFSGLQEAERLHK+FADK HGRDE H+IN 
Subjt:  AFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEINS

Query:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
         H IDS+ DLHKA GAN +ESVLYGYLGLAED  KLDFETKKRSVVKSKKEIQAIV+A+LQC
Subjt:  SHRIDSHGDLHKA-GANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

A0A6J1CGJ5 uncharacterized protein LOC1110110329.0e-9095.63Show/hide
Query:  MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEA
        MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSE+VTISCEELEVVIR     GKIKVVRGKPANQSIMVVTF AMFSGLQEA
Subjt:  MGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR-----GKIKVVRGKPANQSIMVVTFGAMFSGLQEA

Query:  ERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
        ERLHKNFADK HGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC
Subjt:  ERLHKNFADKIHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22430.1 CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380)4.4e-2829.68Show/hide
Query:  LVRERKRNKRLINTNISHECLDFQASDCFED-----KTQISTSRPPEDPEELNQL-IKSAFFKFIKVLNENLARRKKFTEPG-SGIIKCIVCGSKSKEFA
        ++R+R++  +  N N  H  +   + D  ED       ++ +SR      +++Q+ +K +F  F+K + E+   +K + E G  G ++C+VCG  SK+  
Subjt:  LVRERKRNKRLINTNISHECLDFQASDCFED-----KTQISTSRPPEDPEELNQL-IKSAFFKFIKVLNENLARRKKFTEPG-SGIIKCIVCGSKSKEFA

Query:  DALSLSQHAF-NSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR----
        D  SL  H + +    SR  HLGLHKALC LMGW+   AP+     + LP  EA   +  LIIWPP +I+ N+S              + ++  IR    
Subjt:  DALSLSQHAF-NSLVGSRAEHLGLHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIR----

Query:  --GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEIN----SSHRIDSHGDLHKAG-ANKMESVLYGYLGLAEDFEKLDFETKK
          GK K + G+  +  I +  F    SGL++A R+ + F     GR  +  +     S     + G +   G   + + + YGYL    D +K+D ETKK
Subjt:  --GKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNFADKIHGRDEFHEIN----SSHRIDSHGDLHKAG-ANKMESVLYGYLGLAEDFEKLDFETKK

Query:  RSVVKSKKEI
        ++ ++S +E+
Subjt:  RSVVKSKKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTGGAGAGAAAGGAGTAAAGATGATAGGTCTCGGTCTCGGTCTCCGTCGCTTCGACGGAGAAATTCAGAACCTCGGGTTGAGGAAAACCGGCACTGTCATTCTCA
CTGGTTTTCGGGCTCTGCACGAGAAGGACCGGTGATGAATGGCCCTGCGCTTCCGGGTTATTCTGTGAGAGACCATTTTAATGAAACTCGTCTTTATGAGAATAGAGACG
AACATTTTCGTAAACTCTCTCAGTTTTGCGAGAGTTTGGAGCGGAGGGAATCGCCGGCGAAAAAGTTTGGGTGGGAAAGTTTGTTCGCCAAAAATCCCGCCAATGCGAGT
TCGAAATCGAGTTTGGGATTGAAACATGTAAACGGATGTGATGGTGATAATCAAGGACTTAGGGTTTACGGTTCTCATTTGATTCCGGAATCGTCGTCAGAAGCTAATGA
TTTACGCACATTCCATACGAACATTAGGGCAACTAATGATAGTAATGTAATGGATGGGAATGCTTCTAGAAGTTTTGGAGTCAATGACTGTAGTCATTTGTCTTCATCTA
GAAAGTTTGATGGGCCCGTATACGAGACCACTGATGTTTATGTTCAGGACCATTCACCGTATGAATCAGCAAGAAATTCCCACTCCCACAGAGGAAAACAAAAGGGAACT
TCCTCACATGGGACACAAGGGTCACATCCGCACTCCAGTGCACGTGTTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGGTTTCTGGAGTATAAACGTGCTCG
TGGGGAACATATCGAGCACTTCGACGATTGCAATAAGTATTTTAAAGCTCAACCATGCAAGAGGAGTGACATCGGTGCTGCTCTCAACAGTTCTTTGTCTCAGCAGATGG
TCCGTATCCCACAAGACGATTTCTATCAAGATTCTACTCGGACCAGTGTTATAGTGGATCCAGTTGTCGAGGGATTTGAAGACACTGAAAGCTATGTCATGGGTGATATG
GAAGAGAACCGGCCAAGCGACAACTATGGTTTTTTCAAAGAACCACACATCATTGAAGGTTCTTATAGGGGAAACGGTCCTTTTGCCATGGAACAGGATGATGAAGTTTT
GGGTTCTGGAACCGGGAGTCTGCTGAAGTGTGAAAAAGAAGCATATACAGGCAGTGAGAAGTTGCTCTTGGCAGAAGATGGTTATAATACAAATTATGGGAAATGGTCGG
GTGATGATGGATTAAATGGATCCTTAGTTTCAAGAAATGAACAAGATTTGGGTGGCATGGAAATGGAAGACAGTAGGAAGCTGAGATGGAAAGCCTCGCATTCGACAAAA
CGAAGGGTCAAGGGGAAATGCTTTGTATCTTCAAGATGCGGAATGCATTATCGTGGGTCCGATTCAAAACGTAATGTGTTTAGCAGAATCCATTTTTTAGGTAATGGAGA
TGAAAAGAGTACTGTTAAACACATTGATATCAATTTAAACCGTAGAAACGAGTTGTGGAATGATGAGGATACTTCCATGTCCTTAACCTCCTCCAAACGGCTGTTGCCTT
GGATAATAAACCGTGGCTCTCAGCGTCTGAAGTCTAAACGCAAAGACCTTAAGAAACGTTTGGGTGTCTCCTTGAGGGATCCCAGTTTAAATCCTCTAGTTAGAGAACGT
AAAAGAAATAAGCGTCTGATAAACACAAATATCAGTCATGAGTGCCTTGATTTTCAAGCAAGTGATTGCTTTGAAGACAAGACGCAAATTTCAACCAGTAGGCCACCTGA
AGATCCTGAGGAGTTGAACCAGCTAATAAAGAGTGCCTTTTTCAAGTTTATCAAAGTTCTGAATGAGAACCTAGCCCGACGAAAGAAGTTCACAGAGCCAGGGTCTGGTA
TTATAAAGTGCATTGTCTGCGGCAGCAAGTCCAAGGAGTTTGCGGATGCACTAAGCTTATCACAACATGCCTTCAATTCGCTGGTAGGATCGAGGGCGGAACACTTGGGT
CTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGTAGCGCCAAATGGTCTATGGGTTCAAAGGATATTGCCCCATGTAGAAGCCTTTGCTTTGAAGGAGGA
TCTCATTATATGGCCTCCTGTTCTTATCATTCATAACAGTTCTATTGCAACTGATAATACGTCTGAACGGGTAACCATAAGTTGTGAAGAGCTCGAGGTTGTTATTAGAG
GGAAGATCAAAGTGGTTCGTGGTAAACCTGCAAATCAGAGCATTATGGTAGTAACTTTCGGTGCAATGTTTTCTGGATTGCAAGAAGCAGAAAGACTACACAAAAACTTT
GCCGATAAAATTCATGGGAGGGATGAGTTCCATGAAATCAATTCGAGTCATCGCATCGACAGCCATGGGGATTTGCATAAAGCAGGAGCAAACAAGATGGAAAGCGTTCT
TTATGGCTACTTAGGCCTCGCAGAGGACTTCGAAAAACTTGACTTTGAGACCAAGAAGAGGTCCGTGGTGAAAAGCAAGAAAGAAATCCAGGCCATTGTGGATGCAACTC
TTCAATGT
mRNA sequenceShow/hide mRNA sequence
ATGAGCTGGAGAGAAAGGAGTAAAGATGATAGGTCTCGGTCTCGGTCTCCGTCGCTTCGACGGAGAAATTCAGAACCTCGGGTTGAGGAAAACCGGCACTGTCATTCTCA
CTGGTTTTCGGGCTCTGCACGAGAAGGACCGGTGATGAATGGCCCTGCGCTTCCGGGTTATTCTGTGAGAGACCATTTTAATGAAACTCGTCTTTATGAGAATAGAGACG
AACATTTTCGTAAACTCTCTCAGTTTTGCGAGAGTTTGGAGCGGAGGGAATCGCCGGCGAAAAAGTTTGGGTGGGAAAGTTTGTTCGCCAAAAATCCCGCCAATGCGAGT
TCGAAATCGAGTTTGGGATTGAAACATGTAAACGGATGTGATGGTGATAATCAAGGACTTAGGGTTTACGGTTCTCATTTGATTCCGGAATCGTCGTCAGAAGCTAATGA
TTTACGCACATTCCATACGAACATTAGGGCAACTAATGATAGTAATGTAATGGATGGGAATGCTTCTAGAAGTTTTGGAGTCAATGACTGTAGTCATTTGTCTTCATCTA
GAAAGTTTGATGGGCCCGTATACGAGACCACTGATGTTTATGTTCAGGACCATTCACCGTATGAATCAGCAAGAAATTCCCACTCCCACAGAGGAAAACAAAAGGGAACT
TCCTCACATGGGACACAAGGGTCACATCCGCACTCCAGTGCACGTGTTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGGTTTCTGGAGTATAAACGTGCTCG
TGGGGAACATATCGAGCACTTCGACGATTGCAATAAGTATTTTAAAGCTCAACCATGCAAGAGGAGTGACATCGGTGCTGCTCTCAACAGTTCTTTGTCTCAGCAGATGG
TCCGTATCCCACAAGACGATTTCTATCAAGATTCTACTCGGACCAGTGTTATAGTGGATCCAGTTGTCGAGGGATTTGAAGACACTGAAAGCTATGTCATGGGTGATATG
GAAGAGAACCGGCCAAGCGACAACTATGGTTTTTTCAAAGAACCACACATCATTGAAGGTTCTTATAGGGGAAACGGTCCTTTTGCCATGGAACAGGATGATGAAGTTTT
GGGTTCTGGAACCGGGAGTCTGCTGAAGTGTGAAAAAGAAGCATATACAGGCAGTGAGAAGTTGCTCTTGGCAGAAGATGGTTATAATACAAATTATGGGAAATGGTCGG
GTGATGATGGATTAAATGGATCCTTAGTTTCAAGAAATGAACAAGATTTGGGTGGCATGGAAATGGAAGACAGTAGGAAGCTGAGATGGAAAGCCTCGCATTCGACAAAA
CGAAGGGTCAAGGGGAAATGCTTTGTATCTTCAAGATGCGGAATGCATTATCGTGGGTCCGATTCAAAACGTAATGTGTTTAGCAGAATCCATTTTTTAGGTAATGGAGA
TGAAAAGAGTACTGTTAAACACATTGATATCAATTTAAACCGTAGAAACGAGTTGTGGAATGATGAGGATACTTCCATGTCCTTAACCTCCTCCAAACGGCTGTTGCCTT
GGATAATAAACCGTGGCTCTCAGCGTCTGAAGTCTAAACGCAAAGACCTTAAGAAACGTTTGGGTGTCTCCTTGAGGGATCCCAGTTTAAATCCTCTAGTTAGAGAACGT
AAAAGAAATAAGCGTCTGATAAACACAAATATCAGTCATGAGTGCCTTGATTTTCAAGCAAGTGATTGCTTTGAAGACAAGACGCAAATTTCAACCAGTAGGCCACCTGA
AGATCCTGAGGAGTTGAACCAGCTAATAAAGAGTGCCTTTTTCAAGTTTATCAAAGTTCTGAATGAGAACCTAGCCCGACGAAAGAAGTTCACAGAGCCAGGGTCTGGTA
TTATAAAGTGCATTGTCTGCGGCAGCAAGTCCAAGGAGTTTGCGGATGCACTAAGCTTATCACAACATGCCTTCAATTCGCTGGTAGGATCGAGGGCGGAACACTTGGGT
CTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAAGTAGCGCCAAATGGTCTATGGGTTCAAAGGATATTGCCCCATGTAGAAGCCTTTGCTTTGAAGGAGGA
TCTCATTATATGGCCTCCTGTTCTTATCATTCATAACAGTTCTATTGCAACTGATAATACGTCTGAACGGGTAACCATAAGTTGTGAAGAGCTCGAGGTTGTTATTAGAG
GGAAGATCAAAGTGGTTCGTGGTAAACCTGCAAATCAGAGCATTATGGTAGTAACTTTCGGTGCAATGTTTTCTGGATTGCAAGAAGCAGAAAGACTACACAAAAACTTT
GCCGATAAAATTCATGGGAGGGATGAGTTCCATGAAATCAATTCGAGTCATCGCATCGACAGCCATGGGGATTTGCATAAAGCAGGAGCAAACAAGATGGAAAGCGTTCT
TTATGGCTACTTAGGCCTCGCAGAGGACTTCGAAAAACTTGACTTTGAGACCAAGAAGAGGTCCGTGGTGAAAAGCAAGAAAGAAATCCAGGCCATTGTGGATGCAACTC
TTCAATGT
Protein sequenceShow/hide protein sequence
MSWRERSKDDRSRSRSPSLRRRNSEPRVEENRHCHSHWFSGSAREGPVMNGPALPGYSVRDHFNETRLYENRDEHFRKLSQFCESLERRESPAKKFGWESLFAKNPANAS
SKSSLGLKHVNGCDGDNQGLRVYGSHLIPESSSEANDLRTFHTNIRATNDSNVMDGNASRSFGVNDCSHLSSSRKFDGPVYETTDVYVQDHSPYESARNSHSHRGKQKGT
SSHGTQGSHPHSSARVTESKGISQDEFHGFLEYKRARGEHIEHFDDCNKYFKAQPCKRSDIGAALNSSLSQQMVRIPQDDFYQDSTRTSVIVDPVVEGFEDTESYVMGDM
EENRPSDNYGFFKEPHIIEGSYRGNGPFAMEQDDEVLGSGTGSLLKCEKEAYTGSEKLLLAEDGYNTNYGKWSGDDGLNGSLVSRNEQDLGGMEMEDSRKLRWKASHSTK
RRVKGKCFVSSRCGMHYRGSDSKRNVFSRIHFLGNGDEKSTVKHIDINLNRRNELWNDEDTSMSLTSSKRLLPWIINRGSQRLKSKRKDLKKRLGVSLRDPSLNPLVRER
KRNKRLINTNISHECLDFQASDCFEDKTQISTSRPPEDPEELNQLIKSAFFKFIKVLNENLARRKKFTEPGSGIIKCIVCGSKSKEFADALSLSQHAFNSLVGSRAEHLG
LHKALCWLMGWSSEVAPNGLWVQRILPHVEAFALKEDLIIWPPVLIIHNSSIATDNTSERVTISCEELEVVIRGKIKVVRGKPANQSIMVVTFGAMFSGLQEAERLHKNF
ADKIHGRDEFHEINSSHRIDSHGDLHKAGANKMESVLYGYLGLAEDFEKLDFETKKRSVVKSKKEIQAIVDATLQC