; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016902 (gene) of Snake gourd v1 genome

Gene IDTan0016902
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionXS domain-containing protein
Genome locationLG05:1172721..1177202
RNA-Seq ExpressionTan0016902
SyntenyTan0016902
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005380 - XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458617.1 PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo]1.3e-30667.31Show/hide
Query:  MNWRETSGDRRSQSPSSLRRKTSEPRVEENQHCHSQWFSGSSREGPVTNGLAGSSVRDHYNESRLYENKDEHFRNLSRFCENLQRESPSKKFRWENLFAK
        MN RE + D+RSQSPS   R+TSEPRVEE  HC+S WFS SSRE P+TN L GSS+RDHYN SRLY +KDEHFR LS+FCENLQ ESP+KKF+WENLF  
Subjt:  MNWRETSGDRRSQSPSSLRRKTSEPRVEENQHCHSQWFSGSSREGPVTNGLAGSSVRDHYNESRLYENKDEHFRNLSRFCENLQRESPSKKFRWENLFAK

Query:  NP-ANVNSKSSLGFKHVNGCGDGDNRGIRVSGSHLGTGSSSNNVLDEGNNLRTFHMIIEATKDTNI-NNGDTSRSFGIGDCSRHLSSSRKFDGPVYETSD
        N  AN NSK+S+G KHVNG  DGDNRGIRVSGSHLGT  SS ++L  G NLRTFHM I ATKD+N+ NNGDTSRS GI DC+ HLSSSRK+DGP+++ ++
Subjt:  NP-ANVNSKSSLGFKHVNGCGDGDNRGIRVSGSHLGTGSSSNNVLDEGNNLRTFHMIIEATKDTNI-NNGDTSRSFGIGDCSRHLSSSRKFDGPVYETSD

Query:  VHGRDCPILESARNTHRERRDGTSSHGIEASHPHSSACVAASKRISQDEFHGFYEGRSPWRKEKHRERVETELNMEGLQEYKQARGGNHIEYFDDRNQYF
        VH RD PI E   N+HR RR+ TSS GI+ASH HSSA VA SK ISQ EFH                          L EYK+AR  NHIE+FDD NQYF
Subjt:  VHGRDCPILESARNTHRERRDGTSSHGIEASHPHSSACVAASKRISQDEFHGFYEGRSPWRKEKHRERVETELNMEGLQEYKQARGGNHIEYFDDRNQYF

Query:  KVQPCKRSDIGAALNSPFSQQMVRIPQDDFYQDSTRTSVVMDPVVEAFEDTGSYGVGAMEETRPRDPHDFFKGPFIIEGGSYMGNAPFAMEQDGEVLGSG
         VQPCKR+DI A  + PFSQ MVRIPQDDFY+DSTRTSVVMD VVE F+DT S+     E TRPRD + F +       GS M  APFAMEQ  EVLGSG
Subjt:  KVQPCKRSDIGAALNSPFSQQMVRIPQDDFYQDSTRTSVVMDPVVEAFEDTGSYGVGAMEETRPRDPHDFFKGPFIIEGGSYMGNAPFAMEQDGEVLGSG

Query:  TGSPLKLERKTYLSGQKLLLAEEEGYTTNYGKWLHGDGLNGSLVSEHEQDLSYMEDSRKSRWKAAHSTKPRVKGTKCEGHYPVSDSSRKPNVFSRIQFLS
        T S    ER+ Y+S +KLLL EE+GY TN+GKW   DG+NGS VS+H+QDL  MED RK  WKA HSTKPRV+G + + H P   S +KPNVFSRIQFL+
Subjt:  TGSPLKLERKTYLSGQKLLLAEEEGYTTNYGKWLHGDGLNGSLVSEHEQDLSYMEDSRKSRWKAAHSTKPRVKGTKCEGHYPVSDSSRKPNVFSRIQFLS

Query:  HGNEKSAVEDIDMNLNRRNKRWIDEGTSISLTSSKRQLPWIINHAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQA
        HG+    V+D D NLN RN   +DE TS    SSKRQLPW++NH +  SK KRRNLKKRLG+ L +P+SN LV+E ERKRNKRLR+TNV+H CL+  VQ 
Subjt:  HGNEKSAVEDIDMNLNRRNKRWIDEGTSISLTSSKRQLPWIINHAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQA

Query:  GDCFEEKTQSPTSR-PLEDPEELNQLIKSAFLKFVKVLCENLARRKKFTEPRSGIIKCIVCGSNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLM
        GD  EEK QSPTSR PLEDPEELNQLIKSAFLKFVKVL EN ARRKK TEP  GII CIVCGS S EF DALSLSQHA ++LE SR+EHLGLHKALCWLM
Subjt:  GDCFEEKTQSPTSR-PLEDPEELNQLIKSAFLKFVKVLCENLARRKKFTEPRSGIIKCIVCGSNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLM

Query:  GWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAE
        GWSS  APNG+WV+RILP+ EV+ALKEDLIIWPPVLIIHNSSI ID  S+ V ISCEELE VIR     GK+K+VRG+PGNQSIMV TF AM SGLQEAE
Subjt:  GWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAE

Query:  RLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC
        RLHKSFADK+HGRDE  KIN  H ID S+ DLHKA GAN +ESVLYGY+GLAEDL KLDFETKKR+VVKSKKEIQ IV+AS  C
Subjt:  RLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC

XP_011657058.1 uncharacterized protein LOC105435801 [Cucumis sativus]3.5e-7570.72Show/hide
Query:  SNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVV
        S S EF DALSL QHA ++LE SR+EHLGLHKALCWLMGWSS  APNG+WV+ ILP VEV+ALKEDLIIWP VLIIHNSSI ID   E V ISCE+LE  
Subjt:  SNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVV

Query:  IR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFET
        +R     GK K+VRGK  NQSIMV TF AM  GLQEAERLH +FADK+HGRDEF KIN    +D S+ D+HKA GAN +ESV YGY+GL EDLDKLDFET
Subjt:  IR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFET

Query:  KKRTVVKSKKEIQTIVDASRHC
        KKR+VV+SKKEIQ IV AS  C
Subjt:  KKRTVVKSKKEIQTIVDASRHC

XP_017982234.1 PREDICTED: uncharacterized protein LOC18590378 [Theobroma cacao]1.1e-7148.82Show/hide
Query:  HAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQAGDCFEEKTQSPTSRPLEDPEELNQLIKSAFLKFVKVLCENLAR
        H      S R+++K+RLG      + N + +  ER + ++L Q NVN       VQA D      +   + P ED EE  Q I  AF+KFVK+L EN A+
Subjt:  HAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQAGDCFEEKTQSPTSRPLEDPEELNQLIKSAFLKFVKVLCENLAR

Query:  RKKFTEP-RSGIIKCIVCGSNSMEFADALSLSQHAFQS-LEASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSS
        R+K+ E   +G +KC VCGS S EF + LSL  HAF S +   R+ HLGLHK+LC+LMGW+S AA NG+W Q+ LP VE +A+KEDL+IWPP++I+HNSS
Subjt:  RKKFTEP-RSGIIKCIVCGSNSMEFADALSLSQHAFQS-LEASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSS

Query:  ITIDNTSERVTISCEELEVVIR------GKMKMVRGKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKAGANRIE
        I   N+  R+ +S EE+E  +R      G  K+ RGKP NQSIM   F    SGL+EAERLHK +A+  HGR EFQ+IN S    G  K   KA  ++++
Subjt:  ITIDNTSERVTISCEELEVVIR------GKMKMVRGKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKAGANRIE

Query:  SVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDA
         VLYGY+G+A DLDKLDFETK R +VKSKKEI    DA
Subjt:  SVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDA

XP_022140332.1 uncharacterized protein LOC111011032 [Momordica charantia]3.6e-7279.89Show/hide
Query:  MGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEA
        MGWSS  APNG+WVQRILP VE  ALKEDLIIWPPVLIIHNSSI  DNTSE+VTISCEELEVVIR     GK+K+VRGKP NQSIMV TF AM SGLQEA
Subjt:  MGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEA

Query:  ERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKAGANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC
        ERLHK+FADK+HGRDEF +INSSH ID SH DLHKAGAN++ESVLYGY+GLAED +KLDFETKKR+VVKSKKEIQ IVDA+  C
Subjt:  ERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKAGANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC

XP_038900433.1 uncharacterized protein LOC120087658 [Benincasa hispida]0.0e+0072.9Show/hide
Query:  MNWRETSGDRRSQSPSSLRRKTSEPRVEENQHCHSQWFSGSSREGPVTNGLAGSSVRDHYNESRLYENKDEHFRNLSRFCENLQRESPSKKFRWENLFAK
        MN+RETS D+RSQSPSS  R+TSEPRVEEN HCHS WFS SSRE PVTNGLAGSS+RDHYN SRLYEN DEHFR LS+ CENLQRESPSKKFRWENLFA 
Subjt:  MNWRETSGDRRSQSPSSLRRKTSEPRVEENQHCHSQWFSGSSREGPVTNGLAGSSVRDHYNESRLYENKDEHFRNLSRFCENLQRESPSKKFRWENLFAK

Query:  NPANVNSKSSLGFKHVNGCGDGDNRGIRVSGSHLGTGSSSNNVLDEGNNLRTFHMIIEATKDTNI-NNGDTSRSFGIGDCSRHLSSSRKFDGPVYETSDV
        NPAN NSKSS+G KH N C DG NRGIRVSGSHLGT  SSNN+L  G+NLRTFHM I  TKD+N+ NNGD SRSFGI DCS HLSSSRKFDGP+YETSDV
Subjt:  NPANVNSKSSLGFKHVNGCGDGDNRGIRVSGSHLGTGSSSNNVLDEGNNLRTFHMIIEATKDTNI-NNGDTSRSFGIGDCSRHLSSSRKFDGPVYETSDV

Query:  HGRDCPILESARNTHRERRDGTSSHGIEASHPHSSACVAASKRISQDEFHGFYEGRSPWRKEKHRERVETELNMEGLQEYKQARGGNHIEYFDDRNQYFK
        H RD PI ESA N+HR RR+  SSHG++AS+  SSA V  SK ISQDEFH F                          EYK+AR  N+IE FDD NQYF 
Subjt:  HGRDCPILESARNTHRERRDGTSSHGIEASHPHSSACVAASKRISQDEFHGFYEGRSPWRKEKHRERVETELNMEGLQEYKQARGGNHIEYFDDRNQYFK

Query:  VQPCKRSDIGAALNSPFSQQMVRIPQDDFYQDSTRTSVVMDPVVEAFEDTGSYGVGAMEETRPRDPHDFFKGPFIIEGGSYMGNAPFAMEQDGEVLGSGT
        VQP KRSDI A LNS FSQQMVRIPQDDFYQDSTRTSVVMD VVE F+DT S+     E TRPRD +D FK PF+IE GSYMG APF ME  GE LGSG 
Subjt:  VQPCKRSDIGAALNSPFSQQMVRIPQDDFYQDSTRTSVVMDPVVEAFEDTGSYGVGAMEETRPRDPHDFFKGPFIIEGGSYMGNAPFAMEQDGEVLGSGT

Query:  GSPLKLERKTYLSGQKLLLAEEEGYTTNYGKWLHGDGLNGSLVSEHEQDLSYMEDSRKSRWKAAHSTKPRVKGTKCEGHYPVSDSSRKPNVFSRIQFLSH
         S +K ER+ Y+S +KLLLAEE+GY T YGKWLH DG+NGSLVS+H+QDLS ME SRK RWKA +STK RV+GT+C  H P S SSRKPNVFSRIQFLSH
Subjt:  GSPLKLERKTYLSGQKLLLAEEEGYTTNYGKWLHGDGLNGSLVSEHEQDLSYMEDSRKSRWKAAHSTKPRVKGTKCEGHYPVSDSSRKPNVFSRIQFLSH

Query:  GNEKSAVEDIDMNLNRRNKRWIDEGTSISLTSSKRQLPWIINHAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQAG
        G+E  AV+D D+NLN R+K W +E TSI LTSSKR LPW+INHA+ HSK KRR+L+KRLG  L++PSS+ LV++ +RK+NKRLR+ NVNH CL+VQ    
Subjt:  GNEKSAVEDIDMNLNRRNKRWIDEGTSISLTSSKRQLPWIINHAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQAG

Query:  DCFEEKTQSPTSRPLEDPEELNQLIKSAFLKFVKVLCENLARRKKFTEPRSGIIKCIVCGSNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLMGW
        D  EEK QSPTSR LED EELNQLIKSAFLKFVKVL EN ARRKKFTEP  GIIKCIVCGS S EFADALSLSQHA Q+LE SR+EHLGL KALCWLMGW
Subjt:  DCFEEKTQSPTSRPLEDPEELNQLIKSAFLKFVKVLCENLARRKKFTEPRSGIIKCIVCGSNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLMGW

Query:  SSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAERL
        SS AAP+G WV+RILP+ EV+ALKEDLIIWPPVLIIHNSSI ID+ SERV ISCEELEVVIR     GK+K+VRGKPGNQSIM+ TFDAM SGLQEAERL
Subjt:  SSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAERL

Query:  HKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC
        HKSFADK+HGRDEFQKI SSH ID SHKDLHKA GAN +++VLYGY+GL EDLDKLDFETKKR+VVKSKKEIQ IV+AS HC
Subjt:  HKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC

TrEMBL top hitse value%identityAlignment
A0A0A0KGN5 XS domain-containing protein1.7e-7570.72Show/hide
Query:  SNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVV
        S S EF DALSL QHA ++LE SR+EHLGLHKALCWLMGWSS  APNG+WV+ ILP VEV+ALKEDLIIWP VLIIHNSSI ID   E V ISCE+LE  
Subjt:  SNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVV

Query:  IR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFET
        +R     GK K+VRGK  NQSIMV TF AM  GLQEAERLH +FADK+HGRDEF KIN    +D S+ D+HKA GAN +ESV YGY+GL EDLDKLDFET
Subjt:  IR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFET

Query:  KKRTVVKSKKEIQTIVDASRHC
        KKR+VV+SKKEIQ IV AS  C
Subjt:  KKRTVVKSKKEIQTIVDASRHC

A0A1S3C894 uncharacterized protein LOC1034979646.1e-30767.31Show/hide
Query:  MNWRETSGDRRSQSPSSLRRKTSEPRVEENQHCHSQWFSGSSREGPVTNGLAGSSVRDHYNESRLYENKDEHFRNLSRFCENLQRESPSKKFRWENLFAK
        MN RE + D+RSQSPS   R+TSEPRVEE  HC+S WFS SSRE P+TN L GSS+RDHYN SRLY +KDEHFR LS+FCENLQ ESP+KKF+WENLF  
Subjt:  MNWRETSGDRRSQSPSSLRRKTSEPRVEENQHCHSQWFSGSSREGPVTNGLAGSSVRDHYNESRLYENKDEHFRNLSRFCENLQRESPSKKFRWENLFAK

Query:  NP-ANVNSKSSLGFKHVNGCGDGDNRGIRVSGSHLGTGSSSNNVLDEGNNLRTFHMIIEATKDTNI-NNGDTSRSFGIGDCSRHLSSSRKFDGPVYETSD
        N  AN NSK+S+G KHVNG  DGDNRGIRVSGSHLGT  SS ++L  G NLRTFHM I ATKD+N+ NNGDTSRS GI DC+ HLSSSRK+DGP+++ ++
Subjt:  NP-ANVNSKSSLGFKHVNGCGDGDNRGIRVSGSHLGTGSSSNNVLDEGNNLRTFHMIIEATKDTNI-NNGDTSRSFGIGDCSRHLSSSRKFDGPVYETSD

Query:  VHGRDCPILESARNTHRERRDGTSSHGIEASHPHSSACVAASKRISQDEFHGFYEGRSPWRKEKHRERVETELNMEGLQEYKQARGGNHIEYFDDRNQYF
        VH RD PI E   N+HR RR+ TSS GI+ASH HSSA VA SK ISQ EFH                          L EYK+AR  NHIE+FDD NQYF
Subjt:  VHGRDCPILESARNTHRERRDGTSSHGIEASHPHSSACVAASKRISQDEFHGFYEGRSPWRKEKHRERVETELNMEGLQEYKQARGGNHIEYFDDRNQYF

Query:  KVQPCKRSDIGAALNSPFSQQMVRIPQDDFYQDSTRTSVVMDPVVEAFEDTGSYGVGAMEETRPRDPHDFFKGPFIIEGGSYMGNAPFAMEQDGEVLGSG
         VQPCKR+DI A  + PFSQ MVRIPQDDFY+DSTRTSVVMD VVE F+DT S+     E TRPRD + F +       GS M  APFAMEQ  EVLGSG
Subjt:  KVQPCKRSDIGAALNSPFSQQMVRIPQDDFYQDSTRTSVVMDPVVEAFEDTGSYGVGAMEETRPRDPHDFFKGPFIIEGGSYMGNAPFAMEQDGEVLGSG

Query:  TGSPLKLERKTYLSGQKLLLAEEEGYTTNYGKWLHGDGLNGSLVSEHEQDLSYMEDSRKSRWKAAHSTKPRVKGTKCEGHYPVSDSSRKPNVFSRIQFLS
        T S    ER+ Y+S +KLLL EE+GY TN+GKW   DG+NGS VS+H+QDL  MED RK  WKA HSTKPRV+G + + H P   S +KPNVFSRIQFL+
Subjt:  TGSPLKLERKTYLSGQKLLLAEEEGYTTNYGKWLHGDGLNGSLVSEHEQDLSYMEDSRKSRWKAAHSTKPRVKGTKCEGHYPVSDSSRKPNVFSRIQFLS

Query:  HGNEKSAVEDIDMNLNRRNKRWIDEGTSISLTSSKRQLPWIINHAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQA
        HG+    V+D D NLN RN   +DE TS    SSKRQLPW++NH +  SK KRRNLKKRLG+ L +P+SN LV+E ERKRNKRLR+TNV+H CL+  VQ 
Subjt:  HGNEKSAVEDIDMNLNRRNKRWIDEGTSISLTSSKRQLPWIINHAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQA

Query:  GDCFEEKTQSPTSR-PLEDPEELNQLIKSAFLKFVKVLCENLARRKKFTEPRSGIIKCIVCGSNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLM
        GD  EEK QSPTSR PLEDPEELNQLIKSAFLKFVKVL EN ARRKK TEP  GII CIVCGS S EF DALSLSQHA ++LE SR+EHLGLHKALCWLM
Subjt:  GDCFEEKTQSPTSR-PLEDPEELNQLIKSAFLKFVKVLCENLARRKKFTEPRSGIIKCIVCGSNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLM

Query:  GWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAE
        GWSS  APNG+WV+RILP+ EV+ALKEDLIIWPPVLIIHNSSI ID  S+ V ISCEELE VIR     GK+K+VRG+PGNQSIMV TF AM SGLQEAE
Subjt:  GWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAE

Query:  RLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC
        RLHKSFADK+HGRDE  KIN  H ID S+ DLHKA GAN +ESVLYGY+GLAEDL KLDFETKKR+VVKSKKEIQ IV+AS  C
Subjt:  RLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC

A0A5A7SQC0 XS domain-containing protein6.1e-30767.31Show/hide
Query:  MNWRETSGDRRSQSPSSLRRKTSEPRVEENQHCHSQWFSGSSREGPVTNGLAGSSVRDHYNESRLYENKDEHFRNLSRFCENLQRESPSKKFRWENLFAK
        MN RE + D+RSQSPS   R+TSEPRVEE  HC+S WFS SSRE P+TN L GSS+RDHYN SRLY +KDEHFR LS+FCENLQ ESP+KKF+WENLF  
Subjt:  MNWRETSGDRRSQSPSSLRRKTSEPRVEENQHCHSQWFSGSSREGPVTNGLAGSSVRDHYNESRLYENKDEHFRNLSRFCENLQRESPSKKFRWENLFAK

Query:  NP-ANVNSKSSLGFKHVNGCGDGDNRGIRVSGSHLGTGSSSNNVLDEGNNLRTFHMIIEATKDTNI-NNGDTSRSFGIGDCSRHLSSSRKFDGPVYETSD
        N  AN NSK+S+G KHVNG  DGDNRGIRVSGSHLGT  SS ++L  G NLRTFHM I ATKD+N+ NNGDTSRS GI DC+ HLSSSRK+DGP+++ ++
Subjt:  NP-ANVNSKSSLGFKHVNGCGDGDNRGIRVSGSHLGTGSSSNNVLDEGNNLRTFHMIIEATKDTNI-NNGDTSRSFGIGDCSRHLSSSRKFDGPVYETSD

Query:  VHGRDCPILESARNTHRERRDGTSSHGIEASHPHSSACVAASKRISQDEFHGFYEGRSPWRKEKHRERVETELNMEGLQEYKQARGGNHIEYFDDRNQYF
        VH RD PI E   N+HR RR+ TSS GI+ASH HSSA VA SK ISQ EFH                          L EYK+AR  NHIE+FDD NQYF
Subjt:  VHGRDCPILESARNTHRERRDGTSSHGIEASHPHSSACVAASKRISQDEFHGFYEGRSPWRKEKHRERVETELNMEGLQEYKQARGGNHIEYFDDRNQYF

Query:  KVQPCKRSDIGAALNSPFSQQMVRIPQDDFYQDSTRTSVVMDPVVEAFEDTGSYGVGAMEETRPRDPHDFFKGPFIIEGGSYMGNAPFAMEQDGEVLGSG
         VQPCKR+DI A  + PFSQ MVRIPQDDFY+DSTRTSVVMD VVE F+DT S+     E TRPRD + F +       GS M  APFAMEQ  EVLGSG
Subjt:  KVQPCKRSDIGAALNSPFSQQMVRIPQDDFYQDSTRTSVVMDPVVEAFEDTGSYGVGAMEETRPRDPHDFFKGPFIIEGGSYMGNAPFAMEQDGEVLGSG

Query:  TGSPLKLERKTYLSGQKLLLAEEEGYTTNYGKWLHGDGLNGSLVSEHEQDLSYMEDSRKSRWKAAHSTKPRVKGTKCEGHYPVSDSSRKPNVFSRIQFLS
        T S    ER+ Y+S +KLLL EE+GY TN+GKW   DG+NGS VS+H+QDL  MED RK  WKA HSTKPRV+G + + H P   S +KPNVFSRIQFL+
Subjt:  TGSPLKLERKTYLSGQKLLLAEEEGYTTNYGKWLHGDGLNGSLVSEHEQDLSYMEDSRKSRWKAAHSTKPRVKGTKCEGHYPVSDSSRKPNVFSRIQFLS

Query:  HGNEKSAVEDIDMNLNRRNKRWIDEGTSISLTSSKRQLPWIINHAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQA
        HG+    V+D D NLN RN   +DE TS    SSKRQLPW++NH +  SK KRRNLKKRLG+ L +P+SN LV+E ERKRNKRLR+TNV+H CL+  VQ 
Subjt:  HGNEKSAVEDIDMNLNRRNKRWIDEGTSISLTSSKRQLPWIINHAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQA

Query:  GDCFEEKTQSPTSR-PLEDPEELNQLIKSAFLKFVKVLCENLARRKKFTEPRSGIIKCIVCGSNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLM
        GD  EEK QSPTSR PLEDPEELNQLIKSAFLKFVKVL EN ARRKK TEP  GII CIVCGS S EF DALSLSQHA ++LE SR+EHLGLHKALCWLM
Subjt:  GDCFEEKTQSPTSR-PLEDPEELNQLIKSAFLKFVKVLCENLARRKKFTEPRSGIIKCIVCGSNSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLM

Query:  GWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAE
        GWSS  APNG+WV+RILP+ EV+ALKEDLIIWPPVLIIHNSSI ID  S+ V ISCEELE VIR     GK+K+VRG+PGNQSIMV TF AM SGLQEAE
Subjt:  GWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEAE

Query:  RLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC
        RLHKSFADK+HGRDE  KIN  H ID S+ DLHKA GAN +ESVLYGY+GLAEDL KLDFETKKR+VVKSKKEIQ IV+AS  C
Subjt:  RLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKA-GANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC

A0A6J0ZXA5 uncharacterized protein LOC1104129791.5e-7148.38Show/hide
Query:  NHAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQAGDCFEEKTQSPTSRPLEDPEELNQLIKSAFLKFVKVLCENLA
        +H      S R+ +K+RLG      + N + +    K  K L++ NVN    +  VQA D      +   + P ED +E  Q I+ AF+++VK+L EN A
Subjt:  NHAAKHSKSKRRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQAGDCFEEKTQSPTSRPLEDPEELNQLIKSAFLKFVKVLCENLA

Query:  RRKKFTEP-RSGIIKCIVCGSNSMEFADALSLSQHAFQS-LEASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNS
        +R+K+TE   +G +KC VCGS S EF + LSL  HAF S +   R  HLGLHKALC+LMGW+S AA NG+W Q+ LP VE +A+KEDL+IWPPV+I+HNS
Subjt:  RRKKFTEP-RSGIIKCIVCGSNSMEFADALSLSQHAFQS-LEASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNS

Query:  SITIDNTSERVTISCEELEVVI------RGKMKMVRGKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKAGANRI
        SI   N+  R+ +S EE+E  +      RG  K+ RGKP NQSIM   F    SGL+EAERLHK +A+  HGR EFQ+IN S    G  K   K   +++
Subjt:  SITIDNTSERVTISCEELEVVI------RGKMKMVRGKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKAGANRI

Query:  ESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDA
        E VLYGY+G+A DLDKLDFETK R +VKSKKEI    DA
Subjt:  ESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDA

A0A6J1CGJ5 uncharacterized protein LOC1110110321.8e-7279.89Show/hide
Query:  MGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEA
        MGWSS  APNG+WVQRILP VE  ALKEDLIIWPPVLIIHNSSI  DNTSE+VTISCEELEVVIR     GK+K+VRGKP NQSIMV TF AM SGLQEA
Subjt:  MGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR-----GKMKMVRGKPGNQSIMVATFDAMLSGLQEA

Query:  ERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKAGANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC
        ERLHK+FADK+HGRDEF +INSSH ID SH DLHKAGAN++ESVLYGY+GLAED +KLDFETKKR+VVKSKKEIQ IVDA+  C
Subjt:  ERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKAGANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22430.1 CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380)1.7e-3033.85Show/hide
Query:  IKSAFLKFVKVLCENLARRKKFTE-PRSGIIKCIVCGSNSMEFADALSLSQHAFQSLE-ASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIA
        +K +FL FVK + E+   +K + E  R G ++C+VCG +S +  D  SL  H + S + +SR  HLGLHKALC LMGW+   AP+     + LP  E   
Subjt:  IKSAFLKFVKVLCENLARRKKFTE-PRSGIIKCIVCGSNSMEFADALSLSQHAFQSLE-ASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIA

Query:  LKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR------GKMKMVRGKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSH
         +  LIIWPP +I+ N+S              + ++  IR      GK K + G+ G+  I +  F    SGL++A R+ + F   N GR  + ++    
Subjt:  LKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIR------GKMKMVRGKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSH

Query:  PIDGSHKDLHKAGANRIES-------VLYGYIGLAEDLDKLDFETKKRTVVKSKKEI
        P+  S  D    G   ++        + YGY+    DLDK+D ETKK+T ++S +E+
Subjt:  PIDGSHKDLHKAGANRIES-------VLYGYIGLAEDLDKLDFETKKRTVVKSKKEI

AT5G23570.1 XS domain-containing protein / XS zinc finger domain-containing protein-related9.1e-0525.17Show/hide
Query:  KEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIRGKMKMVR-----GKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPI
        K+  I+WPP++II N+ +  D+  + + +  +EL +    K + +R     G  G++ + V  F++  +G  EAERLH+  A+    R  +         
Subjt:  KEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIRGKMKMVR-----GKPGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPI

Query:  DGSHKDLHKAGANRIESVLYGYIGLAEDLDKLDFETKKRTVVK
         G  + +   G  +    LYG++   +DLD  +  ++ +T +K
Subjt:  DGSHKDLHKAGANRIESVLYGYIGLAEDLDKLDFETKKRTVVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGGAGAGAAACGAGTGGGGACAGAAGGTCTCAGTCTCCGTCGTCGCTTCGACGGAAAACTTCGGAACCTCGGGTTGAAGAAAATCAGCATTGCCATTCTCAGTG
GTTTTCGGGTTCTTCACGGGAAGGACCGGTGACGAATGGCCTTGCGGGTTCTTCTGTAAGAGACCATTATAACGAAAGTCGTCTTTATGAAAATAAGGACGAACATTTTC
GTAATCTCTCTCGGTTCTGCGAGAATTTGCAGCGGGAATCGCCGTCGAAAAAGTTTCGGTGGGAAAACTTGTTTGCCAAAAATCCCGCCAATGTGAATTCGAAATCGAGT
CTGGGTTTCAAACATGTAAATGGATGTGGTGATGGAGATAATCGAGGAATTAGGGTTTCCGGTTCACATTTGGGTACGGGATCGTCGTCTAACAATGTTTTAGACGAGGG
TAATAATTTGCGCACATTTCATATGATCATTGAGGCAACTAAAGACACTAACATAAACAATGGGGATACTTCCAGAAGTTTTGGAATCGGTGACTGTAGCCGCCATTTGT
CTTCATCTAGGAAGTTTGATGGGCCCGTATACGAGACCAGTGATGTTCATGGTCGCGACTGTCCGATCCTTGAATCAGCAAGAAATACCCACAGAGAAAGACGAGACGGA
ACTTCTTCACATGGGATTGAAGCGTCTCATCCGCACTCCAGTGCATGTGTTGCTGCATCTAAACGCATTTCGCAAGATGAATTTCATGGTTTTTATGAGGGTCGTTCCCC
TTGGAGGAAAGAAAAGCATAGAGAACGAGTTGAAACTGAACTAAATATGGAAGGTTTACAGGAGTATAAACAAGCTCGGGGAGGAAACCATATTGAGTACTTTGATGATC
GTAATCAGTATTTCAAAGTCCAGCCATGTAAGAGGAGTGACATTGGTGCTGCGCTCAACAGTCCTTTCTCTCAGCAGATGGTTCGTATCCCACAAGATGATTTCTATCAA
GATTCTACGCGGACCAGTGTTGTAATGGATCCAGTCGTCGAGGCATTTGAAGACACTGGAAGCTATGGTGTGGGTGCAATGGAAGAGACCCGGCCAAGGGACCCTCATGA
TTTTTTCAAAGGACCCTTCATCATTGAAGGTGGTTCTTATATGGGCAACGCCCCTTTTGCGATGGAACAGGATGGTGAAGTTTTGGGTTCTGGAACTGGAAGTCCATTGA
AGCTTGAAAGAAAAACATATCTAAGTGGCCAGAAGTTGCTCTTGGCTGAAGAAGAGGGTTATACGACAAATTATGGGAAATGGTTGCATGGGGATGGATTAAATGGATCA
TTAGTATCAGAACATGAACAAGATTTGAGCTATATGGAAGACAGTAGAAAGTCGAGATGGAAAGCTGCACATTCAACAAAACCGAGGGTCAAAGGAACAAAATGCGAAGG
ACATTATCCTGTATCTGATTCATCTAGAAAACCTAATGTGTTTAGCAGAATCCAATTCTTAAGTCATGGAAATGAAAAGAGTGCTGTTGAAGATATTGATATGAATTTGA
ACCGTAGAAACAAGCGGTGGATTGACGAGGGTACTTCCATTTCCTTGACCTCCTCTAAACGGCAGTTGCCTTGGATAATAAACCATGCCGCCAAACATTCAAAGTCTAAA
CGTAGAAATCTAAAGAAACGTTTGGGTATCTCCTTGAAGGAACCCAGTTCAAACATTCTAGTTAAAGAAGGAGAACGTAAAAGAAACAAGCGTCTGAGACAGACAAATGT
CAATCACAGGTGCCTTAATGTTCAAGTTCAAGCAGGTGATTGCTTTGAAGAGAAGACGCAAAGTCCAACCAGTAGGCCACTTGAAGATCCCGAGGAGTTGAACCAGCTGA
TAAAGAGTGCCTTTCTCAAGTTTGTTAAAGTTCTGTGTGAGAATCTAGCCAGACGAAAGAAGTTCACAGAGCCACGGTCTGGTATTATAAAGTGCATTGTCTGCGGCAGC
AACTCCATGGAGTTTGCAGATGCGCTAAGCTTATCACAACATGCCTTCCAGTCGCTGGAAGCATCCCGGTCAGAGCACTTGGGTCTTCACAAGGCACTTTGTTGGCTCAT
GGGATGGAGCAGTGGAGCAGCACCCAACGGTATATGGGTCCAAAGGATATTGCCTGTTGTAGAAGTAATTGCTTTGAAGGAGGATCTCATTATATGGCCTCCTGTTCTTA
TCATTCATAACAGTTCTATTACAATTGATAATACGTCTGAACGGGTAACCATAAGTTGTGAAGAGCTTGAGGTTGTCATTAGAGGGAAGATGAAAATGGTACGCGGTAAA
CCTGGAAACCAGAGTATTATGGTAGCAACTTTCGATGCCATGTTATCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGTTTTGCAGATAAGAATCATGGTAGGGATGA
GTTCCAGAAAATCAATTCCAGTCATCCCATTGATGGAAGCCACAAGGATCTGCATAAAGCTGGAGCAAACAGGATAGAAAGCGTACTTTATGGCTACATAGGCCTCGCAG
AGGACTTGGATAAACTTGACTTTGAGACCAAGAAGAGGACTGTGGTGAAAAGCAAGAAAGAAATCCAGACTATTGTGGATGCATCTCGTCACTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTGGAGAGAAACGAGTGGGGACAGAAGGTCTCAGTCTCCGTCGTCGCTTCGACGGAAAACTTCGGAACCTCGGGTTGAAGAAAATCAGCATTGCCATTCTCAGTG
GTTTTCGGGTTCTTCACGGGAAGGACCGGTGACGAATGGCCTTGCGGGTTCTTCTGTAAGAGACCATTATAACGAAAGTCGTCTTTATGAAAATAAGGACGAACATTTTC
GTAATCTCTCTCGGTTCTGCGAGAATTTGCAGCGGGAATCGCCGTCGAAAAAGTTTCGGTGGGAAAACTTGTTTGCCAAAAATCCCGCCAATGTGAATTCGAAATCGAGT
CTGGGTTTCAAACATGTAAATGGATGTGGTGATGGAGATAATCGAGGAATTAGGGTTTCCGGTTCACATTTGGGTACGGGATCGTCGTCTAACAATGTTTTAGACGAGGG
TAATAATTTGCGCACATTTCATATGATCATTGAGGCAACTAAAGACACTAACATAAACAATGGGGATACTTCCAGAAGTTTTGGAATCGGTGACTGTAGCCGCCATTTGT
CTTCATCTAGGAAGTTTGATGGGCCCGTATACGAGACCAGTGATGTTCATGGTCGCGACTGTCCGATCCTTGAATCAGCAAGAAATACCCACAGAGAAAGACGAGACGGA
ACTTCTTCACATGGGATTGAAGCGTCTCATCCGCACTCCAGTGCATGTGTTGCTGCATCTAAACGCATTTCGCAAGATGAATTTCATGGTTTTTATGAGGGTCGTTCCCC
TTGGAGGAAAGAAAAGCATAGAGAACGAGTTGAAACTGAACTAAATATGGAAGGTTTACAGGAGTATAAACAAGCTCGGGGAGGAAACCATATTGAGTACTTTGATGATC
GTAATCAGTATTTCAAAGTCCAGCCATGTAAGAGGAGTGACATTGGTGCTGCGCTCAACAGTCCTTTCTCTCAGCAGATGGTTCGTATCCCACAAGATGATTTCTATCAA
GATTCTACGCGGACCAGTGTTGTAATGGATCCAGTCGTCGAGGCATTTGAAGACACTGGAAGCTATGGTGTGGGTGCAATGGAAGAGACCCGGCCAAGGGACCCTCATGA
TTTTTTCAAAGGACCCTTCATCATTGAAGGTGGTTCTTATATGGGCAACGCCCCTTTTGCGATGGAACAGGATGGTGAAGTTTTGGGTTCTGGAACTGGAAGTCCATTGA
AGCTTGAAAGAAAAACATATCTAAGTGGCCAGAAGTTGCTCTTGGCTGAAGAAGAGGGTTATACGACAAATTATGGGAAATGGTTGCATGGGGATGGATTAAATGGATCA
TTAGTATCAGAACATGAACAAGATTTGAGCTATATGGAAGACAGTAGAAAGTCGAGATGGAAAGCTGCACATTCAACAAAACCGAGGGTCAAAGGAACAAAATGCGAAGG
ACATTATCCTGTATCTGATTCATCTAGAAAACCTAATGTGTTTAGCAGAATCCAATTCTTAAGTCATGGAAATGAAAAGAGTGCTGTTGAAGATATTGATATGAATTTGA
ACCGTAGAAACAAGCGGTGGATTGACGAGGGTACTTCCATTTCCTTGACCTCCTCTAAACGGCAGTTGCCTTGGATAATAAACCATGCCGCCAAACATTCAAAGTCTAAA
CGTAGAAATCTAAAGAAACGTTTGGGTATCTCCTTGAAGGAACCCAGTTCAAACATTCTAGTTAAAGAAGGAGAACGTAAAAGAAACAAGCGTCTGAGACAGACAAATGT
CAATCACAGGTGCCTTAATGTTCAAGTTCAAGCAGGTGATTGCTTTGAAGAGAAGACGCAAAGTCCAACCAGTAGGCCACTTGAAGATCCCGAGGAGTTGAACCAGCTGA
TAAAGAGTGCCTTTCTCAAGTTTGTTAAAGTTCTGTGTGAGAATCTAGCCAGACGAAAGAAGTTCACAGAGCCACGGTCTGGTATTATAAAGTGCATTGTCTGCGGCAGC
AACTCCATGGAGTTTGCAGATGCGCTAAGCTTATCACAACATGCCTTCCAGTCGCTGGAAGCATCCCGGTCAGAGCACTTGGGTCTTCACAAGGCACTTTGTTGGCTCAT
GGGATGGAGCAGTGGAGCAGCACCCAACGGTATATGGGTCCAAAGGATATTGCCTGTTGTAGAAGTAATTGCTTTGAAGGAGGATCTCATTATATGGCCTCCTGTTCTTA
TCATTCATAACAGTTCTATTACAATTGATAATACGTCTGAACGGGTAACCATAAGTTGTGAAGAGCTTGAGGTTGTCATTAGAGGGAAGATGAAAATGGTACGCGGTAAA
CCTGGAAACCAGAGTATTATGGTAGCAACTTTCGATGCCATGTTATCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGTTTTGCAGATAAGAATCATGGTAGGGATGA
GTTCCAGAAAATCAATTCCAGTCATCCCATTGATGGAAGCCACAAGGATCTGCATAAAGCTGGAGCAAACAGGATAGAAAGCGTACTTTATGGCTACATAGGCCTCGCAG
AGGACTTGGATAAACTTGACTTTGAGACCAAGAAGAGGACTGTGGTGAAAAGCAAGAAAGAAATCCAGACTATTGTGGATGCATCTCGTCACTGTTAG
Protein sequenceShow/hide protein sequence
MNWRETSGDRRSQSPSSLRRKTSEPRVEENQHCHSQWFSGSSREGPVTNGLAGSSVRDHYNESRLYENKDEHFRNLSRFCENLQRESPSKKFRWENLFAKNPANVNSKSS
LGFKHVNGCGDGDNRGIRVSGSHLGTGSSSNNVLDEGNNLRTFHMIIEATKDTNINNGDTSRSFGIGDCSRHLSSSRKFDGPVYETSDVHGRDCPILESARNTHRERRDG
TSSHGIEASHPHSSACVAASKRISQDEFHGFYEGRSPWRKEKHRERVETELNMEGLQEYKQARGGNHIEYFDDRNQYFKVQPCKRSDIGAALNSPFSQQMVRIPQDDFYQ
DSTRTSVVMDPVVEAFEDTGSYGVGAMEETRPRDPHDFFKGPFIIEGGSYMGNAPFAMEQDGEVLGSGTGSPLKLERKTYLSGQKLLLAEEEGYTTNYGKWLHGDGLNGS
LVSEHEQDLSYMEDSRKSRWKAAHSTKPRVKGTKCEGHYPVSDSSRKPNVFSRIQFLSHGNEKSAVEDIDMNLNRRNKRWIDEGTSISLTSSKRQLPWIINHAAKHSKSK
RRNLKKRLGISLKEPSSNILVKEGERKRNKRLRQTNVNHRCLNVQVQAGDCFEEKTQSPTSRPLEDPEELNQLIKSAFLKFVKVLCENLARRKKFTEPRSGIIKCIVCGS
NSMEFADALSLSQHAFQSLEASRSEHLGLHKALCWLMGWSSGAAPNGIWVQRILPVVEVIALKEDLIIWPPVLIIHNSSITIDNTSERVTISCEELEVVIRGKMKMVRGK
PGNQSIMVATFDAMLSGLQEAERLHKSFADKNHGRDEFQKINSSHPIDGSHKDLHKAGANRIESVLYGYIGLAEDLDKLDFETKKRTVVKSKKEIQTIVDASRHC