; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001432 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001432
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionXS domain-containing protein
Genome locationscaffold8:46016795..46021686
RNA-Seq ExpressionSpg001432
SyntenySpg001432
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005380 - XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458617.1 PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo]0.0e+0069.01Show/hide
Query:  MNWRERSGDDRSQFPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLFAK
        MN RE + D RSQ PS F RRTS PRVEE  HC+SHW S SSRE P+TN LPGSS+RDHYN SRLY +KDEHFRKLSQFCENLQ ESP KKF+WENLF  
Subjt:  MNWRERSGDDRSQFPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLFAK

Query:  NP-ANENSKSNLGFKHVNGCDDGESRGIRVSSSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDVH
        N  AN NSK+++G KHVNG  DG++RGIRVS SHL  G+SSK I  GN LR+FH  IGATKDSNV N+GDTS+S GI DC+HLSS RK+DGP++  ++VH
Subjt:  NP-ANENSKSNLGFKHVNGCDDGESRGIRVSSSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDVH

Query:  VRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQ
        VRDRPIFE   NSHR R N TSS GI+ASH HSSA V ESKGISQ EFH                          LLEYK+AR  HIEHFDD NQYF VQ
Subjt:  VRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQ

Query:  PCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGS
        PCKR+D  A     FSQ MVRIPQD FY+DST TSVV DSVVEGF+DTE +     E  RP D   F +       GS M  APFAMEQ  EVLGSGT S
Subjt:  PCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGS

Query:  PRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKHEQDLSDMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGD
         + G+ EAY+S+    L EEDGYRT++GKW+ E G++GS VSKH+QDL DMED RK  WKA HSTKPRV+G R   H  G  S +K NVFSRIQFL+HGD
Subjt:  PRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKHEQDLSDMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGD

Query:  EKSAIKDIGINLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYF
            +KD   NLN R+    DEDTS    SS+R LPWV+NH S R K KRR+LKKRLG+ + DP+SN LV RERERK+NKRLRKTNV+H CLDVQ GDY 
Subjt:  EKSAIKDIGINLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYF

Query:  EEKTPSPTSRLP--EDEDLNQLIKSAFFKFVKVLSENPARRKKFTEPGLGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSS
        EEK  SPTSR P  + E+LNQLIKSAF KFVKVLSENPARRKK TEPG GII CIVCGSKS EF +ALSLS+HA R+LEGSRAEHLGLHKALCWLMGWSS
Subjt:  EEKTPSPTSRLP--EDEDLNQLIKSAFFKFVKVLSENPARRKKFTEPGLGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSS

Query:  EPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLH
        E  APNGLWV+RILPL EV+ALKEDLIIWPPVLIIHNSSIAID  S+ V ISCEELE VI+GMGCGGKIKVVRG+PGNQSIMV TFGAMFSGLQEAERLH
Subjt:  EPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLH

Query:  KSFADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLHC
        KSFADK+HGRDE  KIN  +LI DS+  LHKA  ANT+ESVLYGYLGLAEDL KLDFETKKR+VVKSKKEIQAIV+ASL C
Subjt:  KSFADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLHC

XP_011657058.1 uncharacterized protein LOC105435801 [Cucumis sativus]1.9e-8476.68Show/hide
Query:  SKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEV
        SKS EF +ALSL +HA R+LEGSRAEHLGLHKALCWLMGWSSE  APNGLWV+ ILP VEV+ALKEDLIIWP VLIIHNSSIAID   E V ISCE+LE 
Subjt:  SKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEV

Query:  VIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFE
         ++ MGCGGK KVVRGK  NQSIMV TFGAMF GLQEAERLH +FADK+HGRDEF KIN   L+ DS+  +HKA  ANT+ESV YGYLGL EDLDKLDFE
Subjt:  VIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFE

Query:  TKKRAVVKSKKEIQAIVDASLHC
        TKKR+VV+SKKEIQAIV ASL C
Subjt:  TKKRAVVKSKKEIQAIVDASLHC

XP_017982234.1 PREDICTED: uncharacterized protein LOC18590378 [Theobroma cacao]4.0e-7953.59Show/hide
Query:  RRDLKKRLG--ISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYFEEKTPSPTSRLPED-EDLNQLIKSAFFKFVKVLSENPARRKKFTEPG
        R+ +K+RLG    V +P+  P V    ER K ++L + NVN     VQA D          +  PED E+  Q I  AF KFVK+L+ENPA+R+K+ E G
Subjt:  RRDLKKRLG--ISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYFEEKTPSPTSRLPED-EDLNQLIKSAFFKFVKVLSENPARRKKFTEPG

Query:  -LGIIKCIVCGSKSMEFANALSLSEHAFRS-LEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTS
          G +KC VCGSKS EF N LSL  HAF S + G RA HLGLHK+LC+LMGW+S  AA NGLW Q+ LP VE +A+KEDL+IWPP++I+HNSSIA  N+ 
Subjt:  -LGIIKCIVCGSKSMEFANALSLSEHAFRS-LEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTS

Query:  ERVTISCEELEVVIKGMGCG-GKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKGLHKARANTMESVLYGYL
         R+ +S EE+E  ++ MG G G  KV RGKP NQSIM   F   FSGL+EAERLHK +A+  HGR EFQ+IN S+          KA  + ++ VLYGYL
Subjt:  ERVTISCEELEVVIKGMGCG-GKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKGLHKARANTMESVLYGYL

Query:  GLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLH
        G+A DLDKLDFETK RA+VKSKKEI A  DA L+
Subjt:  GLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLH

XP_034687202.1 uncharacterized protein LOC117915679 [Vitis riparia]3.4e-7836.09Show/hide
Query:  HPHSSARVTESKGISQDEFHGFYEGRST-----WKKENPREPVE-------TEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQPC------KRSDNGAAL
        H    A+   S  +S+D+FH  Y+   T     + +E   EP         + F      E +  R +    F          PC      +R D G   
Subjt:  HPHSSARVTESKGISQDEFHGFYEGRST-----WKKENPREPVE-------TEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQPC------KRSDNGAAL

Query:  KSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGSPRKGKIEAYLS
           F  +M  + +D  +QD    SV+ DSVV+  +DTE      ++++R  D ++  +E   I          + +  DGEVLGS      +G       
Subjt:  KSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGSPRKGKIEAYLS

Query:  TLAEEDGY--RTSYGKWSHE---GGLDGSLVSKHEQDLS-DMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAIKDIG
        ++ +E GY        W +E     LD    S     LS  +E+            + R    + G H   S  S      ++I  LS+ +E  A +DIG
Subjt:  TLAEEDGY--RTSYGKWSHE---GGLDGSLVSKHEQDLS-DMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAIKDIG

Query:  INLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPS----SNPLVIRERERKKNKRLRKT-NVNHRCLDVQAGDYFEEKT
                 W+     ++   S R L  + N  SQ  K   +D+KKR   SV  P     S P+V   R+ K +K L+++ + +H  L ++ G   + K 
Subjt:  INLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPS----SNPLVIRERERKKNKRLRKT-NVNHRCLDVQAGDYFEEKT

Query:  PSPTSRLPE-DEDLNQLIKSAFFKFVKVLSENPARRKKFTEPG-LGIIKCIVCGSKSMEFANALSLSEHAFRSLE-GSRAEHLGLHKALCWLMGWSSEPA
         +  + LPE  E+  Q + S F KFVK+L+ENPA+R+ +TE G    +KC +CGS S EF N + L  H   S + G R +HLGL KALC LMGW+SE  
Subjt:  PSPTSRLPE-DEDLNQLIKSAFFKFVKVLSENPARRKKFTEPG-LGIIKCIVCGSKSMEFANALSLSEHAFRSLE-GSRAEHLGLHKALCWLMGWSSEPA

Query:  APNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGC-GGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKS
         PN  W  ++LP  E +ALKEDLIIWPPV+I+HNSSI   +  ER+ ++ + L  +++ MG  GGK K+ RGKP NQSIMV  F A FSGLQ+AE+LH  
Subjt:  APNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGC-GGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKS

Query:  FADKNHGRDEFQKINSSNLIDDSHKGLHKARANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASL
        +A+  HGR EFQ+IN +N    S +   KA+A+ +E VLYGYLG+A DLDKLDFE K+R+VVKSKKEI AI D  L
Subjt:  FADKNHGRDEFQKINSSNLIDDSHKGLHKARANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASL

XP_038900433.1 uncharacterized protein LOC120087658 [Benincasa hispida]0.0e+0074.63Show/hide
Query:  MNWRERSGDDRSQFP-SFRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLFAK
        MN+RE S D RSQ P SF RRTS PRVEEN HCHS W S SSRE PVTNGL GSS+RDHYN SRLYEN DEHFRKLSQ CENLQRESP KKFRWENLFA 
Subjt:  MNWRERSGDDRSQFP-SFRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLFAK

Query:  NPANENSKSNLGFKHVNGCDDGESRGIRVSSSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDVHV
        NPAN NSKS++G KH N C DG +RGIRVS SHL  G+SS +I  G++LR+FH  IG TKDSNV N+GD S+SFGI DCSHLSS RKFDGP+Y TSDVHV
Subjt:  NPANENSKSNLGFKHVNGCDDGESRGIRVSSSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDVHV

Query:  RDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQP
        RDRPIFESA NSHR R N  SSHG++AS+  SSA VTESKGISQDEFH F                         LEYK+AR  +IE FDD NQYF VQP
Subjt:  RDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQP

Query:  CKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGSP
         KRSD  A L S+FSQQMVRIPQD FYQDST TSVV DSVVEGF+DTE ++    E  RP D YD FKEPF IE GSYMG APF ME  GE LGSG  S 
Subjt:  CKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGSP

Query:  RKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKHEQDLSDMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDE
         KG+ EAY+S+    LAEEDGYRT YGKW HE G++GSLVSKH+QDLSDME SRK RWKA +STK RV+GTRC  H  GS SSRK NVFSRIQFLSHGDE
Subjt:  RKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKHEQDLSDMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDE

Query:  KSAIKDIGINLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYFE
          A+KD  INLN RSK WN+EDTSI L SS+RPLPWVINHAS   K KRRDL+KRLG  ++DPSS+PLV R+R+RKKNKRLRK NVNH CLDVQ  DY E
Subjt:  KSAIKDIGINLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYFE

Query:  EKTPSPTSRLPED-EDLNQLIKSAFFKFVKVLSENPARRKKFTEPGLGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEP
        EK  SPTSRL ED E+LNQLIKSAF KFVKVLSENPARRKKFTEPG GIIKCIVCGSKS EFA+ALSLS+HA ++LEGSRAEHLGL KALCWLMGWSSE 
Subjt:  EKTPSPTSRLPED-EDLNQLIKSAFFKFVKVLSENPARRKKFTEPGLGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEP

Query:  AAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKS
        AAP+G WV+RILPL EV+ALKEDLIIWPPVLIIHNSSIAID+ SERV ISCEELEVVI+GMGCGGKIKVVRGKPGNQSIM+ TF AMFSGLQEAERLHKS
Subjt:  AAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKS

Query:  FADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLHC
        FADK+HGRDEFQKI SS+LI DSHK LHKA  ANT+++VLYGYLGL EDLDKLDFETKKR+VVKSKKEIQAIV+ASLHC
Subjt:  FADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLHC

TrEMBL top hitse value%identityAlignment
A0A0A0KGN5 XS domain-containing protein9.0e-8576.68Show/hide
Query:  SKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEV
        SKS EF +ALSL +HA R+LEGSRAEHLGLHKALCWLMGWSSE  APNGLWV+ ILP VEV+ALKEDLIIWP VLIIHNSSIAID   E V ISCE+LE 
Subjt:  SKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEV

Query:  VIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFE
         ++ MGCGGK KVVRGK  NQSIMV TFGAMF GLQEAERLH +FADK+HGRDEF KIN   L+ DS+  +HKA  ANT+ESV YGYLGL EDLDKLDFE
Subjt:  VIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFE

Query:  TKKRAVVKSKKEIQAIVDASLHC
        TKKR+VV+SKKEIQAIV ASL C
Subjt:  TKKRAVVKSKKEIQAIVDASLHC

A0A1S3C894 uncharacterized protein LOC1034979640.0e+0069.01Show/hide
Query:  MNWRERSGDDRSQFPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLFAK
        MN RE + D RSQ PS F RRTS PRVEE  HC+SHW S SSRE P+TN LPGSS+RDHYN SRLY +KDEHFRKLSQFCENLQ ESP KKF+WENLF  
Subjt:  MNWRERSGDDRSQFPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLFAK

Query:  NP-ANENSKSNLGFKHVNGCDDGESRGIRVSSSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDVH
        N  AN NSK+++G KHVNG  DG++RGIRVS SHL  G+SSK I  GN LR+FH  IGATKDSNV N+GDTS+S GI DC+HLSS RK+DGP++  ++VH
Subjt:  NP-ANENSKSNLGFKHVNGCDDGESRGIRVSSSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDVH

Query:  VRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQ
        VRDRPIFE   NSHR R N TSS GI+ASH HSSA V ESKGISQ EFH                          LLEYK+AR  HIEHFDD NQYF VQ
Subjt:  VRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQ

Query:  PCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGS
        PCKR+D  A     FSQ MVRIPQD FY+DST TSVV DSVVEGF+DTE +     E  RP D   F +       GS M  APFAMEQ  EVLGSGT S
Subjt:  PCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGS

Query:  PRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKHEQDLSDMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGD
         + G+ EAY+S+    L EEDGYRT++GKW+ E G++GS VSKH+QDL DMED RK  WKA HSTKPRV+G R   H  G  S +K NVFSRIQFL+HGD
Subjt:  PRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKHEQDLSDMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGD

Query:  EKSAIKDIGINLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYF
            +KD   NLN R+    DEDTS    SS+R LPWV+NH S R K KRR+LKKRLG+ + DP+SN LV RERERK+NKRLRKTNV+H CLDVQ GDY 
Subjt:  EKSAIKDIGINLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYF

Query:  EEKTPSPTSRLP--EDEDLNQLIKSAFFKFVKVLSENPARRKKFTEPGLGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSS
        EEK  SPTSR P  + E+LNQLIKSAF KFVKVLSENPARRKK TEPG GII CIVCGSKS EF +ALSLS+HA R+LEGSRAEHLGLHKALCWLMGWSS
Subjt:  EEKTPSPTSRLP--EDEDLNQLIKSAFFKFVKVLSENPARRKKFTEPGLGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSS

Query:  EPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLH
        E  APNGLWV+RILPL EV+ALKEDLIIWPPVLIIHNSSIAID  S+ V ISCEELE VI+GMGCGGKIKVVRG+PGNQSIMV TFGAMFSGLQEAERLH
Subjt:  EPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLH

Query:  KSFADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLHC
        KSFADK+HGRDE  KIN  +LI DS+  LHKA  ANT+ESVLYGYLGLAEDL KLDFETKKR+VVKSKKEIQAIV+ASL C
Subjt:  KSFADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLHC

A0A5A7SQC0 XS domain-containing protein0.0e+0069.01Show/hide
Query:  MNWRERSGDDRSQFPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLFAK
        MN RE + D RSQ PS F RRTS PRVEE  HC+SHW S SSRE P+TN LPGSS+RDHYN SRLY +KDEHFRKLSQFCENLQ ESP KKF+WENLF  
Subjt:  MNWRERSGDDRSQFPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLFAK

Query:  NP-ANENSKSNLGFKHVNGCDDGESRGIRVSSSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDVH
        N  AN NSK+++G KHVNG  DG++RGIRVS SHL  G+SSK I  GN LR+FH  IGATKDSNV N+GDTS+S GI DC+HLSS RK+DGP++  ++VH
Subjt:  NP-ANENSKSNLGFKHVNGCDDGESRGIRVSSSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDVH

Query:  VRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQ
        VRDRPIFE   NSHR R N TSS GI+ASH HSSA V ESKGISQ EFH                          LLEYK+AR  HIEHFDD NQYF VQ
Subjt:  VRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQ

Query:  PCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGS
        PCKR+D  A     FSQ MVRIPQD FY+DST TSVV DSVVEGF+DTE +     E  RP D   F +       GS M  APFAMEQ  EVLGSGT S
Subjt:  PCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGS

Query:  PRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKHEQDLSDMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGD
         + G+ EAY+S+    L EEDGYRT++GKW+ E G++GS VSKH+QDL DMED RK  WKA HSTKPRV+G R   H  G  S +K NVFSRIQFL+HGD
Subjt:  PRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKHEQDLSDMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGD

Query:  EKSAIKDIGINLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYF
            +KD   NLN R+    DEDTS    SS+R LPWV+NH S R K KRR+LKKRLG+ + DP+SN LV RERERK+NKRLRKTNV+H CLDVQ GDY 
Subjt:  EKSAIKDIGINLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYF

Query:  EEKTPSPTSRLP--EDEDLNQLIKSAFFKFVKVLSENPARRKKFTEPGLGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSS
        EEK  SPTSR P  + E+LNQLIKSAF KFVKVLSENPARRKK TEPG GII CIVCGSKS EF +ALSLS+HA R+LEGSRAEHLGLHKALCWLMGWSS
Subjt:  EEKTPSPTSRLP--EDEDLNQLIKSAFFKFVKVLSENPARRKKFTEPGLGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSS

Query:  EPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLH
        E  APNGLWV+RILPL EV+ALKEDLIIWPPVLIIHNSSIAID  S+ V ISCEELE VI+GMGCGGKIKVVRG+PGNQSIMV TFGAMFSGLQEAERLH
Subjt:  EPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLH

Query:  KSFADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLHC
        KSFADK+HGRDE  KIN  +LI DS+  LHKA  ANT+ESVLYGYLGLAEDL KLDFETKKR+VVKSKKEIQAIV+ASL C
Subjt:  KSFADKNHGRDEFQKINSSNLIDDSHKGLHKAR-ANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLHC

A0A6J0ZXA5 uncharacterized protein LOC1104129792.1e-7853.17Show/hide
Query:  RRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYFEEKTPSPTSRLPED-EDLNQLIKSAFFKFVKVLSENPARRKKFTEPG-L
        R+ +K+RLG        NP  +   +R K ++L K NVN     VQA D          +  PED ++  Q I+ AF ++VK+L+ENPA+R+K+TE G  
Subjt:  RRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYFEEKTPSPTSRLPED-EDLNQLIKSAFFKFVKVLSENPARRKKFTEPG-L

Query:  GIIKCIVCGSKSMEFANALSLSEHAFRS-LEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSER
        G +KC VCGSKS EF N LSL  HAF S + G R  HLGLHKALC+LMGW+S  AA NGLW Q+ LP VE +A+KEDL+IWPPV+I+HNSSIA  N+  R
Subjt:  GIIKCIVCGSKSMEFANALSLSEHAFRS-LEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSER

Query:  VTISCEELEVVIKGMGCG-GKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKGLHKARANTMESVLYGYLGL
        + +S EE+E  ++ MG G G  KV RGKP NQSIM   F   FSGL+EAERLHK +A+  HGR EFQ+IN S           K   + +E VLYGYLG+
Subjt:  VTISCEELEVVIKGMGCG-GKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKGLHKARANTMESVLYGYLGL

Query:  AEDLDKLDFETKKRAVVKSKKEIQAIVDASL
        A DLDKLDFETK RA+VKSKKEI A  DA L
Subjt:  AEDLDKLDFETKKRAVVKSKKEIQAIVDASL

D7SN41 XS domain-containing protein6.2e-7835.47Show/hide
Query:  HPHSSARVTESKGISQDEFHGFYEGRST-----WKKENPREPV-------ETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQPC------KRSDNGAAL
        H    A+   S  +S+D+FHG Y+   T     + +E   EP         + F      E +  R +    F          PC      +R D G   
Subjt:  HPHSSARVTESKGISQDEFHGFYEGRST-----WKKENPREPV-------ETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQPC------KRSDNGAAL

Query:  KSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGSPRKGKIEAYLS
           F  +M  + +D  +QD    SV+ DSVV+  +DTE      ++++R  D ++  +E   I          + +  DGEVLGS      +G       
Subjt:  KSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGSPRKGKIEAYLS

Query:  TLAEEDGY--RTSYGKWSHE---GGLDGSLVSKHEQDLS-DMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAIKDIG
        ++ +E GY        W +E     LD    S     LS  +E+            + R    + G H   S  S      ++I  LS+  E  A +DIG
Subjt:  TLAEEDGY--RTSYGKWSHE---GGLDGSLVSKHEQDLS-DMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAIKDIG

Query:  INLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYFEEKTPSPTS
                 W+     ++ + S R L  + N  SQ  K   +D KKRL    ++   +  V+R+ +  K  + R  + +H  L ++ G   + K  +  +
Subjt:  INLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRLGISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYFEEKTPSPTS

Query:  RLPE-DEDLNQLIKSAFFKFVKVLSENPARRKKFTEPG-LGIIKCIVCGSKSMEFANALSLSEHAFRSLEGS-RAEHLGLHKALCWLMGWSSEPAAPNGL
         LPE  E+  Q + S F KFVK+L+ENPA+R+ +TE G    +KC +CGS S EF N + L  H   S +G  R +HLGL KALC LMGW+S+   PN  
Subjt:  RLPE-DEDLNQLIKSAFFKFVKVLSENPARRKKFTEPG-LGIIKCIVCGSKSMEFANALSLSEHAFRSLEGS-RAEHLGLHKALCWLMGWSSEPAAPNGL

Query:  WVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGC-GGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKN
        WV ++LP  E +ALKEDLIIWPPV+I+HNSSI   +  ER+ ++ + L  +++ MG  GGK ++ RGKP NQSIMV  F A FSGLQ+AE+LH  +A+  
Subjt:  WVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGC-GGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKN

Query:  HGRDEFQKINSSNLIDDSHKGLHKARANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASL
        HGR EF +IN +N    S +   KA+A+ +E VLYGYLG+A DLDKLDFE K+R VVKSKKEI AI D  L
Subjt:  HGRDEFQKINSSNLIDDSHKGLHKARANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22430.1 CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380)3.9e-3234.9Show/hide
Query:  IKSAFFKFVKVLSENPARRKKFTEPG-LGIIKCIVCGSKSMEFANALSLSEHAFRSLE-GSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVI
        +K +F  FVK + E+P  +K + E G  G ++C+VCG  S +  +  SL  H + S +  SR  HLGLHKALC LMGW+   A  N    Q  LP  E  
Subjt:  IKSAFFKFVKVLSENPARRKKFTEPG-LGIIKCIVCGSKSMEFANALSLSEHAFRSLE-GSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVI

Query:  ALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMG-CGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKIN--
          +  LIIWPP +I+ N+S              + ++  I+ +G  GGK K + G+ G+  I +  F    SGL++A R+ + F   N GR  + ++   
Subjt:  ALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMG-CGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKIN--

Query:  SSNLIDDSHKGLHKARANTMES--VLYGYLGLAEDLDKLDFETKKRAVVKSKKEI
        + +  D+ + GL +    T E   + YGYL    DLDK+D ETKK+  ++S +E+
Subjt:  SSNLIDDSHKGLHKARANTMES--VLYGYLGLAEDLDKLDFETKKRAVVKSKKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGGAGAGAAAGGAGTGGGGATGATAGGTCTCAGTTTCCGTCGTTTCGACGGAGAACTTCGGGACCTCGGGTTGAAGAAAACCGGCATTGTCATTCGCACTGGAT
TTCGGGTTCTTCACGGGAAGGACCGGTGACGAATGGCCTTCCGGGTTCTTCTGTTAGAGACCATTACAACGAAAGTCGTCTTTATGAAAATAAGGACGAACATTTTCGTA
AACTGTCTCAGTTTTGTGAGAATTTGCAGCGGGAGTCGCCGTTGAAAAAGTTTCGCTGGGAAAACTTGTTTGCCAAAAATCCCGCCAATGAGAATTCGAAATCGAATCTG
GGGTTCAAACATGTAAATGGATGTGATGATGGTGAAAGTCGAGGAATTAGGGTTTCCAGTTCACATTTGGGTATGGGATCGTCGTCTAAGGATATTTCAGAGGGTAACGA
TTTGCGCTCATTTCATACTTACATTGGTGCAACTAAAGACAGTAACGTAAACGATGGGGATACTTCCAAAAGTTTTGGAATCGGTGACTGTAGCCATTTGTCTTCACCAA
GAAAGTTTGATGGGCCCGTATACGGGACCAGTGATGTTCATGTTCGGGACCGTCCGATCTTTGAATCAGCAAGAAATTCCCACAGAGAAAGACTTAACGGAACATCTTCA
CATGGGATAGAAGCGTCTCATCCACACTCCAGTGCACGTGTTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGTTTTTATGAGGGCCGTTCCACTTGGAAGAA
AGAAAACCCTAGAGAACCTGTTGAAACTGAATTTAATATGGAAGGTTTACTGGAGTATAAACAAGCTCGGGGAATCCATATCGAGCACTTTGATGATCGTAATCAGTATT
TCAAAGTCCAGCCATGTAAGAGGAGTGACAATGGTGCTGCTCTCAAGAGTTCTTTCTCTCAGCAGATGGTTCGCATCCCTCAAGACGCTTTCTATCAAGATTCTACGCTG
ACCAGTGTTGTCACGGATTCAGTTGTTGAGGGATTTGAAGACACTGAAAGGTATGTCGTGGGTGATATGGAAGAGAACCGGCCAATCGACCCTTATGATTTTTTCAAAGA
ACCCTTCACCATTGAAGGTGGTTCTTATATGGGCAACGCCCCTTTTGCGATGGAACAGGACGGTGAAGTTTTGGGTTCTGGAACTGGAAGTCCACGGAAGGGTAAAATAG
AAGCATATTTAAGTACCTTGGCTGAAGAAGATGGTTATAGGACAAGTTATGGGAAATGGTCGCATGAAGGTGGATTAGATGGATCACTAGTATCGAAACACGAACAAGAT
TTGAGCGACATGGAAGACAGTAGAAAGCGGAGATGGAAAGCTGCACATTCAACAAAACCGAGGGTCAAAGGAACAAGATGCGGAGCGCATTATTCTGGGTCTGATTCATC
TAGAAAATCTAATGTGTTTAGCAGAATCCAATTCTTAAGTCATGGAGATGAAAAGAGTGCTATTAAAGATATTGGCATCAATTTAAACGGTAGAAGCAAGCGGTGGAATG
ACGAGGATACTTCCATATCCTTGGCCTCCTCCAGACGGCCGTTGCCTTGGGTAATAAACCATGCCTCTCAGCGTCCAAAGTTTAAACGTAGAGACCTAAAGAAACGTTTG
GGTATCTCCGTGAAGGATCCCAGTTCAAACCCTCTAGTCATTAGAGAACGAGAACGCAAGAAAAACAAGCGTCTGAGAAAGACAAATGTCAATCATAGGTGCCTTGATGT
TCAAGCAGGTGATTACTTTGAAGAGAAGACGCCAAGTCCAACCAGCAGGCTACCTGAAGATGAGGATTTGAACCAGCTTATAAAGAGTGCATTCTTCAAGTTTGTCAAAG
TTCTGAGTGAGAATCCAGCCAGACGAAAGAAGTTCACAGAGCCAGGGTTGGGTATTATAAAGTGCATTGTCTGCGGCAGCAAGTCGATGGAGTTTGCAAATGCCCTAAGC
TTATCAGAACATGCCTTCCGGTCGCTGGAAGGTTCGAGGGCAGAACACTTGGGACTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAACCAGCAGCGCCAAA
CGGTCTATGGGTTCAAAGGATATTGCCTCTTGTAGAAGTAATTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTGCAATTG
ATAATACGTCTGAACGGGTAACCATAAGTTGTGAAGAGCTTGAGGTTGTTATTAAAGGAATGGGTTGTGGAGGGAAGATTAAAGTGGTGCGTGGTAAACCTGGAAACCAG
AGTATTATGGTAGCAACTTTCGGAGCCATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGCTTTGCAGATAAGAATCATGGTAGGGATGAGTTCCAGAAAAT
CAATTCGAGTAATCTCATTGATGACAGCCACAAGGGTCTGCATAAAGCAAGAGCAAACACGATGGAAAGCGTACTTTATGGCTACTTAGGTCTCGCAGAGGACTTGGATA
AACTTGACTTTGAGACCAAGAAGAGGGCTGTGGTGAAGAGCAAGAAAGAAATCCAGGCCATTGTGGATGCATCTCTTCACTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTGGAGAGAAAGGAGTGGGGATGATAGGTCTCAGTTTCCGTCGTTTCGACGGAGAACTTCGGGACCTCGGGTTGAAGAAAACCGGCATTGTCATTCGCACTGGAT
TTCGGGTTCTTCACGGGAAGGACCGGTGACGAATGGCCTTCCGGGTTCTTCTGTTAGAGACCATTACAACGAAAGTCGTCTTTATGAAAATAAGGACGAACATTTTCGTA
AACTGTCTCAGTTTTGTGAGAATTTGCAGCGGGAGTCGCCGTTGAAAAAGTTTCGCTGGGAAAACTTGTTTGCCAAAAATCCCGCCAATGAGAATTCGAAATCGAATCTG
GGGTTCAAACATGTAAATGGATGTGATGATGGTGAAAGTCGAGGAATTAGGGTTTCCAGTTCACATTTGGGTATGGGATCGTCGTCTAAGGATATTTCAGAGGGTAACGA
TTTGCGCTCATTTCATACTTACATTGGTGCAACTAAAGACAGTAACGTAAACGATGGGGATACTTCCAAAAGTTTTGGAATCGGTGACTGTAGCCATTTGTCTTCACCAA
GAAAGTTTGATGGGCCCGTATACGGGACCAGTGATGTTCATGTTCGGGACCGTCCGATCTTTGAATCAGCAAGAAATTCCCACAGAGAAAGACTTAACGGAACATCTTCA
CATGGGATAGAAGCGTCTCATCCACACTCCAGTGCACGTGTTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGTTTTTATGAGGGCCGTTCCACTTGGAAGAA
AGAAAACCCTAGAGAACCTGTTGAAACTGAATTTAATATGGAAGGTTTACTGGAGTATAAACAAGCTCGGGGAATCCATATCGAGCACTTTGATGATCGTAATCAGTATT
TCAAAGTCCAGCCATGTAAGAGGAGTGACAATGGTGCTGCTCTCAAGAGTTCTTTCTCTCAGCAGATGGTTCGCATCCCTCAAGACGCTTTCTATCAAGATTCTACGCTG
ACCAGTGTTGTCACGGATTCAGTTGTTGAGGGATTTGAAGACACTGAAAGGTATGTCGTGGGTGATATGGAAGAGAACCGGCCAATCGACCCTTATGATTTTTTCAAAGA
ACCCTTCACCATTGAAGGTGGTTCTTATATGGGCAACGCCCCTTTTGCGATGGAACAGGACGGTGAAGTTTTGGGTTCTGGAACTGGAAGTCCACGGAAGGGTAAAATAG
AAGCATATTTAAGTACCTTGGCTGAAGAAGATGGTTATAGGACAAGTTATGGGAAATGGTCGCATGAAGGTGGATTAGATGGATCACTAGTATCGAAACACGAACAAGAT
TTGAGCGACATGGAAGACAGTAGAAAGCGGAGATGGAAAGCTGCACATTCAACAAAACCGAGGGTCAAAGGAACAAGATGCGGAGCGCATTATTCTGGGTCTGATTCATC
TAGAAAATCTAATGTGTTTAGCAGAATCCAATTCTTAAGTCATGGAGATGAAAAGAGTGCTATTAAAGATATTGGCATCAATTTAAACGGTAGAAGCAAGCGGTGGAATG
ACGAGGATACTTCCATATCCTTGGCCTCCTCCAGACGGCCGTTGCCTTGGGTAATAAACCATGCCTCTCAGCGTCCAAAGTTTAAACGTAGAGACCTAAAGAAACGTTTG
GGTATCTCCGTGAAGGATCCCAGTTCAAACCCTCTAGTCATTAGAGAACGAGAACGCAAGAAAAACAAGCGTCTGAGAAAGACAAATGTCAATCATAGGTGCCTTGATGT
TCAAGCAGGTGATTACTTTGAAGAGAAGACGCCAAGTCCAACCAGCAGGCTACCTGAAGATGAGGATTTGAACCAGCTTATAAAGAGTGCATTCTTCAAGTTTGTCAAAG
TTCTGAGTGAGAATCCAGCCAGACGAAAGAAGTTCACAGAGCCAGGGTTGGGTATTATAAAGTGCATTGTCTGCGGCAGCAAGTCGATGGAGTTTGCAAATGCCCTAAGC
TTATCAGAACATGCCTTCCGGTCGCTGGAAGGTTCGAGGGCAGAACACTTGGGACTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAACCAGCAGCGCCAAA
CGGTCTATGGGTTCAAAGGATATTGCCTCTTGTAGAAGTAATTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTGCAATTG
ATAATACGTCTGAACGGGTAACCATAAGTTGTGAAGAGCTTGAGGTTGTTATTAAAGGAATGGGTTGTGGAGGGAAGATTAAAGTGGTGCGTGGTAAACCTGGAAACCAG
AGTATTATGGTAGCAACTTTCGGAGCCATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGCTTTGCAGATAAGAATCATGGTAGGGATGAGTTCCAGAAAAT
CAATTCGAGTAATCTCATTGATGACAGCCACAAGGGTCTGCATAAAGCAAGAGCAAACACGATGGAAAGCGTACTTTATGGCTACTTAGGTCTCGCAGAGGACTTGGATA
AACTTGACTTTGAGACCAAGAAGAGGGCTGTGGTGAAGAGCAAGAAAGAAATCCAGGCCATTGTGGATGCATCTCTTCACTGTTAG
Protein sequenceShow/hide protein sequence
MNWRERSGDDRSQFPSFRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLFAKNPANENSKSNL
GFKHVNGCDDGESRGIRVSSSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNVNDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDVHVRDRPIFESARNSHRERLNGTSS
HGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETEFNMEGLLEYKQARGIHIEHFDDRNQYFKVQPCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTL
TSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYMGNAPFAMEQDGEVLGSGTGSPRKGKIEAYLSTLAEEDGYRTSYGKWSHEGGLDGSLVSKHEQD
LSDMEDSRKRRWKAAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAIKDIGINLNGRSKRWNDEDTSISLASSRRPLPWVINHASQRPKFKRRDLKKRL
GISVKDPSSNPLVIRERERKKNKRLRKTNVNHRCLDVQAGDYFEEKTPSPTSRLPEDEDLNQLIKSAFFKFVKVLSENPARRKKFTEPGLGIIKCIVCGSKSMEFANALS
LSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIAIDNTSERVTISCEELEVVIKGMGCGGKIKVVRGKPGNQ
SIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKGLHKARANTMESVLYGYLGLAEDLDKLDFETKKRAVVKSKKEIQAIVDASLHC