; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010167 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010167
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionXS domain-containing protein
Genome locationchr9:45122771..45127178
RNA-Seq ExpressionLag0010167
SyntenyLag0010167
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005380 - XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458617.1 PREDICTED: uncharacterized protein LOC103497964 [Cucumis melo]5.6e-30768.29Show/hide
Query:  MNWRERSGDDRSQSPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRNRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLF
        MN RE + D RSQSPS F RRTS PRVEE  HC+SHW S SSRE P+TN LPGSS+  RDHYN SRLY +KDEHFRKLSQFCENLQ ESP KKF+WENLF
Subjt:  MNWRERSGDDRSQSPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRNRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLF

Query:  AKNP-ANENSKSNLGFKHVNGCDDDDSRGIRVSGSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSD
          N  AN NSK+++G KHVNG  D D+RGIRVSGSHL  G+SSK I  GN LR+FH  IGATKDSNV N+GDTS+S GI DC+HLSS RK+DGP++  ++
Subjt:  AKNP-ANENSKSNLGFKHVNGCDDDDSRGIRVSGSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSD

Query:  VHVRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETERSMEGLLEYKQARGIHIEHFDDRNQYVK
        VHVRDRPIFE   NSHR R N TSS GI+ASH HSSA V ESKGISQ EFH                          LLEYK+AR  HIEHFDD NQY  
Subjt:  VHVRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETERSMEGLLEYKQARGIHIEHFDDRNQYVK

Query:  VQQCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGT
        VQ CKR+D  A     FSQ MVRIPQD FY+DST TSVV DSVVEGF+DTE +     E  RP D   F +       GS +  APFAMEQ  EVLG GT
Subjt:  VQQCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGT

Query:  GSPRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKYEQDLSDMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSH
         S + G+ EAY+S+    L EEDGYRT++GKW+ E G++GS VSK++QDL DMED RK  WK  HSTKPRV+G R   H  G  S +K NVFSRIQFL+H
Subjt:  GSPRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKYEQDLSDMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSH

Query:  GDEKSAVKDIGINLKGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGD
        GD    VKD   NL  R+    DEDTS S   S+R LPWV+NH S R K KRR++KKRLG+ + D +SN LV RERERK+NKRLRKT+V+H CLDVQ GD
Subjt:  GDEKSAVKDIGINLKGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGD

Query:  CFEEKTPSPTSRLP--EDEELNQLIKSAFFKFVKVLSENPARRKKFTEPGSGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGW
          EEK  SPTSR P  + EELNQLIKSAF KFVKVLSENPARRKK TEPG GII CIVCGSKS EF +ALSLS+HA R+LEGSRAEHLGLHKALCWLMGW
Subjt:  CFEEKTPSPTSRLP--EDEELNQLIKSAFFKFVKVLSENPARRKKFTEPGSGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGW

Query:  SSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAER
        SSE  APNGLWV+RILPL EV+ALKEDLIIWPPVLIIHNSSI ID  S+ V ISCEELE VI+GMGCGGKIKVVRG+PGNQSIMV TFGAMFSGLQEAER
Subjt:  SSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAER

Query:  LHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC
        LHKSFADK+HGRDE  KIN  +LI DS+ DLHKA GANT+ESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQAIV+ASL C
Subjt:  LHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC

XP_010651723.1 PREDICTED: uncharacterized protein LOC104879702 [Vitis vinifera]3.6e-8036.08Show/hide
Query:  HPHSSARVTESKGISQDEFHGFYEGRST-----WKKENPREPV--ETERSMEG--------LLEYKQARGIHIEHFDDRNQYVKVQQCKRSDNGAALKSS
        H    A+   S  +S+D+FHG Y+   T     + +E   EP     +  M G        L   +Q + I  E     +   ++ + +R D G      
Subjt:  HPHSSARVTESKGISQDEFHGFYEGRST-----WKKENPREPV--ETERSMEG--------LLEYKQARGIHIEHFDDRNQYVKVQQCKRSDNGAALKSS

Query:  FSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGTGSPRKGKIEAYLSTLA
        F  +M  + +D  +QD    SV+ DSVV+  +DTE      ++++R  D ++  +E   I          + +  DGEVLG       +G       ++ 
Subjt:  FSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGTGSPRKGKIEAYLSTLA

Query:  EEDGY--RTSYGKWSHE---GGLDGSLVSKYEQDLS-DMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAVKDIGINL
        +E GY        W +E     LD    S     LS  +E+       +    + R    + G H   S  S      ++I  LS+  E  A +DIG   
Subjt:  EEDGY--RTSYGKWSHE---GGLDGSLVSKYEQDLS-DMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAVKDIGINL

Query:  KGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGDCFEEKTPSPTSRLP
              WS     ++ + S R+L  + N  SQ  K   +D KKRL    +++  +  V+R+ +  K  + R  D +H  L ++ G   + K  +  + LP
Subjt:  KGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGDCFEEKTPSPTSRLP

Query:  E-DEELNQLIKSAFFKFVKVLSENPARRKKFTEPG-SGIIKCIVCGSKSMEFANALSLSEHAFRSLEGS-RAEHLGLHKALCWLMGWSSEPAAPNGLWVQ
        E  EE  Q + S F KFVK+L+ENPA+R+ +TE G +  +KC +CGS S EF N + L  H   S +G  R +HLGL KALC LMGW+S+   PN  WV 
Subjt:  E-DEELNQLIKSAFFKFVKVLSENPARRKKFTEPG-SGIIKCIVCGSKSMEFANALSLSEHAFRSLEGS-RAEHLGLHKALCWLMGWSSEPAAPNGLWVQ

Query:  RILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGC-GGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGR
        ++LP  E +ALKEDLIIWPPV+I+HNSSIG  +  ER+ ++ + L  +++ MG  GGK ++ RGKP NQSIMV  F A FSGLQ+AE+LH  +A+  HGR
Subjt:  RILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGC-GGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGR

Query:  DEFQKINSSNLIDDSHKDLHKAGANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASL
         EF +IN +N    S ++  KA A+ +E VLYGYLG+A DLDKLDFE K+R VVKSKKEI AI D  L
Subjt:  DEFQKINSSNLIDDSHKDLHKAGANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASL

XP_011657058.1 uncharacterized protein LOC105435801 [Cucumis sativus]5.8e-8677.58Show/hide
Query:  SKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEV
        SKS EF +ALSL +HA R+LEGSRAEHLGLHKALCWLMGWSSE  APNGLWV+ ILP VEV+ALKEDLIIWP VLIIHNSSI ID   E V ISCE+LE 
Subjt:  SKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEV

Query:  VIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFE
         ++ MGCGGK KVVRGK  NQSIMV TFGAMF GLQEAERLH +FADK+HGRDEF KIN   L+ DS+ D+HKA GANT+ESV YGYLGL EDLDKLDFE
Subjt:  VIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFE

Query:  TKKRSVVKSKKEIQAIVDASLHC
        TKKRSVV+SKKEIQAIV ASL C
Subjt:  TKKRSVVKSKKEIQAIVDASLHC

XP_034687202.1 uncharacterized protein LOC117915679 [Vitis riparia]5.6e-8136.38Show/hide
Query:  HPHSSARVTESKGISQDEFHGFYEGRST-----WKKENPREPVE--TERSMEG--------LLEYKQARGIHIEHFDDRNQYVKVQQCKRSDNGAALKSS
        H    A+   S  +S+D+FH  Y+   T     + +E   EP     +  M G        L   +Q + I  E     +   ++ + +R D G      
Subjt:  HPHSSARVTESKGISQDEFHGFYEGRST-----WKKENPREPVE--TERSMEG--------LLEYKQARGIHIEHFDDRNQYVKVQQCKRSDNGAALKSS

Query:  FSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGTGSPRKGKIEAYLSTLA
        F  +M  + +D  +QD    SV+ DSVV+  +DTE      ++++R  D ++  +E   I          + +  DGEVLG       +G       ++ 
Subjt:  FSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGTGSPRKGKIEAYLSTLA

Query:  EEDGY--RTSYGKWSHE---GGLDGSLVSKYEQDLS-DMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAVKDIGINL
        +E GY        W +E     LD    S     LS  +E+       +    + R    + G H   S  S      ++I  LS+ +E  A +DIG   
Subjt:  EEDGY--RTSYGKWSHE---GGLDGSLVSKYEQDLS-DMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAVKDIGINL

Query:  KGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGDCFEEKTPSPTSRLP
              WS     ++   S R+L  + N  SQ  K   +DIKKR     +++  +  V+R+ +  K  + R  D +H  L ++ G   + K  +  + LP
Subjt:  KGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGDCFEEKTPSPTSRLP

Query:  E-DEELNQLIKSAFFKFVKVLSENPARRKKFTEPG-SGIIKCIVCGSKSMEFANALSLSEHAFRSLE-GSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQ
        E  EE  Q + S F KFVK+L+ENPA+R+ +TE G +  +KC +CGS S EF N + L  H   S + G R +HLGL KALC LMGW+SE   PN  W  
Subjt:  E-DEELNQLIKSAFFKFVKVLSENPARRKKFTEPG-SGIIKCIVCGSKSMEFANALSLSEHAFRSLE-GSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQ

Query:  RILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGC-GGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGR
        ++LP  E +ALKEDLIIWPPV+I+HNSSIG  +  ER+ ++ + L  +++ MG  GGK K+ RGKP NQSIMV  F A FSGLQ+AE+LH  +A+  HGR
Subjt:  RILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGC-GGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGR

Query:  DEFQKINSSNLIDDSHKDLHKAGANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASL
         EFQ+IN +N    S ++  KA A+ +E VLYGYLG+A DLDKLDFE K+RSVVKSKKEI AI D  L
Subjt:  DEFQKINSSNLIDDSHKDLHKAGANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASL

XP_038900433.1 uncharacterized protein LOC120087658 [Benincasa hispida]0.0e+0073.44Show/hide
Query:  MNWRERSGDDRSQSP-SFRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRNRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLF
        MN+RE S D RSQSP SF RRTS PRVEEN HCHS W S SSRE PVTNGL GSS+  RDHYN SRLYEN DEHFRKLSQ CENLQRESP KKFRWENLF
Subjt:  MNWRERSGDDRSQSP-SFRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRNRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLF

Query:  AKNPANENSKSNLGFKHVNGCDDDDSRGIRVSGSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDV
        A NPAN NSKS++G KH N C D  +RGIRVSGSHL  G+SS +I  G++LR+FH  IG TKDSNV N+GD S+SFGI DCSHLSS RKFDGP+Y TSDV
Subjt:  AKNPANENSKSNLGFKHVNGCDDDDSRGIRVSGSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDV

Query:  HVRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETERSMEGLLEYKQARGIHIEHFDDRNQYVKV
        HVRDRPIFESA NSHR R N  SSHG++AS+  SSA VTESKGISQDEFH F                         LEYK+AR  +IE FDD NQY  V
Subjt:  HVRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETERSMEGLLEYKQARGIHIEHFDDRNQYVKV

Query:  QQCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGTG
        Q  KRSD  A L S+FSQQMVRIPQD FYQDST TSVV DSVVEGF+DTE ++    E  RP D YD FKEPF IE GSY+G APF ME  GE LG G  
Subjt:  QQCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGTG

Query:  SPRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKYEQDLSDMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHG
        S  KG+ EAY+S+    LAEEDGYRT YGKW HE G++GSLVSK++QDLSDME SRK RWK  +STK RV+GTRC  H  GS SSRK NVFSRIQFLSHG
Subjt:  SPRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKYEQDLSDMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHG

Query:  DEKSAVKDIGINLKGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGDC
        DE  AVKD  INL  RSK W++EDTSI L +S+R LPWVINHAS   K KRRD++KRLG  ++D SS+PLV R+R+RKKNKRLRK +VNH CLDVQ  D 
Subjt:  DEKSAVKDIGINLKGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGDC

Query:  FEEKTPSPTSRLPED-EELNQLIKSAFFKFVKVLSENPARRKKFTEPGSGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSS
         EEK  SPTSRL ED EELNQLIKSAF KFVKVLSENPARRKKFTEPG GIIKCIVCGSKS EFA+ALSLS+HA ++LEGSRAEHLGL KALCWLMGWSS
Subjt:  FEEKTPSPTSRLPED-EELNQLIKSAFFKFVKVLSENPARRKKFTEPGSGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSS

Query:  EPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLH
        E AAP+G WV+RILPL EV+ALKEDLIIWPPVLIIHNSSI ID+ SERV ISCEELEVVI+GMGCGGKIKVVRGKPGNQSIM+ TF AMFSGLQEAERLH
Subjt:  EPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLH

Query:  KSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC
        KSFADK+HGRDEFQKI SS+LI DSHKDLHKA GANT+++VLYGYLGL EDLDKLDFETKKRSVVKSKKEIQAIV+ASLHC
Subjt:  KSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC

TrEMBL top hitse value%identityAlignment
A0A0A0KGN5 XS domain-containing protein2.8e-8677.58Show/hide
Query:  SKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEV
        SKS EF +ALSL +HA R+LEGSRAEHLGLHKALCWLMGWSSE  APNGLWV+ ILP VEV+ALKEDLIIWP VLIIHNSSI ID   E V ISCE+LE 
Subjt:  SKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEV

Query:  VIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFE
         ++ MGCGGK KVVRGK  NQSIMV TFGAMF GLQEAERLH +FADK+HGRDEF KIN   L+ DS+ D+HKA GANT+ESV YGYLGL EDLDKLDFE
Subjt:  VIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFE

Query:  TKKRSVVKSKKEIQAIVDASLHC
        TKKRSVV+SKKEIQAIV ASL C
Subjt:  TKKRSVVKSKKEIQAIVDASLHC

A0A1S3C894 uncharacterized protein LOC1034979642.7e-30768.29Show/hide
Query:  MNWRERSGDDRSQSPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRNRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLF
        MN RE + D RSQSPS F RRTS PRVEE  HC+SHW S SSRE P+TN LPGSS+  RDHYN SRLY +KDEHFRKLSQFCENLQ ESP KKF+WENLF
Subjt:  MNWRERSGDDRSQSPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRNRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLF

Query:  AKNP-ANENSKSNLGFKHVNGCDDDDSRGIRVSGSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSD
          N  AN NSK+++G KHVNG  D D+RGIRVSGSHL  G+SSK I  GN LR+FH  IGATKDSNV N+GDTS+S GI DC+HLSS RK+DGP++  ++
Subjt:  AKNP-ANENSKSNLGFKHVNGCDDDDSRGIRVSGSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSD

Query:  VHVRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETERSMEGLLEYKQARGIHIEHFDDRNQYVK
        VHVRDRPIFE   NSHR R N TSS GI+ASH HSSA V ESKGISQ EFH                          LLEYK+AR  HIEHFDD NQY  
Subjt:  VHVRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETERSMEGLLEYKQARGIHIEHFDDRNQYVK

Query:  VQQCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGT
        VQ CKR+D  A     FSQ MVRIPQD FY+DST TSVV DSVVEGF+DTE +     E  RP D   F +       GS +  APFAMEQ  EVLG GT
Subjt:  VQQCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGT

Query:  GSPRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKYEQDLSDMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSH
         S + G+ EAY+S+    L EEDGYRT++GKW+ E G++GS VSK++QDL DMED RK  WK  HSTKPRV+G R   H  G  S +K NVFSRIQFL+H
Subjt:  GSPRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKYEQDLSDMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSH

Query:  GDEKSAVKDIGINLKGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGD
        GD    VKD   NL  R+    DEDTS S   S+R LPWV+NH S R K KRR++KKRLG+ + D +SN LV RERERK+NKRLRKT+V+H CLDVQ GD
Subjt:  GDEKSAVKDIGINLKGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGD

Query:  CFEEKTPSPTSRLP--EDEELNQLIKSAFFKFVKVLSENPARRKKFTEPGSGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGW
          EEK  SPTSR P  + EELNQLIKSAF KFVKVLSENPARRKK TEPG GII CIVCGSKS EF +ALSLS+HA R+LEGSRAEHLGLHKALCWLMGW
Subjt:  CFEEKTPSPTSRLP--EDEELNQLIKSAFFKFVKVLSENPARRKKFTEPGSGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGW

Query:  SSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAER
        SSE  APNGLWV+RILPL EV+ALKEDLIIWPPVLIIHNSSI ID  S+ V ISCEELE VI+GMGCGGKIKVVRG+PGNQSIMV TFGAMFSGLQEAER
Subjt:  SSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAER

Query:  LHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC
        LHKSFADK+HGRDE  KIN  +LI DS+ DLHKA GANT+ESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQAIV+ASL C
Subjt:  LHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC

A0A5A7SQC0 XS domain-containing protein2.7e-30768.29Show/hide
Query:  MNWRERSGDDRSQSPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRNRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLF
        MN RE + D RSQSPS F RRTS PRVEE  HC+SHW S SSRE P+TN LPGSS+  RDHYN SRLY +KDEHFRKLSQFCENLQ ESP KKF+WENLF
Subjt:  MNWRERSGDDRSQSPS-FRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRNRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLF

Query:  AKNP-ANENSKSNLGFKHVNGCDDDDSRGIRVSGSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSD
          N  AN NSK+++G KHVNG  D D+RGIRVSGSHL  G+SSK I  GN LR+FH  IGATKDSNV N+GDTS+S GI DC+HLSS RK+DGP++  ++
Subjt:  AKNP-ANENSKSNLGFKHVNGCDDDDSRGIRVSGSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNV-NDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSD

Query:  VHVRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETERSMEGLLEYKQARGIHIEHFDDRNQYVK
        VHVRDRPIFE   NSHR R N TSS GI+ASH HSSA V ESKGISQ EFH                          LLEYK+AR  HIEHFDD NQY  
Subjt:  VHVRDRPIFESARNSHRERLNGTSSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETERSMEGLLEYKQARGIHIEHFDDRNQYVK

Query:  VQQCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGT
        VQ CKR+D  A     FSQ MVRIPQD FY+DST TSVV DSVVEGF+DTE +     E  RP D   F +       GS +  APFAMEQ  EVLG GT
Subjt:  VQQCKRSDNGAALKSSFSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGT

Query:  GSPRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKYEQDLSDMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSH
         S + G+ EAY+S+    L EEDGYRT++GKW+ E G++GS VSK++QDL DMED RK  WK  HSTKPRV+G R   H  G  S +K NVFSRIQFL+H
Subjt:  GSPRKGKIEAYLST----LAEEDGYRTSYGKWSHEGGLDGSLVSKYEQDLSDMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSH

Query:  GDEKSAVKDIGINLKGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGD
        GD    VKD   NL  R+    DEDTS S   S+R LPWV+NH S R K KRR++KKRLG+ + D +SN LV RERERK+NKRLRKT+V+H CLDVQ GD
Subjt:  GDEKSAVKDIGINLKGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGD

Query:  CFEEKTPSPTSRLP--EDEELNQLIKSAFFKFVKVLSENPARRKKFTEPGSGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGW
          EEK  SPTSR P  + EELNQLIKSAF KFVKVLSENPARRKK TEPG GII CIVCGSKS EF +ALSLS+HA R+LEGSRAEHLGLHKALCWLMGW
Subjt:  CFEEKTPSPTSRLP--EDEELNQLIKSAFFKFVKVLSENPARRKKFTEPGSGIIKCIVCGSKSMEFANALSLSEHAFRSLEGSRAEHLGLHKALCWLMGW

Query:  SSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAER
        SSE  APNGLWV+RILPL EV+ALKEDLIIWPPVLIIHNSSI ID  S+ V ISCEELE VI+GMGCGGKIKVVRG+PGNQSIMV TFGAMFSGLQEAER
Subjt:  SSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAER

Query:  LHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC
        LHKSFADK+HGRDE  KIN  +LI DS+ DLHKA GANT+ESVLYGYLGLAEDL KLDFETKKRSVVKSKKEIQAIV+ASL C
Subjt:  LHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKA-GANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC

A0A6J1CGJ5 uncharacterized protein LOC1110110329.6e-7985.41Show/hide
Query:  MGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQE
        MGWSSE  APNGLWVQRILP VE  ALKEDLIIWPPVLIIHNSSI  DNTSE+V ISCEELEVVI+GMG GGKIKVVRGKP NQSIMV TF AMFSGLQE
Subjt:  MGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPGNQSIMVATFGAMFSGLQE

Query:  AERLHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKAGANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC
        AERLHK+FADK+HGRDEF +INSS+ I DSH DLHKAGAN MESVLYGYLGLAED +KLDFETKKRSVVKSKKEIQAIVDA+L C
Subjt:  AERLHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKAGANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC

D7SN41 XS domain-containing protein1.8e-8036.08Show/hide
Query:  HPHSSARVTESKGISQDEFHGFYEGRST-----WKKENPREPV--ETERSMEG--------LLEYKQARGIHIEHFDDRNQYVKVQQCKRSDNGAALKSS
        H    A+   S  +S+D+FHG Y+   T     + +E   EP     +  M G        L   +Q + I  E     +   ++ + +R D G      
Subjt:  HPHSSARVTESKGISQDEFHGFYEGRST-----WKKENPREPV--ETERSMEG--------LLEYKQARGIHIEHFDDRNQYVKVQQCKRSDNGAALKSS

Query:  FSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGTGSPRKGKIEAYLSTLA
        F  +M  + +D  +QD    SV+ DSVV+  +DTE      ++++R  D ++  +E   I          + +  DGEVLG       +G       ++ 
Subjt:  FSQQMVRIPQDAFYQDSTLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGTGSPRKGKIEAYLSTLA

Query:  EEDGY--RTSYGKWSHE---GGLDGSLVSKYEQDLS-DMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAVKDIGINL
        +E GY        W +E     LD    S     LS  +E+       +    + R    + G H   S  S      ++I  LS+  E  A +DIG   
Subjt:  EEDGY--RTSYGKWSHE---GGLDGSLVSKYEQDLS-DMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAVKDIGINL

Query:  KGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGDCFEEKTPSPTSRLP
              WS     ++ + S R+L  + N  SQ  K   +D KKRL    +++  +  V+R+ +  K  + R  D +H  L ++ G   + K  +  + LP
Subjt:  KGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKKRLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGDCFEEKTPSPTSRLP

Query:  E-DEELNQLIKSAFFKFVKVLSENPARRKKFTEPG-SGIIKCIVCGSKSMEFANALSLSEHAFRSLEGS-RAEHLGLHKALCWLMGWSSEPAAPNGLWVQ
        E  EE  Q + S F KFVK+L+ENPA+R+ +TE G +  +KC +CGS S EF N + L  H   S +G  R +HLGL KALC LMGW+S+   PN  WV 
Subjt:  E-DEELNQLIKSAFFKFVKVLSENPARRKKFTEPG-SGIIKCIVCGSKSMEFANALSLSEHAFRSLEGS-RAEHLGLHKALCWLMGWSSEPAAPNGLWVQ

Query:  RILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGC-GGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGR
        ++LP  E +ALKEDLIIWPPV+I+HNSSIG  +  ER+ ++ + L  +++ MG  GGK ++ RGKP NQSIMV  F A FSGLQ+AE+LH  +A+  HGR
Subjt:  RILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGC-GGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGR

Query:  DEFQKINSSNLIDDSHKDLHKAGANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASL
         EF +IN +N    S ++  KA A+ +E VLYGYLG+A DLDKLDFE K+R VVKSKKEI AI D  L
Subjt:  DEFQKINSSNLIDDSHKDLHKAGANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22430.1 CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380)1.0e-3234.9Show/hide
Query:  IKSAFFKFVKVLSENPARRKKFTEPG-SGIIKCIVCGSKSMEFANALSLSEHAFRSLE-GSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVI
        +K +F  FVK + E+P  +K + E G  G ++C+VCG  S +  +  SL  H + S +  SR  HLGLHKALC LMGW+   A  N    Q  LP  E  
Subjt:  IKSAFFKFVKVLSENPARRKKFTEPG-SGIIKCIVCGSKSMEFANALSLSEHAFRSLE-GSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVI

Query:  ALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMG-CGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKIN--
          +  LIIWPP +I+ N+S G            + ++  I+ +G  GGK K + G+ G+  I +  F    SGL++A R+ + F   N GR  + ++   
Subjt:  ALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMG-CGGKIKVVRGKPGNQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKIN--

Query:  SSNLIDDSHKDLHKAGANTMES--VLYGYLGLAEDLDKLDFETKKRSVVKSKKEI
        + +  D+ +  L +    T E   + YGYL    DLDK+D ETKK++ ++S +E+
Subjt:  SSNLIDDSHKDLHKAGANTMES--VLYGYLGLAEDLDKLDFETKKRSVVKSKKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTGGAGAGAAAGGAGTGGGGATGATAGGTCTCAGTCTCCGTCGTTTCGACGGAGAACTTCGGGACCTCGGGTTGAAGAAAACCGGCATTGCCATTCGCACTGGAT
TTCGGGTTCTTCACGGGAAGGACCGGTGACGAATGGCCTTCCGGGTTCTTCTGTTAGAAATAGAGACCATTACAACGAAAGTCGTCTTTATGAAAATAAGGACGAACATT
TTCGTAAACTGTCTCAGTTTTGTGAGAATTTGCAGCGGGAGTCGCCGTTGAAAAAGTTTCGCTGGGAAAACTTGTTTGCCAAAAATCCCGCCAATGAGAATTCGAAGTCG
AATCTGGGGTTCAAACATGTAAATGGATGTGATGATGATGATAGTCGAGGAATTAGGGTTTCCGGTTCACATTTGGGTATGGGATCGTCGTCCAAGGATATTTCAGAGGG
TAACGATTTGCGCTCATTTCATACTTACATTGGTGCAACTAAAGACAGTAACGTAAACGATGGGGATACTTCCAAAAGTTTTGGAATCGGTGACTGTAGCCATTTGTCTT
CACCAAGAAAGTTTGATGGGCCCGTATACGGGACCAGTGATGTTCATGTTCGGGACCGTCCGATCTTTGAATCAGCAAGAAATTCCCACAGAGAAAGACTTAACGGAACA
TCTTCACATGGGATAGAAGCGTCTCATCCACACTCCAGTGCACGTGTTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGTTTTTATGAGGGCCGTTCCACTTG
GAAGAAAGAAAACCCTAGAGAACCTGTTGAAACTGAACGAAGTATGGAAGGTTTACTGGAGTATAAACAAGCTCGGGGAATCCATATCGAGCACTTTGATGATCGTAATC
AGTATGTCAAAGTCCAGCAATGTAAGAGGAGTGACAATGGTGCTGCTCTCAAGAGTTCTTTCTCTCAGCAGATGGTTCGCATCCCTCAAGACGCTTTCTATCAAGATTCT
ACGCTGACCAGTGTTGTCACGGATTCAGTTGTTGAGGGATTTGAAGACACTGAAAGGTATGTCGTGGGTGATATGGAAGAGAACCGGCCAATCGACCCTTATGATTTTTT
CAAAGAACCCTTCACCATTGAAGGCGGTTCTTATATTGGCAACGCTCCTTTTGCGATGGAACAGGACGGTGAAGTTTTGGGTCCTGGAACTGGAAGTCCACGGAAGGGTA
AAATAGAAGCATATTTAAGTACCTTGGCTGAAGAAGATGGTTATAGGACAAGTTATGGGAAATGGTCGCATGAAGGTGGATTAGATGGATCATTAGTATCAAAATACGAA
CAAGATTTGAGCGACATGGAAGACAGTAGAAAGCGGAGATGGAAAGATGCACATTCAACCAAACCGAGGGTCAAAGGAACAAGATGCGGAGCGCATTATTCTGGGTCTGA
TTCATCTAGAAAATCTAATGTGTTTAGCAGAATCCAATTCTTAAGTCATGGGGATGAAAAGAGTGCTGTTAAAGATATTGGCATCAATTTAAAAGGTAGAAGCAAGCGGT
GGAGTGACGAGGATACTTCCATATCCTTGGCTGCCTCCAGAAGGTCGTTGCCTTGGGTAATAAACCATGCCTCTCAGCGTCCAAAGTTTAAACGTAGAGACATAAAGAAA
CGTTTGGGTATCTCCGTGAAGGATCTCAGTTCAAACCCTCTAGTCATTAGAGAACGAGAACGCAAGAAAAACAAGCGTCTGAGAAAGACAGATGTCAATCATAGGTGCCT
TGATGTTCAAGCAGGTGATTGCTTTGAAGAGAAGACGCCAAGTCCAACCAGCAGGCTACCTGAAGATGAGGAGTTGAACCAGCTTATAAAGAGTGCCTTCTTCAAGTTTG
TCAAAGTTCTGAGTGAGAATCCAGCCAGACGAAAGAAGTTCACAGAGCCAGGGTCGGGTATTATAAAGTGCATTGTCTGCGGCAGCAAGTCGATGGAGTTTGCAAATGCC
CTAAGCTTATCAGAACATGCCTTCCGGTCGCTGGAAGGTTCGAGGGCAGAACACTTGGGACTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAACCAGCAGC
GCCAAACGGTCTATGGGTTCAAAGGATATTGCCTCTTGTAGAAGTAATTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTG
GAATTGATAATACGTCTGAACGGGTAAACATAAGTTGTGAAGAGCTTGAGGTTGTTATTAAAGGAATGGGTTGTGGAGGGAAGATTAAAGTGGTGCGTGGTAAACCTGGA
AATCAGAGTATTATGGTAGCAACTTTCGGTGCCATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGCTTTGCAGATAAGAATCATGGTAGGGATGAGTTCCA
GAAAATCAACTCGAGTAATCTCATTGATGACAGCCACAAGGATCTGCATAAAGCAGGAGCCAACACGATGGAAAGCGTACTTTATGGCTACTTAGGCCTTGCAGAGGACT
TGGATAAACTTGACTTTGAGACCAAGAAGAGGTCTGTGGTGAAAAGCAAGAAAGAAATCCAGGCCATTGTGGATGCATCTCTTCACTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTGGAGAGAAAGGAGTGGGGATGATAGGTCTCAGTCTCCGTCGTTTCGACGGAGAACTTCGGGACCTCGGGTTGAAGAAAACCGGCATTGCCATTCGCACTGGAT
TTCGGGTTCTTCACGGGAAGGACCGGTGACGAATGGCCTTCCGGGTTCTTCTGTTAGAAATAGAGACCATTACAACGAAAGTCGTCTTTATGAAAATAAGGACGAACATT
TTCGTAAACTGTCTCAGTTTTGTGAGAATTTGCAGCGGGAGTCGCCGTTGAAAAAGTTTCGCTGGGAAAACTTGTTTGCCAAAAATCCCGCCAATGAGAATTCGAAGTCG
AATCTGGGGTTCAAACATGTAAATGGATGTGATGATGATGATAGTCGAGGAATTAGGGTTTCCGGTTCACATTTGGGTATGGGATCGTCGTCCAAGGATATTTCAGAGGG
TAACGATTTGCGCTCATTTCATACTTACATTGGTGCAACTAAAGACAGTAACGTAAACGATGGGGATACTTCCAAAAGTTTTGGAATCGGTGACTGTAGCCATTTGTCTT
CACCAAGAAAGTTTGATGGGCCCGTATACGGGACCAGTGATGTTCATGTTCGGGACCGTCCGATCTTTGAATCAGCAAGAAATTCCCACAGAGAAAGACTTAACGGAACA
TCTTCACATGGGATAGAAGCGTCTCATCCACACTCCAGTGCACGTGTTACTGAATCTAAAGGCATTTCGCAAGATGAATTTCATGGTTTTTATGAGGGCCGTTCCACTTG
GAAGAAAGAAAACCCTAGAGAACCTGTTGAAACTGAACGAAGTATGGAAGGTTTACTGGAGTATAAACAAGCTCGGGGAATCCATATCGAGCACTTTGATGATCGTAATC
AGTATGTCAAAGTCCAGCAATGTAAGAGGAGTGACAATGGTGCTGCTCTCAAGAGTTCTTTCTCTCAGCAGATGGTTCGCATCCCTCAAGACGCTTTCTATCAAGATTCT
ACGCTGACCAGTGTTGTCACGGATTCAGTTGTTGAGGGATTTGAAGACACTGAAAGGTATGTCGTGGGTGATATGGAAGAGAACCGGCCAATCGACCCTTATGATTTTTT
CAAAGAACCCTTCACCATTGAAGGCGGTTCTTATATTGGCAACGCTCCTTTTGCGATGGAACAGGACGGTGAAGTTTTGGGTCCTGGAACTGGAAGTCCACGGAAGGGTA
AAATAGAAGCATATTTAAGTACCTTGGCTGAAGAAGATGGTTATAGGACAAGTTATGGGAAATGGTCGCATGAAGGTGGATTAGATGGATCATTAGTATCAAAATACGAA
CAAGATTTGAGCGACATGGAAGACAGTAGAAAGCGGAGATGGAAAGATGCACATTCAACCAAACCGAGGGTCAAAGGAACAAGATGCGGAGCGCATTATTCTGGGTCTGA
TTCATCTAGAAAATCTAATGTGTTTAGCAGAATCCAATTCTTAAGTCATGGGGATGAAAAGAGTGCTGTTAAAGATATTGGCATCAATTTAAAAGGTAGAAGCAAGCGGT
GGAGTGACGAGGATACTTCCATATCCTTGGCTGCCTCCAGAAGGTCGTTGCCTTGGGTAATAAACCATGCCTCTCAGCGTCCAAAGTTTAAACGTAGAGACATAAAGAAA
CGTTTGGGTATCTCCGTGAAGGATCTCAGTTCAAACCCTCTAGTCATTAGAGAACGAGAACGCAAGAAAAACAAGCGTCTGAGAAAGACAGATGTCAATCATAGGTGCCT
TGATGTTCAAGCAGGTGATTGCTTTGAAGAGAAGACGCCAAGTCCAACCAGCAGGCTACCTGAAGATGAGGAGTTGAACCAGCTTATAAAGAGTGCCTTCTTCAAGTTTG
TCAAAGTTCTGAGTGAGAATCCAGCCAGACGAAAGAAGTTCACAGAGCCAGGGTCGGGTATTATAAAGTGCATTGTCTGCGGCAGCAAGTCGATGGAGTTTGCAAATGCC
CTAAGCTTATCAGAACATGCCTTCCGGTCGCTGGAAGGTTCGAGGGCAGAACACTTGGGACTTCACAAAGCACTTTGTTGGCTCATGGGATGGAGCAGTGAACCAGCAGC
GCCAAACGGTCTATGGGTTCAAAGGATATTGCCTCTTGTAGAAGTAATTGCTTTGAAGGAGGATCTCATTATATGGCCCCCTGTTCTTATCATTCATAACAGTTCTATTG
GAATTGATAATACGTCTGAACGGGTAAACATAAGTTGTGAAGAGCTTGAGGTTGTTATTAAAGGAATGGGTTGTGGAGGGAAGATTAAAGTGGTGCGTGGTAAACCTGGA
AATCAGAGTATTATGGTAGCAACTTTCGGTGCCATGTTTTCTGGGTTGCAAGAAGCAGAAAGACTACACAAAAGCTTTGCAGATAAGAATCATGGTAGGGATGAGTTCCA
GAAAATCAACTCGAGTAATCTCATTGATGACAGCCACAAGGATCTGCATAAAGCAGGAGCCAACACGATGGAAAGCGTACTTTATGGCTACTTAGGCCTTGCAGAGGACT
TGGATAAACTTGACTTTGAGACCAAGAAGAGGTCTGTGGTGAAAAGCAAGAAAGAAATCCAGGCCATTGTGGATGCATCTCTTCACTGTTAG
Protein sequenceShow/hide protein sequence
MNWRERSGDDRSQSPSFRRRTSGPRVEENRHCHSHWISGSSREGPVTNGLPGSSVRNRDHYNESRLYENKDEHFRKLSQFCENLQRESPLKKFRWENLFAKNPANENSKS
NLGFKHVNGCDDDDSRGIRVSGSHLGMGSSSKDISEGNDLRSFHTYIGATKDSNVNDGDTSKSFGIGDCSHLSSPRKFDGPVYGTSDVHVRDRPIFESARNSHRERLNGT
SSHGIEASHPHSSARVTESKGISQDEFHGFYEGRSTWKKENPREPVETERSMEGLLEYKQARGIHIEHFDDRNQYVKVQQCKRSDNGAALKSSFSQQMVRIPQDAFYQDS
TLTSVVTDSVVEGFEDTERYVVGDMEENRPIDPYDFFKEPFTIEGGSYIGNAPFAMEQDGEVLGPGTGSPRKGKIEAYLSTLAEEDGYRTSYGKWSHEGGLDGSLVSKYE
QDLSDMEDSRKRRWKDAHSTKPRVKGTRCGAHYSGSDSSRKSNVFSRIQFLSHGDEKSAVKDIGINLKGRSKRWSDEDTSISLAASRRSLPWVINHASQRPKFKRRDIKK
RLGISVKDLSSNPLVIRERERKKNKRLRKTDVNHRCLDVQAGDCFEEKTPSPTSRLPEDEELNQLIKSAFFKFVKVLSENPARRKKFTEPGSGIIKCIVCGSKSMEFANA
LSLSEHAFRSLEGSRAEHLGLHKALCWLMGWSSEPAAPNGLWVQRILPLVEVIALKEDLIIWPPVLIIHNSSIGIDNTSERVNISCEELEVVIKGMGCGGKIKVVRGKPG
NQSIMVATFGAMFSGLQEAERLHKSFADKNHGRDEFQKINSSNLIDDSHKDLHKAGANTMESVLYGYLGLAEDLDKLDFETKKRSVVKSKKEIQAIVDASLHC