; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G17300 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G17300
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein SQS1
Genome locationChr1:13014419..13023986
RNA-Seq ExpressionCSPI01G17300
SyntenyCSPI01G17300
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain
IPR001374 - R3H domain
IPR034082 - Protein SQS1, R3H domain
IPR036867 - R3H domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050035.1 Zinc finger protein, putative isoform 1 [Cucumis melo var. makuwa]0.0e+0090.81Show/hide
Query:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY
        MAGGRRRTNHAK+SD FRKNKTNSARRRS+ PSNLFVDGGFL DW  Q SPPISAR          EGNSRAKGKSGSKS TLDRKKIASSSGTKQS+GY
Subjt:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY

Query:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA
        AIGYEYPSAPNQEDL SESR LQNDAE  LDDSQPFIL+NSKSNQIVAYVDENP LMADNLEFTYDYGTSFVLG+SSHRGLGF DDDE VTKQNTDDDSA
Subjt:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA

Query:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES
        TQ+EEQG LCT SLS  KETGTDE+VDGRVGVETANEMVAEASPSNK S+DM SP+NSGFLSIGGVRLYTQDVSDE SDDDGESSDG+SEYSEPLESDES
Subjt:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES

Query:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL
        SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGS+NILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKT SRNKRSVVSRDHWSALAL
Subjt:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL

Query:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL
        DDMLVKD RSTS RKQKNA QFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQI+L+LEHMVLNKEDMYAFQPMHPRDCSQVRRL
Subjt:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL

Query:  AAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKG
        AAIYRLH+GCQGSGKKRFVTVTRTQHTGMPSASDQVRLE+LIGARDEDNDFSVAEGSNIKSRG +RSREKK TK+ GL+TLEFDQSVS+KSRSKGSAGKG
Subjt:  AAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKG

Query:  SSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ
        SSQKMTGKKYADQPVSFVS GVMQPE ME + V+D ND DKGKDIVA SEMIEM TSNVNN DISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ
Subjt:  SSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ

Query:  GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARG
        GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQAS +SSARTGS ALEK+K+IGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGI+NPLLPVRRPKARG
Subjt:  GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARG

Query:  LGAKS
        LGAKS
Subjt:  LGAKS

XP_008443896.1 PREDICTED: uncharacterized protein LOC103487380 isoform X1 [Cucumis melo]0.0e+0088.33Show/hide
Query:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY
        MAGGRRRTNHAK+SD FRKNKTNSARRRS+ PSNLFVDGGFL DW  Q SPPISAR          EGNSRAKGKSGSKS TLDRKKIASSSGTKQSNGY
Subjt:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY

Query:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA
        AIGYEYPSAPNQEDL SESR LQNDAE  LDDSQPFILLNSKSNQIVAYVDENP LMADNLEFTYDYGTSFVLG+SSHRGLGF DDDE VTKQNTDDDSA
Subjt:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA

Query:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES
        TQ+EEQG LCT SLS  KETGTDE+VDGRVGVETANEMVAEASPSNK S+DM SP+NSGFLSIGGVRLYTQDVSDE SDDDGESSDG+SEYSEPLESDES
Subjt:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES

Query:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL
        SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGS+NILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKT SRNKRSVVSRDHWSALAL
Subjt:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL

Query:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL
        DDMLVKD RSTS RKQKNA QFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQI+L+LEHMVLNKEDMYAFQPMHPRDCSQ+   
Subjt:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL

Query:  AAIYRLHNGCQGSG---------KKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKS
          I R  +  Q +          KKRFVTVTRTQHTGMPSASDQVRLE+LIGARDEDNDFSVAEGSNIKSRG +RSREKK TKV GL+TLEFDQSVS+KS
Subjt:  AAIYRLHNGCQGSG---------KKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKS

Query:  RSKGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVE
        RSKGSAGKGSSQKMTGKKYADQPVSFVS GVMQPE ME + V+D ND DKGKDIVA SEMIEM TSNVNN DISRDSIGSFELHTKGFGSKMMAKMGFVE
Subjt:  RSKGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVE

Query:  GGGLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLL
        GGGLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQAS +SSART S ALEK+K+IGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGI+NPLL
Subjt:  GGGLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLL

Query:  PVRRPKARGLGAKS
        PVRRPKARGLGAKS
Subjt:  PVRRPKARGLGAKS

XP_008443898.1 PREDICTED: uncharacterized protein LOC103487380 isoform X2 [Cucumis melo]0.0e+0091.06Show/hide
Query:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY
        MAGGRRRTNHAK+SD FRKNKTNSARRRS+ PSNLFVDGGFL DW  Q SPPISAR          EGNSRAKGKSGSKS TLDRKKIASSSGTKQSNGY
Subjt:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY

Query:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA
        AIGYEYPSAPNQEDL SESR LQNDAE  LDDSQPFILLNSKSNQIVAYVDENP LMADNLEFTYDYGTSFVLG+SSHRGLGF DDDE VTKQNTDDDSA
Subjt:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA

Query:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES
        TQ+EEQG LCT SLS  KETGTDE+VDGRVGVETANEMVAEASPSNK S+DM SP+NSGFLSIGGVRLYTQDVSDE SDDDGESSDG+SEYSEPLESDES
Subjt:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES

Query:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL
        SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGS+NILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKT SRNKRSVVSRDHWSALAL
Subjt:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL

Query:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL
        DDMLVKD RSTS RKQKNA QFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQI+L+LEHMVLNKEDMYAFQPMHPRDCSQVRRL
Subjt:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL

Query:  AAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKG
        AAIYRLH+GCQGSGKKRFVTVTRTQHTGMPSASDQVRLE+LIGARDEDNDFSVAEGSNIKSRG +RSREKK TKV GL+TLEFDQSVS+KSRSKGSAGKG
Subjt:  AAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKG

Query:  SSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ
        SSQKMTGKKYADQPVSFVS GVMQPE ME + V+D ND DKGKDIVA SEMIEM TSNVNN DISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ
Subjt:  SSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ

Query:  GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARG
        GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQAS +SSART S ALEK+K+IGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGI+NPLLPVRRPKARG
Subjt:  GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARG

Query:  LGAKS
        LGAKS
Subjt:  LGAKS

XP_011655819.1 uncharacterized protein LOC101215133 [Cucumis sativus]0.0e+0099.25Show/hide
Query:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDWQCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGYAI
        MAGGRRRTNHAKASDSFRKNKTNSARRRS+TPSNLFVDGGFLFDWQCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGYAI
Subjt:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDWQCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGYAI

Query:  GYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSATQ
        GYEYPSAPNQEDLHSESRVLQNDAER  DDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSATQ
Subjt:  GYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSATQ

Query:  MEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDESSE
        +EEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDESSE
Subjt:  MEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDESSE

Query:  GDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALALDD
         DSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALALDD
Subjt:  GDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALALDD

Query:  MLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRLAA
        MLVK+TRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRLAA
Subjt:  MLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRLAA

Query:  IYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKGSS
        IYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKGSS
Subjt:  IYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKGSS

Query:  QKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQGM
        QKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQGM
Subjt:  QKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQGM

Query:  THPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLG
        THPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLG
Subjt:  THPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLG

Query:  AKS
        AKS
Subjt:  AKS

XP_038876900.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120069254 [Benincasa hispida]0.0e+0082.92Show/hide
Query:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPS---NLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQS
        MAGGRRRTNHAKASDSF+KNK+NS RRRS+  S   NLFVDGGFL DW  Q SPP SAR          EGNS AKGKSGSKS  LDRK+IASSSGTKQ+
Subjt:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPS---NLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQS

Query:  NGYAIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDD
        +G AIGYEYP APNQE  HSESR LQ DAE  LD+SQPFILLNS +NQIVAYVDENPPL ADNLEFTYDYGTSFVLG+SSHRGLGFHD+DE V  QNTDD
Subjt:  NGYAIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDD

Query:  DSATQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLES
        DSATQ+E  G LC  SLSS KETG  ERV+ RVGVETANEM+AEASPSNKY     SPRNSGFLSIGGVRLYTQDVSDEG DDDGESSDGSS YSEPLES
Subjt:  DSATQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLES

Query:  DESSEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKR-SVVSRDHWS
        DESSE DSSVEMSCSGSDIDDEVA+DYLEG+GGSENIL SKWL+KQEL+ES DDSS+SSLDDTL+KLGGIALQEASKEYG++KTPSR K+ SVVSRDHWS
Subjt:  DESSEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKR-SVVSRDHWS

Query:  ALALDDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQ
        ALALDD+LVKDTRSTSARKQKN   FA SWPPKA   K+ RKYPGEKKKYRKE IAAKRRERMLNRGVDLM+INLKLEHMVLN+EDM+AFQPMHPRDCSQ
Subjt:  ALALDDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQ

Query:  VRRLAAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGS
        VRRLAAIYRLH G QGSGKKRFVTVTRTQHTGMPSASDQVRLE+LIGARDED+DF VAEGSNIKS+G NRSREKK TKV GL+TLE D+S SSKSRSKGS
Subjt:  VRRLAAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGS

Query:  AGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLG
        AGKG SQKMTGKKYADQPVSFVS GVMQPE M+EK  ++  D DK KDIVA SEM EMTTS+V NMDISRDSIG+FEL+TKGFGSKMMAKMG+VEGGGLG
Subjt:  AGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLG

Query:  KDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRP
        KDGQGM  PIEVMKRPKSLGLGIEFSEASTSAAGDNQAS +S+AR+GS AL K+KKIGAFE HTKGFGSKMMAKMGFVEGMGLGKDSQGI+NPLLP+RRP
Subjt:  KDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRP

Query:  KARGLGAK
        KARGLGAK
Subjt:  KARGLGAK

TrEMBL top hitse value%identityAlignment
A0A0A0LTU7 Protein SQS10.0e+0098.4Show/hide
Query:  GNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGYAIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLM
        G+ R     EGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGYAIGYEYPSAPNQEDLHSESRVLQNDAER  DDSQPFILLNSKSNQIVAYVDENPPLM
Subjt:  GNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGYAIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLM

Query:  ADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSATQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRN
        ADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSATQ+EEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRN
Subjt:  ADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSATQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRN

Query:  SGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDESSEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSL
        SGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDESSE DSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSL
Subjt:  SGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDESSEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSL

Query:  DDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALALDDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRE
        DDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALALDDMLVK+TRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRE
Subjt:  DDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALALDDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRE

Query:  RMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRLAAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGS
        RMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRLAAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGS
Subjt:  RMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRLAAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGS

Query:  NIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTS
        NIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTS
Subjt:  NIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTS

Query:  NVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFE
        NVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFE
Subjt:  NVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFE

Query:  VHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLGAKS
        VHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLGAKS
Subjt:  VHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLGAKS

A0A1S3B932 uncharacterized protein LOC103487380 isoform X10.0e+0088.33Show/hide
Query:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY
        MAGGRRRTNHAK+SD FRKNKTNSARRRS+ PSNLFVDGGFL DW  Q SPPISAR          EGNSRAKGKSGSKS TLDRKKIASSSGTKQSNGY
Subjt:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY

Query:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA
        AIGYEYPSAPNQEDL SESR LQNDAE  LDDSQPFILLNSKSNQIVAYVDENP LMADNLEFTYDYGTSFVLG+SSHRGLGF DDDE VTKQNTDDDSA
Subjt:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA

Query:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES
        TQ+EEQG LCT SLS  KETGTDE+VDGRVGVETANEMVAEASPSNK S+DM SP+NSGFLSIGGVRLYTQDVSDE SDDDGESSDG+SEYSEPLESDES
Subjt:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES

Query:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL
        SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGS+NILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKT SRNKRSVVSRDHWSALAL
Subjt:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL

Query:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL
        DDMLVKD RSTS RKQKNA QFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQI+L+LEHMVLNKEDMYAFQPMHPRDCSQ+   
Subjt:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL

Query:  AAIYRLHNGCQGSG---------KKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKS
          I R  +  Q +          KKRFVTVTRTQHTGMPSASDQVRLE+LIGARDEDNDFSVAEGSNIKSRG +RSREKK TKV GL+TLEFDQSVS+KS
Subjt:  AAIYRLHNGCQGSG---------KKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKS

Query:  RSKGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVE
        RSKGSAGKGSSQKMTGKKYADQPVSFVS GVMQPE ME + V+D ND DKGKDIVA SEMIEM TSNVNN DISRDSIGSFELHTKGFGSKMMAKMGFVE
Subjt:  RSKGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVE

Query:  GGGLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLL
        GGGLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQAS +SSART S ALEK+K+IGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGI+NPLL
Subjt:  GGGLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLL

Query:  PVRRPKARGLGAKS
        PVRRPKARGLGAKS
Subjt:  PVRRPKARGLGAKS

A0A1S3B9V9 Protein SQS10.0e+0091.06Show/hide
Query:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY
        MAGGRRRTNHAK+SD FRKNKTNSARRRS+ PSNLFVDGGFL DW  Q SPPISAR          EGNSRAKGKSGSKS TLDRKKIASSSGTKQSNGY
Subjt:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY

Query:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA
        AIGYEYPSAPNQEDL SESR LQNDAE  LDDSQPFILLNSKSNQIVAYVDENP LMADNLEFTYDYGTSFVLG+SSHRGLGF DDDE VTKQNTDDDSA
Subjt:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA

Query:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES
        TQ+EEQG LCT SLS  KETGTDE+VDGRVGVETANEMVAEASPSNK S+DM SP+NSGFLSIGGVRLYTQDVSDE SDDDGESSDG+SEYSEPLESDES
Subjt:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES

Query:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL
        SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGS+NILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKT SRNKRSVVSRDHWSALAL
Subjt:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL

Query:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL
        DDMLVKD RSTS RKQKNA QFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQI+L+LEHMVLNKEDMYAFQPMHPRDCSQVRRL
Subjt:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL

Query:  AAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKG
        AAIYRLH+GCQGSGKKRFVTVTRTQHTGMPSASDQVRLE+LIGARDEDNDFSVAEGSNIKSRG +RSREKK TKV GL+TLEFDQSVS+KSRSKGSAGKG
Subjt:  AAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKG

Query:  SSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ
        SSQKMTGKKYADQPVSFVS GVMQPE ME + V+D ND DKGKDIVA SEMIEM TSNVNN DISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ
Subjt:  SSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ

Query:  GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARG
        GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQAS +SSART S ALEK+K+IGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGI+NPLLPVRRPKARG
Subjt:  GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARG

Query:  LGAKS
        LGAKS
Subjt:  LGAKS

A0A5A7U8V1 Protein SQS10.0e+0090.81Show/hide
Query:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY
        MAGGRRRTNHAK+SD FRKNKTNSARRRS+ PSNLFVDGGFL DW  Q SPPISAR          EGNSRAKGKSGSKS TLDRKKIASSSGTKQS+GY
Subjt:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGY

Query:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA
        AIGYEYPSAPNQEDL SESR LQNDAE  LDDSQPFIL+NSKSNQIVAYVDENP LMADNLEFTYDYGTSFVLG+SSHRGLGF DDDE VTKQNTDDDSA
Subjt:  AIGYEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSA

Query:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES
        TQ+EEQG LCT SLS  KETGTDE+VDGRVGVETANEMVAEASPSNK S+DM SP+NSGFLSIGGVRLYTQDVSDE SDDDGESSDG+SEYSEPLESDES
Subjt:  TQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDES

Query:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL
        SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGS+NILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKT SRNKRSVVSRDHWSALAL
Subjt:  SEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALAL

Query:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL
        DDMLVKD RSTS RKQKNA QFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQI+L+LEHMVLNKEDMYAFQPMHPRDCSQVRRL
Subjt:  DDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRL

Query:  AAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKG
        AAIYRLH+GCQGSGKKRFVTVTRTQHTGMPSASDQVRLE+LIGARDEDNDFSVAEGSNIKSRG +RSREKK TK+ GL+TLEFDQSVS+KSRSKGSAGKG
Subjt:  AAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKG

Query:  SSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ
        SSQKMTGKKYADQPVSFVS GVMQPE ME + V+D ND DKGKDIVA SEMIEM TSNVNN DISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ
Subjt:  SSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQ

Query:  GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARG
        GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQAS +SSARTGS ALEK+K+IGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGI+NPLLPVRRPKARG
Subjt:  GMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARG

Query:  LGAKS
        LGAKS
Subjt:  LGAKS

A0A6J1KLR0 Protein SQS10.0e+0077.31Show/hide
Query:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPS----NLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQ
        MAGGRRRTNHAK SD FRKNK +S RRRS+  S    NLFVDGGFL DW  Q +PP SAR          EGNSR+KG+SGSKS  LDRKK ASSSGTKQ
Subjt:  MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPS----NLFVDGGFLFDW--QCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQ

Query:  SNGYAIGYEYPSAPNQE-DLHSESRVLQNDAERRLDDSQPFILL--NSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQ
        SNG AIGYEYP AP+QE  LHSESR LQNDA+  LD+SQPFILL  NSK  QIVAYVD+ PPL  D LE TYDYGT F+LG+SSHRGLGFHD+DE V  Q
Subjt:  SNGYAIGYEYPSAPNQE-DLHSESRVLQNDAERRLDDSQPFILL--NSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQ

Query:  NTDDDSATQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSE
        NTDDDS T +EEQ  LCT SL S KETG+DERV+ R  VE A+EM+AEAS  NKYS    SPRNSGFLSIGGVRLYTQDVSDE SDDDGE S+GSSEYSE
Subjt:  NTDDDSATQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSE

Query:  PLESDESSEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRD
        PLESDESSE DSS EM+CSGSDIDDEVAEDYLEG+GG E+ILKSKWLVKQEL ES DD SSSSLDDTL+KL  IALQEASKEYGM KTPSR K  +VSRD
Subjt:  PLESDESSEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRD

Query:  HWSALALDDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRD
        +WS+LALDD+L+KD+RS SARK+KNA  FA SWPPKA  SKA  KYPGEKKKYRKE IAAKRRERMLNRGVDL QINLKLEHMVLN+EDM++FQPMHPRD
Subjt:  HWSALALDDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRD

Query:  CSQVRRLAAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRS
        CSQVRRLAAIYRLH+GCQGSGKKRFVTVTRTQ+TG+PS+SDQVRL +LIGARD+DNDFSVAEG NIKS G NRSREKK  K  GL+ LE +QS SSKSR+
Subjt:  CSQVRRLAAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRS

Query:  KGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGG
        KGSAGKGSSQK TGKKYADQPVSFVS GVMQP+++E    N V D DKGKDIV  SEM+E+TTSNV NMDISRDSIG+FE HT GFGSKMMAKMGFVEGG
Subjt:  KGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGG

Query:  GLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPV
        GLGKDGQGM  PIEV+KRPKSLGLG+EFS  ++++AGDNQ S  SS RT + AL K+KK+GAFE HTKGFGSKMMAKMGFVEGMGLGKDSQG++NPLLPV
Subjt:  GLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPV

Query:  RRPKARGLGAK
        RRPKARGLGAK
Subjt:  RRPKARGLGAK

SwissProt top hitse value%identityAlignment
A4UMC5 Tuftelin-interacting protein 111.8e-0733.33Show/hide
Query:  GKKYAD--QPVSFVSSGVMQPESMEEKPVNDVNDADK-GKDIVAVSEMIEMTTSNVNNMDISRD----------SIGSFELHTKGFGSKMMAKMGFVEGG
        GK+  D   PV+F+S+G ++  + EE  ++D  D +K GK      ++         N   S+             GS+E HTKG G K++ KMG+V G 
Subjt:  GKKYAD--QPVSFVSSGVMQPESMEEKPVNDVNDADK-GKDIVAVSEMIEMTTSNVNNMDISRD----------SIGSFELHTKGFGSKMMAKMGFVEGG

Query:  GLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGD
        GLGK+ QG+ +PIE  +R     +G   SE +T +  D
Subjt:  GLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGD

A7SBN6 Zinc finger CCCH-type with G patch domain-containing protein4.6e-0843.82Show/hide
Query:  SIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMTHPIEVMKRP--KSLGLGIEFSE-----------------ASTSAAGDNQASGI
        S+G +E HTKG GSK+MAKMG++ G GLGKDG+G   PIEV+  P  KSL    E  E                 AST++A   +AS +
Subjt:  SIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMTHPIEVMKRP--KSLGLGIEFSE-----------------ASTSAAGDNQASGI

B3MPC0 Zinc finger CCCH-type with G patch domain-containing protein6.0e-0850Show/hide
Query:  TKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLGA
        T+K+GA+E +T+G GSK+M KMG++ G GLG D +GI+ P+     PK R L A
Subjt:  TKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLGA

Q29RR5 Tuftelin-interacting protein 112.3e-0733.58Show/hide
Query:  GKKYAD--QPVSFVSSGV---------MQPESMEEKPV-NDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGG
        GK+  D   PV+F+S+G+         ++    EEKPV  D    D G   +      + +         S    GS+E HTKG G K++ KMG+V G G
Subjt:  GKKYAD--QPVSFVSSGV---------MQPESMEEKPV-NDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGG

Query:  LGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGD
        LGK+ QG+ +PIE  +R     +G   SE +T +  D
Subjt:  LGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGD

Q9ERA6 Tuftelin-interacting protein 111.8e-0732.85Show/hide
Query:  GKKYAD--QPVSFVSSGV---------MQPESMEEKPVNDVN-DADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGG
        GK+  D   PV+F+S+G+          +    EEKPV   +   D G   +      + +    +    S    GS+E HTKG G K++ KMG+V G G
Subjt:  GKKYAD--QPVSFVSSGV---------MQPESMEEKPVNDVN-DADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMAKMGFVEGGG

Query:  LGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGD
        LGK+ QG+ +PIE  +R     +G   SE +T +  D
Subjt:  LGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGD

Arabidopsis top hitse value%identityAlignment
AT1G17070.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain8.1e-0828.51Show/hide
Query:  DNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKGSSQKM-----TGKKYAD--QPVSFVSSGVMQPE---SMEEKPVNDVN
        DND+        +     R  ++K+TK    N   +     S S S  S G GS +K      +G+K AD  +PV+FVS+G + P      + +  ND  
Subjt:  DNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKGSSQKM-----TGKKYAD--QPVSFVSSGVMQPE---SMEEKPVNDVN

Query:  DADKGKDIVAVSEMIEMTTS------------NVNNMDISRD------------------------------------------SIGSFELHTKGFGSKM
        D DK +D   + E +E+               N N  D   +                                           IG FE  TKG G K+
Subjt:  DADKGKDIVAVSEMIEMTTS------------NVNNMDISRD------------------------------------------SIGSFELHTKGFGSKM

Query:  MAKMGFVEGGGLGKDGQGMTHPIEVMKRPKSLGLGI-EFSEA
        + KMG+ +GGGLGK+ QG+  PIE   RPK++G+G  +F EA
Subjt:  MAKMGFVEGGGLGKDGQGMTHPIEVMKRPKSLGLGI-EFSEA

AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein7.6e-0640.96Show/hide
Query:  EFSEASTSAAGDNQASG-ISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGL
        E S +S    G  Q  G + S          T     +E HT+G  SKMMA MG+ EGMGLG   QGI+NP+L    P  R L
Subjt:  EFSEASTSAAGDNQASG-ISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGL

AT2G42330.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain4.9e-0545.45Show/hide
Query:  EKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLG
        EK  K   FE  + G G K++ KMG+ +G GLGK+ QGI+ P+    RPK  G+G
Subjt:  EKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLG

AT2G42330.2 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain4.9e-0545.45Show/hide
Query:  EKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLG
        EK  K   FE  + G G K++ KMG+ +G GLGK+ QGI+ P+    RPK  G+G
Subjt:  EKTKKIGAFEVHTKGFGSKMMAKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLG

AT3G09850.1 D111/G-patch domain-containing protein1.7e-14343.78Show/hide
Query:  GGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDWQCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDR-KKIASSSGTKQSNGYAIG
        G   RTNH  A+ S     ++  R R    + LFV+GG L D++  P  S       + GK           G +S +++R K  AS+SG ++ +G    
Subjt:  GGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDWQCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDR-KKIASSSGTKQSNGYAIG

Query:  YEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMAD-NLEFTYDYGTSFVLGESSHRGLGFHDDDE-------HVTKQNT
        Y+YPS   +E L   S V     E    +  P +L  S++ QIVA++D+ P       + + Y+Y  S+VLG+ SH+GLGF DD +        + K   
Subjt:  YEYPSAPNQEDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMAD-NLEFTYDYGTSFVLGESSHRGLGFHDDDE-------HVTKQNT

Query:  DDDSATQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYS---
        D   +   EE+G+    S       G D+            E++ +   + K        RNSGF+SIGG++LYT+DVS E SD + E +D   + S   
Subjt:  DDDSATQMEEQGELCTRSLSSGKETGTDERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYS---

Query:  EPLESDESSEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQEL----LESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRS
            S E SE DSS +M  S S+IDD+VA+DYLEG+GGSE +L + WL +Q L    L S D SSS S D   +KL GI LQ+AS EYG  KT +R+  S
Subjt:  EPLESDESSEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSENILKSKWLVKQEL----LESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRS

Query:  VVSRDHWSALALDD-MLVKDTRSTSAR--KQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYA
           +   + LA+DD M VKD RS S +  K+K   +F +SWP +A  SK +R +PGE KK+RKE IA KRRERML RGVDL  IN +LE+ VL   DM+ 
Subjt:  VVSRDHWSALALDD-MLVKDTRSTSAR--KQKNALQFARSWPPKASTSKATRKYPGEKKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYA

Query:  FQPMHPRDCSQVRRLAAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQ
        FQ MH RDCSQVRRLA +YRL + C GSGKK FVTVTRT  T MPSASD++R+EKLIGA DED DF+V+ G  +K + G+  R+K K       T E   
Subjt:  FQPMHPRDCSQVRRLAAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFSVAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQ

Query:  SVSSKSRSKGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMA
            + R+K S GK SS       YADQPVSFVSSG++  E    K        +K        E+ E T    N  D     IG+FE+HT+GFGSKMMA
Subjt:  SVSSKSRSKGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNMDISRDSIGSFELHTKGFGSKMMA

Query:  KMGFVEGGGLGKDGQGMTHPIEVMKRPKSLGLGIEFS----EASTSAAGDNQASGISSARTG-------------SSALEKTKKIGAFEVHTKGFGSKMM
        KMGF++GGGLGKDG+G+  PIE ++RPKSLGLG++FS    + S S+  +N A    S+ +G              S   + K++GAFE HT GFGS+MM
Subjt:  KMGFVEGGGLGKDGQGMTHPIEVMKRPKSLGLGIEFS----EASTSAAGDNQASGISSARTG-------------SSALEKTKKIGAFEVHTKGFGSKMM

Query:  AKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLGAK
        A+MGFVEG GLG++SQGI+NPL+ VRRP+ARG+GA+
Subjt:  AKMGFVEGMGLGKDSQGIINPLLPVRRPKARGLGAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGAGGAAGACGAAGAACGAACCATGCTAAAGCCAGTGATAGCTTCAGGAAGAACAAGACTAATTCAGCCAGAAGAAGATCGAACACGCCATCAAATTTGTTTGT
TGATGGAGGGTTTTTATTCGATTGGCAGTGTTCGCCCCCCATTTCTGCTCGAGAGGGAAATTCTAGGGCGAAGGGCAAGTCTGAGGGAAATTCTAGGGCGAAGGGCAAGT
CTGGATCAAAATCGGATACCTTAGATCGTAAAAAGATTGCTTCATCAAGTGGGACTAAACAGTCCAATGGTTATGCTATTGGCTACGAATACCCTTCTGCTCCGAATCAG
GAAGATTTGCATTCAGAATCCCGGGTTCTGCAGAATGATGCAGAGCGTCGCTTGGACGACTCGCAGCCTTTTATATTACTTAATTCCAAAAGTAATCAAATTGTGGCATA
TGTAGATGAAAATCCACCATTAATGGCAGATAATTTAGAATTTACTTATGATTATGGTACTAGTTTTGTGTTAGGTGAGAGTTCACATAGAGGACTAGGGTTTCACGATG
ATGATGAACATGTTACAAAGCAGAACACAGATGATGATTCAGCCACACAAATGGAAGAACAAGGAGAATTATGTACTCGTTCATTATCCTCTGGGAAGGAAACTGGTACA
GATGAGAGGGTTGATGGCAGGGTAGGAGTTGAAACGGCTAATGAGATGGTAGCTGAAGCGTCACCTTCCAACAAATATTCAGAAGACATGTCATCTCCAAGAAATTCAGG
TTTCTTGTCAATTGGTGGTGTGAGATTATACACCCAAGATGTGTCTGATGAAGGAAGTGATGATGATGGGGAGTCATCTGATGGAAGTTCCGAGTACTCTGAGCCTTTAG
AGTCAGATGAATCGTCTGAAGGTGATAGCTCTGTGGAAATGTCTTGCAGTGGTTCAGATATTGATGACGAGGTCGCTGAAGATTATCTTGAAGGGGTTGGTGGAAGTGAG
AATATTTTAAAATCCAAATGGTTAGTAAAACAAGAGTTGCTAGAGTCTGGTGATGATAGTTCTAGCAGTAGCCTTGATGATACTTTAAAGAAATTAGGCGGTATTGCTTT
GCAGGAAGCATCAAAAGAATATGGAATGATAAAAACTCCTTCAAGAAATAAAAGATCCGTTGTTTCTAGAGATCATTGGTCAGCACTGGCTCTAGATGACATGCTAGTAA
AAGATACTAGATCTACATCAGCTAGAAAGCAAAAGAATGCTTTGCAGTTTGCTCGTTCTTGGCCTCCAAAAGCTTCAACAAGTAAAGCTACTAGAAAGTATCCCGGTGAA
AAGAAGAAATATCGTAAAGAAGCAATTGCAGCAAAGCGTAGAGAAAGAATGCTTAATCGGGGTGTTGATCTAATGCAAATAAATCTGAAATTGGAGCACATGGTTCTCAA
TAAAGAAGATATGTACGCTTTCCAACCTATGCATCCCCGTGATTGTTCTCAGGTACGACGATTGGCAGCAATTTACCGCTTGCATAATGGATGCCAAGGTTCTGGTAAAA
AAAGGTTTGTAACGGTAACTCGAACACAGCACACAGGAATGCCATCAGCAAGTGATCAAGTTCGCCTTGAAAAGCTAATCGGGGCAAGAGACGAGGATAATGACTTTTCG
GTTGCTGAAGGCTCAAACATAAAATCACGAGGCGGCAACAGAAGCAGAGAAAAGAAAAAAACCAAAGTTCATGGTTTGAACACATTGGAATTTGATCAATCTGTAAGCAG
TAAGTCGAGGTCAAAGGGTTCTGCAGGTAAGGGGTCGAGTCAGAAAATGACTGGCAAAAAATATGCTGATCAACCAGTCTCATTTGTATCAAGTGGAGTAATGCAACCAG
AATCAATGGAAGAGAAACCGGTCAATGATGTTAATGATGCAGACAAAGGTAAGGACATTGTAGCGGTATCCGAAATGATTGAGATGACCACTAGTAATGTTAATAATATG
GATATAAGTAGGGATAGTATTGGTTCATTTGAGCTGCACACCAAGGGTTTTGGGTCAAAAATGATGGCAAAAATGGGATTTGTAGAAGGCGGAGGATTAGGGAAAGATGG
TCAAGGAATGACTCATCCCATTGAAGTGATGAAAAGACCTAAATCACTAGGGTTAGGCATCGAGTTCTCCGAGGCCTCAACATCTGCTGCTGGTGATAATCAGGCAAGTG
GGATATCGTCTGCTAGAACGGGGTCCAGTGCTTTGGAAAAAACTAAAAAAATTGGTGCTTTTGAAGTGCACACTAAAGGATTTGGGTCGAAGATGATGGCAAAGATGGGA
TTTGTTGAAGGCATGGGATTGGGAAAAGATTCACAAGGGATAATCAACCCCTTGCTTCCTGTTAGGCGGCCTAAAGCTCGAGGATTGGGAGCCAAAAGTTAG
mRNA sequenceShow/hide mRNA sequence
GCCACCGCAAGGTTCCATTTAAGAGTTCGGGAATTGGGTTTACGGCCTATCTTGCACTGGTGTACGATGGCTGGAGGAAGACGAAGAACGAACCATGCTAAAGCCAGTGA
TAGCTTCAGGAAGAACAAGACTAATTCAGCCAGAAGAAGATCGAACACGCCATCAAATTTGTTTGTTGATGGAGGGTTTTTATTCGATTGGCAGTGTTCGCCCCCCATTT
CTGCTCGAGAGGGAAATTCTAGGGCGAAGGGCAAGTCTGAGGGAAATTCTAGGGCGAAGGGCAAGTCTGGATCAAAATCGGATACCTTAGATCGTAAAAAGATTGCTTCA
TCAAGTGGGACTAAACAGTCCAATGGTTATGCTATTGGCTACGAATACCCTTCTGCTCCGAATCAGGAAGATTTGCATTCAGAATCCCGGGTTCTGCAGAATGATGCAGA
GCGTCGCTTGGACGACTCGCAGCCTTTTATATTACTTAATTCCAAAAGTAATCAAATTGTGGCATATGTAGATGAAAATCCACCATTAATGGCAGATAATTTAGAATTTA
CTTATGATTATGGTACTAGTTTTGTGTTAGGTGAGAGTTCACATAGAGGACTAGGGTTTCACGATGATGATGAACATGTTACAAAGCAGAACACAGATGATGATTCAGCC
ACACAAATGGAAGAACAAGGAGAATTATGTACTCGTTCATTATCCTCTGGGAAGGAAACTGGTACAGATGAGAGGGTTGATGGCAGGGTAGGAGTTGAAACGGCTAATGA
GATGGTAGCTGAAGCGTCACCTTCCAACAAATATTCAGAAGACATGTCATCTCCAAGAAATTCAGGTTTCTTGTCAATTGGTGGTGTGAGATTATACACCCAAGATGTGT
CTGATGAAGGAAGTGATGATGATGGGGAGTCATCTGATGGAAGTTCCGAGTACTCTGAGCCTTTAGAGTCAGATGAATCGTCTGAAGGTGATAGCTCTGTGGAAATGTCT
TGCAGTGGTTCAGATATTGATGACGAGGTCGCTGAAGATTATCTTGAAGGGGTTGGTGGAAGTGAGAATATTTTAAAATCCAAATGGTTAGTAAAACAAGAGTTGCTAGA
GTCTGGTGATGATAGTTCTAGCAGTAGCCTTGATGATACTTTAAAGAAATTAGGCGGTATTGCTTTGCAGGAAGCATCAAAAGAATATGGAATGATAAAAACTCCTTCAA
GAAATAAAAGATCCGTTGTTTCTAGAGATCATTGGTCAGCACTGGCTCTAGATGACATGCTAGTAAAAGATACTAGATCTACATCAGCTAGAAAGCAAAAGAATGCTTTG
CAGTTTGCTCGTTCTTGGCCTCCAAAAGCTTCAACAAGTAAAGCTACTAGAAAGTATCCCGGTGAAAAGAAGAAATATCGTAAAGAAGCAATTGCAGCAAAGCGTAGAGA
AAGAATGCTTAATCGGGGTGTTGATCTAATGCAAATAAATCTGAAATTGGAGCACATGGTTCTCAATAAAGAAGATATGTACGCTTTCCAACCTATGCATCCCCGTGATT
GTTCTCAGGTACGACGATTGGCAGCAATTTACCGCTTGCATAATGGATGCCAAGGTTCTGGTAAAAAAAGGTTTGTAACGGTAACTCGAACACAGCACACAGGAATGCCA
TCAGCAAGTGATCAAGTTCGCCTTGAAAAGCTAATCGGGGCAAGAGACGAGGATAATGACTTTTCGGTTGCTGAAGGCTCAAACATAAAATCACGAGGCGGCAACAGAAG
CAGAGAAAAGAAAAAAACCAAAGTTCATGGTTTGAACACATTGGAATTTGATCAATCTGTAAGCAGTAAGTCGAGGTCAAAGGGTTCTGCAGGTAAGGGGTCGAGTCAGA
AAATGACTGGCAAAAAATATGCTGATCAACCAGTCTCATTTGTATCAAGTGGAGTAATGCAACCAGAATCAATGGAAGAGAAACCGGTCAATGATGTTAATGATGCAGAC
AAAGGTAAGGACATTGTAGCGGTATCCGAAATGATTGAGATGACCACTAGTAATGTTAATAATATGGATATAAGTAGGGATAGTATTGGTTCATTTGAGCTGCACACCAA
GGGTTTTGGGTCAAAAATGATGGCAAAAATGGGATTTGTAGAAGGCGGAGGATTAGGGAAAGATGGTCAAGGAATGACTCATCCCATTGAAGTGATGAAAAGACCTAAAT
CACTAGGGTTAGGCATCGAGTTCTCCGAGGCCTCAACATCTGCTGCTGGTGATAATCAGGCAAGTGGGATATCGTCTGCTAGAACGGGGTCCAGTGCTTTGGAAAAAACT
AAAAAAATTGGTGCTTTTGAAGTGCACACTAAAGGATTTGGGTCGAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATGGGATTGGGAAAAGATTCACAAGGGATAAT
CAACCCCTTGCTTCCTGTTAGGCGGCCTAAAGCTCGAGGATTGGGAGCCAAAAGTTAGATATGCAAGAATAGATGTTCGTTTCATCAGCTAATACATTATTCTCTTAGAT
TCCTCTCATTCTGTGTAACAACACTGTAAGTGCAGTCTACTCCTGCTATTTTCGTTGACTCTTCCTCTTAAATTAGTGATAATAAAAAGACAAACTTTCTTTTATTTACT
ATTAAGAATAATTCCTCTCATTCT
Protein sequenceShow/hide protein sequence
MAGGRRRTNHAKASDSFRKNKTNSARRRSNTPSNLFVDGGFLFDWQCSPPISAREGNSRAKGKSEGNSRAKGKSGSKSDTLDRKKIASSSGTKQSNGYAIGYEYPSAPNQ
EDLHSESRVLQNDAERRLDDSQPFILLNSKSNQIVAYVDENPPLMADNLEFTYDYGTSFVLGESSHRGLGFHDDDEHVTKQNTDDDSATQMEEQGELCTRSLSSGKETGT
DERVDGRVGVETANEMVAEASPSNKYSEDMSSPRNSGFLSIGGVRLYTQDVSDEGSDDDGESSDGSSEYSEPLESDESSEGDSSVEMSCSGSDIDDEVAEDYLEGVGGSE
NILKSKWLVKQELLESGDDSSSSSLDDTLKKLGGIALQEASKEYGMIKTPSRNKRSVVSRDHWSALALDDMLVKDTRSTSARKQKNALQFARSWPPKASTSKATRKYPGE
KKKYRKEAIAAKRRERMLNRGVDLMQINLKLEHMVLNKEDMYAFQPMHPRDCSQVRRLAAIYRLHNGCQGSGKKRFVTVTRTQHTGMPSASDQVRLEKLIGARDEDNDFS
VAEGSNIKSRGGNRSREKKKTKVHGLNTLEFDQSVSSKSRSKGSAGKGSSQKMTGKKYADQPVSFVSSGVMQPESMEEKPVNDVNDADKGKDIVAVSEMIEMTTSNVNNM
DISRDSIGSFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMTHPIEVMKRPKSLGLGIEFSEASTSAAGDNQASGISSARTGSSALEKTKKIGAFEVHTKGFGSKMMAKMG
FVEGMGLGKDSQGIINPLLPVRRPKARGLGAKS