; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G12970 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G12970
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein SQS1
Genome locationClcChr06:23887512..23893339
RNA-Seq ExpressionClc06G12970
SyntenyClc06G12970
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050035.1 Zinc finger protein, putative isoform 1 [Cucumis melo var. makuwa]0.0e+0085.77Show/hide
Query:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD
        EGNSR+KGKSGSKSA LDRKKIASSSGTKQS+G AIGYEYP APNQE L SESRGLQNDAECSLD+SQPFIL+N K+NQIVAYVDENP L ADNLEFTYD
Subjt:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD

Query:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV
        YGTSFVLGDSSHRGLGF D+DELV  QNTDDDSATQV EQGGLC GSLS  KE GTDEKVD RVGVETANE +AEASPSNK SD + SP+NSGFLSIGGV
Subjt:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV

Query:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG
        RLYTQDVSDEE DDD ESSDG+SEYSEPLESD SS  DSSVEMSC  SDIDDEVAEDYLEG+GGS+NILKSKWLVKQEL+ES DDSSSSSLDDTL+KLGG
Subjt:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG

Query:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL
        I LQEASKEYG++KT SR K SVVSRDHWSALALDD+LVKD RSTS RKQKNASQFARSWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDL
Subjt:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL

Query:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNR
        MQI+L+LEHMVL +EDM+AFQPMHPRDCSQVRRLAAIYR HSGCQGSGKKRFVTVTRTQ+TGMPSASDQVRLEQLIGA DEDNDFSVAEGSNIKS+G +R
Subjt:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNR

Query:  SREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISR
        SREKKNTK+RGLH LE D+S S+K RSKGSAGK SSQKMTGKKYADQPVSFVSCGVMQPE M+     +  DVDK KD+VA SEMIE+  S+V N DISR
Subjt:  SREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISR

Query:  DRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSK
        D IG+FELHTKGFGSKMMAKMGFVEGGGLGKDGQGM  PIEVMKRPKSLGLGIEFSEASTSAA DNQAS  S+ARTGSGAL KSK+IGAFE HTKGFGSK
Subjt:  DRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSK

Query:  MMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        MMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGAK
Subjt:  MMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

XP_008443898.1 PREDICTED: uncharacterized protein LOC103487380 isoform X2 [Cucumis melo]0.0e+0086.04Show/hide
Query:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD
        EGNSR+KGKSGSKSA LDRKKIASSSGTKQSNG AIGYEYP APNQE L SESRGLQNDAECSLD+SQPFILLN K+NQIVAYVDENP L ADNLEFTYD
Subjt:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD

Query:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV
        YGTSFVLGDSSHRGLGF D+DELV  QNTDDDSATQV EQGGLC GSLS  KE GTDEKVD RVGVETANE +AEASPSNK SD + SP+NSGFLSIGGV
Subjt:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV

Query:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG
        RLYTQDVSDEE DDD ESSDG+SEYSEPLESD SS  DSSVEMSC  SDIDDEVAEDYLEG+GGS+NILKSKWLVKQEL+ES DDSSSSSLDDTL+KLGG
Subjt:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG

Query:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL
        I LQEASKEYG++KT SR K SVVSRDHWSALALDD+LVKD RSTS RKQKNASQFARSWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDL
Subjt:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL

Query:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNR
        MQI+L+LEHMVL +EDM+AFQPMHPRDCSQVRRLAAIYR HSGCQGSGKKRFVTVTRTQ+TGMPSASDQVRLEQLIGA DEDNDFSVAEGSNIKS+G +R
Subjt:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNR

Query:  SREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISR
        SREKKNTKVRGLH LE D+S S+K RSKGSAGK SSQKMTGKKYADQPVSFVSCGVMQPE M+     +  DVDK KD+VA SEMIE+  S+V N DISR
Subjt:  SREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISR

Query:  DRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSK
        D IG+FELHTKGFGSKMMAKMGFVEGGGLGKDGQGM  PIEVMKRPKSLGLGIEFSEASTSAA DNQAS  S+ART SGAL KSK+IGAFE HTKGFGSK
Subjt:  DRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSK

Query:  MMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        MMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGAK
Subjt:  MMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

XP_011655819.1 uncharacterized protein LOC101215133 [Cucumis sativus]0.0e+0085.64Show/hide
Query:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD
        EGNSR+KGKSGSKS  LDRKKIASSSGTKQSNG AIGYEYP APNQE LHSESR LQNDAE   D+SQPFILLN K+NQIVAYVDENPPL ADNLEFTYD
Subjt:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD

Query:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV
        YGTSFVLG+SSHRGLGFHD+DE V  QNTDDDSATQV EQG LC  SLSSGKE GTDE+VD RVGVETANE +AEASPSNKYS+ + SPRNSGFLSIGGV
Subjt:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV

Query:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG
        RLYTQDVSDE  DDD ESSDGSSEYSEPLESD SS  DSSVEMSC  SDIDDEVAEDYLEG+GGSENILKSKWLVKQEL+ES DDSSSSSLDDTL+KLGG
Subjt:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG

Query:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL
        I LQEASKEYG++KTPSR K SVVSRDHWSALALDD+LVK++RSTSARKQKNA QFARSWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDL
Subjt:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL

Query:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNR
        MQINLKLEHMVL +EDM+AFQPMHPRDCSQVRRLAAIYR H+GCQGSGKKRFVTVTRTQ+TGMPSASDQVRLE+LIGA DEDNDFSVAEGSNIKS+GGNR
Subjt:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNR

Query:  SREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISR
        SREKK TKV GL+ LE D+S SSK RSKGSAGK SSQKMTGKKYADQPVSFVS GVMQPE M+E   ++V D DK KD+VA SEMIE+  S+V NMDISR
Subjt:  SREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISR

Query:  DRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSK
        D IG+FELHTKGFGSKMMAKMGFVEGGGLGKDGQGM  PIEVMKRPKSLGLGIEFSEASTSAA DNQAS  S+ARTGS AL K+KKIGAFE HTKGFGSK
Subjt:  DRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSK

Query:  MMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        MMAKMGFVEGMGLGKDSQG++NPLLPVRRPKARGLGAK
Subjt:  MMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

XP_023003182.1 uncharacterized protein LOC111496865 isoform X2 [Cucurbita maxima]0.0e+0082.61Show/hide
Query:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQE-SLHSESRGLQNDAECSLDNSQPFILL--NCKNNQIVAYVDENPPLKADNLEF
        EGNSRSKG+SGSKSA+LDRKK ASSSGTKQSNGNAIGYEYPPAP+QE  LHSESRGLQNDA+CSLDNSQPFILL  N K+ QIVAYVD+ PPLK D LE 
Subjt:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQE-SLHSESRGLQNDAECSLDNSQPFILL--NCKNNQIVAYVDENPPLKADNLEF

Query:  TYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSI
        TYDYGT F+LGDSSHRGLGFHDEDELV+NQNTDDDS T V EQ GLC GSL S KE G+DE+V+CR  VE A+E LAEAS  NKYS G CSPRNSGFLSI
Subjt:  TYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSI

Query:  GGVRLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEK
        GGVRLYTQDVSDEE DDD E S+GSSEYSEPLESD SS SDSS EM+C  SDIDDEVAEDYLEGIGG E+ILKSKWLVKQEL ESDDD SSSSLDDTLEK
Subjt:  GGVRLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEK

Query:  LGGITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRG
        L  I LQEASKEYG+ KTPSR KS +VSRD+WS+LALDDLL+KDSRS SARK+KNA+ FA SWPPKAP SKA  KYPGEKKKYRKETIAAKRRERMLNRG
Subjt:  LGGITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRG

Query:  VDLMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQG
        VDL QINLKLEHMVL REDMF+FQPMHPRDCSQVRRLAAIYR HSGCQGSGKKRFVTVTRTQYTG+PS+SDQVRL QLIGA D+DNDFSVAEG NIKS G
Subjt:  VDLMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQG

Query:  GNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMD
         NRSREKKN K  GL +LEL++S SSK R+KGSAGK SSQK TGKKYADQPVSFVSCGVMQP+ + EITTSNVKDVDK KD+V ASEM+E+  S+VKNMD
Subjt:  GNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMD

Query:  ISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGF
        ISRD IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA PIEV+KRPKSLGLG+EFS  ++++A DNQ SR S+ RT +GALGKSKK+GAFEEHTKGF
Subjt:  ISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGF

Query:  GSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG
        GSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG
Subjt:  GSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG

XP_038876900.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120069254 [Benincasa hispida]0.0e+0089.73Show/hide
Query:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD
        EGNS +KGKSGSKSANLDRK+IASSSGTKQ++GNAIGYEYPPAPNQES HSESRGLQ DAECSLDNSQPFILLN  NNQIVAYVDENPPLKADNLEFTYD
Subjt:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD

Query:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV
        YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQV  +GGLCAGSLSS KE G  E+V+CRVGVETANE LAEASPSNKY     SPRNSGFLSIGGV
Subjt:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV

Query:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG
        RLYTQDVSDE GDDD ESSDGSS YSEPLESD SS SDSSVEMSC  SDIDDEVA+DYLEGIGGSENIL SKWL+KQELVESDDDSS+SSLDDTLEKLGG
Subjt:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG

Query:  ITLQEASKEYGIMKTPSR-KKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVD
        I LQEASKEYGIMKTPSR KKSSVVSRDHWSALALDDLLVKD+RSTSARKQKN + FA SWPPKAP  K+ARKYPGEKKKYRKETIAAKRRERMLNRGVD
Subjt:  ITLQEASKEYGIMKTPSR-KKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVD

Query:  LMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGN
        LM+INLKLEHMVL REDMFAFQPMHPRDCSQVRRLAAIYR H G QGSGKKRFVTVTRTQ+TGMPSASDQVRLEQLIGA DED+DF VAEGSNIKSQG N
Subjt:  LMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGN

Query:  RSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDIS
        RSREKKNTKV GLH LELDKSASSK RSKGSAGK  SQKMTGKKYADQPVSFVSCGVMQPEPMDE TTSN +DVDKSKD+VAASEM E+  SDVKNMDIS
Subjt:  RSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDIS

Query:  RDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGS
        RD IGAFEL+TKGFGSKMMAKMG+VEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAA DNQASR SAAR+GSGALGKSKKIGAFEEHTKGFGS
Subjt:  RDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGS

Query:  KMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG
        KMMAKMGFVEGMGLGKDSQG+VNPLLP+RRPKARGLGAKG
Subjt:  KMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG

TrEMBL top hitse value%identityAlignment
A0A0A0LTU7 Protein SQS10.0e+0081.55Show/hide
Query:  MASGTTRAIPVEEDQTRRH-CLWMAGFYLIGSFGYRRRVLHEVRFSLSFSSTSESLHLLNHTGIEGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAI
        +ASG TR I  EED+TR   CL M GFY IGS   R   L                       +EGNSR+KGKSGSKS  LDRKKIASSSGTKQSNG AI
Subjt:  MASGTTRAIPVEEDQTRRH-CLWMAGFYLIGSFGYRRRVLHEVRFSLSFSSTSESLHLLNHTGIEGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAI

Query:  GYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQ
        GYEYP APNQE LHSESR LQNDAE   D+SQPFILLN K+NQIVAYVDENPPL ADNLEFTYDYGTSFVLG+SSHRGLGFHD+DE V  QNTDDDSATQ
Subjt:  GYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQ

Query:  VGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGVRLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSG
        V EQG LC  SLSSGKE GTDE+VD RVGVETANE +AEASPSNKYS+ + SPRNSGFLSIGGVRLYTQDVSDE  DDD ESSDGSSEYSEPLESD SS 
Subjt:  VGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGVRLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSG

Query:  SDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDD
         DSSVEMSC  SDIDDEVAEDYLEG+GGSENILKSKWLVKQEL+ES DDSSSSSLDDTL+KLGGI LQEASKEYG++KTPSR K SVVSRDHWSALALDD
Subjt:  SDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDD

Query:  LLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAA
        +LVK++RSTSARKQKNA QFARSWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDLMQINLKLEHMVL +EDM+AFQPMHPRDCSQVRRLAA
Subjt:  LLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAA

Query:  IYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSS
        IYR H+GCQGSGKKRFVTVTRTQ+TGMPSASDQVRLE+LIGA DEDNDFSVAEGSNIKS+GGNRSREKK TKV GL+ LE D+S SSK RSKGSAGK SS
Subjt:  IYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSS

Query:  QKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGM
        QKMTGKKYADQPVSFVS GVMQPE M+E   ++V D DK KD+VA SEMIE+  S+V NMDISRD IG+FELHTKGFGSKMMAKMGFVEGGGLGKDGQGM
Subjt:  QKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGM

Query:  ARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG
          PIEVMKRPKSLGLGIEFSEASTSAA DNQAS  S+ARTGS AL K+KKIGAFE HTKGFGSKMMAKMGFVEGMGLGKDSQG++NPLLPVRRPKARGLG
Subjt:  ARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLG

Query:  AK
        AK
Subjt:  AK

A0A1S3B932 uncharacterized protein LOC103487380 isoform X10.0e+0083.13Show/hide
Query:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD
        EGNSR+KGKSGSKSA LDRKKIASSSGTKQSNG AIGYEYP APNQE L SESRGLQNDAECSLD+SQPFILLN K+NQIVAYVDENP L ADNLEFTYD
Subjt:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD

Query:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV
        YGTSFVLGDSSHRGLGF D+DELV  QNTDDDSATQV EQGGLC GSLS  KE GTDEKVD RVGVETANE +AEASPSNK SD + SP+NSGFLSIGGV
Subjt:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV

Query:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG
        RLYTQDVSDEE DDD ESSDG+SEYSEPLESD SS  DSSVEMSC  SDIDDEVAEDYLEG+GGS+NILKSKWLVKQEL+ES DDSSSSSLDDTL+KLGG
Subjt:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG

Query:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL
        I LQEASKEYG++KT SR K SVVSRDHWSALALDD+LVKD RSTS RKQKNASQFARSWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDL
Subjt:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL

Query:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSG---------KKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGS
        MQI+L+LEHMVL +EDM+AFQPMHPRDCSQ+     I R+    Q +          KKRFVTVTRTQ+TGMPSASDQVRLEQLIGA DEDNDFSVAEGS
Subjt:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSG---------KKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGS

Query:  NIKSQGGNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANS
        NIKS+G +RSREKKNTKVRGLH LE D+S S+K RSKGSAGK SSQKMTGKKYADQPVSFVSCGVMQPE M+     +  DVDK KD+VA SEMIE+  S
Subjt:  NIKSQGGNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANS

Query:  DVKNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFE
        +V N DISRD IG+FELHTKGFGSKMMAKMGFVEGGGLGKDGQGM  PIEVMKRPKSLGLGIEFSEASTSAA DNQAS  S+ART SGAL KSK+IGAFE
Subjt:  DVKNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFE

Query:  EHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
         HTKGFGSKMMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGAK
Subjt:  EHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

A0A1S3B9V9 Protein SQS10.0e+0086.04Show/hide
Query:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD
        EGNSR+KGKSGSKSA LDRKKIASSSGTKQSNG AIGYEYP APNQE L SESRGLQNDAECSLD+SQPFILLN K+NQIVAYVDENP L ADNLEFTYD
Subjt:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD

Query:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV
        YGTSFVLGDSSHRGLGF D+DELV  QNTDDDSATQV EQGGLC GSLS  KE GTDEKVD RVGVETANE +AEASPSNK SD + SP+NSGFLSIGGV
Subjt:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV

Query:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG
        RLYTQDVSDEE DDD ESSDG+SEYSEPLESD SS  DSSVEMSC  SDIDDEVAEDYLEG+GGS+NILKSKWLVKQEL+ES DDSSSSSLDDTL+KLGG
Subjt:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG

Query:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL
        I LQEASKEYG++KT SR K SVVSRDHWSALALDD+LVKD RSTS RKQKNASQFARSWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDL
Subjt:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL

Query:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNR
        MQI+L+LEHMVL +EDM+AFQPMHPRDCSQVRRLAAIYR HSGCQGSGKKRFVTVTRTQ+TGMPSASDQVRLEQLIGA DEDNDFSVAEGSNIKS+G +R
Subjt:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNR

Query:  SREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISR
        SREKKNTKVRGLH LE D+S S+K RSKGSAGK SSQKMTGKKYADQPVSFVSCGVMQPE M+     +  DVDK KD+VA SEMIE+  S+V N DISR
Subjt:  SREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISR

Query:  DRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSK
        D IG+FELHTKGFGSKMMAKMGFVEGGGLGKDGQGM  PIEVMKRPKSLGLGIEFSEASTSAA DNQAS  S+ART SGAL KSK+IGAFE HTKGFGSK
Subjt:  DRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSK

Query:  MMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        MMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGAK
Subjt:  MMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

A0A5A7U8V1 Protein SQS10.0e+0085.77Show/hide
Query:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD
        EGNSR+KGKSGSKSA LDRKKIASSSGTKQS+G AIGYEYP APNQE L SESRGLQNDAECSLD+SQPFIL+N K+NQIVAYVDENP L ADNLEFTYD
Subjt:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYD

Query:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV
        YGTSFVLGDSSHRGLGF D+DELV  QNTDDDSATQV EQGGLC GSLS  KE GTDEKVD RVGVETANE +AEASPSNK SD + SP+NSGFLSIGGV
Subjt:  YGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGV

Query:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG
        RLYTQDVSDEE DDD ESSDG+SEYSEPLESD SS  DSSVEMSC  SDIDDEVAEDYLEG+GGS+NILKSKWLVKQEL+ES DDSSSSSLDDTL+KLGG
Subjt:  RLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGG

Query:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL
        I LQEASKEYG++KT SR K SVVSRDHWSALALDD+LVKD RSTS RKQKNASQFARSWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDL
Subjt:  ITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDL

Query:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNR
        MQI+L+LEHMVL +EDM+AFQPMHPRDCSQVRRLAAIYR HSGCQGSGKKRFVTVTRTQ+TGMPSASDQVRLEQLIGA DEDNDFSVAEGSNIKS+G +R
Subjt:  MQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNR

Query:  SREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISR
        SREKKNTK+RGLH LE D+S S+K RSKGSAGK SSQKMTGKKYADQPVSFVSCGVMQPE M+     +  DVDK KD+VA SEMIE+  S+V N DISR
Subjt:  SREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMDISR

Query:  DRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSK
        D IG+FELHTKGFGSKMMAKMGFVEGGGLGKDGQGM  PIEVMKRPKSLGLGIEFSEASTSAA DNQAS  S+ARTGSGAL KSK+IGAFE HTKGFGSK
Subjt:  DRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGFGSK

Query:  MMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        MMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGAK
Subjt:  MMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

A0A6J1KLR0 Protein SQS10.0e+0082.61Show/hide
Query:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQE-SLHSESRGLQNDAECSLDNSQPFILL--NCKNNQIVAYVDENPPLKADNLEF
        EGNSRSKG+SGSKSA+LDRKK ASSSGTKQSNGNAIGYEYPPAP+QE  LHSESRGLQNDA+CSLDNSQPFILL  N K+ QIVAYVD+ PPLK D LE 
Subjt:  EGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPPAPNQE-SLHSESRGLQNDAECSLDNSQPFILL--NCKNNQIVAYVDENPPLKADNLEF

Query:  TYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSI
        TYDYGT F+LGDSSHRGLGFHDEDELV+NQNTDDDS T V EQ GLC GSL S KE G+DE+V+CR  VE A+E LAEAS  NKYS G CSPRNSGFLSI
Subjt:  TYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSI

Query:  GGVRLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEK
        GGVRLYTQDVSDEE DDD E S+GSSEYSEPLESD SS SDSS EM+C  SDIDDEVAEDYLEGIGG E+ILKSKWLVKQEL ESDDD SSSSLDDTLEK
Subjt:  GGVRLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEK

Query:  LGGITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRG
        L  I LQEASKEYG+ KTPSR KS +VSRD+WS+LALDDLL+KDSRS SARK+KNA+ FA SWPPKAP SKA  KYPGEKKKYRKETIAAKRRERMLNRG
Subjt:  LGGITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRG

Query:  VDLMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQG
        VDL QINLKLEHMVL REDMF+FQPMHPRDCSQVRRLAAIYR HSGCQGSGKKRFVTVTRTQYTG+PS+SDQVRL QLIGA D+DNDFSVAEG NIKS G
Subjt:  VDLMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQG

Query:  GNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMD
         NRSREKKN K  GL +LEL++S SSK R+KGSAGK SSQK TGKKYADQPVSFVSCGVMQP+ + EITTSNVKDVDK KD+V ASEM+E+  S+VKNMD
Subjt:  GNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDVKNMD

Query:  ISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGF
        ISRD IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA PIEV+KRPKSLGLG+EFS  ++++A DNQ SR S+ RT +GALGKSKK+GAFEEHTKGF
Subjt:  ISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGSAARTGSGALGKSKKIGAFEEHTKGF

Query:  GSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG
        GSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG
Subjt:  GSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG

SwissProt top hitse value%identityAlignment
A1XD94 Tuftelin-interacting protein 118.7e-0745.16Show/hide
Query:  GAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADD
        G++E HTKG G K++ KMG+V G GLGK+ QG+  PIE  +R     +G   SE +T +  D
Subjt:  GAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADD

A1XD95 Tuftelin-interacting protein 118.7e-0745.16Show/hide
Query:  GAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADD
        G++E HTKG G K++ KMG+V G GLGK+ QG+  PIE  +R     +G   SE +T +  D
Subjt:  GAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADD

A1XD97 Tuftelin-interacting protein 118.7e-0745.16Show/hide
Query:  GAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADD
        G++E HTKG G K++ KMG+V G GLGK+ QG+  PIE  +R     +G   SE +T +  D
Subjt:  GAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADD

A4UMC6 Tuftelin-interacting protein 116.7e-0745.16Show/hide
Query:  GAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADD
        G++E HTKG G K++ KMG+V G GLGK+ QG+  PIE  +R     +G   SE +T +  D
Subjt:  GAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADD

Q9UBB9 Tuftelin-interacting protein 118.7e-0745.16Show/hide
Query:  GAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADD
        G++E HTKG G K++ KMG+V G GLGK+ QG+  PIE  +R     +G   SE +T +  D
Subjt:  GAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADD

Arabidopsis top hitse value%identityAlignment
AT1G17070.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain1.1e-0739.39Show/hide
Query:  IGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGI-EFSEASTS-------------AADDNQASRGSAARTGSGALGKSKKI
        IG FE  TKG G K++ KMG+ +GGGLGK+ QG+  PIE   RPK++G+G  +F EA                +  +N+ S G     G   L K KK+
Subjt:  IGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGI-EFSEASTS-------------AADDNQASRGSAARTGSGALGKSKKI

AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein6.4e-0538.1Show/hide
Query:  SEASTSAADDNQASRG----SAARTGSGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
        SE S S+  D  + +G     +     G    +     +E HT+G  SKMMA MG+ EGMGLG   QG++NP+L    P  R L
Subjt:  SEASTSAADDNQASRG----SAARTGSGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL

AT2G42330.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.9e-0530.97Show/hide
Query:  KKYADQPVSFVSCGVMQPEPMDEITTSNVKD--VDKSKDVVAASEMIEIANSDVKNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARP
        K+   +PV F S G      +D++   N  +  +D++ D +    + +   +D  ++    ++   FE  + G G K++ KMG+ +G GLGK+ QG+  P
Subjt:  KKYADQPVSFVSCGVMQPEPMDEITTSNVKD--VDKSKDVVAASEMIEIANSDVKNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARP

Query:  IEVMKRPKSLGLG
        IEV  RPK++G+G
Subjt:  IEVMKRPKSLGLG

AT2G42330.2 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.9e-0530.97Show/hide
Query:  KKYADQPVSFVSCGVMQPEPMDEITTSNVKD--VDKSKDVVAASEMIEIANSDVKNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARP
        K+   +PV F S G      +D++   N  +  +D++ D +    + +   +D  ++    ++   FE  + G G K++ KMG+ +G GLGK+ QG+  P
Subjt:  KKYADQPVSFVSCGVMQPEPMDEITTSNVKD--VDKSKDVVAASEMIEIANSDVKNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARP

Query:  IEVMKRPKSLGLG
        IEV  RPK++G+G
Subjt:  IEVMKRPKSLGLG

AT3G09850.1 D111/G-patch domain-containing protein1.1e-14246.53Show/hide
Query:  SKGKSGSKSANLDR-KKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENP-PLKADNLEFTYDYGT
        S GK G +S +++R K  AS+SG ++ +GN   Y+YP    +E L   S G+++D      N  P +L   +  QIVA++D+ P   K   + + Y+Y  
Subjt:  SKGKSGSKSANLDR-KKIASSSGTKQSNGNAIGYEYPPAPNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENP-PLKADNLEFTYDYGT

Query:  SFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGVRLY
        S+VLGD SH+GLGF D+ +   + +     A  +G+QG    GS S  +E    + VD   G +  +E + +   + K        RNSGF+SIGG++LY
Subjt:  SFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGSLSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGVRLY

Query:  TQDVSDEEGDDDRESSDGSSEYSEPLESDGS---SGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQEL----VESDDDSSSSSLDDTLE
        T+DVS EE D + E +D   + S    S GS   S SDSS +M    S+IDD+VA+DYLEGIGGSE +L + WL +Q L    + SDD SSS S D    
Subjt:  TQDVSDEEGDDDRESSDGSSEYSEPLESDGS---SGSDSSVEMSCGESDIDDEVAEDYLEGIGGSENILKSKWLVKQEL----VESDDDSSSSSLDDTLE

Query:  KLGGITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLL-VKDSRSTSAR--KQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERM
        KL GI LQ+AS EYG  KT     S        + LA+DDL+ VKD RS S +  K+K  ++F +SWP +AP SK +R +PGE KK+RKE IA KRRERM
Subjt:  KLGGITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLL-VKDSRSTSAR--KQKNASQFARSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERM

Query:  LNRGVDLMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNI
        L RGVDL  IN +LE+ VL   DM  FQ MH RDCSQVRRLA +YR  S C GSGKK FVTVTRT  T MPSASD++R+E+LIGA DED DF+V+ G   
Subjt:  LNRGVDLMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTGMPSASDQVRLEQLIGATDEDNDFSVAEGSNI

Query:  KSQGGNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDV
        KS   +R + K + K R         +   + R+K S GK SS       YADQPVSFVS G++  E            V K+     A E+ E      
Subjt:  KSQGGNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKSKDVVAASEMIEIANSDV

Query:  KNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFS----EASTSAADDNQASRGSAARTG---------SGA
           D     IGAFE+HT+GFGSKMMAKMGF++GGGLGKDG+G+A+PIE ++RPKSLGLG++FS    + S S+ ++N A R  ++ +G         +GA
Subjt:  KNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFS----EASTSAADDNQASRGSAARTG---------SGA

Query:  LG----KSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG
         G    + K++GAFE+HT GFGS+MMA+MGFVEG GLG++SQG+VNPL+ VRRP+ARG+GA+G
Subjt:  LG----KSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAGGAACAACAAGAGCAATTCCGGTAGAAGAAGATCAGACCCGTCGTCACTGTTTGTGGATGGCGGGTTTTTATCTGATTGGCAGTTTCGGATATCGC
CGCCGAGTTCTGCACGAGGTGCGATTTTCTCTTTCCTTCTCATCCACTTCCGAGTCTCTGCATTTGCTTAATCATACTGGTATTGAGGGAAATTCTAGGTCAAAG
GGTAAGTCTGGATCAAAATCGGCTAATTTAGATCGTAAAAAAATTGCTTCATCAAGTGGGACTAAACAATCCAATGGTAATGCTATTGGCTACGAATACCCTCCT
GCTCCGAATCAGGAAAGTTTGCATTCGGAATCCCGGGGTCTACAGAATGATGCAGAGTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTAATTGCAAGAAT
AATCAAATTGTCGCATATGTAGATGAAAATCCACCATTGAAGGCAGATAATTTGGAATTTACTTATGATTACGGTACTAGTTTTGTGTTAGGTGACAGTTCACAT
AGAGGACTGGGGTTTCACGATGAGGACGAACTTGTTAAAAATCAGAACACAGATGACGATTCAGCCACTCAAGTGGGAGAACAAGGAGGATTGTGTGCTGGTTCA
TTATCCTCCGGGAAGGAAAATGGTACAGATGAGAAGGTTGATTGTAGGGTAGGGGTTGAGACGGCTAATGAGACATTAGCAGAAGCATCACCCTCCAACAAATAT
TCAGATGGTGTGTGCTCTCCAAGAAATTCAGGTTTCTTGTCCATTGGTGGTGTGAGATTATACACCCAAGATGTATCTGATGAAGAGGGTGATGATGATAGGGAG
TCATCTGATGGAAGCTCCGAGTACTCTGAACCTCTAGAGTCAGATGGATCATCTGGAAGTGATAGCTCTGTAGAAATGTCTTGCGGTGAATCAGATATTGATGAC
GAGGTGGCTGAAGATTATCTCGAAGGGATTGGTGGAAGTGAAAATATTTTAAAATCCAAATGGTTAGTAAAACAAGAGTTGGTAGAGTCTGATGATGATAGTTCT
AGCAGTAGCCTTGATGATACTTTAGAGAAATTAGGCGGCATTACTTTGCAGGAAGCATCTAAAGAATATGGAATTATGAAAACTCCTTCAAGAAAGAAAAGCTCC
GTTGTTTCTAGAGATCATTGGTCAGCATTGGCACTAGATGACTTGCTAGTAAAAGATTCTAGATCTACATCAGCTAGAAAGCAAAAGAATGCTTCTCAGTTTGCT
CGTTCTTGGCCTCCAAAAGCTCCAACAAGTAAAGCTGCTAGAAAGTATCCCGGTGAAAAGAAGAAATATCGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGA
ATGCTTAATCGGGGTGTTGATCTAATGCAAATAAATCTGAAATTGGAACACATGGTTCTCTACAGAGAAGATATGTTCGCTTTCCAACCTATGCATCCCCGTGAT
TGTTCCCAGGTACGACGATTGGCAGCAATTTACCGCTTTCATAGTGGATGCCAAGGTTCTGGTAAGAAAAGGTTTGTAACAGTAACTCGAACACAGTACACAGGA
ATGCCATCAGCAAGTGATCAAGTTCGCCTTGAACAGCTAATCGGAGCAACAGATGAGGATAATGATTTTTCTGTTGCTGAAGGCTCAAATATAAAATCACAGGGC
GGCAACAGAAGCAGAGAAAAGAAGAATACCAAAGTGCGTGGTTTGCACATGTTGGAGCTTGATAAATCCGCAAGTAGTAAGTTGAGGTCGAAGGGTTCTGCAGGT
AAGGTATCAAGTCAGAAAATGACCGGCAAAAAGTATGCTGATCAACCCGTCTCATTTGTGTCATGTGGAGTAATGCAACCAGAACCAATGGATGAGATAACGACT
AGCAATGTTAAGGACGTAGATAAAAGTAAGGATGTCGTAGCTGCATCCGAAATGATCGAGATTGCCAATAGTGATGTCAAGAATATGGATATAAGTAGGGATAGG
ATTGGTGCATTTGAGCTGCACACGAAGGGTTTTGGGTCAAAAATGATGGCAAAAATGGGATTCGTAGAAGGCGGAGGATTAGGGAAAGATGGTCAAGGAATGGCT
CGTCCCATTGAAGTGATGAAACGGCCAAAATCACTTGGATTAGGCATCGAGTTCTCCGAGGCCTCAACTTCTGCTGCTGATGACAACCAGGCAAGCCGGGGATCT
GCCGCTAGAACTGGATCCGGAGCCTTGGGAAAATCCAAAAAAATTGGTGCTTTTGAAGAGCACACTAAAGGATTTGGGTCGAAGATGATGGCAAAGATGGGATTT
GTTGAAGGCATGGGATTGGGAAAAGATTCACAAGGAATGGTCAACCCTTTGCTTCCTGTTAGACGGCCTAAAGCACGAGGGTTGGGTGCCAAAGGTTAG
mRNA sequenceShow/hide mRNA sequence
GAATAAACGGGCCTGAAAGAGGAGTTGGGCTTTGAAGCTTACTTGCCCAGAAAGCTAGTGGATCCGATTCTTTGTTATCCGTCCACCGAGTCGACGCCACCGCAA
GGTTTAGCTGTGGCGGTTTCTTTGCTTCGGAGTTCGGGAATTGACTTGTAGCCTACCCTGCACAAGTCCTCGACACACACGCCGTCGTTCTGTTCTGGTGGCGGT
GGTAGCTCTCCGATGGCCGGAGGAAGACGAAGAACGAACCATTCCAAAGCTAGTGATGGCTTCAGGAACAACAAGAGCAATTCCGGTAGAAGAAGATCAGACCCG
TCGTCACTGTTTGTGGATGGCGGGTTTTTATCTGATTGGCAGTTTCGGATATCGCCGCCGAGTTCTGCACGAGGTGCGATTTTCTCTTTCCTTCTCATCCACTTC
CGAGTCTCTGCATTTGCTTAATCATACTGGTATTGAGGGAAATTCTAGGTCAAAGGGTAAGTCTGGATCAAAATCGGCTAATTTAGATCGTAAAAAAATTGCTTC
ATCAAGTGGGACTAAACAATCCAATGGTAATGCTATTGGCTACGAATACCCTCCTGCTCCGAATCAGGAAAGTTTGCATTCGGAATCCCGGGGTCTACAGAATGA
TGCAGAGTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTAATTGCAAGAATAATCAAATTGTCGCATATGTAGATGAAAATCCACCATTGAAGGCAGATAA
TTTGGAATTTACTTATGATTACGGTACTAGTTTTGTGTTAGGTGACAGTTCACATAGAGGACTGGGGTTTCACGATGAGGACGAACTTGTTAAAAATCAGAACAC
AGATGACGATTCAGCCACTCAAGTGGGAGAACAAGGAGGATTGTGTGCTGGTTCATTATCCTCCGGGAAGGAAAATGGTACAGATGAGAAGGTTGATTGTAGGGT
AGGGGTTGAGACGGCTAATGAGACATTAGCAGAAGCATCACCCTCCAACAAATATTCAGATGGTGTGTGCTCTCCAAGAAATTCAGGTTTCTTGTCCATTGGTGG
TGTGAGATTATACACCCAAGATGTATCTGATGAAGAGGGTGATGATGATAGGGAGTCATCTGATGGAAGCTCCGAGTACTCTGAACCTCTAGAGTCAGATGGATC
ATCTGGAAGTGATAGCTCTGTAGAAATGTCTTGCGGTGAATCAGATATTGATGACGAGGTGGCTGAAGATTATCTCGAAGGGATTGGTGGAAGTGAAAATATTTT
AAAATCCAAATGGTTAGTAAAACAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGTAGCCTTGATGATACTTTAGAGAAATTAGGCGGCATTACTTTGCA
GGAAGCATCTAAAGAATATGGAATTATGAAAACTCCTTCAAGAAAGAAAAGCTCCGTTGTTTCTAGAGATCATTGGTCAGCATTGGCACTAGATGACTTGCTAGT
AAAAGATTCTAGATCTACATCAGCTAGAAAGCAAAAGAATGCTTCTCAGTTTGCTCGTTCTTGGCCTCCAAAAGCTCCAACAAGTAAAGCTGCTAGAAAGTATCC
CGGTGAAAAGAAGAAATATCGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGCTTAATCGGGGTGTTGATCTAATGCAAATAAATCTGAAATTGGAACA
CATGGTTCTCTACAGAGAAGATATGTTCGCTTTCCAACCTATGCATCCCCGTGATTGTTCCCAGGTACGACGATTGGCAGCAATTTACCGCTTTCATAGTGGATG
CCAAGGTTCTGGTAAGAAAAGGTTTGTAACAGTAACTCGAACACAGTACACAGGAATGCCATCAGCAAGTGATCAAGTTCGCCTTGAACAGCTAATCGGAGCAAC
AGATGAGGATAATGATTTTTCTGTTGCTGAAGGCTCAAATATAAAATCACAGGGCGGCAACAGAAGCAGAGAAAAGAAGAATACCAAAGTGCGTGGTTTGCACAT
GTTGGAGCTTGATAAATCCGCAAGTAGTAAGTTGAGGTCGAAGGGTTCTGCAGGTAAGGTATCAAGTCAGAAAATGACCGGCAAAAAGTATGCTGATCAACCCGT
CTCATTTGTGTCATGTGGAGTAATGCAACCAGAACCAATGGATGAGATAACGACTAGCAATGTTAAGGACGTAGATAAAAGTAAGGATGTCGTAGCTGCATCCGA
AATGATCGAGATTGCCAATAGTGATGTCAAGAATATGGATATAAGTAGGGATAGGATTGGTGCATTTGAGCTGCACACGAAGGGTTTTGGGTCAAAAATGATGGC
AAAAATGGGATTCGTAGAAGGCGGAGGATTAGGGAAAGATGGTCAAGGAATGGCTCGTCCCATTGAAGTGATGAAACGGCCAAAATCACTTGGATTAGGCATCGA
GTTCTCCGAGGCCTCAACTTCTGCTGCTGATGACAACCAGGCAAGCCGGGGATCTGCCGCTAGAACTGGATCCGGAGCCTTGGGAAAATCCAAAAAAATTGGTGC
TTTTGAAGAGCACACTAAAGGATTTGGGTCGAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATGGGATTGGGAAAAGATTCACAAGGAATGGTCAACCCTTT
GCTTCCTGTTAGACGGCCTAAAGCACGAGGGTTGGGTGCCAAAGGTTAGCTATTTACAAGATTTAACTGTTCGTTTCATCGTATTGCTAATACATCTATTCTCTT
AGATGCTGTCATTCTGTGTCAATGTTGCTTATTAACAAAGCAAATGTTAGTCCTCTCCTGTTATTTAAGTTGTATCTTATTTTCTTATATTAATAATAATGAGAA
CAAAATACTTAAAGAATAAGGAGTGTCTTGGGGTTGGGTTTCTTCTGGCCTTCGGTCTATCAAACTCTGACCACAGGCCAATTGACGAGGATAGAACTGAAGGCT
AAGGCAATGACAAGACAAGATTGAGATGAGAATTTAAAACTGGACTCATAAATGTAAAAACAAATAATAAATACTAAAACATATATTTACGCTATGTTAGAACAT
AACCTTAGATACAAGTTTTGAGCTTTTTCTTCAAAATAATTCTTTTTTCTTCTTTTCTTTTATCCTCTCTTTGTCCTTTCCCTTGTTCTCTCTTACTTTTCCTCT
TCTTTTGTTTTGTTTTTTTCTCTCTCGTTCTCACTCGCTTTTTTACTTCTCACATCTTATTTCTTCGTTGTGGAGATGAACATGGAGATTTAATGTGATCGTTTT
TTTTTCTTTCTTTCTTTCTTCTTCTTCTTTCTGCTTTCTTTCTTTCTCCTGATATCCATGAGTGTTTGAACCAACTTTACACCTGCCTTGACTAATAATCTCATG
AGACAACTCTGTTTTGACTCTACAATTTTTTTTGGGTGTCAAGGAAACGCGTAGGATATTAATCCTAGGTAGGTGACCACTATGGATTGAACGGATGACCTCTTA
ATCTTTCATCAAGATCATGCCCCTTATTTATTAGTAGGCCAATCTATGGTGGTTTTGTGTTTGGGAAAAGAGAGGTCTTCAAAAGGGTTTGAACTTCAATGGTTG
TGTGACTATGGATCTTCCGTTGTGAAGATTGACGGTCTCGACACTGCATTTAGGTCAGTAGAAAAATGCTGAGTAAAAAATGTTAAACATGTCACATGGGTTCTT
ATTCTTAAGGCCAAATGGCATGTTAGTTGTTACTGTTCTTTGGGCATTGGCATTATTGGAATCAATGAAGAAGAGCTTGATCTACAAGCAAGAAGTCACCAATGA
GGTGGTTCAATCCAAATAAGGAAGTCACCAATGAGGTGGTTCAATCCAAATAAGG
Protein sequenceShow/hide protein sequence
MASGTTRAIPVEEDQTRRHCLWMAGFYLIGSFGYRRRVLHEVRFSLSFSSTSESLHLLNHTGIEGNSRSKGKSGSKSANLDRKKIASSSGTKQSNGNAIGYEYPP
APNQESLHSESRGLQNDAECSLDNSQPFILLNCKNNQIVAYVDENPPLKADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDDSATQVGEQGGLCAGS
LSSGKENGTDEKVDCRVGVETANETLAEASPSNKYSDGVCSPRNSGFLSIGGVRLYTQDVSDEEGDDDRESSDGSSEYSEPLESDGSSGSDSSVEMSCGESDIDD
EVAEDYLEGIGGSENILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGITLQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLLVKDSRSTSARKQKNASQFA
RSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLKLEHMVLYREDMFAFQPMHPRDCSQVRRLAAIYRFHSGCQGSGKKRFVTVTRTQYTG
MPSASDQVRLEQLIGATDEDNDFSVAEGSNIKSQGGNRSREKKNTKVRGLHMLELDKSASSKLRSKGSAGKVSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITT
SNVKDVDKSKDVVAASEMIEIANSDVKNMDISRDRIGAFELHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVMKRPKSLGLGIEFSEASTSAADDNQASRGS
AARTGSGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKG