; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10008129 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10008129
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein SQS1
Genome locationChr10:20093992..20098295
RNA-Seq ExpressionHG10008129
SyntenyHG10008129
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050035.1 Zinc finger protein, putative isoform 1 [Cucumis melo var. makuwa]0.0e+0084.21Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY
        MAGGRRRTNH K+SD F+KNK NS RRRSDP S    NLFVDGGFLSDWQFQLSPP SA+EGNSRAKGKSG KSA LDRKK ASSSGTKQS+G AIGYEY
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY

Query:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ
        P AP Q  L SES+GLQNDAECSLD+SQPFIL+NSK+NQIVAYVDENP LMADNLEFTYDYGTSFVLGDSSHRGLGF D+DELV  QNTDD SATQV EQ
Subjt:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ

Query:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS
        GGLC GSLS  KETGTDE+VD RVGVETANEM+ E SPSNK S D+  P NSGFLSIGGVRLYTQD+SDEE DDDGESSDG+SEYSEPLESDESSE D S
Subjt:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS

Query:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVK
        VEMSCS SDIDDEVAEDYLEG+GGS+NILKSKWLVK+EL+ES DDSSSSSLDDTL+KLGGIALQEASKEYG++KT SR K SVVSRDHWSALALDD++VK
Subjt:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVK

Query:  DIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYRL
        DIRSTS RKQKNASQFA SWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDLMQI+LRLEHMVLN+EDM+AFQPMHPRDCSQVRRLAAIYRL
Subjt:  DIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYRL

Query:  HSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKMT
        HSGCQGSGKK+FV VTRTQ+TGMPSAS QVRLEQLIGARDEDNDFSVAEGSNI+S+G +RSREKK+TK+RGL +L+ DQS S+KSRSKGS+GKGSSQKMT
Subjt:  HSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKMT

Query:  GKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPI
        GKKYADQPVSFVSCGVMQPE M+     +  DVDK KDIVA SEMIEM T++V N DISRDSIG+FE+HTKGFGSKMMAKMGFVEG GLGKDGQGM  PI
Subjt:  GKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPI

Query:  EVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAK
        EVMKRPKSLGLGIEFSE STSAAGDNQAS   + RTG+G L KSK+IGAFE HTKGFGSKMMAKMGFVEGMGLG+DSQGIVNPLLPVRRPKARGLGAK
Subjt:  EVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAK

XP_008443898.1 PREDICTED: uncharacterized protein LOC103487380 isoform X2 [Cucumis melo]0.0e+0084.46Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY
        MAGGRRRTNH K+SD F+KNK NS RRRSDP S    NLFVDGGFLSDWQFQLSPP SA+EGNSRAKGKSG KSA LDRKK ASSSGTKQSNG AIGYEY
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY

Query:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ
        P AP Q  L SES+GLQNDAECSLD+SQPFILLNSK+NQIVAYVDENP LMADNLEFTYDYGTSFVLGDSSHRGLGF D+DELV  QNTDD SATQV EQ
Subjt:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ

Query:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS
        GGLC GSLS  KETGTDE+VD RVGVETANEM+ E SPSNK S D+  P NSGFLSIGGVRLYTQD+SDEE DDDGESSDG+SEYSEPLESDESSE D S
Subjt:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS

Query:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVK
        VEMSCS SDIDDEVAEDYLEG+GGS+NILKSKWLVK+EL+ES DDSSSSSLDDTL+KLGGIALQEASKEYG++KT SR K SVVSRDHWSALALDD++VK
Subjt:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVK

Query:  DIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYRL
        DIRSTS RKQKNASQFA SWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDLMQI+LRLEHMVLN+EDM+AFQPMHPRDCSQVRRLAAIYRL
Subjt:  DIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYRL

Query:  HSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKMT
        HSGCQGSGKK+FV VTRTQ+TGMPSAS QVRLEQLIGARDEDNDFSVAEGSNI+S+G +RSREKK+TKVRGL +L+ DQS S+KSRSKGS+GKGSSQKMT
Subjt:  HSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKMT

Query:  GKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPI
        GKKYADQPVSFVSCGVMQPE M+     +  DVDK KDIVA SEMIEM T++V N DISRDSIG+FE+HTKGFGSKMMAKMGFVEG GLGKDGQGM  PI
Subjt:  GKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPI

Query:  EVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAK
        EVMKRPKSLGLGIEFSE STSAAGDNQAS   + RT +G L KSK+IGAFE HTKGFGSKMMAKMGFVEGMGLG+DSQGIVNPLLPVRRPKARGLGAK
Subjt:  EVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAK

XP_011655819.1 uncharacterized protein LOC101215133 [Cucumis sativus]0.0e+0082.8Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQ----------EGNSRAKGKSGPKSANLDRKKSASSSGTKQ
        MAGGRRRTNH KASD F+KNK NS RRRSD  S    NLFVDGGFL DW  Q SPP SA+          EGNSRAKGKSG KS  LDRKK ASSSGTKQ
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQ----------EGNSRAKGKSGPKSANLDRKKSASSSGTKQ

Query:  SNGNAIGYEYPPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTD
        SNG AIGYEYP AP Q  LHSES+ LQNDAE   D+SQPFILLNSK+NQIVAYVDENPPLMADNLEFTYDYGTSFVLG+SSHRGLGFHD+DE V  QNTD
Subjt:  SNGNAIGYEYPPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTD

Query:  DVSATQVREQGGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLE
        D SATQV EQG LC  SLSSGKETGTDERVD RVGVETANEM+ E SPSNKYS D+S P NSGFLSIGGVRLYTQD+SDE  DDDGESSDGSSEYSEPLE
Subjt:  DVSATQVREQGGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLE

Query:  SDESSESDGSVEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWS
        SDESSE D SVEMSCS SDIDDEVAEDYLEG+GGSENILKSKWLVK+EL+ES DDSSSSSLDDTL+KLGGIALQEASKEYG++KTPSR K SVVSRDHWS
Subjt:  SDESSESDGSVEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWS

Query:  ALALDDLVVKDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQ
        ALALDD++VK+ RSTSARKQKNA QFA SWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDLMQINL+LEHMVLN+EDM+AFQPMHPRDCSQ
Subjt:  ALALDDLVVKDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQ

Query:  VRRLAAIYRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGS
        VRRLAAIYRLH+GCQGSGKK+FV VTRTQ+TGMPSAS QVRLE+LIGARDEDNDFSVAEGSNI+S+GGNRSREKK TKV GL +L+ DQS SSKSRSKGS
Subjt:  VRRLAAIYRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGS

Query:  SGKGSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLG
        +GKGSSQKMTGKKYADQPVSFVS GVMQPE M+E   ++V D DK KDIVA SEMIEM T++V NMDISRDSIG+FE+HTKGFGSKMMAKMGFVEG GLG
Subjt:  SGKGSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLG

Query:  KDGQGMARPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRP
        KDGQGM  PIEVMKRPKSLGLGIEFSE STSAAGDNQAS   + RTG+  L K+KKIGAFE HTKGFGSKMMAKMGFVEGMGLG+DSQGI+NPLLPVRRP
Subjt:  KDGQGMARPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRP

Query:  KARGLGAK
        KARGLGAK
Subjt:  KARGLGAK

XP_023003182.1 uncharacterized protein LOC111496865 isoform X2 [Cucurbita maxima]0.0e+0081.55Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY
        MAGGRRRTNH K SDGF+KNK +SGRRRSDPSSS+RGNLFVDGGFLSDWQFQ +PPSSA+EGNSR+KG+SG KSA+LDRKK+ASSSGTKQSNGNAIGYEY
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY

Query:  PPAPYQ-GSLHSESQGLQNDAECSLDNSQPFILL--NSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQV
        PPAP+Q G LHSES+GLQNDA+CSLDNSQPFILL  NSK+ QIVAYVD+ PPL  D LE TYDYGT F+LGDSSHRGLGFHDEDELV+NQNTDD S T V
Subjt:  PPAPYQ-GSLHSESQGLQNDAECSLDNSQPFILL--NSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQV

Query:  REQGGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSES
         EQ GLC GSL S KETG+DERV+CR  VE A+EML E S  NKYS     P NSGFLSIGGVRLYTQD+SDEE DDDGE S+GSSEYSEPLESDESSES
Subjt:  REQGGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSES

Query:  DGSVEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDL
        D S EM+CS SDIDDEVAEDYLEGIGG E+ILKSKWLVK+EL ESDDD SSSSLDDTLEKL  IALQEASKEYG+ KTPSR KS +VSRD+WS+LALDDL
Subjt:  DGSVEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDL

Query:  VVKDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAI
        ++KD RS SARK+KNA+ FA SWPPKAP SKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINL+LEHMVLNREDMF+FQPMHPRDCSQVRRLAAI
Subjt:  VVKDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQ
        YRLHSGCQGSGKK+FV VTRTQYTG+PS+S QVRL QLIGARD+DNDFSVAEG NI+S G NRSREKK+ K  GL  L+L+QS SSKSR+KGS+GKGSSQ
Subjt:  YRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQ

Query:  KMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMA
        K TGKKYADQPVSFVSCGVMQP+ + EITTSNVKDVDK KDIV ASEM+E+ T++VKNMDISRDSIGAFE HT GFGSKMMAKMGFVEG GLGKDGQGMA
Subjt:  KMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMA

Query:  RPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGA
         PIEV+KRPKSLGLG+EFS  ++++AGDNQ SR  + RT TG LGKSKK+GAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQG+VNPLLPVRRPKARGLGA
Subjt:  RPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGA

Query:  KG
        KG
Subjt:  KG

XP_038876900.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120069254 [Benincasa hispida]0.0e+0088.12Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY
        MAGGRRRTNH KASD FKKNK NSGRRRSDP SSIRGNLFVDGGFLSDWQFQLSPPSSA+EGNS AKGKSG KSANLDRK+ ASSSGTKQ++GNAIGYEY
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY

Query:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ
        PPAP Q S HSES+GLQ DAECSLDNSQPFILLNS NNQIVAYVDENPPL ADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDD SATQV  +
Subjt:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ

Query:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS
        GGLCAGSLSS KETG  ERV+CRVGVETANEML E SPSNKYS     P NSGFLSIGGVRLYTQD+SDE GDDDGESSDGSS YSEPLESDESSESD S
Subjt:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS

Query:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSR-KKSSVVSRDHWSALALDDLVV
        VEMSCS SDIDDEVA+DYLEGIGGSENIL SKWL+K+ELVESDDDSS+SSLDDTLEKLGGIALQEASKEYGIMKTPSR KKSSVVSRDHWSALALDDL+V
Subjt:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSR-KKSSVVSRDHWSALALDDLVV

Query:  KDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYR
        KD RSTSARKQKN + FA SWPPKAP  K+ARKYPGEKKKYRKETIAAKRRERMLNRGVDLM+INL+LEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYR
Subjt:  KDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYR

Query:  LHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKM
        LH G QGSGKK+FV VTRTQ+TGMPSAS QVRLEQLIGARDED+DF VAEGSNI+SQG NRSREKK+TKV GL +L+LD+SASSKSRSKGS+GKG SQKM
Subjt:  LHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKM

Query:  TGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARP
        TGKKYADQPVSFVSCGVMQPEPMDE TTSN +DVDK+KDIVAASEM EM T+DVKNMDISRDSIGAFE++TKGFGSKMMAKMG+VEG GLGKDGQGMARP
Subjt:  TGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARP

Query:  IEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAKG
        IEVMKRPKSLGLGIEFSE STSAAGDNQASR    R+G+G LGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQGIVNPLLP+RRPKARGLGAKG
Subjt:  IEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAKG

TrEMBL top hitse value%identityAlignment
A0A1S3B932 uncharacterized protein LOC103487380 isoform X10.0e+0081.82Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY
        MAGGRRRTNH K+SD F+KNK NS RRRSDP S    NLFVDGGFLSDWQFQLSPP SA+EGNSRAKGKSG KSA LDRKK ASSSGTKQSNG AIGYEY
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY

Query:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ
        P AP Q  L SES+GLQNDAECSLD+SQPFILLNSK+NQIVAYVDENP LMADNLEFTYDYGTSFVLGDSSHRGLGF D+DELV  QNTDD SATQV EQ
Subjt:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ

Query:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS
        GGLC GSLS  KETGTDE+VD RVGVETANEM+ E SPSNK S D+  P NSGFLSIGGVRLYTQD+SDEE DDDGESSDG+SEYSEPLESDESSE D S
Subjt:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS

Query:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVK
        VEMSCS SDIDDEVAEDYLEG+GGS+NILKSKWLVK+EL+ES DDSSSSSLDDTL+KLGGIALQEASKEYG++KT SR K SVVSRDHWSALALDD++VK
Subjt:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVK

Query:  DIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLA-AIYR
        DIRSTS RKQKNASQFA SWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDLMQI+LRLEHMVLN+EDM+AFQPMHPRDCSQ+  +    ++
Subjt:  DIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLA-AIYR

Query:  LHSGCQGSG----KKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGS
          + C        KK+FV VTRTQ+TGMPSAS QVRLEQLIGARDEDNDFSVAEGSNI+S+G +RSREKK+TKVRGL +L+ DQS S+KSRSKGS+GKGS
Subjt:  LHSGCQGSG----KKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGS

Query:  SQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQG
        SQKMTGKKYADQPVSFVSCGVMQPE M+     +  DVDK KDIVA SEMIEM T++V N DISRDSIG+FE+HTKGFGSKMMAKMGFVEG GLGKDGQG
Subjt:  SQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQG

Query:  MARPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGL
        M  PIEVMKRPKSLGLGIEFSE STSAAGDNQAS   + RT +G L KSK+IGAFE HTKGFGSKMMAKMGFVEGMGLG+DSQGIVNPLLPVRRPKARGL
Subjt:  MARPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGL

Query:  GAK
        GAK
Subjt:  GAK

A0A1S3B9V9 Protein SQS10.0e+0084.46Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY
        MAGGRRRTNH K+SD F+KNK NS RRRSDP S    NLFVDGGFLSDWQFQLSPP SA+EGNSRAKGKSG KSA LDRKK ASSSGTKQSNG AIGYEY
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY

Query:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ
        P AP Q  L SES+GLQNDAECSLD+SQPFILLNSK+NQIVAYVDENP LMADNLEFTYDYGTSFVLGDSSHRGLGF D+DELV  QNTDD SATQV EQ
Subjt:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ

Query:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS
        GGLC GSLS  KETGTDE+VD RVGVETANEM+ E SPSNK S D+  P NSGFLSIGGVRLYTQD+SDEE DDDGESSDG+SEYSEPLESDESSE D S
Subjt:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS

Query:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVK
        VEMSCS SDIDDEVAEDYLEG+GGS+NILKSKWLVK+EL+ES DDSSSSSLDDTL+KLGGIALQEASKEYG++KT SR K SVVSRDHWSALALDD++VK
Subjt:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVK

Query:  DIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYRL
        DIRSTS RKQKNASQFA SWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDLMQI+LRLEHMVLN+EDM+AFQPMHPRDCSQVRRLAAIYRL
Subjt:  DIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYRL

Query:  HSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKMT
        HSGCQGSGKK+FV VTRTQ+TGMPSAS QVRLEQLIGARDEDNDFSVAEGSNI+S+G +RSREKK+TKVRGL +L+ DQS S+KSRSKGS+GKGSSQKMT
Subjt:  HSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKMT

Query:  GKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPI
        GKKYADQPVSFVSCGVMQPE M+     +  DVDK KDIVA SEMIEM T++V N DISRDSIG+FE+HTKGFGSKMMAKMGFVEG GLGKDGQGM  PI
Subjt:  GKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPI

Query:  EVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAK
        EVMKRPKSLGLGIEFSE STSAAGDNQAS   + RT +G L KSK+IGAFE HTKGFGSKMMAKMGFVEGMGLG+DSQGIVNPLLPVRRPKARGLGAK
Subjt:  EVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAK

A0A5A7U8V1 Protein SQS10.0e+0084.21Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY
        MAGGRRRTNH K+SD F+KNK NS RRRSDP S    NLFVDGGFLSDWQFQLSPP SA+EGNSRAKGKSG KSA LDRKK ASSSGTKQS+G AIGYEY
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY

Query:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ
        P AP Q  L SES+GLQNDAECSLD+SQPFIL+NSK+NQIVAYVDENP LMADNLEFTYDYGTSFVLGDSSHRGLGF D+DELV  QNTDD SATQV EQ
Subjt:  PPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQ

Query:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS
        GGLC GSLS  KETGTDE+VD RVGVETANEM+ E SPSNK S D+  P NSGFLSIGGVRLYTQD+SDEE DDDGESSDG+SEYSEPLESDESSE D S
Subjt:  GGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGS

Query:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVK
        VEMSCS SDIDDEVAEDYLEG+GGS+NILKSKWLVK+EL+ES DDSSSSSLDDTL+KLGGIALQEASKEYG++KT SR K SVVSRDHWSALALDD++VK
Subjt:  VEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVK

Query:  DIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYRL
        DIRSTS RKQKNASQFA SWPPKA TSKA RKYPGEKKKYRKE IAAKRRERMLNRGVDLMQI+LRLEHMVLN+EDM+AFQPMHPRDCSQVRRLAAIYRL
Subjt:  DIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYRL

Query:  HSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKMT
        HSGCQGSGKK+FV VTRTQ+TGMPSAS QVRLEQLIGARDEDNDFSVAEGSNI+S+G +RSREKK+TK+RGL +L+ DQS S+KSRSKGS+GKGSSQKMT
Subjt:  HSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKMT

Query:  GKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPI
        GKKYADQPVSFVSCGVMQPE M+     +  DVDK KDIVA SEMIEM T++V N DISRDSIG+FE+HTKGFGSKMMAKMGFVEG GLGKDGQGM  PI
Subjt:  GKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPI

Query:  EVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAK
        EVMKRPKSLGLGIEFSE STSAAGDNQAS   + RTG+G L KSK+IGAFE HTKGFGSKMMAKMGFVEGMGLG+DSQGIVNPLLPVRRPKARGLGAK
Subjt:  EVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAK

A0A6J1KLR0 Protein SQS10.0e+0081.55Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY
        MAGGRRRTNH K SDGF+KNK +SGRRRSDPSSS+RGNLFVDGGFLSDWQFQ +PPSSA+EGNSR+KG+SG KSA+LDRKK+ASSSGTKQSNGNAIGYEY
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY

Query:  PPAPYQ-GSLHSESQGLQNDAECSLDNSQPFILL--NSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQV
        PPAP+Q G LHSES+GLQNDA+CSLDNSQPFILL  NSK+ QIVAYVD+ PPL  D LE TYDYGT F+LGDSSHRGLGFHDEDELV+NQNTDD S T V
Subjt:  PPAPYQ-GSLHSESQGLQNDAECSLDNSQPFILL--NSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQV

Query:  REQGGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSES
         EQ GLC GSL S KETG+DERV+CR  VE A+EML E S  NKYS     P NSGFLSIGGVRLYTQD+SDEE DDDGE S+GSSEYSEPLESDESSES
Subjt:  REQGGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSES

Query:  DGSVEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDL
        D S EM+CS SDIDDEVAEDYLEGIGG E+ILKSKWLVK+EL ESDDD SSSSLDDTLEKL  IALQEASKEYG+ KTPSR KS +VSRD+WS+LALDDL
Subjt:  DGSVEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDL

Query:  VVKDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAI
        ++KD RS SARK+KNA+ FA SWPPKAP SKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINL+LEHMVLNREDMF+FQPMHPRDCSQVRRLAAI
Subjt:  VVKDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQ
        YRLHSGCQGSGKK+FV VTRTQYTG+PS+S QVRL QLIGARD+DNDFSVAEG NI+S G NRSREKK+ K  GL  L+L+QS SSKSR+KGS+GKGSSQ
Subjt:  YRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQ

Query:  KMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMA
        K TGKKYADQPVSFVSCGVMQP+ + EITTSNVKDVDK KDIV ASEM+E+ T++VKNMDISRDSIGAFE HT GFGSKMMAKMGFVEG GLGKDGQGMA
Subjt:  KMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMA

Query:  RPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGA
         PIEV+KRPKSLGLG+EFS  ++++AGDNQ SR  + RT TG LGKSKK+GAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQG+VNPLLPVRRPKARGLGA
Subjt:  RPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGA

Query:  KG
        KG
Subjt:  KG

A0A6J1KSJ4 Protein SQS10.0e+0080.65Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY
        MAGGRRRTNH K SDGF+KNK +SGRRRSDPSSS+RGNLFVDGGFLSDWQFQ +PPSSA+EGNSR+KG+SG KSA+LDRKK+ASSSGTKQSNGNAIGYEY
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEY

Query:  PPAPYQ-GSLHSESQGLQNDAECSLDNSQPFILL--NSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQV
        PPAP+Q G LHSES+GLQNDA+CSLDNSQPFILL  NSK+ QIVAYVD+ PPL  D LE TYDYGT F+LGDSSHRGLGFHDEDELV+NQNTDD S T V
Subjt:  PPAPYQ-GSLHSESQGLQNDAECSLDNSQPFILL--NSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQV

Query:  REQGGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSES
         EQ GLC GSL S KETG+DERV+CR  VE A+EML E S  NKYS     P NSGFLSIGGVRLYTQD+SDEE DDDGE S+GSSEYSEPLESDESSES
Subjt:  REQGGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSES

Query:  DGSVEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDL
        D S EM+CS SDIDDEVAEDYLEGIGG E+ILKSKWLVK+EL ESDDD SSSSLDDTLEKL  IALQEASKEYG+ KTPSR KS +VSRD+WS+LALDDL
Subjt:  DGSVEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDL

Query:  VVKDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAI
        ++KD RS SARK+KNA+ FA SWPPKAP SKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINL+LEHMVLNREDMF+FQPMHPRDCSQVRRLAAI
Subjt:  VVKDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQ
        YRLHSGCQGSGKK+FV VTRTQYTG+PS+S QVRL QLIGARD+DNDFSVAEG NI+S G NRSREKK+ K  GL  L+L+QS SSKSR+KGS+GKGSSQ
Subjt:  YRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQ

Query:  KMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMA
        K TGKKYADQPVSFVSCGVMQP+ + EITTSNVKDVDK KDIV ASEM+E+ T++VKNMDISRDSIGAFE HT GFGSKMMAKMGFVEG GLGKDGQGMA
Subjt:  KMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMA

Query:  RPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGA
         PIEV+KRPKSLGLGIEFSE STS AGDNQ S G   RT TGTLGKSKK+GAFEEHTKGFGSKMMAKMGFVEG GLG+D QG+  P+  ++RPK+ GLG 
Subjt:  RPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGA

Query:  K
        +
Subjt:  K

SwissProt top hitse value%identityAlignment
A7SBN6 Zinc finger CCCH-type with G patch domain-containing protein2.7e-0834.95Show/hide
Query:  PMDEITTSNVKDVDKNKDIVAASEMIEMATTDV---KNMDISR---------DSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPIEVMKRPKS
        P+D+I     ++V+ + +  + S+  + +++     ++ D+ R          S+G +E HTKG GSK+MAKMG++ G+GLGKDG+G   PIEV+  P+ 
Subjt:  PMDEITTSNVKDVDKNKDIVAASEMIEMATTDV---KNMDISR---------DSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPIEVMKRPKS

Query:  LGL
          L
Subjt:  LGL

B3MPC0 Zinc finger CCCH-type with G patch domain-containing protein2.7e-0850Show/hide
Query:  TLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGA
        T   ++K+GA+EE+T+G GSK+M KMG++ G GLG D +GIV P+     PK R L A
Subjt:  TLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGA

B4JCG4 Zinc finger CCCH-type with G patch domain-containing protein1.7e-0746.55Show/hide
Query:  TLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGA
        T   ++K+GA+E++T+G GSK+MA MG++ G GLG D +GIV P+     P+ R L A
Subjt:  TLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGA

Q0IIX9 Tuftelin-interacting protein 111.0e-0741.18Show/hide
Query:  PTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAKG
        P+ RT  G +  +   G++E HTKG G K++ KMG+V+G GLG+++QGI+ P+   +R     +GA G
Subjt:  PTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAKG

Q29RR5 Tuftelin-interacting protein 112.3e-0746.77Show/hide
Query:  GAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLGIEFSETSTSAAGD
        G++E HTKG G K++ KMG+V GRGLGK+ QG+  PIE  +R     +G   SE +T +  D
Subjt:  GAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLGIEFSETSTSAAGD

Arabidopsis top hitse value%identityAlignment
AT1G17070.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain1.4e-0754Show/hide
Query:  IGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLG
        IG FE  TKG G K++ KMG+ +G GLGK+ QG+  PIE   RPK++G+G
Subjt:  IGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLG

AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein7.5e-0639.29Show/hide
Query:  SETSTSAAGDNQASRG----PTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGL
        SE S S+  D  + +G     +     G    +     +E HT+G  SKMMA MG+ EGMGLG   QGI+NP+L    P  R L
Subjt:  SETSTSAAGDNQASRG----PTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGL

AT2G42330.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain7.5e-0633.33Show/hide
Query:  KSRSKGSSGKGSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKD--VDKNKD----IVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKM
        +S S      GS +K   K+   +PV F S G      +D++   N  +  +D+N D    I    ++ + A    KN     +    FE  + G G K+
Subjt:  KSRSKGSSGKGSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKD--VDKNKD----IVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKM

Query:  MAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLG
        + KMG+ +GRGLGK+ QG+  PIEV  RPK++G+G
Subjt:  MAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLG

AT2G42330.2 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain7.5e-0633.33Show/hide
Query:  KSRSKGSSGKGSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKD--VDKNKD----IVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKM
        +S S      GS +K   K+   +PV F S G      +D++   N  +  +D+N D    I    ++ + A    KN     +    FE  + G G K+
Subjt:  KSRSKGSSGKGSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKD--VDKNKD----IVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSKM

Query:  MAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLG
        + KMG+ +GRGLGK+ QG+  PIEV  RPK++G+G
Subjt:  MAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLG

AT3G09850.1 D111/G-patch domain-containing protein4.5e-14444.05Show/hide
Query:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGN------------LFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDR-KKSASSSG
        M GG RR ++   S G  K K      R++ +S+   +            LFV+GG LSD++   +  + ++ G+S  KG    +S +++R K SAS+SG
Subjt:  MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGN------------LFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDR-KKSASSSG

Query:  TKQSNGNAIGYEYPPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMAD-NLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKN
         ++ +GN   Y+YP   ++  L   S G+++D      N  P +L  S+  QIVA++D+ P       + + Y+Y  S+VLGD SH+GLGF D+ +   +
Subjt:  TKQSNGNAIGYEYPPAPYQGSLHSESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMAD-NLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKN

Query:  QNTDDVSATQVREQGGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYS
         +     A  + +QG    GS S  +E    + VD   G +  +E++ +   + K         NSGF+SIGG++LYT+D+S EE D + E +D   + S
Subjt:  QNTDDVSATQVREQGGLCAGSLSSGKETGTDERVDCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYS

Query:  ---EPLESDESSESDGSVEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEEL----VESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRK
               S E SESD S +M  S S+IDD+VA+DYLEGIGGSE +L + WL ++ L    + SDD SSS S D    KL GI LQ+AS EYG  KT    
Subjt:  ---EPLESDESSESDGSVEMSCSDSDIDDEVAEDYLEGIGGSENILKSKWLVKEEL----VESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRK

Query:  KSSVVSRDHWSALALDDLV-VKDIRSTSAR--KQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNRED
         S        + LA+DDL+ VKD RS S +  K+K  ++F  SWP +AP SK +R +PGE KK+RKE IA KRRERML RGVDL  IN +LE+ VL   D
Subjt:  KSSVVSRDHWSALALDDLV-VKDIRSTSAR--KQKNASQFAHSWPPKAPTSKAARKYPGEKKKYRKETIAAKRRERMLNRGVDLMQINLRLEHMVLNRED

Query:  MFAFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLD
        M  FQ MH RDCSQVRRLA +YRL S C GSGKK FV VTRT  T MPSAS ++R+E+LIGA DED DF+V+ G   +S   +R + K   K R  R   
Subjt:  MFAFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEGSNIRSQGGNRSREKKHTKVRGLRSLD

Query:  LDQSASSKSRSKGSSGKGSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSK
               + R+K S+GK SS       YADQPVSFVS G++  E    +  ++V+ V        A E+ E         D     IGAFEVHT+GFGSK
Subjt:  LDQSASSKSRSKGSSGKGSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISRDSIGAFEVHTKGFGSK

Query:  MMAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLGIEFS----ETSTSAAGDNQASRGPTPRTG---------TGTLG----KSKKIGAFEEHTKGFGS
        MMAKMGF++G GLGKDG+G+A+PIE ++RPKSLGLG++FS    + S S+  +N A R  +  +G          G  G    + K++GAFE+HT GFGS
Subjt:  MMAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLGIEFS----ETSTSAAGDNQASRGPTPRTG---------TGTLG----KSKKIGAFEEHTKGFGS

Query:  KMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAKG
        +MMA+MGFVEG GLGR+SQGIVNPL+ VRRP+ARG+GA+G
Subjt:  KMMAKMGFVEGMGLGRDSQGIVNPLLPVRRPKARGLGAKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGGAGGAAGACGAAGAACGAACCATCCCAAAGCCAGTGATGGCTTCAAGAAGAACAAGATTAATTCCGGTAGAAGAAGATCGGACCCGTCGTCATCAATCCGAGG
AAATTTGTTTGTCGATGGAGGGTTTTTATCCGATTGGCAGTTTCAGTTATCGCCGCCGAGTTCTGCTCAAGAGGGAAATTCTAGGGCAAAGGGCAAGTCTGGACCAAAAT
CAGCTAATTTAGATCGTAAAAAGAGTGCTTCATCAAGCGGGACTAAACAATCCAATGGTAATGCTATTGGCTACGAATATCCTCCTGCCCCGTATCAGGGAAGTTTGCAT
TCGGAATCCCAGGGCCTTCAGAATGATGCAGAGTGTAGTTTGGACAACTCGCAGCCTTTTATATTACTTAATTCTAAGAATAATCAAATTGTCGCATATGTAGATGAAAA
CCCACCATTGATGGCAGATAATTTGGAATTTACTTACGATTATGGTACTAGTTTTGTGTTAGGTGACAGTTCACATAGAGGACTAGGGTTCCACGATGAGGATGAACTTG
TTAAAAATCAGAACACAGATGATGTTTCAGCCACACAAGTGAGAGAACAAGGAGGATTGTGTGCTGGTTCATTATCCTCCGGGAAGGAAACTGGTACTGATGAGAGGGTT
GATTGCAGGGTAGGGGTTGAGACGGCTAATGAGATGCTAACTGAACCATCACCCTCCAACAAATATTCAGGTGATGTGAGCCCTCCAGGAAATTCAGGTTTCTTGTCAAT
TGGTGGTGTGAGATTATACACCCAAGATATATCTGATGAAGAGGGTGATGATGATGGGGAGTCATCTGATGGAAGTTCTGAGTACTCTGAGCCTCTAGAGTCAGATGAAT
CGTCTGAAAGTGATGGCTCTGTAGAAATGTCTTGCAGTGATTCAGATATTGATGACGAGGTGGCTGAAGATTATCTTGAAGGGATTGGTGGAAGTGAAAATATTTTAAAA
TCCAAATGGTTAGTAAAAGAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGTAGCCTTGATGATACTTTAGAGAAATTAGGTGGCATTGCTTTGCAGGAAGCATC
TAAAGAATATGGAATTATGAAAACTCCTTCAAGAAAGAAAAGCTCCGTTGTTTCTAGAGATCATTGGTCAGCATTGGCTCTAGATGACTTGGTAGTAAAAGATATTAGAT
CTACATCAGCTAGAAAGCAAAAGAATGCTTCTCAGTTTGCTCATTCTTGGCCTCCAAAAGCTCCAACAAGTAAAGCTGCTAGAAAGTATCCCGGTGAAAAGAAGAAATAT
CGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGCTTAATCGGGGTGTTGATCTAATGCAAATAAATCTGAGATTGGAGCACATGGTTCTCAACAGAGAAGATAT
GTTCGCTTTCCAACCTATGCATCCCCGTGATTGTTCCCAGGTACGGCGGTTGGCAGCAATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGAAAAAAAAAGTTTGTAA
TAGTAACTCGAACACAGTACACAGGAATGCCATCAGCAAGTGGTCAAGTTCGCCTTGAACAGCTAATTGGAGCAAGAGACGAGGATAATGATTTTTCTGTTGCCGAGGGC
TCAAATATAAGATCACAGGGCGGCAACAGAAGCAGAGAAAAGAAGCATACCAAAGTGCGTGGTTTGCGCTCATTGGATCTTGATCAATCTGCTAGTAGTAAGTCAAGGTC
GAAGGGCTCTTCAGGTAAGGGATCAAGTCAGAAAATGACCGGTAAAAAGTATGCTGATCAACCTGTCTCGTTCGTATCATGTGGAGTAATGCAACCGGAACCAATGGATG
AGATAACGACTAGCAATGTTAAGGATGTAGATAAAAACAAGGACATTGTAGCTGCATCCGAAATGATCGAAATGGCCACTACTGATGTGAAGAATATGGATATAAGTAGG
GATAGTATAGGTGCATTTGAGGTGCACACCAAGGGATTTGGATCAAAGATGATGGCAAAAATGGGATTTGTAGAAGGCAGAGGATTAGGGAAAGATGGTCAAGGAATGGC
CCGCCCCATTGAAGTGATGAAACGGCCAAAATCACTTGGATTAGGCATCGAGTTCTCCGAGACCTCGACTTCTGCTGCTGGTGACAACCAAGCAAGTCGAGGACCCACGC
CTAGAACTGGAACTGGAACTTTGGGAAAATCTAAAAAAATTGGTGCTTTTGAAGAGCACACTAAAGGATTTGGATCGAAGATGATGGCAAAGATGGGATTTGTTGAAGGC
ATGGGACTGGGAAGAGATTCACAAGGAATTGTCAACCCTTTGCTTCCTGTTAGACGGCCTAAAGCACGAGGGTTGGGTGCTAAAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGGAGGAAGACGAAGAACGAACCATCCCAAAGCCAGTGATGGCTTCAAGAAGAACAAGATTAATTCCGGTAGAAGAAGATCGGACCCGTCGTCATCAATCCGAGG
AAATTTGTTTGTCGATGGAGGGTTTTTATCCGATTGGCAGTTTCAGTTATCGCCGCCGAGTTCTGCTCAAGAGGGAAATTCTAGGGCAAAGGGCAAGTCTGGACCAAAAT
CAGCTAATTTAGATCGTAAAAAGAGTGCTTCATCAAGCGGGACTAAACAATCCAATGGTAATGCTATTGGCTACGAATATCCTCCTGCCCCGTATCAGGGAAGTTTGCAT
TCGGAATCCCAGGGCCTTCAGAATGATGCAGAGTGTAGTTTGGACAACTCGCAGCCTTTTATATTACTTAATTCTAAGAATAATCAAATTGTCGCATATGTAGATGAAAA
CCCACCATTGATGGCAGATAATTTGGAATTTACTTACGATTATGGTACTAGTTTTGTGTTAGGTGACAGTTCACATAGAGGACTAGGGTTCCACGATGAGGATGAACTTG
TTAAAAATCAGAACACAGATGATGTTTCAGCCACACAAGTGAGAGAACAAGGAGGATTGTGTGCTGGTTCATTATCCTCCGGGAAGGAAACTGGTACTGATGAGAGGGTT
GATTGCAGGGTAGGGGTTGAGACGGCTAATGAGATGCTAACTGAACCATCACCCTCCAACAAATATTCAGGTGATGTGAGCCCTCCAGGAAATTCAGGTTTCTTGTCAAT
TGGTGGTGTGAGATTATACACCCAAGATATATCTGATGAAGAGGGTGATGATGATGGGGAGTCATCTGATGGAAGTTCTGAGTACTCTGAGCCTCTAGAGTCAGATGAAT
CGTCTGAAAGTGATGGCTCTGTAGAAATGTCTTGCAGTGATTCAGATATTGATGACGAGGTGGCTGAAGATTATCTTGAAGGGATTGGTGGAAGTGAAAATATTTTAAAA
TCCAAATGGTTAGTAAAAGAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGTAGCCTTGATGATACTTTAGAGAAATTAGGTGGCATTGCTTTGCAGGAAGCATC
TAAAGAATATGGAATTATGAAAACTCCTTCAAGAAAGAAAAGCTCCGTTGTTTCTAGAGATCATTGGTCAGCATTGGCTCTAGATGACTTGGTAGTAAAAGATATTAGAT
CTACATCAGCTAGAAAGCAAAAGAATGCTTCTCAGTTTGCTCATTCTTGGCCTCCAAAAGCTCCAACAAGTAAAGCTGCTAGAAAGTATCCCGGTGAAAAGAAGAAATAT
CGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGCTTAATCGGGGTGTTGATCTAATGCAAATAAATCTGAGATTGGAGCACATGGTTCTCAACAGAGAAGATAT
GTTCGCTTTCCAACCTATGCATCCCCGTGATTGTTCCCAGGTACGGCGGTTGGCAGCAATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGAAAAAAAAAGTTTGTAA
TAGTAACTCGAACACAGTACACAGGAATGCCATCAGCAAGTGGTCAAGTTCGCCTTGAACAGCTAATTGGAGCAAGAGACGAGGATAATGATTTTTCTGTTGCCGAGGGC
TCAAATATAAGATCACAGGGCGGCAACAGAAGCAGAGAAAAGAAGCATACCAAAGTGCGTGGTTTGCGCTCATTGGATCTTGATCAATCTGCTAGTAGTAAGTCAAGGTC
GAAGGGCTCTTCAGGTAAGGGATCAAGTCAGAAAATGACCGGTAAAAAGTATGCTGATCAACCTGTCTCGTTCGTATCATGTGGAGTAATGCAACCGGAACCAATGGATG
AGATAACGACTAGCAATGTTAAGGATGTAGATAAAAACAAGGACATTGTAGCTGCATCCGAAATGATCGAAATGGCCACTACTGATGTGAAGAATATGGATATAAGTAGG
GATAGTATAGGTGCATTTGAGGTGCACACCAAGGGATTTGGATCAAAGATGATGGCAAAAATGGGATTTGTAGAAGGCAGAGGATTAGGGAAAGATGGTCAAGGAATGGC
CCGCCCCATTGAAGTGATGAAACGGCCAAAATCACTTGGATTAGGCATCGAGTTCTCCGAGACCTCGACTTCTGCTGCTGGTGACAACCAAGCAAGTCGAGGACCCACGC
CTAGAACTGGAACTGGAACTTTGGGAAAATCTAAAAAAATTGGTGCTTTTGAAGAGCACACTAAAGGATTTGGATCGAAGATGATGGCAAAGATGGGATTTGTTGAAGGC
ATGGGACTGGGAAGAGATTCACAAGGAATTGTCAACCCTTTGCTTCCTGTTAGACGGCCTAAAGCACGAGGGTTGGGTGCTAAAGGTTAG
Protein sequenceShow/hide protein sequence
MAGGRRRTNHPKASDGFKKNKINSGRRRSDPSSSIRGNLFVDGGFLSDWQFQLSPPSSAQEGNSRAKGKSGPKSANLDRKKSASSSGTKQSNGNAIGYEYPPAPYQGSLH
SESQGLQNDAECSLDNSQPFILLNSKNNQIVAYVDENPPLMADNLEFTYDYGTSFVLGDSSHRGLGFHDEDELVKNQNTDDVSATQVREQGGLCAGSLSSGKETGTDERV
DCRVGVETANEMLTEPSPSNKYSGDVSPPGNSGFLSIGGVRLYTQDISDEEGDDDGESSDGSSEYSEPLESDESSESDGSVEMSCSDSDIDDEVAEDYLEGIGGSENILK
SKWLVKEELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIMKTPSRKKSSVVSRDHWSALALDDLVVKDIRSTSARKQKNASQFAHSWPPKAPTSKAARKYPGEKKKY
RKETIAAKRRERMLNRGVDLMQINLRLEHMVLNREDMFAFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKKFVIVTRTQYTGMPSASGQVRLEQLIGARDEDNDFSVAEG
SNIRSQGGNRSREKKHTKVRGLRSLDLDQSASSKSRSKGSSGKGSSQKMTGKKYADQPVSFVSCGVMQPEPMDEITTSNVKDVDKNKDIVAASEMIEMATTDVKNMDISR
DSIGAFEVHTKGFGSKMMAKMGFVEGRGLGKDGQGMARPIEVMKRPKSLGLGIEFSETSTSAAGDNQASRGPTPRTGTGTLGKSKKIGAFEEHTKGFGSKMMAKMGFVEG
MGLGRDSQGIVNPLLPVRRPKARGLGAKG