; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022583 (gene) of Chayote v1 genome

Gene IDSed0022583
OrganismSechium edule (Chayote v1)
DescriptionProtein SQS1
Genome locationLG12:33254814..33259945
RNA-Seq ExpressionSed0022583
SyntenySed0022583
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain
IPR001374 - R3H domain
IPR034082 - Protein SQS1, R3H domain
IPR036867 - R3H domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587849.1 Zinc finger CCCH-type with G patch domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0074.8Show/hide
Query:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY
        M GGRRR N  K S GF KNK    RRRSD SSS   R NLFVDGGFLSDWQFQ SPPSS+RE +SRSKGKS SKS  LD +K ASSSGTKQS+ N I Y
Subjt:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY

Query:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP
        EYP A HQE  LHS S+GLQ DA+CSLDNSQPFILLNSNSK+++I+AYVDDKPP KEDD++FTYDYGT+F+LGDSSHRGLGFH+ED LVRN N+DDD SP
Subjt:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP

Query:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES
        T VEEQ GLCT SLPS+++TG+ ERVECRE VE+A EMLAEASSPNKYS+GVCS +NSGFLSIGGVRLYTQDVSDEESDD+GELS+G+S+YSE LE+DES
Subjt:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES

Query:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL
        SESDSSAEM CS SDIDDEVAEDYLEGIGGSEHILKSKWL+KQELAE DDD S+SSLDDTLEKL GIALQE SKEYG KKTPSRKKS+IVSRD+WSSLAL
Subjt:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL

Query:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL
        DDLL+K  RSASARKKKNAAHFA SWP   PKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQVR+L
Subjt:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL

Query:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG
        AAIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSD VRL QLIG RDEDNDFSVAEG NI+S G N+SREKKNA       +E ++S S KSR++GSAGKG
Subjt:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG

Query:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG
         SQKKT  KYADQPVSF+S G+MQP+AVEI  S+  +VDKGKDIV ASEM+E+T++NVKNMDISR+ IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQG
Subjt:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG

Query:  MACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK-----------------------------------------
        MACPIEVIKRPKSLGLGVEFSE+STSAGD +Q S GSA RTTGALGKS ++GAFEEHTK                                         
Subjt:  MACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK-----------------------------------------

Query:  ----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAK
                                    GKSK++GAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQGMVNPLLP RRPKARGLGAK
Subjt:  ----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAK

XP_022929809.1 uncharacterized protein LOC111436305 [Cucurbita moschata]0.0e+0074.72Show/hide
Query:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY
        M GGRRR N  K S GF KNK    RRRSD SSS   R NLFVDGGFLSDWQFQ SPPSS+RE +SRSKGKS SKS  LD +K ASSSGTKQS+ N I Y
Subjt:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY

Query:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP
        EYP A HQE  LHS S+GLQ DA+CSLDNSQPFILLNSNSK+++I+AYVDDKPP KEDD++FTYDYGT+F+LGDSSHRGLGFH+ED LVRN N+DDD SP
Subjt:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP

Query:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES
        T VEEQ GLCT SLPS+++TG+ ERVECRE VE+A EMLAEASSPNKYS+GVCS +NSGFLSIGGVRLYTQDVSDEESDD+GELS+G+S+YSE LE+DES
Subjt:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES

Query:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL
        SESDSSAEM CS SDIDDEVAEDYLEGIGGSEHILKSKWL+KQELAE DDD S+SSLDDTLEKL GIALQE SKEYG KKTPSRKKS+IVSRD+WSSLAL
Subjt:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL

Query:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL
        DDLL+K  RSASARKKKNAAHFA SWP   PKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQVR+L
Subjt:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL

Query:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG
        AAIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSD VRL QLIG RD DNDFSVAEG NI+S G N+SREKKNA       +E ++S S KSR++GSAGKG
Subjt:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG

Query:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG
         SQKKT  KYADQPVSF+S G+MQP+AVEI  S+  +VDKGKDIV ASEM+E+T++NVKNMDISR+ IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQG
Subjt:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG

Query:  MACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK-----------------------------------------
        MACPIEVIKRPKSLGLGVEFSE+STSAGD +Q S GSA RTTGALGKS ++GAFEEHTK                                         
Subjt:  MACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK-----------------------------------------

Query:  ----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG
                                    GKSK++GAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQGMVNPLLP RRPKARGLGAKG
Subjt:  ----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG

XP_023003174.1 uncharacterized protein LOC111496865 isoform X1 [Cucurbita maxima]0.0e+0074.52Show/hide
Query:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY
        M GGRRR N  K S GF KNK    RRRSD SSS   R NLFVDGGFLSDWQFQ +PPSS+RE +SRSKG+S SKS  LD +K ASSSGTKQS+ N I Y
Subjt:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY

Query:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP
        EYP A HQE  LHS SRGLQ DA+CSLDNSQPFILLNSNSK+++I+AYVDDKPP KED L+ TYDYGT F+LGDSSHRGLGFHDED LVRN NTDDD SP
Subjt:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP

Query:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES
        T VEEQ GLCT SLPS+++TG+ ERVECRE VE+A EMLAEASSPNKYS+G CS +NSGFLSIGGVRLYTQDVSDEESDD+GELS+G+S+YSEPLE+DES
Subjt:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES

Query:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL
        SESDSSAEM CS SDIDDEVAEDYLEGIGG EHILKSKWL+KQELAE DDD S+SSLDDTLEKL  IALQE SKEYG KKTPSR KS IVSRD+WSSLAL
Subjt:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL

Query:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL
        DDLL+K  RSASARKKKNAAHFA SWP   PKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVR+L
Subjt:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL

Query:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG
        AAIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSD VRL QLIG RD+DNDFSVAEG NI+S G N+SREKKNA       +E ++S S KSR+KGSAGKG
Subjt:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG

Query:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG
         SQKKT  KYADQPVSF+S G+MQP+AVEI  S+  +VDKGKDIV ASEM+E+T++NVKNMDISR+ IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQG
Subjt:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG

Query:  MACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK-----------------------------------------
        MACPIEVIKRPKSLGLG+EFSE+STSAGD +Q S GSA RTTG LGKSK++GAFEEHTK                                         
Subjt:  MACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK-----------------------------------------

Query:  -----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG
                                     GKSK++GAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQGMVNPLLPVRRPKARGLGAKG
Subjt:  -----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG

XP_023003182.1 uncharacterized protein LOC111496865 isoform X2 [Cucurbita maxima]0.0e+0079.1Show/hide
Query:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY
        M GGRRR N  K S GF KNK    RRRSD SSS   R NLFVDGGFLSDWQFQ +PPSS+RE +SRSKG+S SKS  LD +K ASSSGTKQS+ N I Y
Subjt:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY

Query:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP
        EYP A HQE  LHS SRGLQ DA+CSLDNSQPFILLNSNSK+++I+AYVDDKPP KED L+ TYDYGT F+LGDSSHRGLGFHDED LVRN NTDDD SP
Subjt:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP

Query:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES
        T VEEQ GLCT SLPS+++TG+ ERVECRE VE+A EMLAEASSPNKYS+G CS +NSGFLSIGGVRLYTQDVSDEESDD+GELS+G+S+YSEPLE+DES
Subjt:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES

Query:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL
        SESDSSAEM CS SDIDDEVAEDYLEGIGG EHILKSKWL+KQELAE DDD S+SSLDDTLEKL  IALQE SKEYG KKTPSR KS IVSRD+WSSLAL
Subjt:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL

Query:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL
        DDLL+K  RSASARKKKNAAHFA SWP   PKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVR+L
Subjt:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL

Query:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG
        AAIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSD VRL QLIG RD+DNDFSVAEG NI+S G N+SREKKNA       +E ++S S KSR+KGSAGKG
Subjt:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG

Query:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG
         SQKKT  KYADQPVSF+S G+MQP+AVEI  S+  +VDKGKDIV ASEM+E+T++NVKNMDISR+ IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQG
Subjt:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG

Query:  MACPIEVIKRPKSLGLGVEFS-ESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMV
        MACPIEVIKRPKSLGLGVEFS E+STSAGD+ ++ R S   TTGAL              GKSK++GAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQGMV
Subjt:  MACPIEVIKRPKSLGLGVEFS-ESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMV

Query:  NPLLPVRRPKARGLGAKG
        NPLLPVRRPKARGLGAKG
Subjt:  NPLLPVRRPKARGLGAKG

XP_023530152.1 uncharacterized protein LOC111792798 [Cucurbita pepo subsp. pepo]0.0e+0075.2Show/hide
Query:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY
        M GGRRR N  K S GF KNK    RRRSD SSS   R NLFVDGG LSDWQFQ SPPSS+RE +SRSKGKS SKS  LD +K ASSSGTKQS+ N I Y
Subjt:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY

Query:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP
        EYP A HQE  LHS SRGLQ DA+CSLDNSQPFILLNSNSK+S+I+AYVDDKPP KEDD++FTYDYGT+F+LGDSSHRGLGFHDED LVRN NTDDD SP
Subjt:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP

Query:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES
        T VEEQ GLCT SLPS+++ GT ERVECREGVE+A EMLAEASSPNKYS+GVCS +NSGFLSIGGVRLYTQDVSDEESDD+GELS+G+S+YSEP E+DES
Subjt:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES

Query:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL
        SESDSSAEM CS SDIDDEVAEDYLEGIGGSEHILKSKWL+KQELAE DDD S+SSLDDTLEKL  IALQE SKEYG KKTPSRKKS+IVSRD+WSSLAL
Subjt:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL

Query:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL
        DDLL+K  RSASARKKKNAAHFA SWP   PKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQVR+L
Subjt:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL

Query:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG
        AAIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSD VRL QL+G RDEDNDFSVAEG NI+S G N+SREKKNA       +E + S S KSR++GSAGKG
Subjt:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG

Query:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG
         SQKKT  KYADQPVSF+S G+MQP+AVEI  S+  +VDKGKDIV ASEM+E+T++ VKNMDISRE IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQG
Subjt:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG

Query:  MACPIEVIKRPKSLGLGVEFS-ESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK----------------------------------------
        MACPIEVIKRPKSLGLGVEFS E+STSAGD +Q SRGSA RTTGALGKS ++GAFEEHTK                                        
Subjt:  MACPIEVIKRPKSLGLGVEFS-ESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK----------------------------------------

Query:  -----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG
                                     GKSK++GAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQGMVNPLLPVRRPKARGLGAKG
Subjt:  -----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG

TrEMBL top hitse value%identityAlignment
A0A1S3B9V9 Protein SQS18.9e-30572.52Show/hide
Query:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY
        M GGRRR N  KSS  F KNK+  ARRRSD  S      NLFVDGGFLSDWQFQ SPP S+RE +SR+KGKS SKS +LD +K ASSSGTKQS+   I Y
Subjt:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY

Query:  EYPLAHHQEALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSPT
        EYP A +QE L S SRGLQ DAECSLD+SQPFILL  NSK+++I+AYVD+ P    D+L+FTYDYGTSFVLGDSSHRGLGF D+D LV   NTDDDS+  
Subjt:  EYPLAHHQEALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSPT

Query:  PVEEQGLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDESSE
          E+ GLCT SL  E++TGT E+V+ R GVE A+EM+AEAS  NK S  + S +NSGFLSIGGVRLYTQDVSDEESDD+GE SDG S+YSEPLE+DESSE
Subjt:  PVEEQGLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDESSE

Query:  SDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLALDD
         DSS EM CS SDIDDEVAEDYLEG+GGS++ILKSKWL+KQEL E  DDSS+SSLDDTL+KLGGIALQE SKEYG  KT SR K S+VSRD WS+LALDD
Subjt:  SDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLALDD

Query:  LLVKSCRSASARKKKNAAHFASSWPPK---SKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQLAA
        +LVK  RS S RK+KNA+ FA SWPPK   SKA+RKYPGEKKKYRKE IAAKRRERMLNRGVDL QI+L+LEHMVLN+EDM++FQPMHPRDCSQVR+LAA
Subjt:  LLVKSCRSASARKKKNAAHFASSWPPK---SKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKN-------AMEFDRSASGKSRSKGSAGKGFS
        IYRLHSGCQGSGKKRFVTVTRTQ+TGMPS+SD VRL+QLIG RDEDNDFSVAEGSNI+S+G ++SREKKN        +EFD+S S KSRSKGSAGKG S
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKN-------AMEFDRSASGKSRSKGSAGKGFS

Query:  QKKTSNKYADQPVSFISSGMMQPNAVE---IEPSSNVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGM
        QK T  KYADQPVSF+S G+MQP  +E   +  +++VDKGKDIVA SEMIEM ++NV N DISR+ IG+FE+HTKGFGSKMMAKMGFVEGGGLGKDGQGM
Subjt:  QKKTSNKYADQPVSFISSGMMQPNAVE---IEPSSNVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGM

Query:  ACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNP
          PIEV+KRPKSLGLG+EFSE+STSA   +QAS  S+ RT        + GA E     KSK+IGAFE HTKGFGSKMMAKMGFVEGMGLG+DSQG+VNP
Subjt:  ACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNP

Query:  LLPVRRPKARGLGAK
        LLPVRRPKARGLGAK
Subjt:  LLPVRRPKARGLGAK

A0A5A7U8V1 Protein SQS14.4e-30472.39Show/hide
Query:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY
        M GGRRR N  KSS  F KNK+  ARRRSD  S      NLFVDGGFLSDWQFQ SPP S+RE +SR+KGKS SKS +LD +K ASSSGTKQS    I Y
Subjt:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY

Query:  EYPLAHHQEALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSPT
        EYP A +QE L S SRGLQ DAECSLD+SQPFIL+  NSK+++I+AYVD+ P    D+L+FTYDYGTSFVLGDSSHRGLGF D+D LV   NTDDDS+  
Subjt:  EYPLAHHQEALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSPT

Query:  PVEEQGLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDESSE
          E+ GLCT SL  E++TGT E+V+ R GVE A+EM+AEAS  NK S  + S +NSGFLSIGGVRLYTQDVSDEESDD+GE SDG S+YSEPLE+DESSE
Subjt:  PVEEQGLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDESSE

Query:  SDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLALDD
         DSS EM CS SDIDDEVAEDYLEG+GGS++ILKSKWL+KQEL E  DDSS+SSLDDTL+KLGGIALQE SKEYG  KT SR K S+VSRD WS+LALDD
Subjt:  SDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLALDD

Query:  LLVKSCRSASARKKKNAAHFASSWPPK---SKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQLAA
        +LVK  RS S RK+KNA+ FA SWPPK   SKA+RKYPGEKKKYRKE IAAKRRERMLNRGVDL QI+L+LEHMVLN+EDM++FQPMHPRDCSQVR+LAA
Subjt:  LLVKSCRSASARKKKNAAHFASSWPPK---SKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKN-------AMEFDRSASGKSRSKGSAGKGFS
        IYRLHSGCQGSGKKRFVTVTRTQ+TGMPS+SD VRL+QLIG RDEDNDFSVAEGSNI+S+G ++SREKKN        +EFD+S S KSRSKGSAGKG S
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKN-------AMEFDRSASGKSRSKGSAGKGFS

Query:  QKKTSNKYADQPVSFISSGMMQPNAVE---IEPSSNVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGM
        QK T  KYADQPVSF+S G+MQP  +E   +  +++VDKGKDIVA SEMIEM ++NV N DISR+ IG+FE+HTKGFGSKMMAKMGFVEGGGLGKDGQGM
Subjt:  QKKTSNKYADQPVSFISSGMMQPNAVE---IEPSSNVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGM

Query:  ACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNP
          PIEV+KRPKSLGLG+EFSE+STSA   +QAS  S+ RT          GA E     KSK+IGAFE HTKGFGSKMMAKMGFVEGMGLG+DSQG+VNP
Subjt:  ACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNP

Query:  LLPVRRPKARGLGAK
        LLPVRRPKARGLGAK
Subjt:  LLPVRRPKARGLGAK

A0A6J1EPV3 Protein SQS10.0e+0074.72Show/hide
Query:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY
        M GGRRR N  K S GF KNK    RRRSD SSS   R NLFVDGGFLSDWQFQ SPPSS+RE +SRSKGKS SKS  LD +K ASSSGTKQS+ N I Y
Subjt:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY

Query:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP
        EYP A HQE  LHS S+GLQ DA+CSLDNSQPFILLNSNSK+++I+AYVDDKPP KEDD++FTYDYGT+F+LGDSSHRGLGFH+ED LVRN N+DDD SP
Subjt:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP

Query:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES
        T VEEQ GLCT SLPS+++TG+ ERVECRE VE+A EMLAEASSPNKYS+GVCS +NSGFLSIGGVRLYTQDVSDEESDD+GELS+G+S+YSE LE+DES
Subjt:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES

Query:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL
        SESDSSAEM CS SDIDDEVAEDYLEGIGGSEHILKSKWL+KQELAE DDD S+SSLDDTLEKL GIALQE SKEYG KKTPSRKKS+IVSRD+WSSLAL
Subjt:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL

Query:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL
        DDLL+K  RSASARKKKNAAHFA SWP   PKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLEHMVLNREDMFSFQPMHPRDCSQVR+L
Subjt:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL

Query:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG
        AAIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSD VRL QLIG RD DNDFSVAEG NI+S G N+SREKKNA       +E ++S S KSR++GSAGKG
Subjt:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG

Query:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG
         SQKKT  KYADQPVSF+S G+MQP+AVEI  S+  +VDKGKDIV ASEM+E+T++NVKNMDISR+ IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQG
Subjt:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG

Query:  MACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK-----------------------------------------
        MACPIEVIKRPKSLGLGVEFSE+STSAGD +Q S GSA RTTGALGKS ++GAFEEHTK                                         
Subjt:  MACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK-----------------------------------------

Query:  ----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG
                                    GKSK++GAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQGMVNPLLP RRPKARGLGAKG
Subjt:  ----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG

A0A6J1KLR0 Protein SQS10.0e+0079.1Show/hide
Query:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY
        M GGRRR N  K S GF KNK    RRRSD SSS   R NLFVDGGFLSDWQFQ +PPSS+RE +SRSKG+S SKS  LD +K ASSSGTKQS+ N I Y
Subjt:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY

Query:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP
        EYP A HQE  LHS SRGLQ DA+CSLDNSQPFILLNSNSK+++I+AYVDDKPP KED L+ TYDYGT F+LGDSSHRGLGFHDED LVRN NTDDD SP
Subjt:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP

Query:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES
        T VEEQ GLCT SLPS+++TG+ ERVECRE VE+A EMLAEASSPNKYS+G CS +NSGFLSIGGVRLYTQDVSDEESDD+GELS+G+S+YSEPLE+DES
Subjt:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES

Query:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL
        SESDSSAEM CS SDIDDEVAEDYLEGIGG EHILKSKWL+KQELAE DDD S+SSLDDTLEKL  IALQE SKEYG KKTPSR KS IVSRD+WSSLAL
Subjt:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL

Query:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL
        DDLL+K  RSASARKKKNAAHFA SWP   PKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVR+L
Subjt:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL

Query:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG
        AAIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSD VRL QLIG RD+DNDFSVAEG NI+S G N+SREKKNA       +E ++S S KSR+KGSAGKG
Subjt:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG

Query:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG
         SQKKT  KYADQPVSF+S G+MQP+AVEI  S+  +VDKGKDIV ASEM+E+T++NVKNMDISR+ IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQG
Subjt:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG

Query:  MACPIEVIKRPKSLGLGVEFS-ESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMV
        MACPIEVIKRPKSLGLGVEFS E+STSAGD+ ++ R S   TTGAL              GKSK++GAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQGMV
Subjt:  MACPIEVIKRPKSLGLGVEFS-ESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMV

Query:  NPLLPVRRPKARGLGAKG
        NPLLPVRRPKARGLGAKG
Subjt:  NPLLPVRRPKARGLGAKG

A0A6J1KSJ4 Protein SQS10.0e+0074.52Show/hide
Query:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY
        M GGRRR N  K S GF KNK    RRRSD SSS   R NLFVDGGFLSDWQFQ +PPSS+RE +SRSKG+S SKS  LD +K ASSSGTKQS+ N I Y
Subjt:  MTGGRRRPN-PKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAY

Query:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP
        EYP A HQE  LHS SRGLQ DA+CSLDNSQPFILLNSNSK+++I+AYVDDKPP KED L+ TYDYGT F+LGDSSHRGLGFHDED LVRN NTDDD SP
Subjt:  EYPLAHHQE-ALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSP

Query:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES
        T VEEQ GLCT SLPS+++TG+ ERVECRE VE+A EMLAEASSPNKYS+G CS +NSGFLSIGGVRLYTQDVSDEESDD+GELS+G+S+YSEPLE+DES
Subjt:  TPVEEQ-GLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDES

Query:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL
        SESDSSAEM CS SDIDDEVAEDYLEGIGG EHILKSKWL+KQELAE DDD S+SSLDDTLEKL  IALQE SKEYG KKTPSR KS IVSRD+WSSLAL
Subjt:  SESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLAL

Query:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL
        DDLL+K  RSASARKKKNAAHFA SWP   PKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLEHMVLNREDMFSFQPMHPRDCSQVR+L
Subjt:  DDLLVKSCRSASARKKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQL

Query:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG
        AAIYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSD VRL QLIG RD+DNDFSVAEG NI+S G N+SREKKNA       +E ++S S KSR+KGSAGKG
Subjt:  AAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNA-------MEFDRSASGKSRSKGSAGKG

Query:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG
         SQKKT  KYADQPVSF+S G+MQP+AVEI  S+  +VDKGKDIV ASEM+E+T++NVKNMDISR+ IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQG
Subjt:  FSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSS--NVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQG

Query:  MACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK-----------------------------------------
        MACPIEVIKRPKSLGLG+EFSE+STSAGD +Q S GSA RTTG LGKSK++GAFEEHTK                                         
Subjt:  MACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTK-----------------------------------------

Query:  -----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG
                                     GKSK++GAFEEHTKGFGSKMMAKMGFVEGMGLG+DSQGMVNPLLPVRRPKARGLGAKG
Subjt:  -----------------------------GKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG

SwissProt top hitse value%identityAlignment
A4UMC6 Tuftelin-interacting protein 114.3e-0638.96Show/hide
Query:  KSKQVGAFEEHTKG------KSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG
        K K  G F+   KG           G++E HTKG G K++ KMG+V G GLG+++QG++NP+   +R     +GA G
Subjt:  KSKQVGAFEEHTKG------KSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG

A7SBN6 Zinc finger CCCH-type with G patch domain-containing protein3.0e-0738.78Show/hide
Query:  LGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGL
        LG E  ES  S  DS Q+  G ++ +  A+ +   V  +     G    +G +E+HTKG GSK+MAKMG++ G GLG+D +G V P+  V  P+ + L
Subjt:  LGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGL

B3N8L3 Zinc finger CCCH-type with G patch domain-containing protein3.3e-0646.3Show/hide
Query:  SKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGA
        ++++GA+EE T+G GSK+M KMG++ G GLG D +G+V P+     P+ R L A
Subjt:  SKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGA

Q5ZII9 Tuftelin-interacting protein 114.3e-0638.96Show/hide
Query:  KSKQVGAFEEHTKG------KSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG
        K K  G F+   KG           G++E HTKG G K++ KMG+V G GLG+++QG++NP+   +R     +GA G
Subjt:  KSKQVGAFEEHTKG------KSKQIGAFEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG

Q6K687 Zinc finger CCCH domain-containing protein 181.9e-0631.58Show/hide
Query:  MTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGM--ACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQ
        + S N+  +         +E HT+G  SKMMAKMG+ EG GLG  GQGM    P++V+   +SL   V  SE + S G   + SRG         GK K+
Subjt:  MTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGM--ACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQ

Query:  VGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGL--GRDSQGMVN
           F E  +    +    E     F +  +      EG  +   +DS G  N
Subjt:  VGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGFVEGMGL--GRDSQGMVN

Arabidopsis top hitse value%identityAlignment
AT1G17070.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain1.7e-0527.31Show/hide
Query:  DNDFSVA--EGSN-IRSQGGNKSREKKNAMEFDRSASGKSRSKGSAGKGFSQK----KTSNKYAD--QPVSFISSGMMQPNAVEIEPSS---NVDKGKDI
        DND+     EG   +  +   K ++ KN   +   A   S S  S G G  +K    + S + AD  +PV+F+S+G + PN  EI+  S   N +K +D 
Subjt:  DNDFSVA--EGSN-IRSQGGNKSREKKNAMEFDRSASGKSRSKGSAGKGFSQK----KTSNKYAD--QPVSFISSGMMQPNAVEIEPSS---NVDKGKDI

Query:  VAASEMIE-------------------MTSNNVKNMD------ISREY---------------------------------IGAFEVHTKGFGSKMMAKM
        +  ++MI+                     +N   + D      + ++                                  IG FE  TKG G K++ KM
Subjt:  VAASEMIE-------------------MTSNNVKNMD------ISREY---------------------------------IGAFEVHTKGFGSKMMAKM

Query:  GFVEGGGLGKDGQGMACPIEVIKRPKSLGLGV-EFSES
        G+ +GGGLGK+ QG+  PIE   RPK++G+G  +F E+
Subjt:  GFVEGGGLGKDGQGMACPIEVIKRPKSLGLGV-EFSES

AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein2.0e-0654.35Show/hide
Query:  FEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGL
        +E HT+G  SKMMA MG+ EGMGLG   QG++NP+L    P  R L
Subjt:  FEEHTKGFGSKMMAKMGFVEGMGLGRDSQGMVNPLLPVRRPKARGL

AT2G42330.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain3.2e-0429.41Show/hide
Query:  KSREKKNAMEFDRSASGKSRSKGSAGKGFSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSSNVDKGKDIVAASEMIEMTSN--NVKNMDISREYIGAFE
        K R+ K    +       S S  S G   S++K   K   +PV F S+G +     +   +  +D+  D +    + +  ++  +V+  +  +E    FE
Subjt:  KSREKKNAMEFDRSASGKSRSKGSAGKGFSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSSNVDKGKDIVAASEMIEMTSN--NVKNMDISREYIGAFE

Query:  VHTKGFGSKMMAKMGFVEGGGLGKDGQGMACPIEVIKRPKSLGLGV-EFSESS
          + G G K++ KMG+ +G GLGK+ QG+  PIEV  RPK++G+G  +F E +
Subjt:  VHTKGFGSKMMAKMGFVEGGGLGKDGQGMACPIEVIKRPKSLGLGV-EFSESS

AT2G42330.2 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain3.2e-0429.41Show/hide
Query:  KSREKKNAMEFDRSASGKSRSKGSAGKGFSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSSNVDKGKDIVAASEMIEMTSN--NVKNMDISREYIGAFE
        K R+ K    +       S S  S G   S++K   K   +PV F S+G +     +   +  +D+  D +    + +  ++  +V+  +  +E    FE
Subjt:  KSREKKNAMEFDRSASGKSRSKGSAGKGFSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSSNVDKGKDIVAASEMIEMTSN--NVKNMDISREYIGAFE

Query:  VHTKGFGSKMMAKMGFVEGGGLGKDGQGMACPIEVIKRPKSLGLGV-EFSESS
          + G G K++ KMG+ +G GLGK+ QG+  PIEV  RPK++G+G  +F E +
Subjt:  VHTKGFGSKMMAKMGFVEGGGLGKDGQGMACPIEVIKRPKSLGLGV-EFSESS

AT3G09850.1 D111/G-patch domain-containing protein3.5e-14444.51Show/hide
Query:  GGRRRPNPKSSHGFNKNKSIPARRRSDTSSSS----------SNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRK-EASSSGTKQ
        GGRRR N   S G NK KS     R++ +S++           N   LFV+GG LSD++  P+  + SR  SS  KG    +S S++  K  AS+SG ++
Subjt:  GGRRRPNPKSSHGFNKNKSIPARRRSDTSSSS----------SNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRK-EASSSGTKQ

Query:  SHVNGIAYEYPLAHHQEALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKP-PAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNP
           N   Y+YP    +E L   S G++ D      N  P +L    S+ ++I+A++D  P  +K   + + Y+Y  S+VLGD SH+GLGF D+     + 
Subjt:  SHVNGIAYEYPLAHHQEALHSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKP-PAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNP

Query:  NTDDDSSPTPVEEQGLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYS-
        +    S P  + +QG    S   EE+   G+ V+   G +  DE++ +     K        +NSGF+SIGG++LYT+DVS EESD   E++D +   S 
Subjt:  NTDDDSSPTPVEEQGLCTASLPSEEKTGTGERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYS-

Query:  --EPLETDESSESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEF----DDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKK
              + E SESDSS +MF S S+IDD+VA+DYLEGIGGSE +L + WL +Q L +     DD SS+ S D    KL GI LQ+ S EYG KKT     
Subjt:  --EPLETDESSESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEHILKSKWLIKQELAEF----DDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKK

Query:  SSIVSRDDWSSLALDDLL-VKSCRSASAR--KKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDM
        S        + LA+DDL+ VK  RS S +  KKK  A F  SWP   PKSK SR +PGE KK+RKE IA KRRERML RGVDL  IN +LE+ VL   DM
Subjt:  SSIVSRDDWSSLALDDLL-VKSCRSASAR--KKKNAAHFASSWP---PKSKASRKYPGEKKKYRKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDM

Query:  FSFQPMHPRDCSQVRQLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNAMEFDRSASGK
          FQ MH RDCSQVR+LA +YRL S C GSGKK FVTVTRT  T MPS+SD +R+++LIG  DED DF+V+ G  ++ + G+  R+K       R    +
Subjt:  FSFQPMHPRDCSQVRQLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEGSNIRSQGGNKSREKKNAMEFDRSASGK

Query:  SRSKGSAGKGFSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSSNVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGG
             S GK        + YADQPVSF+SSG++       + S  V+K      A E+ E T       D     IGAFEVHT+GFGSKMMAKMGF++GG
Subjt:  SRSKGSAGKGFSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSSNVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHTKGFGSKMMAKMGFVEGG

Query:  GLGKDGQGMACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKG----KSKQIGAFEEHTKGFGSKMMAKMGFVEG
        GLGKDG+G+A PIE ++RPKSLGLG++FS  +     S   +  +    + + GK  +    +    G    + K++GAFE+HT GFGS+MMA+MGFVEG
Subjt:  GLGKDGQGMACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKG----KSKQIGAFEEHTKGFGSKMMAKMGFVEG

Query:  MGLGRDSQGMVNPLLPVRRPKARGLGAKG
         GLGR+SQG+VNPL+ VRRP+ARG+GA+G
Subjt:  MGLGRDSQGMVNPLLPVRRPKARGLGAKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGGCGGAAGGCGAAGACCGAACCCTAAATCCAGCCATGGCTTCAACAAAAACAAGAGCATTCCCGCCCGAAGAAGATCCGACACGTCGTCGTCGTCGTCAAATCG
ACGGAATTTGTTCGTCGATGGCGGTTTCCTCTCTGATTGGCAGTTTCAGCCCTCTCCGCCTAGTTCGTCTCGAGAGAAAAGTTCTAGGTCGAAAGGGAAATCTGTATCAA
AATCGGATAGTTTAGATCATAGAAAAGAAGCTTCGTCTAGTGGGACTAAACAATCTCATGTTAATGGTATTGCCTACGAATATCCCCTTGCTCATCATCAGGAAGCGTTG
CATTCGGGATCACGGGGTTTGCAGATTGATGCAGAGTGTAGTTTAGACAACTCACAGCCTTTTATATTACTTAATTCTAATTCTAAGAATAGTAAAATTATCGCTTATGT
GGATGATAAACCACCTGCAAAGGAAGATGATTTGAAATTTACTTATGATTATGGTACAAGTTTTGTATTAGGTGACAGTTCTCATAGAGGACTAGGGTTTCATGACGAGG
ATGGACTTGTTAGGAATCCAAACACCGATGATGATTCATCACCCACACCAGTGGAAGAACAAGGATTGTGTACTGCTTCATTGCCCTCCGAGGAGAAAACTGGTACCGGT
GAGAGGGTTGAATGCAGGGAAGGGGTTGAGATGGCTGATGAGATGCTAGCAGAAGCGTCATCCCCCAACAAATATTCATATGGTGTGTGCTCTTCACAAAATTCGGGTTT
CTTGTCAATTGGTGGTGTGAGATTATACACCCAAGATGTATCTGATGAAGAAAGTGATGATAATGGAGAGTTATCAGATGGAAATTCTCAATACTCCGAGCCTTTAGAGA
CGGATGAATCATCTGAAAGTGATAGTTCTGCAGAAATGTTTTGCAGTGATTCAGATATTGACGACGAGGTGGCTGAAGACTATCTTGAAGGGATTGGTGGAAGTGAACAT
ATTTTAAAATCCAAATGGTTAATAAAACAAGAGTTGGCGGAATTTGATGATGATAGTTCTACCAGTAGCCTTGATGATACTTTAGAAAAATTAGGTGGCATTGCTTTGCA
GGAAGGGTCTAAAGAATATGGAACTAAGAAAACTCCTTCAAGGAAGAAAAGCTCCATTGTTTCTAGAGACGATTGGTCATCATTGGCTTTAGATGACTTGCTAGTAAAAA
GTTGTAGATCTGCATCAGCTAGAAAGAAGAAGAATGCTGCTCATTTTGCTTCTTCTTGGCCTCCAAAAAGTAAAGCTTCAAGAAAGTATCCTGGTGAAAAAAAGAAATAT
CGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGCTTAATCGGGGTGTTGATCTATGGCAGATAAATCTGAAATTGGAGCACATGGTTCTGAATAGAGAAGATAT
GTTTTCTTTCCAACCTATGCATCCTCGTGATTGTTCCCAGGTACGACAGCTGGCAGCTATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGTAAAAAAAGGTTTGTAA
CAGTAACTCGAACACAATACACAGGAATGCCATCATCAAGTGATCACGTTCGCCTACAACAGCTAATAGGAACAAGAGACGAGGACAATGATTTTTCTGTTGCTGAAGGC
TCAAATATAAGATCACAGGGCGGCAACAAAAGCAGAGAAAAGAAGAATGCCATGGAGTTCGATCGATCTGCAAGTGGCAAGTCGAGGTCAAAGGGTTCTGCAGGTAAGGG
ATTTAGTCAGAAAAAGACTAGCAATAAGTATGCGGATCAACCGGTCTCATTTATTTCATCCGGAATGATGCAACCAAATGCGGTTGAGATAGAACCTAGCAGCAATGTAG
ATAAGGGTAAGGATATTGTAGCTGCATCAGAAATGATTGAGATGACTAGTAACAATGTCAAGAACATGGATATTAGTAGGGAATACATTGGTGCATTTGAGGTGCACACC
AAGGGGTTTGGGTCGAAAATGATGGCAAAAATGGGATTTGTAGAAGGTGGAGGACTGGGGAAAGATGGTCAAGGAATGGCCTGCCCCATTGAAGTGATCAAGAGGCCGAA
ATCACTTGGGTTAGGCGTCGAGTTCTCGGAGTCCTCGACTTCTGCCGGCGATAGCCACCAAGCAAGCCGAGGATCCGCCACAAGAACTACAGGAGCCTTGGGAAAATCAA
AACAAGTCGGTGCTTTTGAAGAGCACACGAAAGGAAAATCAAAACAAATCGGTGCTTTTGAAGAGCACACGAAAGGATTCGGGTCGAAAATGATGGCGAAGATGGGGTTT
GTTGAAGGCATGGGATTGGGAAGAGATTCACAAGGAATGGTGAACCCTTTGCTCCCTGTTAGGCGGCCTAAAGCACGAGGGTTGGGTGCCAAAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCGGCGGAAGGCGAAGACCGAACCCTAAATCCAGCCATGGCTTCAACAAAAACAAGAGCATTCCCGCCCGAAGAAGATCCGACACGTCGTCGTCGTCGTCAAATCG
ACGGAATTTGTTCGTCGATGGCGGTTTCCTCTCTGATTGGCAGTTTCAGCCCTCTCCGCCTAGTTCGTCTCGAGAGAAAAGTTCTAGGTCGAAAGGGAAATCTGTATCAA
AATCGGATAGTTTAGATCATAGAAAAGAAGCTTCGTCTAGTGGGACTAAACAATCTCATGTTAATGGTATTGCCTACGAATATCCCCTTGCTCATCATCAGGAAGCGTTG
CATTCGGGATCACGGGGTTTGCAGATTGATGCAGAGTGTAGTTTAGACAACTCACAGCCTTTTATATTACTTAATTCTAATTCTAAGAATAGTAAAATTATCGCTTATGT
GGATGATAAACCACCTGCAAAGGAAGATGATTTGAAATTTACTTATGATTATGGTACAAGTTTTGTATTAGGTGACAGTTCTCATAGAGGACTAGGGTTTCATGACGAGG
ATGGACTTGTTAGGAATCCAAACACCGATGATGATTCATCACCCACACCAGTGGAAGAACAAGGATTGTGTACTGCTTCATTGCCCTCCGAGGAGAAAACTGGTACCGGT
GAGAGGGTTGAATGCAGGGAAGGGGTTGAGATGGCTGATGAGATGCTAGCAGAAGCGTCATCCCCCAACAAATATTCATATGGTGTGTGCTCTTCACAAAATTCGGGTTT
CTTGTCAATTGGTGGTGTGAGATTATACACCCAAGATGTATCTGATGAAGAAAGTGATGATAATGGAGAGTTATCAGATGGAAATTCTCAATACTCCGAGCCTTTAGAGA
CGGATGAATCATCTGAAAGTGATAGTTCTGCAGAAATGTTTTGCAGTGATTCAGATATTGACGACGAGGTGGCTGAAGACTATCTTGAAGGGATTGGTGGAAGTGAACAT
ATTTTAAAATCCAAATGGTTAATAAAACAAGAGTTGGCGGAATTTGATGATGATAGTTCTACCAGTAGCCTTGATGATACTTTAGAAAAATTAGGTGGCATTGCTTTGCA
GGAAGGGTCTAAAGAATATGGAACTAAGAAAACTCCTTCAAGGAAGAAAAGCTCCATTGTTTCTAGAGACGATTGGTCATCATTGGCTTTAGATGACTTGCTAGTAAAAA
GTTGTAGATCTGCATCAGCTAGAAAGAAGAAGAATGCTGCTCATTTTGCTTCTTCTTGGCCTCCAAAAAGTAAAGCTTCAAGAAAGTATCCTGGTGAAAAAAAGAAATAT
CGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGCTTAATCGGGGTGTTGATCTATGGCAGATAAATCTGAAATTGGAGCACATGGTTCTGAATAGAGAAGATAT
GTTTTCTTTCCAACCTATGCATCCTCGTGATTGTTCCCAGGTACGACAGCTGGCAGCTATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGTAAAAAAAGGTTTGTAA
CAGTAACTCGAACACAATACACAGGAATGCCATCATCAAGTGATCACGTTCGCCTACAACAGCTAATAGGAACAAGAGACGAGGACAATGATTTTTCTGTTGCTGAAGGC
TCAAATATAAGATCACAGGGCGGCAACAAAAGCAGAGAAAAGAAGAATGCCATGGAGTTCGATCGATCTGCAAGTGGCAAGTCGAGGTCAAAGGGTTCTGCAGGTAAGGG
ATTTAGTCAGAAAAAGACTAGCAATAAGTATGCGGATCAACCGGTCTCATTTATTTCATCCGGAATGATGCAACCAAATGCGGTTGAGATAGAACCTAGCAGCAATGTAG
ATAAGGGTAAGGATATTGTAGCTGCATCAGAAATGATTGAGATGACTAGTAACAATGTCAAGAACATGGATATTAGTAGGGAATACATTGGTGCATTTGAGGTGCACACC
AAGGGGTTTGGGTCGAAAATGATGGCAAAAATGGGATTTGTAGAAGGTGGAGGACTGGGGAAAGATGGTCAAGGAATGGCCTGCCCCATTGAAGTGATCAAGAGGCCGAA
ATCACTTGGGTTAGGCGTCGAGTTCTCGGAGTCCTCGACTTCTGCCGGCGATAGCCACCAAGCAAGCCGAGGATCCGCCACAAGAACTACAGGAGCCTTGGGAAAATCAA
AACAAGTCGGTGCTTTTGAAGAGCACACGAAAGGAAAATCAAAACAAATCGGTGCTTTTGAAGAGCACACGAAAGGATTCGGGTCGAAAATGATGGCGAAGATGGGGTTT
GTTGAAGGCATGGGATTGGGAAGAGATTCACAAGGAATGGTGAACCCTTTGCTCCCTGTTAGGCGGCCTAAAGCACGAGGGTTGGGTGCCAAAGGTTAG
Protein sequenceShow/hide protein sequence
MTGGRRRPNPKSSHGFNKNKSIPARRRSDTSSSSSNRRNLFVDGGFLSDWQFQPSPPSSSREKSSRSKGKSVSKSDSLDHRKEASSSGTKQSHVNGIAYEYPLAHHQEAL
HSGSRGLQIDAECSLDNSQPFILLNSNSKNSKIIAYVDDKPPAKEDDLKFTYDYGTSFVLGDSSHRGLGFHDEDGLVRNPNTDDDSSPTPVEEQGLCTASLPSEEKTGTG
ERVECREGVEMADEMLAEASSPNKYSYGVCSSQNSGFLSIGGVRLYTQDVSDEESDDNGELSDGNSQYSEPLETDESSESDSSAEMFCSDSDIDDEVAEDYLEGIGGSEH
ILKSKWLIKQELAEFDDDSSTSSLDDTLEKLGGIALQEGSKEYGTKKTPSRKKSSIVSRDDWSSLALDDLLVKSCRSASARKKKNAAHFASSWPPKSKASRKYPGEKKKY
RKETIAAKRRERMLNRGVDLWQINLKLEHMVLNREDMFSFQPMHPRDCSQVRQLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDHVRLQQLIGTRDEDNDFSVAEG
SNIRSQGGNKSREKKNAMEFDRSASGKSRSKGSAGKGFSQKKTSNKYADQPVSFISSGMMQPNAVEIEPSSNVDKGKDIVAASEMIEMTSNNVKNMDISREYIGAFEVHT
KGFGSKMMAKMGFVEGGGLGKDGQGMACPIEVIKRPKSLGLGVEFSESSTSAGDSHQASRGSATRTTGALGKSKQVGAFEEHTKGKSKQIGAFEEHTKGFGSKMMAKMGF
VEGMGLGRDSQGMVNPLLPVRRPKARGLGAKG