; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007723 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007723
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein SQS1
Genome locationscaffold2:1167969..1172744
RNA-Seq ExpressionSpg007723
SyntenySpg007723
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain
IPR001374 - R3H domain
IPR034082 - Protein SQS1, R3H domain
IPR036867 - R3H domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587849.1 Zinc finger CCCH-type with G patch domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0081.25Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY
        MAGGRRRTNHAK SDGFRKNK +SGRR+SDPSSS+RGNLFVDGGFLSDWQ Q+SPPSSAREGNSRSKG    KSA+LDRKK ASSSGTKQSNG+AIGYEY
Subjt:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY

Query:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV
        PP+ HQE GLH ES+GLQNDA+CSL NSQPFILLNSN  ++QIVAYVD+KPPLK DD++FTYDYGT+F+LGDSSHRGLGFH+EDE VRN N DDDSPT V
Subjt:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CR  VE+ +E L EASSPNKYS GVC P+NSGFLSIGGVRLYTQDVSDEESDD+ E S+GSS+ S E  ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGGSE ILKSKWLVKQEL ESDDD SSSSLDDTLEKL GIALQEASKEYG+KKTP RKK++IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNA  FA SWPPKAPKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVT+TQYTG+PSSSDQ+RL Q                      GSNR+REKKNAK SGL ILEL+QS SSKSR++GSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS

Query:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        QKKTGKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+TTS+VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
         PIEVIKRPKSLGLGVEFSEASTSAGD NQ S GS  RTTGALGKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

XP_022929809.1 uncharacterized protein LOC111436305 [Cucurbita moschata]0.0e+0081.35Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY
        MAGGRRRTNHAK SDGFRKNK +SGRR+SDPSSS+RGNLFVDGGFLSDWQ Q+SPPSSAREGNSRSKG    KSA+LDRKK ASSSGTKQSNG+AIGYEY
Subjt:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY

Query:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV
        PP+ HQE GLH ES+GLQNDA+CSL NSQPFILLNSN  ++QIVAYVD+KPPLK DD++FTYDYGT+F+LGDSSHRGLGFH+EDE VRN N DDDSPT V
Subjt:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CR  VE+ +E L EASSPNKYS GVC P+NSGFLSIGGVRLYTQDVSDEESDD+ E S+GSS+ S E  ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGGSE ILKSKWLVKQEL ESDDD SSSSLDDTLEKL GIALQEASKEYG+KKTP RKK++IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNA  FA SWPPKAPKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ---------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQ
        IYRLHSGCQGSGKKRFVTVT+TQYTG+PSSSDQ+RL Q                     GSNR+REKKNAK SGL ILEL+QS SSKSR++GSAGKGSSQ
Subjt:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ---------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQ

Query:  KKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR
        KKTGKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+TTS+VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA 
Subjt:  KKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        PIEVIKRPKSLGLGVEFSEASTSAGD NQ S GS  RTTGALGKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  PIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

XP_023003174.1 uncharacterized protein LOC111496865 isoform X1 [Cucurbita maxima]0.0e+0081.12Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY
        MAGGRRRTNHAK SDGFRKNK +SGRR+SDPSSS+RGNLFVDGGFLSDWQ Q++PPSSAREGNSRSKG    KSA+LDRKKTASSSGTKQSNG+AIGYEY
Subjt:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY

Query:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV
        PP+ HQE GLH ESRGLQNDA+CSL NSQPFILLNSN  ++QIVAYVD+KPPLK D L+ TYDYGT F+LGDSSHRGLGFHDEDE VRN N DDDSPT V
Subjt:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CR  VE+ +E L EASSPNKYS G C P+NSGFLSIGGVRLYTQDVSDEESDD+ E S+GSS+ S EP ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGG E ILKSKWLVKQEL ESDDD SSSSLDDTLEKL  IALQEASKEYG+KKTP R K+ IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNA  FA SWPPKAPKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVT+TQYTG+PSSSDQ+RL Q                      GSNR+REKKNAK SGL ILEL+QS SSKSR+KGSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS

Query:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        QKKTGKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+TTS+VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
         PIEVIKRPKSLGLG+EFSEASTSAGD NQ S GS  RTTG LGKSKK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

XP_023003182.1 uncharacterized protein LOC111496865 isoform X2 [Cucurbita maxima]0.0e+0082.54Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY
        MAGGRRRTNHAK SDGFRKNK +SGRR+SDPSSS+RGNLFVDGGFLSDWQ Q++PPSSAREGNSRSKG    KSA+LDRKKTASSSGTKQSNG+AIGYEY
Subjt:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY

Query:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV
        PP+ HQE GLH ESRGLQNDA+CSL NSQPFILLNSN  ++QIVAYVD+KPPLK D L+ TYDYGT F+LGDSSHRGLGFHDEDE VRN N DDDSPT V
Subjt:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CR  VE+ +E L EASSPNKYS G C P+NSGFLSIGGVRLYTQDVSDEESDD+ E S+GSS+ S EP ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGG E ILKSKWLVKQEL ESDDD SSSSLDDTLEKL  IALQEASKEYG+KKTP R K+ IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNA  FA SWPPKAPKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVT+TQYTG+PSSSDQ+RL Q                      GSNR+REKKNAK SGL ILEL+QS SSKSR+KGSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS

Query:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        QKKTGKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+TTS+VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
         PIEVIKRPKSLGLGVEFS EASTSAGDN ++ R S   TTGALGKSKK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
Subjt:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

Query:  KG
        KG
Subjt:  KG

XP_023530152.1 uncharacterized protein LOC111792798 [Cucurbita pepo subsp. pepo]0.0e+0081.27Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY
        MAGGRRRTNHAK SDGFRKNK +SGRR+SDPSSS+RGNLFVDGG LSDWQ Q+SPPSSAREGNSRSKG    KSA+LDRKK ASSSGTKQSNG+AIGYEY
Subjt:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY

Query:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV
        PP+ HQE GLH ESRGLQNDA+CSL NSQPFILLNSN  +SQIVAYVD+KPP+K DD++FTYDYGT+F+LGDSSHRGLGFHDEDE VRN N DDDSPT V
Subjt:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE
        EE+EGLCTGSL S+KE GTDERV+CR  VE+ +E L EASSPNKYS GVC P+NSGFLSIGGVRLYTQDVSDEESDD+ E S+GSS+ S EPSESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGGSE ILKSKWLVKQEL ESDDD SSSSLDDTLEKL  IALQEASKEYG+KKTP RKK++IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNA  FA SWPPKAPKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVT+TQYTG+PSSSDQ+RL Q                      GSNR+REKKNAK SGL ILEL+ S SSKSR++GSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS

Query:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        QKKTGKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+TTS VKNMDISR++IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
         PIEVIKRPKSLGLGVEFS EASTSAGD NQ SRGS  RTTGALGKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG 
Subjt:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

Query:  K
        +
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A1S3B9V9 Protein SQS10.0e+0076.85Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY
        MAGGRRRTNHAK+SD FRKNK+NS RR+SDP S    NLFVDGGFLSDWQ Q SPP SAREGNSR+KG    KSA LDRKK ASSSGTKQSNG AIGYEY
Subjt:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY

Query:  PPSLHQEGLHHESRGLQNDAECSLGNSQPFILLNSNNSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPVEEE
        P + +QE L  ESRGLQNDAECSL +SQPFILLNS ++QIVAYVDE P L AD+L+FTYDYGTSFVLGDSSHRGLGF D+DE V   N DDDS T VEE+
Subjt:  PPSLHQEGLHHESRGLQNDAECSLGNSQPFILLNSNNSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPVEEE

Query:  EGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSESDS
         GLCTGSLS EKETGTDE+VD R  VE  NE + EAS  NK SD +  PKNSGFLSIGGVRLYTQDVSDEESDD+ E SDG+S+ S EP ESDESSE DS
Subjt:  EGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSESDS

Query:  SAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDDLLV
        S EMSCS SDIDDEVAEDYLEG+GGS++ILKSKWLVKQEL+ES DDSSSSSLDDTL+KLGGIALQEASKEYG+ KT  R K S+VSRD WS+LALDD+LV
Subjt:  SAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDDLLV

Query:  KDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAAIYR
        KD RSTS +K+KNA+ FA SWPPKA  SKA RKYPGEKKKYRKE IAAKRRERMLNRGVDL QI+L+LE MVL +EDM++FQPMHPRDCSQ+RRLAAIYR
Subjt:  KDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAAIYR

Query:  LHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQKK
        LHSGCQGSGKKRFVTVT+TQ+TGMPS+SDQ+RLEQ                      GS+R+REKKN K+ GL  LE DQS S+KSRSKGSAGKGSSQK 
Subjt:  LHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQKK

Query:  TGKKYADQPVSFVSCGVMQPEAVEI-TTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARP
        TGKKYADQPVSFVSCGVMQPE +E+    D  DVDK KDIVA SEMIEM TS+V N DISRD+IG+FE+HTKGFGSKMMAKMGFVEGGGLGKDGQGM  P
Subjt:  TGKKYADQPVSFVSCGVMQPEAVEI-TTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARP

Query:  IEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRT-TGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        IEV+KRPKSLGLG+EFSEASTSA  +NQAS  S+ RT +GAL KSK+IGAFE HTKGFGSKMMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGAK
Subjt:  IEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRT-TGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

A0A6J1BYW1 Protein SQS10.0e+0079.9Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY
        MAGGRRRT HAKASDGFRKNK N+GRR+SDPSSSIRGNLFVDGGFLSD Q QSSPPSSAREGNSRSKG    KS NLDR KTAS+SG+K+SNG+AIGYEY
Subjt:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY

Query:  PPSLHQEGLHHESRGLQNDAECSLGNSQPFILLNSNNSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPV-RNPNIDDDSPTPVEE
        PP+ HQEGLH ES+GL NDA+CSL NSQPFILL+S N+QIVAYVDEKP LK DDL+FTYDYGTSFVLGDSSHRGLGFHD+D+ + R+P+ DD SP  VEE
Subjt:  PPSLHQEGLHHESRGLQNDAECSLGNSQPFILLNSNNSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPV-RNPNIDDDSPTPVEE

Query:  EEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSESD
        +EGLC GSLSSEKE GTDERV+CR   +M NE L E S+PNKY+D VC  KNSGFLSIGG+RLYTQDVS EESDD+ E SDGSS+ S E  ES ESSESD
Subjt:  EEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSESD

Query:  SSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPP-RKKNSIVSRDDWSSLALDDL
        SS EMSC+DSDIDDEVAEDYLEGIGGSE+IL SKWLVKQELVESDDDSSSS  DDTLEKLGGIALQEASKE+G+KKTPP RKK+SIVS DDWSSLALDDL
Subjt:  SSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPP-RKKNSIVSRDDWSSLALDDL

Query:  LVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAAI
        LVKD+R+TSAKKKKNA    CSWPPKAPKSK  RKYPGEKKKYRKETIAAKRRERM++RGVDL QINLKLE MVL REDMFSFQPMHPRDCSQ+RRLAAI
Subjt:  LVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAAI

Query:  YRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRL----------------------EQGSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQ
        YRLHSGCQGSGKKRFVTVT+TQYTGMPSSSDQ+RL                       QG NR+REKKNA++S   ILEL QS S KSRSKGSAGK SSQ
Subjt:  YRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRL----------------------EQGSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQ

Query:  KKTG-KKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKD-IVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGM
        KKTG KKYAD+PVSFVSCGVMQPE VEITTSD+ D DK KD I  A E I+++TS+VKN DI+ D IGAFEVHTKGFGSKMMAKMGFV GGGLGKDGQGM
Subjt:  KKTG-KKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKD-IVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGM

Query:  ARPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
        ARPIEVIKRPKSLGLGVEFSEA +S G  NQ SRGST   TGA GK+KKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
Subjt:  ARPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

Query:  K
        K
Subjt:  K

A0A6J1EPV3 Protein SQS10.0e+0081.35Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY
        MAGGRRRTNHAK SDGFRKNK +SGRR+SDPSSS+RGNLFVDGGFLSDWQ Q+SPPSSAREGNSRSKG    KSA+LDRKK ASSSGTKQSNG+AIGYEY
Subjt:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY

Query:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV
        PP+ HQE GLH ES+GLQNDA+CSL NSQPFILLNSN  ++QIVAYVD+KPPLK DD++FTYDYGT+F+LGDSSHRGLGFH+EDE VRN N DDDSPT V
Subjt:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CR  VE+ +E L EASSPNKYS GVC P+NSGFLSIGGVRLYTQDVSDEESDD+ E S+GSS+ S E  ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGGSE ILKSKWLVKQEL ESDDD SSSSLDDTLEKL GIALQEASKEYG+KKTP RKK++IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNA  FA SWPPKAPKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ---------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQ
        IYRLHSGCQGSGKKRFVTVT+TQYTG+PSSSDQ+RL Q                     GSNR+REKKNAK SGL ILEL+QS SSKSR++GSAGKGSSQ
Subjt:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ---------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQ

Query:  KKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR
        KKTGKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+TTS+VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA 
Subjt:  KKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        PIEVIKRPKSLGLGVEFSEASTSAGD NQ S GS  RTTGALGKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  PIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

A0A6J1KLR0 Protein SQS10.0e+0082.54Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY
        MAGGRRRTNHAK SDGFRKNK +SGRR+SDPSSS+RGNLFVDGGFLSDWQ Q++PPSSAREGNSRSKG    KSA+LDRKKTASSSGTKQSNG+AIGYEY
Subjt:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY

Query:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV
        PP+ HQE GLH ESRGLQNDA+CSL NSQPFILLNSN  ++QIVAYVD+KPPLK D L+ TYDYGT F+LGDSSHRGLGFHDEDE VRN N DDDSPT V
Subjt:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CR  VE+ +E L EASSPNKYS G C P+NSGFLSIGGVRLYTQDVSDEESDD+ E S+GSS+ S EP ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGG E ILKSKWLVKQEL ESDDD SSSSLDDTLEKL  IALQEASKEYG+KKTP R K+ IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNA  FA SWPPKAPKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVT+TQYTG+PSSSDQ+RL Q                      GSNR+REKKNAK SGL ILEL+QS SSKSR+KGSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS

Query:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        QKKTGKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+TTS+VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
         PIEVIKRPKSLGLGVEFS EASTSAGDN ++ R S   TTGALGKSKK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
Subjt:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

Query:  KG
        KG
Subjt:  KG

A0A6J1KSJ4 Protein SQS10.0e+0081.12Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY
        MAGGRRRTNHAK SDGFRKNK +SGRR+SDPSSS+RGNLFVDGGFLSDWQ Q++PPSSAREGNSRSKG    KSA+LDRKKTASSSGTKQSNG+AIGYEY
Subjt:  MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNGSAIGYEY

Query:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV
        PP+ HQE GLH ESRGLQNDA+CSL NSQPFILLNSN  ++QIVAYVD+KPPLK D L+ TYDYGT F+LGDSSHRGLGFHDEDE VRN N DDDSPT V
Subjt:  PPSLHQE-GLHHESRGLQNDAECSLGNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CR  VE+ +E L EASSPNKYS G C P+NSGFLSIGGVRLYTQDVSDEESDD+ E S+GSS+ S EP ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGG E ILKSKWLVKQEL ESDDD SSSSLDDTLEKL  IALQEASKEYG+KKTP R K+ IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNA  FA SWPPKAPKSKA  KYPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVT+TQYTG+PSSSDQ+RL Q                      GSNR+REKKNAK SGL ILEL+QS SSKSR+KGSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQ----------------------GSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSS

Query:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        QKKTGKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+TTS+VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
         PIEVIKRPKSLGLG+EFSEASTSAGD NQ S GS  RTTG LGKSKK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

SwissProt top hitse value%identityAlignment
A7SBN6 Zinc finger CCCH-type with G patch domain-containing protein3.8e-0738.78Show/hide
Query:  NIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRP--KSLGLGVEFSEASTSAGDNNQASRG--STTRTTGALGKSKKIGAFEEHTKG
        ++G +E HTKG GSK+MAKMG++ G GLGKDG+G   PIEV+  P  KSL    E  E +       +  +     + T+ + GK+  +  F  H  G
Subjt:  NIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRP--KSLGLGVEFSEASTSAGDNNQASRG--STTRTTGALGKSKKIGAFEEHTKG

B3MPC0 Zinc finger CCCH-type with G patch domain-containing protein1.3e-0750Show/hide
Query:  SKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
        ++K+GA+EE+T+G GSK+M KMG++ G GLG D +G+V P+     PK R L A
Subjt:  SKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

B4HWD7 Zinc finger CCCH-type with G patch domain-containing protein8.4e-0730Show/hide
Query:  DQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI--EVIK
        D+  S  S   M+  + +   SD+ D+++++     + M+E++    K      D +GA+E  T+G GSK+M KMG++ G GLG +G+G+  P+  +++ 
Subjt:  DQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI--EVIK

Query:  RPKSLGLGVEFSEASTSAGD
        + +SL   +E  EA+    D
Subjt:  RPKSLGLGVEFSEASTSAGD

B4Q8A7 Zinc finger CCCH-type with G patch domain-containing protein2.9e-0730.83Show/hide
Query:  DQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI--EVIK
        D+  S  S   M+  + +   SD+ D+++++     + M+E++    K      D +GA+E  T+G GSK+M KMG++ G GLG DG+G+  P+  +++ 
Subjt:  DQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI--EVIK

Query:  RPKSLGLGVEFSEASTSAGD
        + +SL   +E  EA+    D
Subjt:  RPKSLGLGVEFSEASTSAGD

Q6K687 Zinc finger CCCH domain-containing protein 181.1e-0635.77Show/hide
Query:  FEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI--EVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTK---GFGSKM
        +E HT+G  SKMMAKMG+ EG GLG  GQGM  PI  +V+   +SL   V  SE + S G   + SRG   +      +  +    EE  +    F +  
Subjt:  FEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI--EVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTK---GFGSKM

Query:  MAKMGFVEGMGL--GKDSQGMVN
        +      EG  +   KDS G  N
Subjt:  MAKMGFVEGMGL--GKDSQGMVN

Arabidopsis top hitse value%identityAlignment
AT1G17070.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain1.6e-0528.15Show/hide
Query:  YTGMPSSSDQLRLEQGSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQKKTGKKYAD--QPVSFVSCGVMQPEAVEITT-----SDLKDVDK
        Y G     D+   ++   + ++ KN    G  I     S S  S   GS  K    + +G+K AD  +PV+FVS G + P   EI       +D KD DK
Subjt:  YTGMPSSSDQLRLEQGSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQKKTGKKYAD--QPVSFVSCGVMQPEAVEITT-----SDLKDVDK

Query:  SKDIVAASEMIEMT------------------TSDVKNM------------------------------------DISRDNIGAFEVHTKGFGSKMMAKM
         +D     E +E+                     D  N+                                    +    +IG FE  TKG G K++ KM
Subjt:  SKDIVAASEMIEMT------------------TSDVKNM------------------------------------DISRDNIGAFEVHTKGFGSKMMAKM

Query:  GFVEGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA
        G+ +GGGLGK+ QG+  PIE   RPK++G+G  +F EA
Subjt:  GFVEGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA

AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein1.6e-0539.29Show/hide
Query:  SEASTSAGDNNQASRG-----STTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
        SE S S+  +  + +G     ST    G    +     +E HT+G  SKMMA MG+ EGMGLG   QG++NP+L    P  R L
Subjt:  SEASTSAGDNNQASRG-----STTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL

AT2G42330.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain3.1e-0431.54Show/hide
Query:  KSRSKGSAGKGSSQKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTT--SDVKNMDISRDNIGAFEVHTKGFGSKMMAKMG
        +S S      GS +KK  K    +PV F S G +  + ++    + K +D++ D +    + +     + V+  +  ++N   FE  + G G K++ KMG
Subjt:  KSRSKGSAGKGSSQKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTT--SDVKNMDISRDNIGAFEVHTKGFGSKMMAKMG

Query:  FVEGGGLGKDGQGMARPIEVIKRPKSLGLG
        + +G GLGK+ QG+  PIEV  RPK++G+G
Subjt:  FVEGGGLGKDGQGMARPIEVIKRPKSLGLG

AT2G42330.2 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain3.1e-0431.54Show/hide
Query:  KSRSKGSAGKGSSQKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTT--SDVKNMDISRDNIGAFEVHTKGFGSKMMAKMG
        +S S      GS +KK  K    +PV F S G +  + ++    + K +D++ D +    + +     + V+  +  ++N   FE  + G G K++ KMG
Subjt:  KSRSKGSAGKGSSQKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTT--SDVKNMDISRDNIGAFEVHTKGFGSKMMAKMG

Query:  FVEGGGLGKDGQGMARPIEVIKRPKSLGLG
        + +G GLGK+ QG+  PIEV  RPK++G+G
Subjt:  FVEGGGLGKDGQGMARPIEVIKRPKSLGLG

AT3G09850.1 D111/G-patch domain-containing protein3.0e-14545.62Show/hide
Query:  MAGGRRRTNHAKASDGFRKNKS---------NSGRRKSDPSSSIRGN---LFVDGGFLSDWQLQSSPPSSAREGNSRSKG-KSANLDR-KKTASSSGTKQ
        M GG RR ++   S G  K KS         NS    S      R N   LFV+GG LSD++   +  + +R G+S  KG +S +++R K +AS+SG ++
Subjt:  MAGGRRRTNHAKASDGFRKNKS---------NSGRRKSDPSSSIRGN---LFVDGGFLSDWQLQSSPPSSAREGNSRSKG-KSANLDR-KKTASSSGTKQ

Query:  SNGSAIGYEYPPSLHQEGLHHESRGLQNDAECSLGNSQPFILLNSNNSQIVAYVDEKP-PLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNI
         +G+   Y+YP    +EGL   S G+++D      N  P +L  S  +QIVA++D+ P   K   + + Y+Y  S+VLGD SH+GLGF D+ +   + ++
Subjt:  SNGSAIGYEYPPSLHQEGLHHESRGLQNDAECSLGNSQPFILLNSNNSQIVAYVDEKP-PLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNI

Query:  DDDSPTPVEEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDE-
               + +  G   GS S E+E    +         MP+       +P +        +NSGF+SIGG++LYT+DVS EESD EEE +D   D S + 
Subjt:  DDDSPTPVEEEEGLCTGSLSSEKETGTDERVDCRAVVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDE-

Query:  -PSESDESSESDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQEL----VESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNS
          S S E SESDSS +M  S S+IDD+VA+DYLEGIGGSE +L + WL +Q L    + SDD SSS S D    KL GI LQ+AS EYG KKT   +   
Subjt:  -PSESDESSESDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQEL----VESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNS

Query:  IVSRDDWSSLALDDLL-VKDTRSTSAK--KKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFS
               + LA+DDL+ VKD RS S K  KKK    F  SWP +APKSK  R +PGE KK+RKE IA KRRERML RGVDL  IN +LE  VL+  DM  
Subjt:  IVSRDDWSSLALDDLL-VKDTRSTSAK--KKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFS

Query:  FQPMHPRDCSQIRRLAAIYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQGSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQ----
        FQ MH RDCSQ+RRLA +YRL S C GSGKK FVTVT+T  T MPS+SD+LR+E+      E  +  +SG   ++    +  + ++K SA K  ++    
Subjt:  FQPMHPRDCSQIRRLAAIYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQGSNRNREKKNAKISGLRILELDQSASSKSRSKGSAGKGSSQ----

Query:  --KKTGKK--YADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQ
          K  GKK  YADQPVSFVS G++  E     TS    V+K      A E+ E T       D     IGAFEVHT+GFGSKMMAKMGF++GGGLGKDG+
Subjt:  --KKTGKK--YADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQ

Query:  GMARPIEVIKRPKSLGLGVEFS----EASTSAGDNNQASRGST----------TRTTGALG----KSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDS
        G+A+PIE ++RPKSLGLG++FS    + S S+ +NN A R  +          T   GA G    + K++GAFE+HT GFGS+MMA+MGFVEG GLG++S
Subjt:  GMARPIEVIKRPKSLGLGVEFS----EASTSAGDNNQASRGST----------TRTTGALG----KSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDS

Query:  QGMVNPLLPVRRPKARGLGAKG
        QG+VNPL+ VRRP+ARG+GA+G
Subjt:  QGMVNPLLPVRRPKARGLGAKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAGGGAGACGAAGAACGAACCATGCCAAAGCCAGTGATGGCTTCAGGAAGAACAAGAGTAATTCCGGCAGAAGAAAGTCAGACCCTTCCTCATCTATTCGAGG
AAATTTGTTCGTCGATGGCGGTTTTTTATCTGATTGGCAGCTCCAGTCATCGCCGCCTAGTTCTGCTCGAGAGGGAAATTCTAGGTCGAAGGGGAAGTCTGCTAATTTAG
ATCGTAAAAAGACTGCTTCATCAAGTGGGACTAAACAATCAAATGGCAGTGCTATTGGCTACGAATACCCTCCTTCTCTTCATCAGGAAGGGTTGCATCATGAATCACGG
GGACTGCAGAATGATGCAGAGTGTAGCTTGGGCAACTCACAGCCTTTTATATTACTTAATTCTAATAATAGTCAAATTGTTGCATATGTAGATGAAAAACCACCTTTAAA
GGCAGATGATTTGAAATTTACTTATGATTATGGCACAAGTTTTGTGTTAGGTGACAGTTCTCATAGAGGATTAGGGTTCCACGATGAGGATGAACCTGTTAGAAATCCAA
ACATAGATGATGATTCACCCACACCAGTGGAAGAAGAAGAAGGATTGTGTACTGGTTCATTGTCCTCTGAGAAGGAAACTGGTACTGATGAGAGGGTTGATTGCAGGGCA
GTGGTTGAGATGCCTAATGAAACGCTAGTAGAAGCATCATCACCCAACAAATATTCAGATGGTGTGTGCTTTCCAAAAAATTCAGGTTTCTTGTCAATTGGTGGTGTGAG
ATTATACACCCAAGATGTATCTGATGAAGAAAGTGATGATGAGGAGGAGCCATCAGATGGAAGTTCTGATCGCTCTGATGAGCCTTCAGAGTCAGATGAATCGTCTGAAA
GTGATAGCTCTGCAGAAATGTCTTGCAGTGATTCAGATATTGATGACGAGGTGGCTGAAGATTATCTTGAAGGGATTGGTGGAAGTGAAAGTATTTTAAAATCCAAATGG
TTAGTAAAACAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGCAGCCTTGATGATACTTTAGAGAAATTAGGCGGCATTGCTTTGCAGGAAGCATCTAAAGAATA
TGGAATTAAGAAAACCCCTCCAAGGAAGAAAAATTCCATTGTTTCAAGAGATGATTGGTCATCATTGGCTCTAGATGACTTGCTAGTAAAAGATACTAGATCTACATCAG
CTAAAAAGAAGAAGAATGCTACTGATTTTGCCTGTTCTTGGCCTCCAAAGGCTCCAAAAAGTAAAGCTGGTAGAAAGTATCCTGGTGAAAAGAAGAAATATCGTAAAGAA
ACAATTGCAGCAAAGCGTCGAGAAAGAATGCTCAATCGGGGTGTCGATCTAAGGCAAATAAATTTGAAATTGGAGCAGATGGTTCTGAAGAGAGAAGATATGTTTTCTTT
CCAACCTATGCATCCCCGTGATTGTTCCCAGATACGGCGATTGGCAGCAATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGGAAAAAAAGGTTTGTTACAGTAACTC
AAACACAATACACAGGAATGCCATCATCAAGTGATCAACTTCGCCTAGAACAGGGCAGCAATAGAAACAGAGAAAAGAAGAATGCCAAAATAAGTGGTTTGCGCATATTG
GAGCTTGATCAATCTGCAAGCAGTAAGTCGAGGTCAAAGGGTTCTGCAGGTAAGGGATCAAGTCAGAAAAAGACTGGCAAAAAGTATGCTGATCAACCAGTCTCATTTGT
ATCGTGTGGAGTGATGCAACCAGAAGCGGTCGAGATAACAACTAGCGATCTTAAGGACGTAGATAAAAGCAAGGATATTGTAGCTGCATCAGAAATGATTGAGATGACTA
CTAGCGACGTCAAGAATATGGACATAAGTAGGGACAATATTGGTGCATTTGAGGTGCACACCAAGGGTTTTGGGTCAAAAATGATGGCAAAAATGGGATTTGTAGAAGGT
GGAGGATTGGGGAAAGATGGTCAAGGAATGGCTCGCCCCATTGAAGTGATCAAACGACCAAAATCGCTTGGGTTAGGCGTCGAGTTCTCTGAGGCCTCCACTTCTGCTGG
TGATAACAACCAGGCAAGTCGAGGATCCACCACTAGAACAACTGGAGCTTTGGGAAAATCTAAAAAAATTGGTGCTTTTGAAGAACACACTAAAGGATTTGGGTCGAAGA
TGATGGCAAAGATGGGATTTGTTGAAGGCATGGGATTGGGAAAAGATTCACAAGGAATGGTCAACCCTTTGCTTCCTGTTAGGCGGCCTAAAGCACGAGGGTTGGGTGCC
AAAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGAGGGAGACGAAGAACGAACCATGCCAAAGCCAGTGATGGCTTCAGGAAGAACAAGAGTAATTCCGGCAGAAGAAAGTCAGACCCTTCCTCATCTATTCGAGG
AAATTTGTTCGTCGATGGCGGTTTTTTATCTGATTGGCAGCTCCAGTCATCGCCGCCTAGTTCTGCTCGAGAGGGAAATTCTAGGTCGAAGGGGAAGTCTGCTAATTTAG
ATCGTAAAAAGACTGCTTCATCAAGTGGGACTAAACAATCAAATGGCAGTGCTATTGGCTACGAATACCCTCCTTCTCTTCATCAGGAAGGGTTGCATCATGAATCACGG
GGACTGCAGAATGATGCAGAGTGTAGCTTGGGCAACTCACAGCCTTTTATATTACTTAATTCTAATAATAGTCAAATTGTTGCATATGTAGATGAAAAACCACCTTTAAA
GGCAGATGATTTGAAATTTACTTATGATTATGGCACAAGTTTTGTGTTAGGTGACAGTTCTCATAGAGGATTAGGGTTCCACGATGAGGATGAACCTGTTAGAAATCCAA
ACATAGATGATGATTCACCCACACCAGTGGAAGAAGAAGAAGGATTGTGTACTGGTTCATTGTCCTCTGAGAAGGAAACTGGTACTGATGAGAGGGTTGATTGCAGGGCA
GTGGTTGAGATGCCTAATGAAACGCTAGTAGAAGCATCATCACCCAACAAATATTCAGATGGTGTGTGCTTTCCAAAAAATTCAGGTTTCTTGTCAATTGGTGGTGTGAG
ATTATACACCCAAGATGTATCTGATGAAGAAAGTGATGATGAGGAGGAGCCATCAGATGGAAGTTCTGATCGCTCTGATGAGCCTTCAGAGTCAGATGAATCGTCTGAAA
GTGATAGCTCTGCAGAAATGTCTTGCAGTGATTCAGATATTGATGACGAGGTGGCTGAAGATTATCTTGAAGGGATTGGTGGAAGTGAAAGTATTTTAAAATCCAAATGG
TTAGTAAAACAAGAGTTGGTAGAGTCTGATGATGATAGTTCTAGCAGCAGCCTTGATGATACTTTAGAGAAATTAGGCGGCATTGCTTTGCAGGAAGCATCTAAAGAATA
TGGAATTAAGAAAACCCCTCCAAGGAAGAAAAATTCCATTGTTTCAAGAGATGATTGGTCATCATTGGCTCTAGATGACTTGCTAGTAAAAGATACTAGATCTACATCAG
CTAAAAAGAAGAAGAATGCTACTGATTTTGCCTGTTCTTGGCCTCCAAAGGCTCCAAAAAGTAAAGCTGGTAGAAAGTATCCTGGTGAAAAGAAGAAATATCGTAAAGAA
ACAATTGCAGCAAAGCGTCGAGAAAGAATGCTCAATCGGGGTGTCGATCTAAGGCAAATAAATTTGAAATTGGAGCAGATGGTTCTGAAGAGAGAAGATATGTTTTCTTT
CCAACCTATGCATCCCCGTGATTGTTCCCAGATACGGCGATTGGCAGCAATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGGAAAAAAAGGTTTGTTACAGTAACTC
AAACACAATACACAGGAATGCCATCATCAAGTGATCAACTTCGCCTAGAACAGGGCAGCAATAGAAACAGAGAAAAGAAGAATGCCAAAATAAGTGGTTTGCGCATATTG
GAGCTTGATCAATCTGCAAGCAGTAAGTCGAGGTCAAAGGGTTCTGCAGGTAAGGGATCAAGTCAGAAAAAGACTGGCAAAAAGTATGCTGATCAACCAGTCTCATTTGT
ATCGTGTGGAGTGATGCAACCAGAAGCGGTCGAGATAACAACTAGCGATCTTAAGGACGTAGATAAAAGCAAGGATATTGTAGCTGCATCAGAAATGATTGAGATGACTA
CTAGCGACGTCAAGAATATGGACATAAGTAGGGACAATATTGGTGCATTTGAGGTGCACACCAAGGGTTTTGGGTCAAAAATGATGGCAAAAATGGGATTTGTAGAAGGT
GGAGGATTGGGGAAAGATGGTCAAGGAATGGCTCGCCCCATTGAAGTGATCAAACGACCAAAATCGCTTGGGTTAGGCGTCGAGTTCTCTGAGGCCTCCACTTCTGCTGG
TGATAACAACCAGGCAAGTCGAGGATCCACCACTAGAACAACTGGAGCTTTGGGAAAATCTAAAAAAATTGGTGCTTTTGAAGAACACACTAAAGGATTTGGGTCGAAGA
TGATGGCAAAGATGGGATTTGTTGAAGGCATGGGATTGGGAAAAGATTCACAAGGAATGGTCAACCCTTTGCTTCCTGTTAGGCGGCCTAAAGCACGAGGGTTGGGTGCC
AAAGGTTAG
Protein sequenceShow/hide protein sequence
MAGGRRRTNHAKASDGFRKNKSNSGRRKSDPSSSIRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKGKSANLDRKKTASSSGTKQSNGSAIGYEYPPSLHQEGLHHESR
GLQNDAECSLGNSQPFILLNSNNSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNIDDDSPTPVEEEEGLCTGSLSSEKETGTDERVDCRA
VVEMPNETLVEASSPNKYSDGVCFPKNSGFLSIGGVRLYTQDVSDEESDDEEEPSDGSSDRSDEPSESDESSESDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKW
LVKQELVESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPPRKKNSIVSRDDWSSLALDDLLVKDTRSTSAKKKKNATDFACSWPPKAPKSKAGRKYPGEKKKYRKE
TIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAAIYRLHSGCQGSGKKRFVTVTQTQYTGMPSSSDQLRLEQGSNRNREKKNAKISGLRIL
ELDQSASSKSRSKGSAGKGSSQKKTGKKYADQPVSFVSCGVMQPEAVEITTSDLKDVDKSKDIVAASEMIEMTTSDVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEG
GGLGKDGQGMARPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSTTRTTGALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
KG