; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033708 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033708
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein SQS1
Genome locationchr3:1268777..1273376
RNA-Seq ExpressionLag0033708
SyntenyLag0033708
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain
IPR001374 - R3H domain
IPR034082 - Protein SQS1, R3H domain
IPR036867 - R3H domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587849.1 Zinc finger CCCH-type with G patch domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0081.75Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY
        MAGGRRRTN AKPSDGFRKNK +SGRR+SDPSSS RGNLFVDGGFLSDWQ Q+SPPSSAREGNSRSKG    KSA+LDRKK ASSSGTKQSN +AIGYEY
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY

Query:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV
        PP  HQE GLH ES+GLQNDA+CSLDNSQPFILLNSN  ++QIVAYVD+KPPLK DD++FTYDYGT+F+LGDSSHRGLGFH+EDE VRN N+D+DSPT V
Subjt:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CRE VE  +E L EASSPNKYS GVCSP+NSGFLSIGGVRLYTQDVSDEESDDDGE S+GSS+ + E  ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGGSE ILKSKWLVKQEL +SDDD SSSSLDDTLEKL GIALQEASKEYG+KKTPSRKK++IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNAA FA SWPPKAPK K       EKKKYRKETIAAKRRERMLNRGVDL +INLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQ+RL QLIGARDEDNDFSVAEG N++S  SNRSREKKNAK SGL ILEL+QS SSK+R++GSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS

Query:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        Q+KTGKKYADQPVSFVSCGVMQP+AVEITTS+          ++ASEM+E+TTS VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK
         PIEVIKRPKSLGLGVEFSEASTSAGD NQ S GSA RTTGA+GKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QG+  P+  ++RPK+ GLG +
Subjt:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK

XP_022929809.1 uncharacterized protein LOC111436305 [Cucurbita moschata]0.0e+0081.62Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY
        MAGGRRRTN AKPSDGFRKNK +SGRR+SDPSSS RGNLFVDGGFLSDWQ Q+SPPSSAREGNSRSKG    KSA+LDRKK ASSSGTKQSN +AIGYEY
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY

Query:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV
        PP  HQE GLH ES+GLQNDA+CSLDNSQPFILLNSN  ++QIVAYVD+KPPLK DD++FTYDYGT+F+LGDSSHRGLGFH+EDE VRN N+D+DSPT V
Subjt:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CRE VE  +E L EASSPNKYS GVCSP+NSGFLSIGGVRLYTQDVSDEESDDDGE S+GSS+ + E  ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGGSE ILKSKWLVKQEL +SDDD SSSSLDDTLEKL GIALQEASKEYG+KKTPSRKK++IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNAA FA SWPPKAPK K       EKKKYRKETIAAKRRERMLNRGVDL +INLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQ+RL QLIGARD DNDFSVAEG N++S  SNRSREKKNAK SGL ILEL+QS SSK+R++GSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS

Query:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        Q+KTGKKYADQPVSFVSCGVMQP+AVEITTS+          ++ASEM+E+TTS VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK
         PIEVIKRPKSLGLGVEFSEASTSAGD NQ S GSA RTTGA+GKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QG+  P+  ++RPK+ GLG +
Subjt:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK

XP_023003174.1 uncharacterized protein LOC111496865 isoform X1 [Cucurbita maxima]0.0e+0081.62Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY
        MAGGRRRTN AKPSDGFRKNK +SGRR+SDPSSS RGNLFVDGGFLSDWQ Q++PPSSAREGNSRSKG    KSA+LDRKKTASSSGTKQSN +AIGYEY
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY

Query:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV
        PP  HQE GLH ESRGLQNDA+CSLDNSQPFILLNSN  ++QIVAYVD+KPPLK D L+ TYDYGT F+LGDSSHRGLGFHDEDE VRN NTD+DSPT V
Subjt:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CRE VE  +E L EASSPNKYS G CSP+NSGFLSIGGVRLYTQDVSDEESDDDGE S+GSS+ + EP ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGG E ILKSKWLVKQEL +SDDD SSSSLDDTLEKL  IALQEASKEYG+KKTPSR K+ IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNAA FA SWPPKAPK K       EKKKYRKETIAAKRRERMLNRGVDL QINLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQ+RL QLIGARD+DNDFSVAEG N++S  SNRSREKKNAK SGL ILEL+QS SSK+R+KGSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS

Query:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        Q+KTGKKYADQPVSFVSCGVMQP+AVEITTS+          ++ASEM+E+TTS VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK
         PIEVIKRPKSLGLG+EFSEASTSAGD NQ S GSA RTTG +GKSKK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QG+  P+  ++RPK+ GLG +
Subjt:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK

XP_023003182.1 uncharacterized protein LOC111496865 isoform X2 [Cucurbita maxima]0.0e+0082.92Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY
        MAGGRRRTN AKPSDGFRKNK +SGRR+SDPSSS RGNLFVDGGFLSDWQ Q++PPSSAREGNSRSKG    KSA+LDRKKTASSSGTKQSN +AIGYEY
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY

Query:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV
        PP  HQE GLH ESRGLQNDA+CSLDNSQPFILLNSN  ++QIVAYVD+KPPLK D L+ TYDYGT F+LGDSSHRGLGFHDEDE VRN NTD+DSPT V
Subjt:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CRE VE  +E L EASSPNKYS G CSP+NSGFLSIGGVRLYTQDVSDEESDDDGE S+GSS+ + EP ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGG E ILKSKWLVKQEL +SDDD SSSSLDDTLEKL  IALQEASKEYG+KKTPSR K+ IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNAA FA SWPPKAPK K       EKKKYRKETIAAKRRERMLNRGVDL QINLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQ+RL QLIGARD+DNDFSVAEG N++S  SNRSREKKNAK SGL ILEL+QS SSK+R+KGSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS

Query:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        Q+KTGKKYADQPVSFVSCGVMQP+AVEITTS+          ++ASEM+E+TTS VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA
         PIEVIKRPKSLGLGVEFS EASTSAGDN ++ R S   TTGA+GKSKK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGA
Subjt:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA

Query:  KG
        KG
Subjt:  KG

XP_023530152.1 uncharacterized protein LOC111792798 [Cucurbita pepo subsp. pepo]0.0e+0081.77Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY
        MAGGRRRTN AKPSDGFRKNK +SGRR+SDPSSS RGNLFVDGG LSDWQ Q+SPPSSAREGNSRSKG    KSA+LDRKK ASSSGTKQSN +AIGYEY
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY

Query:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV
        PP  HQE GLH ESRGLQNDA+CSLDNSQPFILLNSN  +SQIVAYVD+KPP+K DD++FTYDYGT+F+LGDSSHRGLGFHDEDE VRN NTD+DSPT V
Subjt:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE
        EE+EGLCTGSL S+KE GTDERV+CRE VE  +E L EASSPNKYS GVCSP+NSGFLSIGGVRLYTQDVSDEESDDDGE S+GSS+ + EPSESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGGSE ILKSKWLVKQEL +SDDD SSSSLDDTLEKL  IALQEASKEYG+KKTPSRKK++IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNAA FA SWPPKAPK K       EKKKYRKETIAAKRRERMLNRGVDL +INLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQ+RL QL+GARDEDNDFSVAEG N++S  SNRSREKKNAK SGL ILEL+ S SSK+R++GSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS

Query:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        Q+KTGKKYADQPVSFVSCGVMQP+AVEITTS+          ++ASEM+E+TTS+VKNMDISR++IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA
         PIEVIKRPKSLGLGVEFS EASTSAGD NQ SRGSA RTTGA+GKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QG+  P+  ++RPK+ GLG 
Subjt:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA

Query:  K
        +
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A1S3B9V9 Protein SQS10.0e+0077.44Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSSRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEYP
        MAGGRRRTN AK SD FRKNK+NS RR+SDP S   NLFVDGGFLSDWQ Q SPP SAREGNSR+KG    KSA LDRKK ASSSGTKQSN  AIGYEYP
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSSRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEYP

Query:  PVLHQEGLHYESRGLQNDAECSLDNSQPFILLNSNNSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPVEEEE
           +QE L  ESRGLQNDAECSLD+SQPFILLNS ++QIVAYVDE P L AD+L+FTYDYGTSFVLGDSSHRGLGF D+DE V   NTD+DS T VEE+ 
Subjt:  PVLHQEGLHYESRGLQNDAECSLDNSQPFILLNSNNSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPVEEEE

Query:  GLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSESDSS
        GLCTGSLS EKETGTDE+VD R  VET NE + EAS  NK SD + SPKNSGFLSIGGVRLYTQDVSDEESDDDGE SDG+S+ + EP ESDESSE DSS
Subjt:  GLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSESDSS

Query:  AEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDDLLVK
         EMSCS SDIDDEVAEDYLEG+GGS++ILKSKWLVKQEL++S DDSSSSSLDDTL+KLGGIALQEASKEYG+ KT SR K S+VSRD WS+LALDD+LVK
Subjt:  AEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDDLLVK

Query:  DTRSTSAKKKKNAADFACSWPPKAPKVKLVE---SEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAAIYRL
        D RSTS +K+KNA+ FA SWPPKA   K       EKKKYRKE IAAKRRERMLNRGVDL QI+L+LE MVL +EDM++FQPMHPRDCSQ+RRLAAIYRL
Subjt:  DTRSTSAKKKKNAADFACSWPPKAPKVKLVE---SEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAAIYRL

Query:  HSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSSQRKT
        HSGCQGSGKKRFVTVTRTQ+TGMPS+SDQ+RLEQLIGARDEDNDFSVAEGSN++S+ S+RSREKKN K+ GL  LE DQS S+K+RSKGSAGKGSSQ+ T
Subjt:  HSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSSQRKT

Query:  GKKYADQPVSFVSCGVMQPEAVEI----TTSDIN-------ASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI
        GKKYADQPVSFVSCGVMQPE +E+      +D++        SEMIEM TS V N DISRD+IG+FE+HTKGFGSKMMAKMGFVEGGGLGKDGQGM  PI
Subjt:  GKKYADQPVSFVSCGVMQPEAVEI----TTSDIN-------ASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI

Query:  EVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRT-TGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK
        EV+KRPKSLGLG+EFSEASTSA  +NQAS  S+ RT +GA+ KSK+IGAFE HTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK
Subjt:  EVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRT-TGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK

A0A6J1BYW1 Protein SQS10.0e+0080.15Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY
        MAGGRRRT  AK SDGFRKNK N+GRR+SDPSSS RGNLFVDGGFLSD Q QSSPPSSAREGNSRSKG    KS NLDR KTAS+SG+K+SN +AIGYEY
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY

Query:  PPVLHQEGLHYESRGLQNDAECSLDNSQPFILLNSNNSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPV-RNPNTDEDSPTPVEE
        PP  HQEGLH ES+GL NDA+CSLDNSQPFILL+S N+QIVAYVDEKP LK DDL+FTYDYGTSFVLGDSSHRGLGFHD+D+ + R+P+TD+ SP  VEE
Subjt:  PPVLHQEGLHYESRGLQNDAECSLDNSQPFILLNSNNSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPV-RNPNTDEDSPTPVEE

Query:  EEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSESD
        +EGLC GSLSSEKE GTDERV+CR   +  NE L E S+PNKY+D VCS KNSGFLSIGG+RLYTQDVS EESDDDGE SDGSS+ + E  ES ESSESD
Subjt:  EEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSESD

Query:  SSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKT-PSRKKNSIVSRDDWSSLALDDL
        SS EMSC+DSDIDDEVAEDYLEGIGGSE+IL SKWLVKQELV+SDDDSSSS  DDTLEKLGGIALQEASKE+G+KKT PSRKK+SIVS DDWSSLALDDL
Subjt:  SSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKT-PSRKKNSIVSRDDWSSLALDDL

Query:  LVKDTRSTSAKKKKNAADFACSWPPKAPKVKLVE---SEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAAI
        LVKD+R+TSAKKKKNAA   CSWPPKAPK K+      EKKKYRKETIAAKRRERM++RGVDL QINLKLE MVL REDMFSFQPMHPRDCSQ+RRLAAI
Subjt:  LVKDTRSTSAKKKKNAADFACSWPPKAPKVKLVE---SEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAAI

Query:  YRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSSQ
        YRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQ+RL QLIGARDED+DFSVA+GSN++SQ  NRSREKKNA++S   ILEL QS S K+RSKGSAGK SSQ
Subjt:  YRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSSQ

Query:  RKTG-KKYADQPVSFVSCGVMQPEAVEITTSDI-----------NASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGM
        +KTG KKYAD+PVSFVSCGVMQPE VEITTSD+            A E I+++TSEVKN DI+ D IGAFEVHTKGFGSKMMAKMGFV GGGLGKDGQGM
Subjt:  RKTG-KKYADQPVSFVSCGVMQPEAVEITTSDI-----------NASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGM

Query:  ARPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA
        ARPIEVIKRPKSLGLGVEFSEA +S G  NQ SRGS    TGA GK+KKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGA
Subjt:  ARPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA

Query:  K
        K
Subjt:  K

A0A6J1EPV3 Protein SQS10.0e+0081.62Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY
        MAGGRRRTN AKPSDGFRKNK +SGRR+SDPSSS RGNLFVDGGFLSDWQ Q+SPPSSAREGNSRSKG    KSA+LDRKK ASSSGTKQSN +AIGYEY
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY

Query:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV
        PP  HQE GLH ES+GLQNDA+CSLDNSQPFILLNSN  ++QIVAYVD+KPPLK DD++FTYDYGT+F+LGDSSHRGLGFH+EDE VRN N+D+DSPT V
Subjt:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CRE VE  +E L EASSPNKYS GVCSP+NSGFLSIGGVRLYTQDVSDEESDDDGE S+GSS+ + E  ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGGSE ILKSKWLVKQEL +SDDD SSSSLDDTLEKL GIALQEASKEYG+KKTPSRKK++IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNAA FA SWPPKAPK K       EKKKYRKETIAAKRRERMLNRGVDL +INLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQ+RL QLIGARD DNDFSVAEG N++S  SNRSREKKNAK SGL ILEL+QS SSK+R++GSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS

Query:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        Q+KTGKKYADQPVSFVSCGVMQP+AVEITTS+          ++ASEM+E+TTS VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK
         PIEVIKRPKSLGLGVEFSEASTSAGD NQ S GSA RTTGA+GKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QG+  P+  ++RPK+ GLG +
Subjt:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK

A0A6J1KLR0 Protein SQS10.0e+0082.92Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY
        MAGGRRRTN AKPSDGFRKNK +SGRR+SDPSSS RGNLFVDGGFLSDWQ Q++PPSSAREGNSRSKG    KSA+LDRKKTASSSGTKQSN +AIGYEY
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY

Query:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV
        PP  HQE GLH ESRGLQNDA+CSLDNSQPFILLNSN  ++QIVAYVD+KPPLK D L+ TYDYGT F+LGDSSHRGLGFHDEDE VRN NTD+DSPT V
Subjt:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CRE VE  +E L EASSPNKYS G CSP+NSGFLSIGGVRLYTQDVSDEESDDDGE S+GSS+ + EP ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGG E ILKSKWLVKQEL +SDDD SSSSLDDTLEKL  IALQEASKEYG+KKTPSR K+ IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNAA FA SWPPKAPK K       EKKKYRKETIAAKRRERMLNRGVDL QINLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQ+RL QLIGARD+DNDFSVAEG N++S  SNRSREKKNAK SGL ILEL+QS SSK+R+KGSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS

Query:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        Q+KTGKKYADQPVSFVSCGVMQP+AVEITTS+          ++ASEM+E+TTS VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA
         PIEVIKRPKSLGLGVEFS EASTSAGDN ++ R S   TTGA+GKSKK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGA
Subjt:  RPIEVIKRPKSLGLGVEFS-EASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA

Query:  KG
        KG
Subjt:  KG

A0A6J1KSJ4 Protein SQS10.0e+0081.62Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY
        MAGGRRRTN AKPSDGFRKNK +SGRR+SDPSSS RGNLFVDGGFLSDWQ Q++PPSSAREGNSRSKG    KSA+LDRKKTASSSGTKQSN +AIGYEY
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSS-RGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKG----KSANLDRKKTASSSGTKQSNSSAIGYEY

Query:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV
        PP  HQE GLH ESRGLQNDA+CSLDNSQPFILLNSN  ++QIVAYVD+KPPLK D L+ TYDYGT F+LGDSSHRGLGFHDEDE VRN NTD+DSPT V
Subjt:  PPVLHQE-GLHYESRGLQNDAECSLDNSQPFILLNSN--NSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPV

Query:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE
        EE+EGLCTGSL S+KETG+DERV+CRE VE  +E L EASSPNKYS G CSP+NSGFLSIGGVRLYTQDVSDEESDDDGE S+GSS+ + EP ESDESSE
Subjt:  EEEEGLCTGSLSSEKETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSE

Query:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD
        SDSSAEM+CS SDIDDEVAEDYLEGIGG E ILKSKWLVKQEL +SDDD SSSSLDDTLEKL  IALQEASKEYG+KKTPSR K+ IVSRD+WSSLALDD
Subjt:  SDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDD

Query:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA
        LL+KD+RS SA+KKKNAA FA SWPPKAPK K       EKKKYRKETIAAKRRERMLNRGVDL QINLKLE MVL REDMFSFQPMHPRDCSQ+RRLAA
Subjt:  LLVKDTRSTSAKKKKNAADFACSWPPKAPKVKL---VESEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAA

Query:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS
        IYRLHSGCQGSGKKRFVTVTRTQYTG+PSSSDQ+RL QLIGARD+DNDFSVAEG N++S  SNRSREKKNAK SGL ILEL+QS SSK+R+KGSAGKGSS
Subjt:  IYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSS

Query:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        Q+KTGKKYADQPVSFVSCGVMQP+AVEITTS+          ++ASEM+E+TTS VKNMDISRD+IGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA
Subjt:  QRKTGKKYADQPVSFVSCGVMQPEAVEITTSD----------INASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK
         PIEVIKRPKSLGLG+EFSEASTSAGD NQ S GSA RTTG +GKSKK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QG+  P+  ++RPK+ GLG +
Subjt:  RPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAK

SwissProt top hitse value%identityAlignment
B3MPC0 Zinc finger CCCH-type with G patch domain-containing protein4.5e-0851.85Show/hide
Query:  SKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA
        ++K+GA+EE+T+G GSK+M KMG++ G GLG D +GIV P+     PK R L A
Subjt:  SKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA

B4JCG4 Zinc finger CCCH-type with G patch domain-containing protein2.9e-0748.15Show/hide
Query:  SKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA
        ++K+GA+E++T+G GSK+MA MG++ G GLG D +GIV P+     P+ R L A
Subjt:  SKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGA

Q5U2Y6 Tuftelin-interacting protein 113.8e-0731.43Show/hide
Query:  GKKYAD--QPVSFVSCGVMQPEAVEITTSDINASEMIEMTTSEVKNMDISRD---------------------------NIGAFEVHTKGFGSKMMAKMG
        GK+  D   PV+F+S G+ +  A E  + D +A E        VK  D  +D                           + G++E HTKG G K++ KMG
Subjt:  GKKYAD--QPVSFVSCGVMQPEAVEITTSDINASEMIEMTTSEVKNMDISRD---------------------------NIGAFEVHTKGFGSKMMAKMG

Query:  FVEGGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEASTSA
        +V G GLGK+ QG+  PIE  +R     +G   SE +T +
Subjt:  FVEGGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEASTSA

Q5ZII9 Tuftelin-interacting protein 116.5e-0735.14Show/hide
Query:  KDGQGMARPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKK---IGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVR
        +D     +P++  + PK      EF       G N + S+       G VG +K     G++E HTKG G K++ KMG+V G GLGK++QGI+NP+   +
Subjt:  KDGQGMARPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKK---IGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVR

Query:  RPKARGLGAKG
        R     +GA G
Subjt:  RPKARGLGAKG

Q6DI35 Tuftelin-interacting protein 115.0e-0741.79Show/hide
Query:  SATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAKG
        ++ R  G +   + +G +E+HT+G G K++ KMG+V G GLGK++QGIVNP+    R     +GA G
Subjt:  SATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGLGAKG

Arabidopsis top hitse value%identityAlignment
AT1G17070.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain9.3e-0953.45Show/hide
Query:  NIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA
        +IG FE  TKG G K++ KMG+ +GGGLGK+ QG+  PIE   RPK++G+G  +F EA
Subjt:  NIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA

AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein3.9e-0756.52Show/hide
Query:  FEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGL
        +E HT+G  SKMMA MG+ EGMGLG   QGI+NP+L    P  R L
Subjt:  FEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRRPKARGL

AT2G42330.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain4.0e-0431.97Show/hide
Query:  SAGKGSSQRKTGKKYADQPVSFVSCGVMQPEAVEITTS---DINASEMIEMT-------TSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLG
        S   G S+RK  K+   +PV F S G +     +   +   D N   ++ +         + V+  +  ++N   FE  + G G K++ KMG+ +G GLG
Subjt:  SAGKGSSQRKTGKKYADQPVSFVSCGVMQPEAVEITTS---DINASEMIEMT-------TSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLG

Query:  KDGQGMARPIEVIKRPKSLGLG
        K+ QG+  PIEV  RPK++G+G
Subjt:  KDGQGMARPIEVIKRPKSLGLG

AT2G42330.2 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain4.0e-0431.97Show/hide
Query:  SAGKGSSQRKTGKKYADQPVSFVSCGVMQPEAVEITTS---DINASEMIEMT-------TSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLG
        S   G S+RK  K+   +PV F S G +     +   +   D N   ++ +         + V+  +  ++N   FE  + G G K++ KMG+ +G GLG
Subjt:  SAGKGSSQRKTGKKYADQPVSFVSCGVMQPEAVEITTS---DINASEMIEMT-------TSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGFVEGGGLG

Query:  KDGQGMARPIEVIKRPKSLGLG
        K+ QG+  PIEV  RPK++G+G
Subjt:  KDGQGMARPIEVIKRPKSLGLG

AT3G09850.1 D111/G-patch domain-containing protein6.0e-14946.34Show/hide
Query:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSSRGN-------------LFVDGGFLSDWQLQSSPPSSAREGNSRSKG-KSANLDR-KKTASSSGTKQ
        M GG RR +    S G  K KS     +++ +S+ G+             LFV+GG LSD++   +  + +R G+S  KG +S +++R K +AS+SG ++
Subjt:  MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSSRGN-------------LFVDGGFLSDWQLQSSPPSSAREGNSRSKG-KSANLDR-KKTASSSGTKQ

Query:  SNSSAIGYEYPPVLHQEGLHYESRGLQNDAECSLDNSQPFILLNSNNSQIVAYVDEKP-PLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNT
         + +   Y+YP +  +EGL   S G+++D      N  P +L  S  +QIVA++D+ P   K   + + Y+Y  S+VLGD SH+GLGF D+ +    P+ 
Subjt:  SNSSAIGYEYPPVLHQEGLHYESRGLQNDAECSLDNSQPFILLNSNNSQIVAYVDEKP-PLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNT

Query:  DEDSPTPVEEEEGLCTGSLSSEK------ETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSS
            P  + ++     GS S E+      E+G ++     +VV+TP                    +NSGF+SIGG++LYT+DVS EESD + E +D   
Subjt:  DEDSPTPVEEEEGLCTGSLSSEK------ETGTDERVDCREVVETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSS

Query:  DCTDE--PSESDESSESDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQEL----VQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTP
        D + +   S S E SESDSS +M  S S+IDD+VA+DYLEGIGGSE +L + WL +Q L    + SDD SSS S D    KL GI LQ+AS EYG KKT 
Subjt:  DCTDE--PSESDESSESDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWLVKQEL----VQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTP

Query:  SRKKNSIVSRDDWSSLALDDLL-VKDTRSTSAK--KKKNAADFACSWPPKAPKVKLVES---EKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLK
        +R   S   +   + LA+DDL+ VKD RS S K  KKK  A F  SWP +APK K   S   E KK+RKE IA KRRERML RGVDL  IN +LE  VL+
Subjt:  SRKKNSIVSRDDWSSLALDDLL-VKDTRSTSAK--KKKNAADFACSWPPKAPKVKLVES---EKKKYRKETIAAKRRERMLNRGVDLRQINLKLEQMVLK

Query:  REDMFSFQPMHPRDCSQIRRLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLR
          DM  FQ MH RDCSQ+RRLA +YRL S C GSGKK FVTVTRT  T MPS+SD+LR+E+LIGA DED DF+V+ G   +S S +R + K +AK    R
Subjt:  REDMFSFQPMHPRDCSQIRRLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQSSNRSREKKNAKISGLR

Query:  ILELDQSASSKTRSKGSAGKGSSQRKTGKKYADQPVSFVSCGVMQPE-AVEITTSDINASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGF
                  + R+K S GK SS       YADQPVSFVS G++  E AV  T+ +  A E+ E T       D     IGAFEVHT+GFGSKMMAKMGF
Subjt:  ILELDQSASSKTRSKGSAGKGSSQRKTGKKYADQPVSFVSCGVMQPE-AVEITTSDINASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMMAKMGF

Query:  VEGGGLGKDGQGMARPIEVIKRPKSLGLGVEFS----EASTSAGDNNQASRGSA----------TRTTGAVG----KSKKIGAFEEHTKGFGSKMMAKMG
        ++GGGLGKDG+G+A+PIE ++RPKSLGLG++FS    + S S+ +NN A R  +          T   GA G    + K++GAFE+HT GFGS+MMA+MG
Subjt:  VEGGGLGKDGQGMARPIEVIKRPKSLGLGVEFS----EASTSAGDNNQASRGSA----------TRTTGAVG----KSKKIGAFEEHTKGFGSKMMAKMG

Query:  FVEGMGLGKDSQGIVNPLLPVRRPKARGLGAKG
        FVEG GLG++SQGIVNPL+ VRRP+ARG+GA+G
Subjt:  FVEGMGLGKDSQGIVNPLLPVRRPKARGLGAKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAGGGAGACGAAGAACGAACCAAGCCAAACCCAGTGATGGCTTCAGGAAGAACAAGAGTAATTCCGGCAGAAGAAAGTCAGACCCTTCCTCATCTAGAGGAAA
TTTGTTCGTCGATGGCGGTTTTTTATCTGATTGGCAGCTCCAGTCATCGCCGCCTAGTTCTGCTCGAGAGGGAAATTCTAGGTCGAAGGGGAAGTCTGCTAATTTAGATC
GTAAAAAGACTGCTTCATCAAGTGGGACTAAACAATCAAATAGCAGTGCTATTGGCTACGAATACCCTCCTGTTCTTCATCAGGAAGGGTTGCATTATGAATCACGGGGA
CTGCAGAATGATGCAGAGTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTAATTCTAATAATAGTCAAATTGTCGCATATGTAGATGAAAAACCACCTTTAAAGGC
AGATGATTTGAAATTTACTTATGATTATGGCACAAGTTTTGTGTTAGGTGACAGTTCTCATAGAGGATTAGGGTTCCACGATGAGGATGAACCTGTTAGAAATCCAAACA
CAGATGAGGATTCACCCACACCAGTGGAAGAAGAAGAAGGATTGTGTACTGGTTCATTGTCCTCTGAGAAGGAAACTGGTACTGATGAGAGGGTTGATTGCAGGGAAGTG
GTTGAGACGCCTAATGAAACGCTAGTAGAAGCATCATCACCCAACAAATATTCAGATGGTGTGTGCTCTCCAAAAAATTCAGGTTTCTTGTCAATTGGTGGTGTGAGATT
ATACACCCAAGATGTATCTGATGAAGAAAGTGATGATGATGGGGAGCCATCAGATGGAAGTTCTGATTGCACTGATGAGCCTTCTGAGTCAGATGAATCGTCTGAAAGTG
ATAGCTCTGCAGAAATGTCTTGCAGTGATTCAGATATCGATGACGAGGTGGCTGAAGATTATCTTGAAGGGATTGGTGGAAGTGAAAGTATTTTAAAATCCAAATGGTTA
GTAAAACAAGAGTTGGTACAGTCTGATGATGATAGTTCTAGCAGTAGCCTTGATGATACTTTAGAGAAATTAGGCGGCATTGCTTTGCAGGAAGCATCTAAAGAATATGG
AATTAAGAAAACCCCTTCAAGGAAGAAAAATTCCATTGTTTCAAGAGATGATTGGTCATCATTGGCTCTAGATGACTTGCTAGTAAAAGATACTAGATCTACATCAGCTA
AAAAGAAGAAGAATGCTGCTGATTTTGCCTGTTCTTGGCCTCCAAAGGCTCCAAAAGTAAAGCTGGTAGAAAGTGAAAAGAAGAAATATCGTAAAGAAACAATTGCGGCA
AAGCGTCGAGAAAGAATGCTCAACCGGGGTGTCGATCTAAGGCAAATAAATTTGAAATTGGAGCAGATGGTTCTGAAGAGAGAAGATATGTTTTCTTTCCAACCTATGCA
TCCCCGTGATTGTTCCCAGATACGGCGATTGGCAGCAATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGGAAAAAAAGGTTTGTAACAGTAACTCGAACACAATACA
CAGGAATGCCATCATCAAGTGATCAACTTCGCCTAGAACAGCTAATAGGAGCGAGAGACGAGGATAATGATTTTTCTGTTGCTGAAGGCTCAAACGTAAGATCACAGAGC
AGCAATAGAAGCAGAGAAAAGAAGAATGCCAAAATAAGTGGTTTGCGCATATTGGAGCTTGATCAATCTGCAAGCAGTAAGACGAGGTCGAAGGGTTCTGCAGGTAAGGG
ATCAAGTCAGAGAAAGACTGGCAAAAAGTATGCAGATCAACCAGTCTCATTTGTATCGTGTGGAGTGATGCAACCAGAAGCGGTCGAGATAACAACTAGCGATATTAATG
CATCAGAAATGATTGAGATGACCACTAGCGAAGTCAAGAATATGGACATAAGTAGGGACAATATTGGTGCATTTGAGGTGCACACCAAGGGTTTTGGGTCAAAAATGATG
GCAAAAATGGGATTTGTAGAAGGTGGAGGATTGGGGAAAGATGGTCAAGGAATGGCTCGCCCCATTGAAGTGATCAAACGACCAAAATCGCTTGGGTTAGGCGTCGAGTT
CTCTGAGGCCTCCACTTCTGCTGGTGATAACAACCAAGCAAGTCGAGGCTCCGCCACTAGAACAACTGGAGCTGTGGGAAAATCTAAAAAAATTGGTGCTTTTGAAGAAC
ACACTAAAGGATTCGGGTCGAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATGGGATTAGGAAAGGATTCACAAGGAATCGTCAACCCTTTGCTTCCTGTTAGGCGG
CCTAAAGCACGAGGGTTGGGTGCCAAAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGAGGGAGACGAAGAACGAACCAAGCCAAACCCAGTGATGGCTTCAGGAAGAACAAGAGTAATTCCGGCAGAAGAAAGTCAGACCCTTCCTCATCTAGAGGAAA
TTTGTTCGTCGATGGCGGTTTTTTATCTGATTGGCAGCTCCAGTCATCGCCGCCTAGTTCTGCTCGAGAGGGAAATTCTAGGTCGAAGGGGAAGTCTGCTAATTTAGATC
GTAAAAAGACTGCTTCATCAAGTGGGACTAAACAATCAAATAGCAGTGCTATTGGCTACGAATACCCTCCTGTTCTTCATCAGGAAGGGTTGCATTATGAATCACGGGGA
CTGCAGAATGATGCAGAGTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTAATTCTAATAATAGTCAAATTGTCGCATATGTAGATGAAAAACCACCTTTAAAGGC
AGATGATTTGAAATTTACTTATGATTATGGCACAAGTTTTGTGTTAGGTGACAGTTCTCATAGAGGATTAGGGTTCCACGATGAGGATGAACCTGTTAGAAATCCAAACA
CAGATGAGGATTCACCCACACCAGTGGAAGAAGAAGAAGGATTGTGTACTGGTTCATTGTCCTCTGAGAAGGAAACTGGTACTGATGAGAGGGTTGATTGCAGGGAAGTG
GTTGAGACGCCTAATGAAACGCTAGTAGAAGCATCATCACCCAACAAATATTCAGATGGTGTGTGCTCTCCAAAAAATTCAGGTTTCTTGTCAATTGGTGGTGTGAGATT
ATACACCCAAGATGTATCTGATGAAGAAAGTGATGATGATGGGGAGCCATCAGATGGAAGTTCTGATTGCACTGATGAGCCTTCTGAGTCAGATGAATCGTCTGAAAGTG
ATAGCTCTGCAGAAATGTCTTGCAGTGATTCAGATATCGATGACGAGGTGGCTGAAGATTATCTTGAAGGGATTGGTGGAAGTGAAAGTATTTTAAAATCCAAATGGTTA
GTAAAACAAGAGTTGGTACAGTCTGATGATGATAGTTCTAGCAGTAGCCTTGATGATACTTTAGAGAAATTAGGCGGCATTGCTTTGCAGGAAGCATCTAAAGAATATGG
AATTAAGAAAACCCCTTCAAGGAAGAAAAATTCCATTGTTTCAAGAGATGATTGGTCATCATTGGCTCTAGATGACTTGCTAGTAAAAGATACTAGATCTACATCAGCTA
AAAAGAAGAAGAATGCTGCTGATTTTGCCTGTTCTTGGCCTCCAAAGGCTCCAAAAGTAAAGCTGGTAGAAAGTGAAAAGAAGAAATATCGTAAAGAAACAATTGCGGCA
AAGCGTCGAGAAAGAATGCTCAACCGGGGTGTCGATCTAAGGCAAATAAATTTGAAATTGGAGCAGATGGTTCTGAAGAGAGAAGATATGTTTTCTTTCCAACCTATGCA
TCCCCGTGATTGTTCCCAGATACGGCGATTGGCAGCAATTTACCGCTTGCATAGTGGATGCCAAGGTTCTGGGAAAAAAAGGTTTGTAACAGTAACTCGAACACAATACA
CAGGAATGCCATCATCAAGTGATCAACTTCGCCTAGAACAGCTAATAGGAGCGAGAGACGAGGATAATGATTTTTCTGTTGCTGAAGGCTCAAACGTAAGATCACAGAGC
AGCAATAGAAGCAGAGAAAAGAAGAATGCCAAAATAAGTGGTTTGCGCATATTGGAGCTTGATCAATCTGCAAGCAGTAAGACGAGGTCGAAGGGTTCTGCAGGTAAGGG
ATCAAGTCAGAGAAAGACTGGCAAAAAGTATGCAGATCAACCAGTCTCATTTGTATCGTGTGGAGTGATGCAACCAGAAGCGGTCGAGATAACAACTAGCGATATTAATG
CATCAGAAATGATTGAGATGACCACTAGCGAAGTCAAGAATATGGACATAAGTAGGGACAATATTGGTGCATTTGAGGTGCACACCAAGGGTTTTGGGTCAAAAATGATG
GCAAAAATGGGATTTGTAGAAGGTGGAGGATTGGGGAAAGATGGTCAAGGAATGGCTCGCCCCATTGAAGTGATCAAACGACCAAAATCGCTTGGGTTAGGCGTCGAGTT
CTCTGAGGCCTCCACTTCTGCTGGTGATAACAACCAAGCAAGTCGAGGCTCCGCCACTAGAACAACTGGAGCTGTGGGAAAATCTAAAAAAATTGGTGCTTTTGAAGAAC
ACACTAAAGGATTCGGGTCGAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATGGGATTAGGAAAGGATTCACAAGGAATCGTCAACCCTTTGCTTCCTGTTAGGCGG
CCTAAAGCACGAGGGTTGGGTGCCAAAGGTTAG
Protein sequenceShow/hide protein sequence
MAGGRRRTNQAKPSDGFRKNKSNSGRRKSDPSSSRGNLFVDGGFLSDWQLQSSPPSSAREGNSRSKGKSANLDRKKTASSSGTKQSNSSAIGYEYPPVLHQEGLHYESRG
LQNDAECSLDNSQPFILLNSNNSQIVAYVDEKPPLKADDLKFTYDYGTSFVLGDSSHRGLGFHDEDEPVRNPNTDEDSPTPVEEEEGLCTGSLSSEKETGTDERVDCREV
VETPNETLVEASSPNKYSDGVCSPKNSGFLSIGGVRLYTQDVSDEESDDDGEPSDGSSDCTDEPSESDESSESDSSAEMSCSDSDIDDEVAEDYLEGIGGSESILKSKWL
VKQELVQSDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKNSIVSRDDWSSLALDDLLVKDTRSTSAKKKKNAADFACSWPPKAPKVKLVESEKKKYRKETIAA
KRRERMLNRGVDLRQINLKLEQMVLKREDMFSFQPMHPRDCSQIRRLAAIYRLHSGCQGSGKKRFVTVTRTQYTGMPSSSDQLRLEQLIGARDEDNDFSVAEGSNVRSQS
SNRSREKKNAKISGLRILELDQSASSKTRSKGSAGKGSSQRKTGKKYADQPVSFVSCGVMQPEAVEITTSDINASEMIEMTTSEVKNMDISRDNIGAFEVHTKGFGSKMM
AKMGFVEGGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEASTSAGDNNQASRGSATRTTGAVGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGIVNPLLPVRR
PKARGLGAKG