; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016911 (gene) of Snake gourd v1 genome

Gene IDTan0016911
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein SQS1
Genome locationLG09:67155399..67160756
RNA-Seq ExpressionTan0016911
SyntenyTan0016911
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain
IPR001374 - R3H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587849.1 Zinc finger CCCH-type with G patch domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.34Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY
        MAGGRRRTN AKPSDGFRKNK + GRRRSDPSSS+RGNLFVDGGFLSDWQFQ+SPPSSAREGNSRSKGKSGSKSA+LDRKK ASSSGTKQSNGNA GYEY
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY

Query:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA
        PPAP+QE GLHS+S+GLQN A+CSLDNSQPFILL  NSK +QIVAYVDDKPPLK DD+EFTYDYGT+F+LGDSSHRGLGFH+EDELVRN N+DDDSPT  
Subjt:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA

Query:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES
        EEQEGLCTG L S+KETG+DERV+CRE VE+A+EM+AEASSPNKY + V SP+NSGFLSIGGV+LYTQDVSDEES DDGE S+GSSEYSE LE+DESSES
Subjt:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES

Query:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL
        DSSAEM+CS SDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDD SSSSLDDTLEKL GIALQEASKEYG+KKTPSRKKS+IVSRD+WSSLALDDL
Subjt:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL

Query:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI
        L+KD RSASARKKKNAAHFA SWPPKAPKSKA   YPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLEHMV+NREDMFSFQPMHPRDCSQVRRLAAI
Subjt:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ
        YRLHSGCQGSGKKR VTVTRTQYTG+PSSSDQ+RL+QLIGARDED DFSVAEG NIK  G NR    KNAK SGL ILE +QS SSKSR++G+AGKGSSQ
Subjt:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ

Query:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR
        KK GKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+T+S+VKNMDISRDSIGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA 
Subjt:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        PIEVIKRPKSLGLGVEFSEASTSAGDNQ S GSA RTT ALGKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  PIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

XP_022929809.1 uncharacterized protein LOC111436305 [Cucurbita moschata]0.0e+0085.21Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY
        MAGGRRRTN AKPSDGFRKNK + GRRRSDPSSS+RGNLFVDGGFLSDWQFQ+SPPSSAREGNSRSKGKSGSKSA+LDRKK ASSSGTKQSNGNA GYEY
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY

Query:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA
        PPAP+QE GLHS+S+GLQN A+CSLDNSQPFILL  NSK +QIVAYVDDKPPLK DD+EFTYDYGT+F+LGDSSHRGLGFH+EDELVRN N+DDDSPT  
Subjt:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA

Query:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES
        EEQEGLCTG L S+KETG+DERV+CRE VE+A+EM+AEASSPNKY + V SP+NSGFLSIGGV+LYTQDVSDEES DDGE S+GSSEYSE LE+DESSES
Subjt:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES

Query:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL
        DSSAEM+CS SDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDD SSSSLDDTLEKL GIALQEASKEYG+KKTPSRKKS+IVSRD+WSSLALDDL
Subjt:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL

Query:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI
        L+KD RSASARKKKNAAHFA SWPPKAPKSKA   YPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLEHMV+NREDMFSFQPMHPRDCSQVRRLAAI
Subjt:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ
        YRLHSGCQGSGKKR VTVTRTQYTG+PSSSDQ+RL+QLIGARD D DFSVAEG NIK  G NR    KNAK SGL ILE +QS SSKSR++G+AGKGSSQ
Subjt:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ

Query:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR
        KK GKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+T+S+VKNMDISRDSIGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA 
Subjt:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        PIEVIKRPKSLGLGVEFSEASTSAGDNQ S GSA RTT ALGKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  PIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

XP_023003174.1 uncharacterized protein LOC111496865 isoform X1 [Cucurbita maxima]0.0e+0085.09Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY
        MAGGRRRTN AKPSDGFRKNK + GRRRSDPSSS+RGNLFVDGGFLSDWQFQ++PPSSAREGNSRSKG+SGSKSA+LDRKKTASSSGTKQSNGNA GYEY
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY

Query:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA
        PPAP+QE GLHS+SRGLQN A+CSLDNSQPFILL  NSK +QIVAYVDDKPPLK D LE TYDYGT F+LGDSSHRGLGFHDEDELVRN NTDDDSPT  
Subjt:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA

Query:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES
        EEQEGLCTG L S+KETG+DERV+CRE VE+A+EM+AEASSPNKY +   SP+NSGFLSIGGV+LYTQDVSDEES DDGE S+GSSEYSEPLE+DESSES
Subjt:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES

Query:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL
        DSSAEM+CS SDIDDEVAEDYLEGIGG EHILKSKWLVKQELAESDDD SSSSLDDTLEKL  IALQEASKEYG+KKTPSR KS IVSRD+WSSLALDDL
Subjt:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL

Query:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI
        L+KD RSASARKKKNAAHFA SWPPKAPKSKA   YPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLEHMV+NREDMFSFQPMHPRDCSQVRRLAAI
Subjt:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ
        YRLHSGCQGSGKKR VTVTRTQYTG+PSSSDQ+RL+QLIGARD+D DFSVAEG NIK  G NR    KNAK SGL ILE +QS SSKSR+KG+AGKGSSQ
Subjt:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ

Query:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR
        KK GKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+T+S+VKNMDISRDSIGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA 
Subjt:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        PIEVIKRPKSLGLG+EFSEASTSAGDNQ S GSA RTT  LGKSKK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  PIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

XP_023003182.1 uncharacterized protein LOC111496865 isoform X2 [Cucurbita maxima]0.0e+0086.62Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY
        MAGGRRRTN AKPSDGFRKNK + GRRRSDPSSS+RGNLFVDGGFLSDWQFQ++PPSSAREGNSRSKG+SGSKSA+LDRKKTASSSGTKQSNGNA GYEY
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY

Query:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA
        PPAP+QE GLHS+SRGLQN A+CSLDNSQPFILL  NSK +QIVAYVDDKPPLK D LE TYDYGT F+LGDSSHRGLGFHDEDELVRN NTDDDSPT  
Subjt:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA

Query:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES
        EEQEGLCTG L S+KETG+DERV+CRE VE+A+EM+AEASSPNKY +   SP+NSGFLSIGGV+LYTQDVSDEES DDGE S+GSSEYSEPLE+DESSES
Subjt:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES

Query:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL
        DSSAEM+CS SDIDDEVAEDYLEGIGG EHILKSKWLVKQELAESDDD SSSSLDDTLEKL  IALQEASKEYG+KKTPSR KS IVSRD+WSSLALDDL
Subjt:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL

Query:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI
        L+KD RSASARKKKNAAHFA SWPPKAPKSKA   YPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLEHMV+NREDMFSFQPMHPRDCSQVRRLAAI
Subjt:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ
        YRLHSGCQGSGKKR VTVTRTQYTG+PSSSDQ+RL+QLIGARD+D DFSVAEG NIK  G NR    KNAK SGL ILE +QS SSKSR+KG+AGKGSSQ
Subjt:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ

Query:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR
        KK GKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+T+S+VKNMDISRDSIGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA 
Subjt:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFS-EASTSAGDNQASRGSATRTTQ-ALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        PIEVIKRPKSLGLGVEFS EASTSAGDNQ SR S+ RTT  ALGKSKK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
Subjt:  PIEVIKRPKSLGLGVEFS-EASTSAGDNQASRGSATRTTQ-ALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

XP_023530152.1 uncharacterized protein LOC111792798 [Cucurbita pepo subsp. pepo]0.0e+0085.23Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY
        MAGGRRRTN AKPSDGFRKNK + GRRRSDPSSS+RGNLFVDGG LSDWQFQ+SPPSSAREGNSRSKGKSGSKSA+LDRKK ASSSGTKQSNGNA GYEY
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY

Query:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA
        PPAP+QE GLHS+SRGLQN A+CSLDNSQPFILL  NSK SQIVAYVDDKPP+K DD+EFTYDYGT+F+LGDSSHRGLGFHDEDELVRN NTDDDSPT  
Subjt:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA

Query:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES
        EEQEGLCTG L S+KE GTDERV+CREGVE+A+EM+AEASSPNKY + V SP+NSGFLSIGGV+LYTQDVSDEES DDGE S+GSSEYSEP E+DESSES
Subjt:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES

Query:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL
        DSSAEM+CS SDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDD SSSSLDDTLEKL  IALQEASKEYG+KKTPSRKKS+IVSRD+WSSLALDDL
Subjt:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL

Query:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI
        L+KD RSASARKKKNAAHFA SWPPKAPKSKA   YPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLEHMV+NREDMFSFQPMHPRDCSQVRRLAAI
Subjt:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ
        YRLHSGCQGSGKKR VTVTRTQYTG+PSSSDQ+RL+QL+GARDED DFSVAEG NIK  G NR    KNAK SGL ILE + S SSKSR++G+AGKGSSQ
Subjt:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ

Query:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR
        KK GKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+T+S VKNMDISR+SIGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA 
Subjt:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFS-EASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        PIEVIKRPKSLGLGVEFS EASTSAGDNQ SRGSA RTT ALGKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  PIEVIKRPKSLGLGVEFS-EASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

TrEMBL top hitse value%identityAlignment
A0A1S3B9V9 Protein SQS10.0e+0080.6Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY
        MAGGRRRTN AK SD FRKNKTN  RRRSDP S    NLFVDGGFLSDWQFQ SPP SAREGNSR+KGKSGSKSA LDRKK ASSSGTKQSNG A GYEY
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY

Query:  PPAPYQEGLHSDSRGLQNGAECSLDNSQPFILLNSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPAEEQ
        P AP QE L S+SRGLQN AECSLD+SQPFILLNSK++QIVAYVD+ P L AD+LEFTYDYGTSFVLGDSSHRGLGF D+DELV   NTDDDS T  EEQ
Subjt:  PPAPYQEGLHSDSRGLQNGAECSLDNSQPFILLNSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPAEEQ

Query:  EGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSESDSS
         GLCTG LS EKETGTDE+VD R GVE ANEMVAEAS  NK    + SPKNSGFLSIGGV+LYTQDVSDEES DDGESSDG+SEYSEPLE+DESSE DSS
Subjt:  EGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSESDSS

Query:  AEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDLLVK
         EMSCS SDIDDEVAEDYLEG+GGS++ILKSKWLVKQEL ES DDSSSSSLDDTL+KLGGIALQEASKEYG+ KT SR K S+VSRD WS+LALDD+LVK
Subjt:  AEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDLLVK

Query:  DYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAIYRL
        D RS S RK+KNA+ FA SWPPKA  SKA+R YPGEKKKYRKE IAAKRRERMLNRGVDL QI+L+LEHMV+N+EDM++FQPMHPRDCSQVRRLAAIYRL
Subjt:  DYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAIYRL

Query:  HSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQKKN
        HSGCQGSGKKR VTVTRTQ+TGMPS+SDQ+RL QLIGARDED DFSVAEGSNIK +G +R    KN K+ GL  LE DQS S+KSRSKG+AGKGSSQK  
Subjt:  HSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQKKN

Query:  GKKYADQPVSFVSCGVMQPEAVEI-TTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI
        GKKYADQPVSFVSCGVMQPE +E+    D  DVDK KDIVA SEMIEM +S+V N DISRDSIG+FE+HTKGFGSKMMAKMGFVEGGGLGKDGQGM  PI
Subjt:  GKKYADQPVSFVSCGVMQPEAVEI-TTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI

Query:  EVIKRPKSLGLGVEFSEASTS-AGDNQASRGSATRT-TQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
        EV+KRPKSLGLG+EFSEASTS AGDNQAS  S+ RT + AL KSK+IGAFE HTKGFGSKMMAKMGFVEGMGLGKDSQG+VNPLLPVRRPKARGLGAK+
Subjt:  EVIKRPKSLGLGVEFSEASTS-AGDNQASRGSATRT-TQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA

A0A6J1BYW1 Protein SQS10.0e+0081.88Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY
        MAGGRRRT  AK SDGFRKNK N GRRRSDPSSSIRGNLFVDGGFLSD QFQSSPPSSAREGNSRSKG+SGSKS NLDR KTAS+SG+K+SNGNA GYEY
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY

Query:  PPAPYQEGLHSDSRGLQNGAECSLDNSQPFILLNSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFH-DEDELVRNPNTDDDSPTPAEE
        PP  +QEGLHS+S+GL N A+CSLDNSQPFILL+SK +QIVAYVD+KP LK DDLEFTYDYGTSFVLGDSSHRGLGFH D+DEL R+P+TDD SP   EE
Subjt:  PPAPYQEGLHSDSRGLQNGAECSLDNSQPFILLNSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFH-DEDELVRNPNTDDDSPTPAEE

Query:  QEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSESDS
        QEGLC G LSSEKE GTDERV+CR   +M NE++AE S+PNKY   V S KNSGFLSIGG++LYTQDVS EES DDGE+SDGSS YSE LE+ ESSESDS
Subjt:  QEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSESDS

Query:  SAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKT-PSRKKSSIVSRDDWSSLALDDLL
        S EMSC+DSDIDDEVAEDYLEGIGGSE+IL SKWLVKQEL ESDDDSSSS  DDTLEKLGGIALQEASKE+G+KKT PSRKKSSIVS DDWSSLALDDLL
Subjt:  SAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKT-PSRKKSSIVSRDDWSSLALDDLL

Query:  VKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAIY
        VKD R+ SA+KKKNAA   CSWPPKAPKSK +R YPGEKKKYRKETIAAKRRERM++RGVDL QINLKLEHMV+NREDMFSFQPMHPRDCSQVRRLAAIY
Subjt:  VKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAIY

Query:  RLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQK
        RLHSGCQGSGKKR VTVTRTQYTGMPSSSDQ+RL QLIGARDED DFSVA+GSN+K QGGNR    KNA++S   ILE  QS S KSRSKG+AGK SSQK
Subjt:  RLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQK

Query:  KNG-KKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKD-IVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA
        K G KKYAD+PVSFVSCGVMQPE VEITTSD+ D DK KD I  A E I++++S+VKN DI+ D+IGAFEVHTKGFGSKMMAKMGFV GGGLGKDGQGMA
Subjt:  KNG-KKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKD-IVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMA

Query:  RPIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
        RPIEVIKRPKSLGLGVEFSEA +S G NQ SRGS    T A GK+KKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA
Subjt:  RPIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAKA

A0A6J1EPV3 Protein SQS10.0e+0085.21Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY
        MAGGRRRTN AKPSDGFRKNK + GRRRSDPSSS+RGNLFVDGGFLSDWQFQ+SPPSSAREGNSRSKGKSGSKSA+LDRKK ASSSGTKQSNGNA GYEY
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY

Query:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA
        PPAP+QE GLHS+S+GLQN A+CSLDNSQPFILL  NSK +QIVAYVDDKPPLK DD+EFTYDYGT+F+LGDSSHRGLGFH+EDELVRN N+DDDSPT  
Subjt:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA

Query:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES
        EEQEGLCTG L S+KETG+DERV+CRE VE+A+EM+AEASSPNKY + V SP+NSGFLSIGGV+LYTQDVSDEES DDGE S+GSSEYSE LE+DESSES
Subjt:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES

Query:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL
        DSSAEM+CS SDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDD SSSSLDDTLEKL GIALQEASKEYG+KKTPSRKKS+IVSRD+WSSLALDDL
Subjt:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL

Query:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI
        L+KD RSASARKKKNAAHFA SWPPKAPKSKA   YPGEKKKYRKETIAAKRRERMLNRGVDL +INLKLEHMV+NREDMFSFQPMHPRDCSQVRRLAAI
Subjt:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ
        YRLHSGCQGSGKKR VTVTRTQYTG+PSSSDQ+RL+QLIGARD D DFSVAEG NIK  G NR    KNAK SGL ILE +QS SSKSR++G+AGKGSSQ
Subjt:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ

Query:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR
        KK GKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+T+S+VKNMDISRDSIGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA 
Subjt:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        PIEVIKRPKSLGLGVEFSEASTSAGDNQ S GSA RTT ALGKS K+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  PIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

A0A6J1KLR0 Protein SQS10.0e+0086.62Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY
        MAGGRRRTN AKPSDGFRKNK + GRRRSDPSSS+RGNLFVDGGFLSDWQFQ++PPSSAREGNSRSKG+SGSKSA+LDRKKTASSSGTKQSNGNA GYEY
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY

Query:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA
        PPAP+QE GLHS+SRGLQN A+CSLDNSQPFILL  NSK +QIVAYVDDKPPLK D LE TYDYGT F+LGDSSHRGLGFHDEDELVRN NTDDDSPT  
Subjt:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA

Query:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES
        EEQEGLCTG L S+KETG+DERV+CRE VE+A+EM+AEASSPNKY +   SP+NSGFLSIGGV+LYTQDVSDEES DDGE S+GSSEYSEPLE+DESSES
Subjt:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES

Query:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL
        DSSAEM+CS SDIDDEVAEDYLEGIGG EHILKSKWLVKQELAESDDD SSSSLDDTLEKL  IALQEASKEYG+KKTPSR KS IVSRD+WSSLALDDL
Subjt:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL

Query:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI
        L+KD RSASARKKKNAAHFA SWPPKAPKSKA   YPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLEHMV+NREDMFSFQPMHPRDCSQVRRLAAI
Subjt:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ
        YRLHSGCQGSGKKR VTVTRTQYTG+PSSSDQ+RL+QLIGARD+D DFSVAEG NIK  G NR    KNAK SGL ILE +QS SSKSR+KG+AGKGSSQ
Subjt:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ

Query:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR
        KK GKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+T+S+VKNMDISRDSIGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA 
Subjt:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFS-EASTSAGDNQASRGSATRTTQ-ALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        PIEVIKRPKSLGLGVEFS EASTSAGDNQ SR S+ RTT  ALGKSKK+GAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
Subjt:  PIEVIKRPKSLGLGVEFS-EASTSAGDNQASRGSATRTTQ-ALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

A0A6J1KSJ4 Protein SQS10.0e+0085.09Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY
        MAGGRRRTN AKPSDGFRKNK + GRRRSDPSSS+RGNLFVDGGFLSDWQFQ++PPSSAREGNSRSKG+SGSKSA+LDRKKTASSSGTKQSNGNA GYEY
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEY

Query:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA
        PPAP+QE GLHS+SRGLQN A+CSLDNSQPFILL  NSK +QIVAYVDDKPPLK D LE TYDYGT F+LGDSSHRGLGFHDEDELVRN NTDDDSPT  
Subjt:  PPAPYQE-GLHSDSRGLQNGAECSLDNSQPFILL--NSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPA

Query:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES
        EEQEGLCTG L S+KETG+DERV+CRE VE+A+EM+AEASSPNKY +   SP+NSGFLSIGGV+LYTQDVSDEES DDGE S+GSSEYSEPLE+DESSES
Subjt:  EEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEES-DDGESSDGSSEYSEPLETDESSES

Query:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL
        DSSAEM+CS SDIDDEVAEDYLEGIGG EHILKSKWLVKQELAESDDD SSSSLDDTLEKL  IALQEASKEYG+KKTPSR KS IVSRD+WSSLALDDL
Subjt:  DSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDL

Query:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI
        L+KD RSASARKKKNAAHFA SWPPKAPKSKA   YPGEKKKYRKETIAAKRRERMLNRGVDL QINLKLEHMV+NREDMFSFQPMHPRDCSQVRRLAAI
Subjt:  LVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAI

Query:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ
        YRLHSGCQGSGKKR VTVTRTQYTG+PSSSDQ+RL+QLIGARD+D DFSVAEG NIK  G NR    KNAK SGL ILE +QS SSKSR+KG+AGKGSSQ
Subjt:  YRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNR----KNAKISGLRILEPDQSASSKSRSKGAAGKGSSQ

Query:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR
        KK GKKYADQPVSFVSCGVMQP+AVEITTS++KDVDK KDIV ASEM+E+T+S+VKNMDISRDSIGAFE HT GFGSKMMAKMGFVEGGGLGKDGQGMA 
Subjt:  KKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMAR

Query:  PIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
        PIEVIKRPKSLGLG+EFSEASTSAGDNQ S GSA RTT  LGKSKK+GAFEEHTKGFGSKMMAKMGFVEG GLGKD QGM  P+  ++RPK+ GLG +
Subjt:  PIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK

SwissProt top hitse value%identityAlignment
A7SBN6 Zinc finger CCCH-type with G patch domain-containing protein4.6e-0840.82Show/hide
Query:  SIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRP--KSLGLGVEFSEASTSAGDNQASRGS---ATRTTQALGKSKKIGAFEEHTKG
        S+G +E HTKG GSK+MAKMG++ G GLGKDG+G   PIEV+  P  KSL    E  E +      +  +     A+ T+ + GK+  +  F  H  G
Subjt:  SIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRP--KSLGLGVEFSEASTSAGDNQASRGS---ATRTTQALGKSKKIGAFEEHTKG

B3MPC0 Zinc finger CCCH-type with G patch domain-containing protein1.7e-0750Show/hide
Query:  SKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA
        ++K+GA+EE+T+G GSK+M KMG++ G GLG D +G+V P+     PK R L A
Subjt:  SKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGA

B4Q8A7 Zinc finger CCCH-type with G patch domain-containing protein1.1e-0630.83Show/hide
Query:  DQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI--EVIK
        D+  S  S   M+  + +   SD+ D+++++     + M+E++    K      D +GA+E  T+G GSK+M KMG++ G GLG DG+G+  P+  +++ 
Subjt:  DQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPI--EVIK

Query:  RPKSLGLGVEFSEASTSAGD
        + +SL   +E  EA+    D
Subjt:  RPKSLGLGVEFSEASTSAGD

Q0IIX9 Tuftelin-interacting protein 118.6e-0729.65Show/hide
Query:  GKKYAD--QPVSFVSCGVMQPEAVEITTSDI------------KDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGG
        GK+  D   PV+F+S G+ +P A E + SD             KD +  K  +      + +         S    G++E HTKG G K++ KMG+V+G 
Subjt:  GKKYAD--QPVSFVSCGVMQPEAVEITTSDI------------KDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGG

Query:  GLGKDGQGMARPIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKM
        GLGK+ QG+  PIE  +R     +G   SE                RT Q++     + + EE  K F  +M
Subjt:  GLGKDGQGMARPIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKM

Q9ERA6 Tuftelin-interacting protein 111.1e-0632.09Show/hide
Query:  GKKYAD--QPVSFVSCGVMQPEAVEITTSD-------IKDVDKSKDI----VAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGG
        GK+  D   PV+F+S G+ +  A E  + D       +K  D  KD+    +      + +         S    G++E HTKG G K++ KMG+V G G
Subjt:  GKKYAD--QPVSFVSCGVMQPEAVEITTSD-------IKDVDKSKDI----VAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGG

Query:  LGKDGQGMARPIEVIKRPKSLGLGVEFSEASTSA
        LGK+ QG+  PIE  +R     +G   SE +T +
Subjt:  LGKDGQGMARPIEVIKRPKSLGLGVEFSEASTSA

Arabidopsis top hitse value%identityAlignment
AT1G17070.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain4.7e-0840.82Show/hide
Query:  IGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA--------------STSAGDNQASRGSATRTTQALGKSKKI
        IG FE  TKG G K++ KMG+ +GGGLGK+ QG+  PIE   RPK++G+G  +F EA                S  +N+ S G   R  + L K KK+
Subjt:  IGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRPKSLGLGV-EFSEA--------------STSAGDNQASRGSATRTTQALGKSKKI

AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein1.5e-0654.35Show/hide
Query:  FEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL
        +E HT+G  SKMMA MG+ EGMGLG   QG++NP+L    P  R L
Subjt:  FEEHTKGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGL

AT2G42330.1 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.4e-0431.36Show/hide
Query:  GSSQKKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQ
        G S++K  +K   +PV F S G +  + ++    + K +D++ D +    + +   +D  ++    +    FE  + G G K++ KMG+ +G GLGK+ Q
Subjt:  GSSQKKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQ

Query:  GMARPIEVIKRPKSLGLG
        G+  PIEV  RPK++G+G
Subjt:  GMARPIEVIKRPKSLGLG

AT2G42330.2 GC-rich sequence DNA-binding factor-like protein with Tuftelin interacting domain2.4e-0431.36Show/hide
Query:  GSSQKKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQ
        G S++K  +K   +PV F S G +  + ++    + K +D++ D +    + +   +D  ++    +    FE  + G G K++ KMG+ +G GLGK+ Q
Subjt:  GSSQKKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTKGFGSKMMAKMGFVEGGGLGKDGQ

Query:  GMARPIEVIKRPKSLGLG
        G+  PIEV  RPK++G+G
Subjt:  GMARPIEVIKRPKSLGLG

AT3G09850.1 D111/G-patch domain-containing protein3.6e-14945.26Show/hide
Query:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGN-------------LFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDR-KKTASSS
        M GG RR +    S G  K K +PG+      +S  G+             LFV+GG LSD++   +  + +R G+S  K   G +S +++R K +AS+S
Subjt:  MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGN-------------LFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDR-KKTASSS

Query:  GTKQSNGNAFGYEYPPAPYQEGLHSDSRGLQNGAECSLDNSQPFILLNSKTSQIVAYVDDKP-PLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVR
        G ++ +GN + Y+YP   ++EGL       ++G E    N  P +L  S+ +QIVA++D  P   K   + + Y+Y  S+VLGD SH+GLGF D+ +   
Subjt:  GTKQSNGNAFGYEYPPAPYQEGLHSDSRGLQNGAECSLDNSQPFILLNSKTSQIVAYVDDKP-PLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVR

Query:  NPNTDDDSPTPAEEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPK--NSGFLSIGGVKLYTQDVSDEES---------DD
         P+     P    +Q     G  S E+E    + VD   G +  +E++ +          V++PK  NSGF+SIGG+KLYT+DVS EES         DD
Subjt:  NPNTDDDSPTPAEEQEGLCTGLLSSEKETGTDERVDCREGVEMANEMVAEASSPNKYLYRVRSPK--NSGFLSIGGVKLYTQDVSDEES---------DD

Query:  GESSDGSSEYSEPLETDESSESDSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQEL----AESDDDSSSSSLDDTLEKLGGIALQEASKEYG
         ES D  S  S      E SESDSS +M  S S+IDD+VA+DYLEGIGGSE +L + WL +Q L      SDD SSS S D    KL GI LQ+AS EYG
Subjt:  GESSDGSSEYSEPLETDESSESDSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKSKWLVKQEL----AESDDDSSSSSLDDTLEKLGGIALQEASKEYG

Query:  IKKTPSRKKSSIVSRDDWSSLALDDLL-VKDYRSASAR--KKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLE
         KKT     S        + LA+DDL+ VKD RS S +  KKK  A F  SWP +APKSK SR++PGE KK+RKE IA KRRERML RGVDL  IN +LE
Subjt:  IKKTPSRKKSSIVSRDDWSSLALDDLL-VKDYRSASAR--KKKNAAHFACSWPPKAPKSKASRNYPGEKKKYRKETIAAKRRERMLNRGVDLRQINLKLE

Query:  HMVMNREDMFSFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNRKNAKISGL
        + V+   DM  FQ MH RDCSQVRRLA +YRL S C GSGKK  VTVTRT  T MPS+SD++R+ +LIGA DED DF+V+ G   K    +RK AK S  
Subjt:  HMVMNREDMFSFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGSNIKPQGGNRKNAKISGL

Query:  RILEPDQSASSKSRSKGAAGKGSSQKKNGKK--YADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTK
                    ++ +    +    K NGKK  YADQPVSFVS G++  E     TS    V+K      A E+ E T       D     IGAFEVHT+
Subjt:  RILEPDQSASSKSRSKGAAGKGSSQKKNGKK--YADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAFEVHTK

Query:  GFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRPKSLGLGVEFS----EASTSAGDNQASRGSATRTTQALGK---------------SKKIGAFEEHT
        GFGSKMMAKMGF++GGGLGKDG+G+A+PIE ++RPKSLGLG++FS    + S S+ +N  ++ + + ++   GK                K++GAFE+HT
Subjt:  GFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRPKSLGLGVEFS----EASTSAGDNQASRGSATRTTQALGK---------------SKKIGAFEEHT

Query:  KGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK
         GFGS+MMA+MGFVEG GLG++SQG+VNPL+ VRRP+ARG+GA+
Subjt:  KGFGSKMMAKMGFVEGMGLGKDSQGMVNPLLPVRRPKARGLGAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAGGAAGAAGAAGAACGAACCGTGCCAAACCCAGCGATGGCTTCAGGAAGAACAAAACTAATCCCGGCAGAAGAAGATCAGACCCCTCGTCTTCAATTCGAGG
GAATTTGTTTGTTGATGGCGGTTTTCTATCTGATTGGCAGTTCCAGTCCTCGCCGCCTAGTTCTGCTCGAGAGGGAAATTCTAGGTCGAAGGGCAAGTCTGGATCAAAAT
CGGCTAATTTAGATCGTAAAAAAACTGCTTCATCAAGTGGGACTAAACAATCAAATGGTAATGCTTTTGGCTACGAATACCCTCCTGCTCCTTATCAGGAAGGGTTGCAT
TCTGATTCACGGGGTCTGCAGAATGGTGCAGAGTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTAATTCTAAGACTAGTCAAATTGTTGCATATGTAGATGACAA
ACCACCTTTAAAGGCAGACGATTTGGAATTTACTTATGATTATGGTACAAGTTTTGTGTTAGGCGACAGTTCTCATAGAGGACTAGGGTTTCATGATGAGGATGAACTTG
TTAGAAATCCAAACACTGATGATGATTCACCCACACCAGCGGAAGAACAAGAAGGATTGTGTACTGGTTTATTGTCTTCTGAGAAGGAAACTGGTACTGATGAGAGGGTT
GATTGCAGGGAAGGGGTTGAGATGGCTAATGAGATGGTAGCAGAAGCATCATCCCCCAACAAATATTTATATCGTGTGCGCTCTCCAAAAAATTCAGGTTTCTTGTCAAT
AGGTGGTGTGAAATTATACACCCAAGATGTATCTGATGAAGAAAGTGATGATGGGGAGTCATCAGATGGAAGTTCTGAATACTCTGAGCCTTTAGAGACAGATGAATCGT
CTGAAAGTGATAGCTCTGCAGAAATGTCTTGCAGTGATTCAGATATTGATGACGAGGTCGCTGAAGATTATCTTGAAGGGATTGGTGGTAGTGAACATATTTTAAAATCC
AAATGGTTAGTAAAACAAGAGTTGGCAGAGTCTGATGATGATAGTTCTAGCAGTAGCCTTGATGATACTTTAGAGAAATTAGGTGGCATTGCTTTGCAAGAAGCATCTAA
AGAATATGGAATTAAGAAAACTCCTTCAAGGAAGAAGAGTTCCATTGTTTCTAGAGATGATTGGTCATCATTGGCTCTAGATGATTTGCTAGTAAAAGATTATAGATCTG
CATCAGCTAGAAAGAAGAAGAATGCTGCTCATTTTGCTTGTTCTTGGCCTCCAAAGGCTCCAAAAAGTAAAGCTTCTAGAAATTATCCTGGTGAAAAGAAGAAATATCGT
AAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGCTTAATCGGGGTGTTGATCTAAGGCAAATAAACCTGAAATTGGAGCACATGGTTATGAACAGAGAAGATATGTT
TTCTTTCCAACCTATGCATCCCCGTGATTGTTCCCAGGTACGACGGTTGGCTGCGATTTACCGTTTGCATAGTGGGTGCCAAGGTTCTGGTAAAAAAAGGTTGGTAACAG
TAACTCGAACACAATACACGGGAATGCCATCATCGAGTGATCAAATTCGCCTCAATCAGCTAATAGGAGCAAGAGACGAGGATATTGATTTTTCTGTTGCTGAAGGCTCA
AATATAAAACCACAAGGCGGCAACAGGAAGAATGCCAAAATAAGTGGTTTGCGTATATTGGAGCCCGATCAATCTGCAAGCAGTAAGTCGAGGTCAAAGGGTGCTGCAGG
TAAGGGATCAAGTCAGAAAAAGAATGGCAAAAAGTATGCTGATCAACCAGTCTCATTTGTATCATGTGGAGTGATGCAACCAGAAGCAGTCGAGATAACAACTAGCGATA
TTAAGGACGTAGATAAAAGTAAGGATATCGTAGCTGCATCAGAAATGATCGAGATGACTAGTAGTGATGTCAAGAACATGGATATAAGTAGGGATAGTATTGGTGCATTT
GAGGTCCACACCAAGGGGTTTGGCTCGAAAATGATGGCAAAAATGGGATTTGTAGAAGGTGGAGGATTGGGAAAAGATGGTCAAGGAATGGCCCGCCCCATTGAAGTGAT
TAAAAGGCCAAAATCACTTGGGTTAGGCGTTGAGTTCTCAGAGGCCTCGACTTCTGCTGGTGATAACCAGGCAAGTCGAGGATCTGCCACAAGAACAACTCAAGCTTTGG
GAAAATCTAAAAAAATTGGTGCTTTTGAAGAGCACACCAAAGGATTTGGGTCGAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATGGGATTGGGAAAAGATTCACAA
GGAATGGTCAACCCTTTGCTTCCTGTTAGGCGGCCTAAAGCACGAGGGTTGGGTGCCAAAGCTTAG
mRNA sequenceShow/hide mRNA sequence
GAAAAGACTCTCTTAAGAATAAAGTTGGGCTTAAACGGGCCTGAAAGAGGAGTTGCAGGCTTACTTGCCCAGAAAGCAAGTGGATCCGATTCTTTGCTGTTCTCATTGTT
ATCCGTCCTCCGAGCCGACGCCACCGCATGCTTCCCCGCTGTGGCGGCTTGCTTCAGATTTCGGAAATTAGACTTGTAGTCTAATCTGGCCTGCCGCGCTTCTCCTTCTC
TGGCAGTGGCAGGTCTTCGATGGCCGGAGGAAGAAGAAGAACGAACCGTGCCAAACCCAGCGATGGCTTCAGGAAGAACAAAACTAATCCCGGCAGAAGAAGATCAGACC
CCTCGTCTTCAATTCGAGGGAATTTGTTTGTTGATGGCGGTTTTCTATCTGATTGGCAGTTCCAGTCCTCGCCGCCTAGTTCTGCTCGAGAGGGAAATTCTAGGTCGAAG
GGCAAGTCTGGATCAAAATCGGCTAATTTAGATCGTAAAAAAACTGCTTCATCAAGTGGGACTAAACAATCAAATGGTAATGCTTTTGGCTACGAATACCCTCCTGCTCC
TTATCAGGAAGGGTTGCATTCTGATTCACGGGGTCTGCAGAATGGTGCAGAGTGTAGCTTGGACAACTCACAGCCTTTTATATTACTTAATTCTAAGACTAGTCAAATTG
TTGCATATGTAGATGACAAACCACCTTTAAAGGCAGACGATTTGGAATTTACTTATGATTATGGTACAAGTTTTGTGTTAGGCGACAGTTCTCATAGAGGACTAGGGTTT
CATGATGAGGATGAACTTGTTAGAAATCCAAACACTGATGATGATTCACCCACACCAGCGGAAGAACAAGAAGGATTGTGTACTGGTTTATTGTCTTCTGAGAAGGAAAC
TGGTACTGATGAGAGGGTTGATTGCAGGGAAGGGGTTGAGATGGCTAATGAGATGGTAGCAGAAGCATCATCCCCCAACAAATATTTATATCGTGTGCGCTCTCCAAAAA
ATTCAGGTTTCTTGTCAATAGGTGGTGTGAAATTATACACCCAAGATGTATCTGATGAAGAAAGTGATGATGGGGAGTCATCAGATGGAAGTTCTGAATACTCTGAGCCT
TTAGAGACAGATGAATCGTCTGAAAGTGATAGCTCTGCAGAAATGTCTTGCAGTGATTCAGATATTGATGACGAGGTCGCTGAAGATTATCTTGAAGGGATTGGTGGTAG
TGAACATATTTTAAAATCCAAATGGTTAGTAAAACAAGAGTTGGCAGAGTCTGATGATGATAGTTCTAGCAGTAGCCTTGATGATACTTTAGAGAAATTAGGTGGCATTG
CTTTGCAAGAAGCATCTAAAGAATATGGAATTAAGAAAACTCCTTCAAGGAAGAAGAGTTCCATTGTTTCTAGAGATGATTGGTCATCATTGGCTCTAGATGATTTGCTA
GTAAAAGATTATAGATCTGCATCAGCTAGAAAGAAGAAGAATGCTGCTCATTTTGCTTGTTCTTGGCCTCCAAAGGCTCCAAAAAGTAAAGCTTCTAGAAATTATCCTGG
TGAAAAGAAGAAATATCGTAAAGAAACAATTGCAGCAAAGCGTCGAGAAAGAATGCTTAATCGGGGTGTTGATCTAAGGCAAATAAACCTGAAATTGGAGCACATGGTTA
TGAACAGAGAAGATATGTTTTCTTTCCAACCTATGCATCCCCGTGATTGTTCCCAGGTACGACGGTTGGCTGCGATTTACCGTTTGCATAGTGGGTGCCAAGGTTCTGGT
AAAAAAAGGTTGGTAACAGTAACTCGAACACAATACACGGGAATGCCATCATCGAGTGATCAAATTCGCCTCAATCAGCTAATAGGAGCAAGAGACGAGGATATTGATTT
TTCTGTTGCTGAAGGCTCAAATATAAAACCACAAGGCGGCAACAGGAAGAATGCCAAAATAAGTGGTTTGCGTATATTGGAGCCCGATCAATCTGCAAGCAGTAAGTCGA
GGTCAAAGGGTGCTGCAGGTAAGGGATCAAGTCAGAAAAAGAATGGCAAAAAGTATGCTGATCAACCAGTCTCATTTGTATCATGTGGAGTGATGCAACCAGAAGCAGTC
GAGATAACAACTAGCGATATTAAGGACGTAGATAAAAGTAAGGATATCGTAGCTGCATCAGAAATGATCGAGATGACTAGTAGTGATGTCAAGAACATGGATATAAGTAG
GGATAGTATTGGTGCATTTGAGGTCCACACCAAGGGGTTTGGCTCGAAAATGATGGCAAAAATGGGATTTGTAGAAGGTGGAGGATTGGGAAAAGATGGTCAAGGAATGG
CCCGCCCCATTGAAGTGATTAAAAGGCCAAAATCACTTGGGTTAGGCGTTGAGTTCTCAGAGGCCTCGACTTCTGCTGGTGATAACCAGGCAAGTCGAGGATCTGCCACA
AGAACAACTCAAGCTTTGGGAAAATCTAAAAAAATTGGTGCTTTTGAAGAGCACACCAAAGGATTTGGGTCGAAGATGATGGCAAAGATGGGATTTGTTGAAGGCATGGG
ATTGGGAAAAGATTCACAAGGAATGGTCAACCCTTTGCTTCCTGTTAGGCGGCCTAAAGCACGAGGGTTGGGTGCCAAAGCTTAGATTTGTAGAAGACTTCTGTTTGTTT
CATCATATGCTAATTCATCTGTTCCATTAGAATCTGTCAAAGTTCTTCATTAACAAGTTAAATTTATTCCTCTCCTGTTATTTCAACTGTACCTTGATCAATAGAAGACT
TGGTTTTCTTTGCTGCAAATAATTGGGAGGTGTTGTCGGGGTCGGACCCGACGGGTTTTCGACCTAACAAATGTCCTTGACAAATTAAAGACAAGAAATCAAGCTCTACC
TTGCTCTCACTTTCATAATTGAGCTGATTGAAACTTGAATGTGTGAAGTTATGTTTTGAGGTAGTT
Protein sequenceShow/hide protein sequence
MAGGRRRTNRAKPSDGFRKNKTNPGRRRSDPSSSIRGNLFVDGGFLSDWQFQSSPPSSAREGNSRSKGKSGSKSANLDRKKTASSSGTKQSNGNAFGYEYPPAPYQEGLH
SDSRGLQNGAECSLDNSQPFILLNSKTSQIVAYVDDKPPLKADDLEFTYDYGTSFVLGDSSHRGLGFHDEDELVRNPNTDDDSPTPAEEQEGLCTGLLSSEKETGTDERV
DCREGVEMANEMVAEASSPNKYLYRVRSPKNSGFLSIGGVKLYTQDVSDEESDDGESSDGSSEYSEPLETDESSESDSSAEMSCSDSDIDDEVAEDYLEGIGGSEHILKS
KWLVKQELAESDDDSSSSSLDDTLEKLGGIALQEASKEYGIKKTPSRKKSSIVSRDDWSSLALDDLLVKDYRSASARKKKNAAHFACSWPPKAPKSKASRNYPGEKKKYR
KETIAAKRRERMLNRGVDLRQINLKLEHMVMNREDMFSFQPMHPRDCSQVRRLAAIYRLHSGCQGSGKKRLVTVTRTQYTGMPSSSDQIRLNQLIGARDEDIDFSVAEGS
NIKPQGGNRKNAKISGLRILEPDQSASSKSRSKGAAGKGSSQKKNGKKYADQPVSFVSCGVMQPEAVEITTSDIKDVDKSKDIVAASEMIEMTSSDVKNMDISRDSIGAF
EVHTKGFGSKMMAKMGFVEGGGLGKDGQGMARPIEVIKRPKSLGLGVEFSEASTSAGDNQASRGSATRTTQALGKSKKIGAFEEHTKGFGSKMMAKMGFVEGMGLGKDSQ
GMVNPLLPVRRPKARGLGAKA