; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000338 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000338
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationscaffold44:1007589..1010528
RNA-Seq ExpressionMS000338
SyntenyMS000338
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067691.1 AT-hook motif nuclear-localized protein 4 [Cucumis melo var. makuwa]1.9e-15581.82Show/hide
Query:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR
        MEEK TGVS F V NDEAL +FQLAP TE  K+T E KV   A AA PP            AA  TPPAS       AVSSTETKKKRGRPRKYGPDGKR
Subjt:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR

Query:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
         LTLALSPMPISSSIPL GEFPNWKR+N++S A++KKPQRFE+ENPG RLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
Subjt:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL

Query:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT
        RQATSSGGTLTYEGRFEIL+LTGS+MPTQNG TKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQ+VVGSFLPGHQQEQKPKKPRNESTTIFFPP+NT
Subjt:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT

Query:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEV
        ITGEEMKAMY GG KP+LT PS+QE HN  SP PVTGFK+SSTDNLPL D+EPKTQSQSNCE+
Subjt:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEV

XP_004148159.1 AT-hook motif nuclear-localized protein 4 [Cucumis sativus]3.1e-15882.47Show/hide
Query:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR
        MEEK TGVS F VTNDEAL +F+LAP TE LK+T E KV   A AA PP            AA  TPP S       AVSSTETKKKRGRPRKYGPDGKR
Subjt:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR

Query:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
         LTLALSPMPISSSIPL GEFPNWKR+N++S A++KKPQRFE+ENPG RLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
Subjt:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL

Query:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT
        RQATSSGGTLTYEGRFEIL+LTGS+MPTQNG TKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQ+VVGSFLPGHQQEQKPKKPRNESTTIFFPP+NT
Subjt:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT

Query:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC
        ITGEEMKAMY GG KP+LT PSYQE HN  SP PVTGFK+SSTDNLPL D+EPKTQSQSNCEVSC
Subjt:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC

XP_008439113.1 PREDICTED: AT-hook motif nuclear-localized protein 4 [Cucumis melo]4.9e-15681.64Show/hide
Query:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR
        MEEK TGVS F V NDEAL +FQLAP TE  K+T E KV   A AA PP            AA  TPPAS       AVSSTETKKKRGRPRKYGPDGKR
Subjt:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR

Query:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
         LTLALSPMPISSSIPL GEFPNWKR+N++S A++KKPQRFE+ENPG RLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
Subjt:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL

Query:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT
        RQATSSGGTLTYEGRFEIL+LTGS+MPTQNG TKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQ+VVGSFLPGHQQEQKPKKPRNESTTIFFPP+NT
Subjt:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT

Query:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC
        I GEEMKAMY GG KP+LT PS++E HN  SP PVTGFK+SSTDNLPL D+EPKTQSQSNCEVSC
Subjt:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC

XP_038880746.1 AT-hook motif nuclear-localized protein 4-like isoform X1 [Benincasa hispida]5.4e-15582.17Show/hide
Query:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR
        M+E  TGVS F VTNDEAL SFQLAP TE LK+T E KV   AAAA PP    AA  PPA+                AVSSTETKKKRGRPRKYGPDGKR
Subjt:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR

Query:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
         LTLALSPMPISSSIPL GEFPNWKR+N++S  ++KKPQRFE+ENPG RLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
Subjt:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL

Query:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT
        RQATSSGGTLTYEGRFEIL+LTGS+MPTQNG TKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPP+NT
Subjt:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT

Query:  ITGEEMKAMYGGGVKPVLTIPSYQEHNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSN
        ITGEEMKAMY GG KP+LT  SYQEHN  SP+PVTGFK+SSTDNLPL D+EPKTQSQSN
Subjt:  ITGEEMKAMYGGGVKPVLTIPSYQEHNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSN

XP_038880757.1 AT-hook motif nuclear-localized protein 4-like isoform X2 [Benincasa hispida]1.4e-15882.42Show/hide
Query:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR
        M+E  TGVS F VTNDEAL SFQLAP TE LK+T E KV   AAAA PP    AA  PPA+                AVSSTETKKKRGRPRKYGPDGKR
Subjt:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR

Query:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
         LTLALSPMPISSSIPL GEFPNWKR+N++S  ++KKPQRFE+ENPG RLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
Subjt:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL

Query:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT
        RQATSSGGTLTYEGRFEIL+LTGS+MPTQNG TKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPP+NT
Subjt:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT

Query:  ITGEEMKAMYGGGVKPVLTIPSYQEHNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC
        ITGEEMKAMY GG KP+LT  SYQEHN  SP+PVTGFK+SSTDNLPL D+EPKTQSQSNCEVSC
Subjt:  ITGEEMKAMYGGGVKPVLTIPSYQEHNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC

TrEMBL top hitse value%identityAlignment
A0A0A0LB41 AT-hook motif nuclear-localized protein1.5e-15882.47Show/hide
Query:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR
        MEEK TGVS F VTNDEAL +F+LAP TE LK+T E KV   A AA PP            AA  TPP S       AVSSTETKKKRGRPRKYGPDGKR
Subjt:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR

Query:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
         LTLALSPMPISSSIPL GEFPNWKR+N++S A++KKPQRFE+ENPG RLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
Subjt:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL

Query:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT
        RQATSSGGTLTYEGRFEIL+LTGS+MPTQNG TKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQ+VVGSFLPGHQQEQKPKKPRNESTTIFFPP+NT
Subjt:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT

Query:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC
        ITGEEMKAMY GG KP+LT PSYQE HN  SP PVTGFK+SSTDNLPL D+EPKTQSQSNCEVSC
Subjt:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC

A0A1S3AYM6 AT-hook motif nuclear-localized protein2.4e-15681.64Show/hide
Query:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR
        MEEK TGVS F V NDEAL +FQLAP TE  K+T E KV   A AA PP            AA  TPPAS       AVSSTETKKKRGRPRKYGPDGKR
Subjt:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR

Query:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
         LTLALSPMPISSSIPL GEFPNWKR+N++S A++KKPQRFE+ENPG RLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
Subjt:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL

Query:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT
        RQATSSGGTLTYEGRFEIL+LTGS+MPTQNG TKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQ+VVGSFLPGHQQEQKPKKPRNESTTIFFPP+NT
Subjt:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT

Query:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC
        I GEEMKAMY GG KP+LT PS++E HN  SP PVTGFK+SSTDNLPL D+EPKTQSQSNCEVSC
Subjt:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC

A0A5A7VQ33 AT-hook motif nuclear-localized protein9.1e-15681.82Show/hide
Query:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR
        MEEK TGVS F V NDEAL +FQLAP TE  K+T E KV   A AA PP            AA  TPPAS       AVSSTETKKKRGRPRKYGPDGKR
Subjt:  MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKR

Query:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
         LTLALSPMPISSSIPL GEFPNWKR+N++S A++KKPQRFE+ENPG RLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL
Subjt:  PLTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTL

Query:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT
        RQATSSGGTLTYEGRFEIL+LTGS+MPTQNG TKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQ+VVGSFLPGHQQEQKPKKPRNESTTIFFPP+NT
Subjt:  RQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINT

Query:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEV
        ITGEEMKAMY GG KP+LT PS+QE HN  SP PVTGFK+SSTDNLPL D+EPKTQSQSNCE+
Subjt:  ITGEEMKAMYGGGVKPVLTIPSYQE-HNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEV

A0A6J1G063 AT-hook motif nuclear-localized protein5.7e-15079.06Show/hide
Query:  EEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRP
        EEK  GVS F VTN+EAL +FQLAP TETLK T+EPKV     A T PAS    A PP                  AVS+T+TKKKRGRPRKY  DGKR 
Subjt:  EEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRP

Query:  LTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLR
        LTLALSPMPISSSIPL GEFPNWKR+ND+SLA+IKKPQRFE+ENPG +LAYSVGANFTPHVITVNAGEDITMK+MS SQQESRAICILSANGTISNVTLR
Subjt:  LTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLR

Query:  QATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINTI
        QATSSGGTLTYEG FEIL+LTGSFMPTQNG TKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNE+TTIFFPPINTI
Subjt:  QATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINTI

Query:  TGEEMKAMYGGGVKPVLTIPSYQEHNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC
        +GEEMKA Y GG+KP++T PS QEH   SP+ VT FK+SSTDNLPL ++EPKTQSQSNCEVSC
Subjt:  TGEEMKAMYGGGVKPVLTIPSYQEHNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC

A0A6J1HUD1 AT-hook motif nuclear-localized protein4.4e-15078.79Show/hide
Query:  EEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRP
        EEK  GVS F VTN+EAL +FQLAP TETLK T+EPKV A+ A A+ P +             A PP  T      AVS+T+TKKKRGRPRKY  DGKR 
Subjt:  EEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRP

Query:  LTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLR
        LTLALSPMPISSSIPL GEFPNWKR+ND+SLA+IKKPQRFE+ENPG +LAYSVGANFTPHVITVNAGEDITMK+MS SQQESRAICILSANGTISNVTLR
Subjt:  LTLALSPMPISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLR

Query:  QATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINTI
        QATSSGGTLTYEG FEIL+LTGSFMPTQNG TKSRCGGMSVSLAGQDG+VVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNE TTIFFPPINTI
Subjt:  QATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINTI

Query:  TGEEMKAMYGGGVKPVLTIPSYQEHNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC
        +GEEMKA Y GG+KP++T PS QEH   SP+ VT FK+SSTDNLPL ++EPKTQSQSNCEVSC
Subjt:  TGEEMKAMYGGGVKPVLTIPSYQEHNSLSPNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC

SwissProt top hitse value%identityAlignment
O49658 AT-hook motif nuclear-localized protein 21.0e-5548.1Show/hide
Query:  GFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPM
        G  V    A   F +AP +ET              + TPP S      PP         + T +A     SS   KK+RGRPRKYG DG     + LSP 
Subjt:  GFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPM

Query:  PISSSIPLAG---EFPNWKRENDLSLAVIKKPQ-----RFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLR
        PISS+ P      +F     +          P      +++ EN G     S  ANFTPH+ITVNAGED+T +++SFSQQ S AIC+L ANG +S+VTLR
Subjt:  PISSSIPLAG---EFPNWKRENDLSLAVIKKPQ-----RFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLR

Query:  QATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNES
        Q  SSGGTLTYEGRFEILSL+G+FMP+ + GT+SR GGMSVSLA  DGRVVGGG+AGLLVAA P+QVVVG+FL G  Q+++  KP N +
Subjt:  QATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNES

Q8VYJ2 AT-hook motif nuclear-localized protein 19.9e-5952.74Show/hide
Query:  GFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPM
        G  V   +A   F +A  +E+  +   P    S     P  S+   A PP   +  T   +TAA     +S    KKKRGRPRKYGPDG     +ALSP 
Subjt:  GFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPM

Query:  PISSSIPLAGEFPNWKRENDLSL----AVIKKPQRF-------EYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNV
        PISS+   +   P      D S     + +K    F       + EN G     SVG NFTPH+ITVN GED+TMK++SFSQQ  R+IC+LSANG IS+V
Subjt:  PISSSIPLAGEFPNWKRENDLSL----AVIKKPQRF-------EYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNV

Query:  TLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPG-HQQEQKPKKPRNE
        TLRQ  SSGGTLTYEGRFEILSL+GSFMP  +GGT+SR GGMSVSLA  DGRVVGGGLAGLLVAA PVQVVVGSFL G   Q+QKPKK +++
Subjt:  TLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPG-HQQEQKPKKPRNE

Q9FHM5 AT-hook motif nuclear-localized protein 43.8e-6649.44Show/hide
Query:  TDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPMPISSSIPLAGEFPNWKR-------
        ++ P +     ++T  +SA AA  P  N A   PP S    +P   SS+E KKKRGRPRKY PDG   L + LSPMPISSS+PL  EF + KR       
Subjt:  TDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPMPISSSIPLAGEFPNWKR-------

Query:  -----------------ENDLSLAVIKKPQRFEYENPGPRL-----AYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQAT
                          N+     +K PQ FE+ N  P       A  V  +FTPHV+TVNAGED+TMK+M+FSQQ SRAICILSANG ISNVTLRQ+ 
Subjt:  -----------------ENDLSLAVIKKPQRFEYENPGPRL-----AYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQAT

Query:  SSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPG----HQQEQKPKKPRNESTTIFFPPINT
        +SGGTLTYEG FEILSLTGSF+P+++GGT+SR GGMSVSLAGQDGRV GGGLAGL +AAGPVQV+VGSF+ G     QQ+Q+ KK R E   I   P  T
Subjt:  SSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPG----HQQEQKPKKPRNESTTIFFPPINT

Query:  I--------TGEEMKAMYGGGVKPV------LTIPSYQEHNSLSPNPVTGFKLSSTDN
                 + E+ KA YG   KPV      ++ P     +  S N V G+  ++T N
Subjt:  I--------TGEEMKAMYGGGVKPV------LTIPSYQEHNSLSPNPVTGFKLSSTDN

Q9LVB0 AT-hook motif nuclear-localized protein 61.8e-6850.46Show/hide
Query:  TPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGK---RPLTLALSPMPISSSIPLAGEFPNWKR----ENDLSLAVIKKPQ
        +P    T    PPA ++   P   T  +   +  S  TKKKRGRPRKY PDG    R L   LSP PISSSIPL+G++  WKR    +    L  +KK  
Subjt:  TPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGK---RPLTLALSPMPISSSIPLAGEFPNWKR----ENDLSLAVIKKPQ

Query:  RFEYENPGPR-----LAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTK
        +FEY +P P      L+  VGANFT H  TVN GED+TMKVM +SQQ SRAICILSA G+ISNVTL Q T++GGTLTYEGRFEILSL+GSFMPT+NGGTK
Subjt:  RFEYENPGPR-----LAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTK

Query:  SRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKK------------------------PRNESTTI--FFPPINTITGEEMKA
         R GGMS+SLAG +G + GGGLAG+L+AAGPVQVV+GSF+  HQ EQ  KK                        P    TT+    P +NT+  ++ +A
Subjt:  SRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKK------------------------PRNESTTI--FFPPINTITGEEMKA

Query:  MYGGGVKPVLTIP-SYQEHNSLSPN
          GG V+P+  +P S+Q  NS   N
Subjt:  MYGGGVKPVLTIP-SYQEHNSLSPN

Q9SB31 AT-hook motif nuclear-localized protein 39.3e-6557.71Show/hide
Query:  TPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPMPISSSIPLAGEFPNWK--RENDLSLAVIKKPQRFEYE
        T PA+AT AAA   NA  ATP + T        S+ + KKKRGRPRKY PDG   L + LSPMPISSS+PL  EFP  K  R    S   +KK Q F+++
Subjt:  TPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPMPISSSIPLAGEFPNWK--RENDLSLAVIKKPQRFEYE

Query:  N-------PGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRC
                 G   A  VGANFTPHV+ VNAGED+TMK+M+FSQQ SRAICILSANG ISNVTLRQ+ +SGGTLTYEGRFEILSLTGSFM   +GGT+SR 
Subjt:  N-------PGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRC

Query:  GGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQ----KPKKPR--NESTTIFFPPINTITGEEMKAMY
        GGMSV LAG DGRV GGGLAGL +AAGPVQV+VG+F+ G +Q Q    K ++ R   + ++I F     I+ EE KA +
Subjt:  GGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQ----KPKKPR--NESTTIFFPPINTITGEEMKAMY

Arabidopsis top hitse value%identityAlignment
AT4G12080.1 AT-hook motif nuclear-localized protein 17.0e-6052.74Show/hide
Query:  GFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPM
        G  V   +A   F +A  +E+  +   P    S     P  S+   A PP   +  T   +TAA     +S    KKKRGRPRKYGPDG     +ALSP 
Subjt:  GFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPM

Query:  PISSSIPLAGEFPNWKRENDLSL----AVIKKPQRF-------EYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNV
        PISS+   +   P      D S     + +K    F       + EN G     SVG NFTPH+ITVN GED+TMK++SFSQQ  R+IC+LSANG IS+V
Subjt:  PISSSIPLAGEFPNWKRENDLSL----AVIKKPQRF-------EYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNV

Query:  TLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPG-HQQEQKPKKPRNE
        TLRQ  SSGGTLTYEGRFEILSL+GSFMP  +GGT+SR GGMSVSLA  DGRVVGGGLAGLLVAA PVQVVVGSFL G   Q+QKPKK +++
Subjt:  TLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPG-HQQEQKPKKPRNE

AT4G22770.1 AT hook motif DNA-binding family protein7.3e-5748.1Show/hide
Query:  GFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPM
        G  V    A   F +AP +ET              + TPP S      PP         + T +A     SS   KK+RGRPRKYG DG     + LSP 
Subjt:  GFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPM

Query:  PISSSIPLAG---EFPNWKRENDLSLAVIKKPQ-----RFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLR
        PISS+ P      +F     +          P      +++ EN G     S  ANFTPH+ITVNAGED+T +++SFSQQ S AIC+L ANG +S+VTLR
Subjt:  PISSSIPLAG---EFPNWKRENDLSLAVIKKPQ-----RFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLR

Query:  QATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNES
        Q  SSGGTLTYEGRFEILSL+G+FMP+ + GT+SR GGMSVSLA  DGRVVGGG+AGLLVAA P+QVVVG+FL G  Q+++  KP N +
Subjt:  QATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNES

AT4G25320.1 AT hook motif DNA-binding family protein6.6e-6657.71Show/hide
Query:  TPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPMPISSSIPLAGEFPNWK--RENDLSLAVIKKPQRFEYE
        T PA+AT AAA   NA  ATP + T        S+ + KKKRGRPRKY PDG   L + LSPMPISSS+PL  EFP  K  R    S   +KK Q F+++
Subjt:  TPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPMPISSSIPLAGEFPNWK--RENDLSLAVIKKPQRFEYE

Query:  N-------PGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRC
                 G   A  VGANFTPHV+ VNAGED+TMK+M+FSQQ SRAICILSANG ISNVTLRQ+ +SGGTLTYEGRFEILSLTGSFM   +GGT+SR 
Subjt:  N-------PGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRC

Query:  GGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQ----KPKKPR--NESTTIFFPPINTITGEEMKAMY
        GGMSV LAG DGRV GGGLAGL +AAGPVQV+VG+F+ G +Q Q    K ++ R   + ++I F     I+ EE KA +
Subjt:  GGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQ----KPKKPR--NESTTIFFPPINTITGEEMKAMY

AT5G51590.1 AT hook motif DNA-binding family protein2.7e-6749.44Show/hide
Query:  TDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPMPISSSIPLAGEFPNWKR-------
        ++ P +     ++T  +SA AA  P  N A   PP S    +P   SS+E KKKRGRPRKY PDG   L + LSPMPISSS+PL  EF + KR       
Subjt:  TDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPMPISSSIPLAGEFPNWKR-------

Query:  -----------------ENDLSLAVIKKPQRFEYENPGPRL-----AYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQAT
                          N+     +K PQ FE+ N  P       A  V  +FTPHV+TVNAGED+TMK+M+FSQQ SRAICILSANG ISNVTLRQ+ 
Subjt:  -----------------ENDLSLAVIKKPQRFEYENPGPRL-----AYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQAT

Query:  SSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPG----HQQEQKPKKPRNESTTIFFPPINT
        +SGGTLTYEG FEILSLTGSF+P+++GGT+SR GGMSVSLAGQDGRV GGGLAGL +AAGPVQV+VGSF+ G     QQ+Q+ KK R E   I   P  T
Subjt:  SSGGTLTYEGRFEILSLTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPG----HQQEQKPKKPRNESTTIFFPPINT

Query:  I--------TGEEMKAMYGGGVKPV------LTIPSYQEHNSLSPNPVTGFKLSSTDN
                 + E+ KA YG   KPV      ++ P     +  S N V G+  ++T N
Subjt:  I--------TGEEMKAMYGGGVKPV------LTIPSYQEHNSLSPNPVTGFKLSSTDN

AT5G62260.1 AT hook motif DNA-binding family protein1.3e-6950.46Show/hide
Query:  TPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGK---RPLTLALSPMPISSSIPLAGEFPNWKR----ENDLSLAVIKKPQ
        +P    T    PPA ++   P   T  +   +  S  TKKKRGRPRKY PDG    R L   LSP PISSSIPL+G++  WKR    +    L  +KK  
Subjt:  TPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGK---RPLTLALSPMPISSSIPLAGEFPNWKR----ENDLSLAVIKKPQ

Query:  RFEYENPGPR-----LAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTK
        +FEY +P P      L+  VGANFT H  TVN GED+TMKVM +SQQ SRAICILSA G+ISNVTL Q T++GGTLTYEGRFEILSL+GSFMPT+NGGTK
Subjt:  RFEYENPGPR-----LAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQATSSGGTLTYEGRFEILSLTGSFMPTQNGGTK

Query:  SRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKK------------------------PRNESTTI--FFPPINTITGEEMKA
         R GGMS+SLAG +G + GGGLAG+L+AAGPVQVV+GSF+  HQ EQ  KK                        P    TT+    P +NT+  ++ +A
Subjt:  SRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKK------------------------PRNESTTI--FFPPINTITGEEMKA

Query:  MYGGGVKPVLTIP-SYQEHNSLSPN
          GG V+P+  +P S+Q  NS   N
Subjt:  MYGGGVKPVLTIP-SYQEHNSLSPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAGAAAGTTACCGGAGTTTCTGGGTTCGCAGTAACGAATGATGAAGCCCTAGGGAGCTTCCAGTTAGCCCCAACCACAGAAACCCTAAAAGCAACCGACGAGCC
TAAAGTGATGGCGAGCGCCGCGGCTGCTACACCACCAGCGAGCGCGACGGCGGCTGCTGCACCACCGGCGAACGCGGCTGATGCTACGCCACCGGCGAGCACGGCGGCTG
CTATACCACCGGCGGTTTCTAGTACGGAAACAAAGAAGAAAAGAGGGAGGCCGAGAAAGTATGGGCCGGACGGGAAGCGTCCGCTGACGCTGGCGTTGTCTCCGATGCCG
ATATCGTCGTCAATTCCACTGGCCGGAGAGTTTCCGAATTGGAAACGAGAGAACGATCTGTCTCTAGCGGTAATCAAGAAGCCGCAGAGATTCGAGTATGAGAATCCCGG
TCCGAGACTTGCATACTCTGTTGGTGCCAATTTTACTCCCCATGTGATCACTGTCAATGCTGGAGAGGACATTACGATGAAGGTCATGTCTTTCTCTCAGCAAGAATCCC
GAGCTATTTGTATTCTTTCTGCAAATGGCACGATTTCAAATGTTACACTTCGACAGGCAACTTCTTCTGGAGGTACCTTAACGTATGAGGGTCGTTTTGAAATACTCTCA
TTGACTGGATCGTTTATGCCGACGCAGAATGGAGGTACGAAGAGCAGATGTGGTGGGATGAGTGTCTCGCTGGCAGGTCAAGATGGTCGAGTTGTGGGCGGAGGACTAGC
TGGTTTGTTGGTAGCTGCTGGTCCTGTGCAAGTTGTGGTTGGTAGTTTCCTTCCGGGTCACCAGCAAGAACAGAAGCCAAAGAAGCCGAGGAACGAATCCACAACCATTT
TCTTTCCTCCCATCAACACCATCACTGGTGAAGAGATGAAGGCGATGTACGGCGGTGGTGTCAAGCCCGTTCTCACGATACCGTCCTATCAAGAACATAACTCACTGTCA
CCAAACCCAGTCACAGGCTTCAAATTATCCTCCACTGACAACTTACCTTTGCCTGACAAAGAACCCAAAACACAAAGTCAATCGAACTGTGAGGTTTCTTGT
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAGAAAGTTACCGGAGTTTCTGGGTTCGCAGTAACGAATGATGAAGCCCTAGGGAGCTTCCAGTTAGCCCCAACCACAGAAACCCTAAAAGCAACCGACGAGCC
TAAAGTGATGGCGAGCGCCGCGGCTGCTACACCACCAGCGAGCGCGACGGCGGCTGCTGCACCACCGGCGAACGCGGCTGATGCTACGCCACCGGCGAGCACGGCGGCTG
CTATACCACCGGCGGTTTCTAGTACGGAAACAAAGAAGAAAAGAGGGAGGCCGAGAAAGTATGGGCCGGACGGGAAGCGTCCGCTGACGCTGGCGTTGTCTCCGATGCCG
ATATCGTCGTCAATTCCACTGGCCGGAGAGTTTCCGAATTGGAAACGAGAGAACGATCTGTCTCTAGCGGTAATCAAGAAGCCGCAGAGATTCGAGTATGAGAATCCCGG
TCCGAGACTTGCATACTCTGTTGGTGCCAATTTTACTCCCCATGTGATCACTGTCAATGCTGGAGAGGACATTACGATGAAGGTCATGTCTTTCTCTCAGCAAGAATCCC
GAGCTATTTGTATTCTTTCTGCAAATGGCACGATTTCAAATGTTACACTTCGACAGGCAACTTCTTCTGGAGGTACCTTAACGTATGAGGGTCGTTTTGAAATACTCTCA
TTGACTGGATCGTTTATGCCGACGCAGAATGGAGGTACGAAGAGCAGATGTGGTGGGATGAGTGTCTCGCTGGCAGGTCAAGATGGTCGAGTTGTGGGCGGAGGACTAGC
TGGTTTGTTGGTAGCTGCTGGTCCTGTGCAAGTTGTGGTTGGTAGTTTCCTTCCGGGTCACCAGCAAGAACAGAAGCCAAAGAAGCCGAGGAACGAATCCACAACCATTT
TCTTTCCTCCCATCAACACCATCACTGGTGAAGAGATGAAGGCGATGTACGGCGGTGGTGTCAAGCCCGTTCTCACGATACCGTCCTATCAAGAACATAACTCACTGTCA
CCAAACCCAGTCACAGGCTTCAAATTATCCTCCACTGACAACTTACCTTTGCCTGACAAAGAACCCAAAACACAAAGTCAATCGAACTGTGAGGTTTCTTGT
Protein sequenceShow/hide protein sequence
MEEKVTGVSGFAVTNDEALGSFQLAPTTETLKATDEPKVMASAAAATPPASATAAAAPPANAADATPPASTAAAIPPAVSSTETKKKRGRPRKYGPDGKRPLTLALSPMP
ISSSIPLAGEFPNWKRENDLSLAVIKKPQRFEYENPGPRLAYSVGANFTPHVITVNAGEDITMKVMSFSQQESRAICILSANGTISNVTLRQATSSGGTLTYEGRFEILS
LTGSFMPTQNGGTKSRCGGMSVSLAGQDGRVVGGGLAGLLVAAGPVQVVVGSFLPGHQQEQKPKKPRNESTTIFFPPINTITGEEMKAMYGGGVKPVLTIPSYQEHNSLS
PNPVTGFKLSSTDNLPLPDKEPKTQSQSNCEVSC