; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G014440 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G014440
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationCmo_Chr15:9798955..9804781
RNA-Seq ExpressionCmoCh15G014440
SyntenyCmoCh15G014440
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017108.1 AT-hook motif nuclear-localized protein 3, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-15487.57Show/hide
Query:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSES-PPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR
        MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSES PPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR
Subjt:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSES-PPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR

Query:  GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP
        GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPH                                         STSSGGTLTYEGRFEILSLSGSYMP
Subjt:  GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP

Query:  SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK
        SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK
Subjt:  SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK

Query:  PTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
        PTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
Subjt:  PTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC

XP_008467286.1 PREDICTED: AT-hook motif nuclear-localized protein 3-like [Cucumis melo]1.5e-14683.43Show/hide
Query:  MEEKE-GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR
        MEEKE GVDFGFAV+VSQAPES GMMDSRPEN+STD E+  PP QP ASVP+  A DGKKKRGRPRKYGP+GTVAPTLSPMPISSSIPL GEF  WKRGR
Subjt:  MEEKE-GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR

Query:  GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP
        GRSVESIKK R+ +E+EIPGNKVAF AGADFTPHVITVNIGEDVNLK+MSFSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP
Subjt:  GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP

Query:  SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK
        SE  GTKSRS G+SVSLAG DGRVMGG LAGMLIAAGPVQVVVGSFLPP HQ+E+KP+KSRMEP  NAI PPAD+LSGEGT  V GGV+ IVPSTL  D+
Subjt:  SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK

Query:  PTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
          SLD  PA KTPQVN+K  FPQESRGVLNHSNHEVSC
Subjt:  PTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC

XP_022928888.1 AT-hook motif nuclear-localized protein 3-like [Cucurbita moschata]2.3e-184100Show/hide
Query:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRG
        MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRG
Subjt:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRG

Query:  RSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPS
        RSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPS
Subjt:  RSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPS

Query:  ESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKP
        ESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKP
Subjt:  ESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKP

Query:  TSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
        TSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
Subjt:  TSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC

XP_022969936.1 AT-hook motif nuclear-localized protein 3-like [Cucurbita maxima]2.5e-17896.5Show/hide
Query:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPP------QPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPV
        MEEKEGVDFGFAVEVSQAPESLGMMD RPENTSTDSESPPP P      QPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPV
Subjt:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPP------QPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPV

Query:  WKRGRGRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLS
        WKRGRGRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLS
Subjt:  WKRGRGRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLS

Query:  GSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPST
        GSYMPSES GTKSRSRGVSVSLAGSDGRVMGGELAG LIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPST
Subjt:  GSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPST

Query:  LIEDKPTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
        LIEDKP SLDQNPA KTPQVNEKLPFPQESRGVLNHSNHEVSC
Subjt:  LIEDKPTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC

XP_023550576.1 AT-hook motif nuclear-localized protein 3-like [Cucurbita pepo subsp. pepo]2.9e-17998.22Show/hide
Query:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRG
        MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSES  PPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRG
Subjt:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRG

Query:  RSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPS
        RSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPS
Subjt:  RSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPS

Query:  ESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKP
        ES GTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTL EDKP
Subjt:  ESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKP

Query:  TSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
         SLDQNPA KTPQVNEKLPFPQESRGVLNHSNHEVSC
Subjt:  TSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC

TrEMBL top hitse value%identityAlignment
A0A0A0KN92 AT-hook motif nuclear-localized protein2.5e-14482.84Show/hide
Query:  MEEKE-GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR
        MEEKE GVDFGFAV+VSQAPES GMMD+RPEN+STD E+  PP QP ASVPT  A DGKKKRGRPRKYGP+GTVAPTLSPMPISSSIPL+GEF  WKRGR
Subjt:  MEEKE-GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR

Query:  GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP
        GRSVESIKK R+ +E+EIPGNKVAF AGADFTPHVITVNIGEDVNLK+MSFSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP
Subjt:  GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP

Query:  SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK
        SE  GTKSRS G+SVSLAG DGRVMGG LAGMLIAAGPVQVVVGSFLPP HQ+E+KP+KSRMEP  NA  PPA++LSGEGTN V GGV+ IV STL  D+
Subjt:  SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK

Query:  PTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
          SLD  PA KTPQVN+K  FPQESRGVLNHSNHEVSC
Subjt:  PTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC

A0A1S3CTE0 AT-hook motif nuclear-localized protein7.1e-14783.43Show/hide
Query:  MEEKE-GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR
        MEEKE GVDFGFAV+VSQAPES GMMDSRPEN+STD E+  PP QP ASVP+  A DGKKKRGRPRKYGP+GTVAPTLSPMPISSSIPL GEF  WKRGR
Subjt:  MEEKE-GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR

Query:  GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP
        GRSVESIKK R+ +E+EIPGNKVAF AGADFTPHVITVNIGEDVNLK+MSFSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP
Subjt:  GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP

Query:  SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK
        SE  GTKSRS G+SVSLAG DGRVMGG LAGMLIAAGPVQVVVGSFLPP HQ+E+KP+KSRMEP  NAI PPAD+LSGEGT  V GGV+ IVPSTL  D+
Subjt:  SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK

Query:  PTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
          SLD  PA KTPQVN+K  FPQESRGVLNHSNHEVSC
Subjt:  PTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC

A0A5A7UGP9 AT-hook motif nuclear-localized protein2.1e-13882.52Show/hide
Query:  MEEKE-GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR
        MEEKE G DFGFAV+VSQAPES GMMDSRPEN+STD E+  PP QP ASVP+  A DGKKKRGRPRKYGP+GTVAPTLSPMPISSSIPL+GEF  WKRGR
Subjt:  MEEKE-GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGR

Query:  GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP
        GRSVESIKK R+ +E+EIPGNKVAF AGADFTPHVITVNIGEDVNLK+MSFSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP
Subjt:  GRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMP

Query:  SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK
        SE  GTKSRS G+SVSLAG DGRVMGG LAGMLIAAGPVQVVVGSFLPP HQ+E+KP+KSRMEP  NAI PPAD+LSGEGT  V GGV+ IVPSTL  D+
Subjt:  SESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDK

Query:  PTSLDQNPASKTPQVNEKLPFPQESR
          SLD  PA KTPQVN+K  FPQESR
Subjt:  PTSLDQNPASKTPQVNEKLPFPQESR

A0A6J1EQD9 AT-hook motif nuclear-localized protein1.1e-184100Show/hide
Query:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRG
        MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRG
Subjt:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRG

Query:  RSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPS
        RSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPS
Subjt:  RSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPS

Query:  ESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKP
        ESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKP
Subjt:  ESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKP

Query:  TSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
        TSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
Subjt:  TSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC

A0A6J1HZ62 AT-hook motif nuclear-localized protein1.2e-17896.5Show/hide
Query:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPP------QPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPV
        MEEKEGVDFGFAVEVSQAPESLGMMD RPENTSTDSESPPP P      QPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPV
Subjt:  MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPP------QPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPV

Query:  WKRGRGRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLS
        WKRGRGRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLS
Subjt:  WKRGRGRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLS

Query:  GSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPST
        GSYMPSES GTKSRSRGVSVSLAGSDGRVMGGELAG LIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPST
Subjt:  GSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPST

Query:  LIEDKPTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC
        LIEDKP SLDQNPA KTPQVNEKLPFPQESRGVLNHSNHEVSC
Subjt:  LIEDKPTSLDQNPASKTPQVNEKLPFPQESRGVLNHSNHEVSC

SwissProt top hitse value%identityAlignment
O49658 AT-hook motif nuclear-localized protein 22.2e-6050.52Show/hide
Query:  GVDFGFAVEVSQAPESLGMMD-SRPENTSTDSESPPPPPQPLASVPTVAATDG------KKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSG---EFPVW
        G D G  V  S AP    M   S   NT  +S +PPPPP P  S    AA DG      KK+RGRPRKYG +G  A TLSP PISS+ P +    +F   
Subjt:  GVDFGFAVEVSQAPESLGMMD-SRPENTSTDSESPPPPPQPLASVPTVAATDG------KKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSG---EFPVW

Query:  KRGRGRSVESIKKP----RRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEIL
           RG+   +   P    R  Y+ E  G      A A+FTPH+ITVN GEDV  +I+SFSQQGS AI +L ANG+VS+VTLRQ  SSGGTLTYEGRFEIL
Subjt:  KRGRGRSVESIKKP----RRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEIL

Query:  SLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGT
        SLSG++MPS+S GT+SR+ G+SVSLA  DGRV+GG +AG+L+AA P+QVVVG+FL   +Q+E  PK       S+ ++P +  ++   T
Subjt:  SLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGT

Q8VYJ2 AT-hook motif nuclear-localized protein 11.2e-5843.6Show/hide
Query:  GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQP--------------LASVPTVAATDG------KKKRGRPRKYGPNGTVAPTLSPMPISSS
        G D G  V  S AP    +      +  + +   PPPPQP              + +  T AA +G      KKKRGRPRKYGP+GTV   LSP PISS+
Subjt:  GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQP--------------LASVPTVAATDG------KKKRGRPRKYGPNGTVAPTLSPMPISSS

Query:  IPLSGEFP-----------VWKRGRGRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQ
         P     P             KR + +   S  + +  ++ E  G       G +FTPH+ITVN GEDV +KI+SFSQQG R+I +LSANG++S+VTLRQ
Subjt:  IPLSGEFP-----------VWKRGRGRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQ

Query:  STSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSR---MEPKSNAILPP
          SSGGTLTYEGRFEILSLSGS+MP++S GT+SR+ G+SVSLA  DGRV+GG LAG+L+AA PVQVVVGSFL     ++ KPKK++   M     A +P 
Subjt:  STSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSR---MEPKSNAILPP

Query:  ADVLSGEGTNGVLGGVEAIVPSTLIEDKPTSLDQNPASKTPQVN
           +S    +  +  V ++  +       TSL  +P +K   +N
Subjt:  ADVLSGEGTNGVLGGVEAIVPSTLIEDKPTSLDQNPASKTPQVN

Q9FHM5 AT-hook motif nuclear-localized protein 48.3e-6047.83Show/hide
Query:  PPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRGRSVESIKKPRRG-----------------------YEFE--
        PP  L      ++++ KKKRGRPRKY P+G++A TLSPMPISSS+PL+ EF   KRGRGR     +   RG                       +EF   
Subjt:  PPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRGRSVESIKKPRRG-----------------------YEFE--

Query:  ---IPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVS
             G   A I    FTPHV+TVN GEDV +KIM+FSQQGSRAI ILSANG +SNVTLRQS +SGGTLTYEG FEILSL+GS++PSES GT+SR+ G+S
Subjt:  ---IPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVS

Query:  VSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKPTSLDQNPASKTP
        VSLAG DGRV GG LAG+ IAAGPVQV+VGSF+    + + + ++ + + +    +P     + + +N   GG      +    +KP  +   P S  P
Subjt:  VSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKPTSLDQNPASKTP

Q9LVB0 AT-hook motif nuclear-localized protein 66.0e-5854.58Show/hide
Query:  TSTDSESPPPPPQPLASVPT------VAATDG----KKKRGRPRKYGPNGT-----VAPTLSPMPISSSIPLSGEFPVWKRGRGRS----VESIKKPRRG
        T+  +  PPPP    A VPT        A+ G    KKKRGRPRKY P+G+     + PTLSP PISSSIPLSG++  WKRG+ +     +E +KK  + 
Subjt:  TSTDSESPPPPPQPLASVPT------VAATDG----KKKRGRPRKYGPNGT-----VAPTLSPMPISSSIPLSGEFPVWKRGRGRS----VESIKKPRRG

Query:  YEFEIPGNK-----VAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKS
        +E+  P        ++   GA+FT H  TVN GEDV +K+M +SQQGSRAI ILSA G +SNVTL Q T++GGTLTYEGRFEILSLSGS+MP+E+ GTK 
Subjt:  YEFEIPGNK-----VAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKS

Query:  RSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKK
        R+ G+S+SLAG +G + GG LAGMLIAAGPVQVV+GSF+  +HQ E   KK
Subjt:  RSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKK

Q9SB31 AT-hook motif nuclear-localized protein 31.5e-6150Show/hide
Query:  MEEKEGVD--------FGFAVEVSQAPESLG-MMDSRPENTSTDSESPPPPPQPLASVPTVAATDG-------------------KKKRGRPRKYGPNGT
        MEE+EG +        FG   +   A    G  MD  P   + +    PP   P A+    A T+                    KKKRGRPRKY P+GT
Subjt:  MEEKEGVD--------FGFAVEVSQAPESLG-MMDSRPENTSTDSESPPPPPQPLASVPTVAATDG-------------------KKKRGRPRKYGPNGT

Query:  VAPTLSPMPISSSIPLSGEFPVWKRGRGRSVES--IKK------PRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSAN
        +  TLSPMPISSS+PL+ EFP  KRGRGR   +  +KK       R   +  + G   A   GA+FTPHV+ VN GEDV +KIM+FSQQGSRAI ILSAN
Subjt:  VAPTLSPMPISSSIPLSGEFPVWKRGRGRSVES--IKK------PRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSAN

Query:  GMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSR
        G +SNVTLRQS +SGGTLTYEGRFEILSL+GS+M ++S GT+SR+ G+SV LAG DGRV GG LAG+ +AAGPVQV+VG+F+    Q + +  K R
Subjt:  GMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSR

Arabidopsis top hitse value%identityAlignment
AT4G12080.1 AT-hook motif nuclear-localized protein 18.5e-6043.6Show/hide
Query:  GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQP--------------LASVPTVAATDG------KKKRGRPRKYGPNGTVAPTLSPMPISSS
        G D G  V  S AP    +      +  + +   PPPPQP              + +  T AA +G      KKKRGRPRKYGP+GTV   LSP PISS+
Subjt:  GVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQP--------------LASVPTVAATDG------KKKRGRPRKYGPNGTVAPTLSPMPISSS

Query:  IPLSGEFP-----------VWKRGRGRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQ
         P     P             KR + +   S  + +  ++ E  G       G +FTPH+ITVN GEDV +KI+SFSQQG R+I +LSANG++S+VTLRQ
Subjt:  IPLSGEFP-----------VWKRGRGRSVESIKKPRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQ

Query:  STSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSR---MEPKSNAILPP
          SSGGTLTYEGRFEILSLSGS+MP++S GT+SR+ G+SVSLA  DGRV+GG LAG+L+AA PVQVVVGSFL     ++ KPKK++   M     A +P 
Subjt:  STSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSR---MEPKSNAILPP

Query:  ADVLSGEGTNGVLGGVEAIVPSTLIEDKPTSLDQNPASKTPQVN
           +S    +  +  V ++  +       TSL  +P +K   +N
Subjt:  ADVLSGEGTNGVLGGVEAIVPSTLIEDKPTSLDQNPASKTPQVN

AT4G22770.1 AT hook motif DNA-binding family protein1.6e-6150.52Show/hide
Query:  GVDFGFAVEVSQAPESLGMMD-SRPENTSTDSESPPPPPQPLASVPTVAATDG------KKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSG---EFPVW
        G D G  V  S AP    M   S   NT  +S +PPPPP P  S    AA DG      KK+RGRPRKYG +G  A TLSP PISS+ P +    +F   
Subjt:  GVDFGFAVEVSQAPESLGMMD-SRPENTSTDSESPPPPPQPLASVPTVAATDG------KKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSG---EFPVW

Query:  KRGRGRSVESIKKP----RRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEIL
           RG+   +   P    R  Y+ E  G      A A+FTPH+ITVN GEDV  +I+SFSQQGS AI +L ANG+VS+VTLRQ  SSGGTLTYEGRFEIL
Subjt:  KRGRGRSVESIKKP----RRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEIL

Query:  SLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGT
        SLSG++MPS+S GT+SR+ G+SVSLA  DGRV+GG +AG+L+AA P+QVVVG+FL   +Q+E  PK       S+ ++P +  ++   T
Subjt:  SLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGT

AT4G25320.1 AT hook motif DNA-binding family protein1.1e-6250Show/hide
Query:  MEEKEGVD--------FGFAVEVSQAPESLG-MMDSRPENTSTDSESPPPPPQPLASVPTVAATDG-------------------KKKRGRPRKYGPNGT
        MEE+EG +        FG   +   A    G  MD  P   + +    PP   P A+    A T+                    KKKRGRPRKY P+GT
Subjt:  MEEKEGVD--------FGFAVEVSQAPESLG-MMDSRPENTSTDSESPPPPPQPLASVPTVAATDG-------------------KKKRGRPRKYGPNGT

Query:  VAPTLSPMPISSSIPLSGEFPVWKRGRGRSVES--IKK------PRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSAN
        +  TLSPMPISSS+PL+ EFP  KRGRGR   +  +KK       R   +  + G   A   GA+FTPHV+ VN GEDV +KIM+FSQQGSRAI ILSAN
Subjt:  VAPTLSPMPISSSIPLSGEFPVWKRGRGRSVES--IKK------PRRGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSAN

Query:  GMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSR
        G +SNVTLRQS +SGGTLTYEGRFEILSL+GS+M ++S GT+SR+ G+SV LAG DGRV GG LAG+ +AAGPVQV+VG+F+    Q + +  K R
Subjt:  GMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSR

AT5G51590.1 AT hook motif DNA-binding family protein5.9e-6147.83Show/hide
Query:  PPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRGRSVESIKKPRRG-----------------------YEFE--
        PP  L      ++++ KKKRGRPRKY P+G++A TLSPMPISSS+PL+ EF   KRGRGR     +   RG                       +EF   
Subjt:  PPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRGRSVESIKKPRRG-----------------------YEFE--

Query:  ---IPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVS
             G   A I    FTPHV+TVN GEDV +KIM+FSQQGSRAI ILSANG +SNVTLRQS +SGGTLTYEG FEILSL+GS++PSES GT+SR+ G+S
Subjt:  ---IPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVS

Query:  VSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKPTSLDQNPASKTP
        VSLAG DGRV GG LAG+ IAAGPVQV+VGSF+    + + + ++ + + +    +P     + + +N   GG      +    +KP  +   P S  P
Subjt:  VSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKPTSLDQNPASKTP

AT5G62260.1 AT hook motif DNA-binding family protein4.2e-5954.58Show/hide
Query:  TSTDSESPPPPPQPLASVPT------VAATDG----KKKRGRPRKYGPNGT-----VAPTLSPMPISSSIPLSGEFPVWKRGRGRS----VESIKKPRRG
        T+  +  PPPP    A VPT        A+ G    KKKRGRPRKY P+G+     + PTLSP PISSSIPLSG++  WKRG+ +     +E +KK  + 
Subjt:  TSTDSESPPPPPQPLASVPT------VAATDG----KKKRGRPRKYGPNGT-----VAPTLSPMPISSSIPLSGEFPVWKRGRGRS----VESIKKPRRG

Query:  YEFEIPGNK-----VAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKS
        +E+  P        ++   GA+FT H  TVN GEDV +K+M +SQQGSRAI ILSA G +SNVTL Q T++GGTLTYEGRFEILSLSGS+MP+E+ GTK 
Subjt:  YEFEIPGNK-----VAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKS

Query:  RSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKK
        R+ G+S+SLAG +G + GG LAGMLIAAGPVQVV+GSF+  +HQ E   KK
Subjt:  RSRGVSVSLAGSDGRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAGAAAGAGGGAGTGGATTTTGGATTTGCAGTAGAAGTGAGCCAAGCACCAGAGAGCTTAGGAATGATGGATTCGAGACCTGAAAATACAAGCACAGATAGTGA
GTCACCGCCGCCGCCGCCGCAGCCGCTGGCGAGTGTTCCAACGGTGGCGGCTACAGATGGGAAGAAGAAAAGAGGAAGGCCGAGAAAGTACGGTCCCAATGGGACTGTAG
CTCCAACTTTGTCGCCAATGCCGATTTCGTCTTCGATTCCGCTGAGTGGAGAATTTCCGGTTTGGAAACGGGGAAGAGGGCGGTCAGTAGAGTCGATTAAGAAGCCGCGA
CGAGGGTACGAATTTGAGATTCCAGGTAACAAGGTTGCCTTCATTGCTGGAGCAGATTTCACACCTCACGTGATCACTGTTAATATTGGTGAGGATGTTAACTTGAAAAT
CATGTCATTTTCTCAACAAGGATCTCGGGCAATTTTTATACTCTCTGCTAATGGTATGGTTTCGAATGTTACACTTCGGCAGTCAACATCTTCTGGGGGCACGCTAACGT
ATGAGGGTCGATTTGAGATACTTTCGTTGTCTGGATCATATATGCCTTCTGAGAGCAGTGGAACAAAGAGCCGATCTCGAGGGGTGAGTGTTTCTTTGGCTGGCTCAGAT
GGCCGAGTGATGGGTGGAGAACTTGCCGGCATGCTGATAGCAGCTGGTCCAGTACAGGTGGTGGTGGGCAGTTTCCTACCCCCAGTTCACCAGAAGGAACATAAACCAAA
GAAGAGTAGGATGGAACCTAAATCGAACGCAATTTTACCTCCAGCCGATGTTCTTTCTGGCGAAGGGACGAACGGAGTCCTTGGCGGAGTGGAGGCCATAGTCCCATCTA
CTCTAATTGAAGATAAACCAACCTCTTTAGACCAAAATCCAGCGTCTAAAACTCCACAAGTCAACGAAAAGTTACCATTTCCACAAGAATCAAGAGGTGTACTCAACCAT
TCAAACCATGAGGTTTCTTGTTGA
mRNA sequenceShow/hide mRNA sequence
TTTTTTTTATGTTTTTTTTTTTTTTTTAAATTTTATTTTTACGATCTTTGGGTCAGTGATTTGTTGGATTTTCCCTTCACTGAAGCTCTGTGTAATAGTCTGTAATGGTG
GAGATTTAGGTTACTGAAACGAAACTGAGTTCAAGTTGTAGTACTGAAGCTCAAGTTCCTGATCTAGAAGCAGTTCTAGTGGAGGTTTCGAGTTTGATGTTGTAGTCATA
GGCTCTGGATTTCATCTCTCTTCATTGGCTGTGTTCTTGTTTTCTTGTGTTCTTATTTCGTTGTCTTCTTCATTGAGAGCTTCTTCGTTGATGTTATGGTTTGGAAGATC
GTCTGTGGTGTGAATAATGGAGGTTTTTAAGGTTTAAGCGGTGAGTTTGTGTTGTTTTGCTTTTGAACCGATCTTTTTTCTCTGTGTCACTGTAAAAATGGAGGAGAAAG
AGGGAGTGGATTTTGGATTTGCAGTAGAAGTGAGCCAAGCACCAGAGAGCTTAGGAATGATGGATTCGAGACCTGAAAATACAAGCACAGATAGTGAGTCACCGCCGCCG
CCGCCGCAGCCGCTGGCGAGTGTTCCAACGGTGGCGGCTACAGATGGGAAGAAGAAAAGAGGAAGGCCGAGAAAGTACGGTCCCAATGGGACTGTAGCTCCAACTTTGTC
GCCAATGCCGATTTCGTCTTCGATTCCGCTGAGTGGAGAATTTCCGGTTTGGAAACGGGGAAGAGGGCGGTCAGTAGAGTCGATTAAGAAGCCGCGACGAGGGTACGAAT
TTGAGATTCCAGGTAACAAGGTTGCCTTCATTGCTGGAGCAGATTTCACACCTCACGTGATCACTGTTAATATTGGTGAGGATGTTAACTTGAAAATCATGTCATTTTCT
CAACAAGGATCTCGGGCAATTTTTATACTCTCTGCTAATGGTATGGTTTCGAATGTTACACTTCGGCAGTCAACATCTTCTGGGGGCACGCTAACGTATGAGGGTCGATT
TGAGATACTTTCGTTGTCTGGATCATATATGCCTTCTGAGAGCAGTGGAACAAAGAGCCGATCTCGAGGGGTGAGTGTTTCTTTGGCTGGCTCAGATGGCCGAGTGATGG
GTGGAGAACTTGCCGGCATGCTGATAGCAGCTGGTCCAGTACAGGTGGTGGTGGGCAGTTTCCTACCCCCAGTTCACCAGAAGGAACATAAACCAAAGAAGAGTAGGATG
GAACCTAAATCGAACGCAATTTTACCTCCAGCCGATGTTCTTTCTGGCGAAGGGACGAACGGAGTCCTTGGCGGAGTGGAGGCCATAGTCCCATCTACTCTAATTGAAGA
TAAACCAACCTCTTTAGACCAAAATCCAGCGTCTAAAACTCCACAAGTCAACGAAAAGTTACCATTTCCACAAGAATCAAGAGGTGTACTCAACCATTCAAACCATGAGG
TTTCTTGTTGATCATAATCGTCCCCTGCACATCAATCTTTCATGCAGATTTGCTGGTGCATTGGCTTTCCTCGACCTCGATTATCAATGGTATGAGCTCATTGATTCTTT
AAAGGGCTAACTAAGTGTGTTTAAACAACTGACGTATATGTGGCCATCAGGTAACATCTCTAGATAGGAGTTTGATCCTTTAGGCTTATTTGTTCTGTTCTCAGCATGTA
GGTTCTTGTGTGTTGTATCCCAATTAACGTTATAGCAGTTTAGGTGCAGAAGTTTGTGTAAATTATTCATGTTCTAGAATTCTCTGTATCTGAAATAATAACATTTTAAG
TTAAATTTCAAACACTGTTTCCCAATGTTCTCTTTGTGTACGTATTCATTTATCTCAGGCCTGGCTGGCTTTTGTTAATAAGGTTCTGTAGAACACCA
Protein sequenceShow/hide protein sequence
MEEKEGVDFGFAVEVSQAPESLGMMDSRPENTSTDSESPPPPPQPLASVPTVAATDGKKKRGRPRKYGPNGTVAPTLSPMPISSSIPLSGEFPVWKRGRGRSVESIKKPR
RGYEFEIPGNKVAFIAGADFTPHVITVNIGEDVNLKIMSFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESSGTKSRSRGVSVSLAGSD
GRVMGGELAGMLIAAGPVQVVVGSFLPPVHQKEHKPKKSRMEPKSNAILPPADVLSGEGTNGVLGGVEAIVPSTLIEDKPTSLDQNPASKTPQVNEKLPFPQESRGVLNH
SNHEVSC