; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr008592 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr008592
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationtig00007026:24961..30563
RNA-Seq ExpressionSgr008592
SyntenySgr008592
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055123.1 AT-hook motif nuclear-localized protein 3-like [Cucumis melo var. makuwa]9.5e-13680.47Show/hide
Query:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR
        MEEKE G DFGFA+KVSQAPESF  MDSRPEN+S DG  P     AS P+A   DGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPL GEF GW+RGRGR
Subjt:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR

Query:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES
        SVESIKKSRK+EYEIPGNKV  FAGADFTPHVITVNIGEDVNLKVM+FSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSE 
Subjt:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES

Query:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV
        GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPV               QVVVGSFLPPGHQQENKP+KSR+EPT       A+ LSGE  K VFGGV
Subjt:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV

Query:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESR
        KP V STL GD+ ASLDPTPAF+TP VNDKS FP+ESR
Subjt:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESR

XP_004143720.1 AT-hook motif nuclear-localized protein 3 [Cucumis sativus]2.7e-13879.94Show/hide
Query:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR
        MEEKE GVDFGFA+KVSQAPESF  MD+RPEN+S DG  P     AS PTA   DGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPL GEF GW+RGRGR
Subjt:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR

Query:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES
        SVESIKKSRK+EYEIPGNKV  FAGADFTPHVITVNIGEDVNLKVM+FSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSE 
Subjt:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES

Query:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV
        GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPV               QVVVGSFLPPGHQQENKP+KSR+EPT       AN LSGE    VFGGV
Subjt:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV

Query:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT
        KP VASTL GD+ ASLD  PAF+TP VNDKS FP+ESR  LN SNHEV+
Subjt:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT

XP_008467286.1 PREDICTED: AT-hook motif nuclear-localized protein 3-like [Cucumis melo]2.9e-14080.23Show/hide
Query:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR
        MEEKE GVDFGFA+KVSQAPESF  MDSRPEN+S DG  P     AS P+A   DGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPL GEF GW+RGRGR
Subjt:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR

Query:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES
        SVESIKKSRK+EYEIPGNKV  FAGADFTPHVITVNIGEDVNLKVM+FSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSE 
Subjt:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES

Query:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV
        GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPV               QVVVGSFLPPGHQQENKP+KSR+EPT       A+ LSGE  K VFGGV
Subjt:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV

Query:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT
        KP V STL GD+ ASLDPTPAF+TP VNDKS FP+ESR  LN SNHEV+
Subjt:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT

XP_022159759.1 AT-hook motif nuclear-localized protein 3-like [Momordica charantia]9.8e-14181.79Show/hide
Query:  MEEKEGVDFGFALKVSQAPESFA-MDSRPENTSADGVPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGRSV
        MEEKEGVDFGFA+KV +APES   MDSRPEN+SADG PAVAAAA    AA TDGKKKRGRPRKYGPDG +APTLSPMPISSSIPLTGEF GW+RGRGRSV
Subjt:  MEEKEGVDFGFALKVSQAPESFA-MDSRPENTSADGVPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGRSV

Query:  ESIKKSRKYEYEI-PGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESG
        ESIKKSRKYEYEI PGNKV  FAGADFTPHVITVNIGEDVNLKVM+FSQQG RAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESG
Subjt:  ESIKKSRKYEYEI-PGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESG

Query:  GTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPTS---TANTLSGEVIKGVFGGVKPT
        GTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPV               QVVVGSFL PGHQQENKPKKSR EP +    AN  SGE  KGVFGGVK  
Subjt:  GTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPTS---TANTLSGEVIKGVFGGVKPT

Query:  VASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT
          STL GDKAASLDPTPA RTPP N+K P+PEESR G NQSNHEV+
Subjt:  VASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT

XP_023532619.1 AT-hook motif nuclear-localized protein 3 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-13173.37Show/hide
Query:  MEEKEGVDFGFALKVSQAPESFA-MDSRPENTSADG-------VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWR
        MEEKEGVDFGFALKVSQAP+SF  MDSRPENTS D         PA A   +A T A TDGKKKRGRPRKYGPDGTV PTLSPMPISSSIPLTGEFPGW+
Subjt:  MEEKEGVDFGFALKVSQAPESFA-MDSRPENTSADG-------VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWR

Query:  RGRGRSVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSY
        RGRGRSVES+KKSRKY+YEIPGNKV  FAGADFTPHVITVNIGEDVNLKVM+FSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSY
Subjt:  RGRGRSVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSY

Query:  MPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPTSTANTLSGEVIKGVFGGV
        MPS+ GGTKSRSGGMSVSLA PDGRVMGGGL+GMLIAAGPV               QVVVGSF PPGHQQENKPKKSR+EPTS A              +
Subjt:  MPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPTSTANTLSGEVIKGVFGGV

Query:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVTSLDRS----LIIRLIRCLF
         P  A    GDKA SLDP PAF T PVN+K P PEESRV LN SNHEV  L+       ++R+ R +F
Subjt:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVTSLDRS----LIIRLIRCLF

TrEMBL top hitse value%identityAlignment
A0A0A0KN92 AT-hook motif nuclear-localized protein1.3e-13879.94Show/hide
Query:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR
        MEEKE GVDFGFA+KVSQAPESF  MD+RPEN+S DG  P     AS PTA   DGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPL GEF GW+RGRGR
Subjt:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR

Query:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES
        SVESIKKSRK+EYEIPGNKV  FAGADFTPHVITVNIGEDVNLKVM+FSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSE 
Subjt:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES

Query:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV
        GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPV               QVVVGSFLPPGHQQENKP+KSR+EPT       AN LSGE    VFGGV
Subjt:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV

Query:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT
        KP VASTL GD+ ASLD  PAF+TP VNDKS FP+ESR  LN SNHEV+
Subjt:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT

A0A1S3CTE0 AT-hook motif nuclear-localized protein1.4e-14080.23Show/hide
Query:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR
        MEEKE GVDFGFA+KVSQAPESF  MDSRPEN+S DG  P     AS P+A   DGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPL GEF GW+RGRGR
Subjt:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR

Query:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES
        SVESIKKSRK+EYEIPGNKV  FAGADFTPHVITVNIGEDVNLKVM+FSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSE 
Subjt:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES

Query:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV
        GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPV               QVVVGSFLPPGHQQENKP+KSR+EPT       A+ LSGE  K VFGGV
Subjt:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV

Query:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT
        KP V STL GD+ ASLDPTPAF+TP VNDKS FP+ESR  LN SNHEV+
Subjt:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT

A0A5A7UGP9 AT-hook motif nuclear-localized protein4.6e-13680.47Show/hide
Query:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR
        MEEKE G DFGFA+KVSQAPESF  MDSRPEN+S DG  P     AS P+A   DGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPL GEF GW+RGRGR
Subjt:  MEEKE-GVDFGFALKVSQAPESFA-MDSRPENTSADG-VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGR

Query:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES
        SVESIKKSRK+EYEIPGNKV  FAGADFTPHVITVNIGEDVNLKVM+FSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSE 
Subjt:  SVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSES

Query:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV
        GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPV               QVVVGSFLPPGHQQENKP+KSR+EPT       A+ LSGE  K VFGGV
Subjt:  GGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPT-----STANTLSGEVIKGVFGGV

Query:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESR
        KP V STL GD+ ASLDPTPAF+TP VNDKS FP+ESR
Subjt:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESR

A0A6J1E398 AT-hook motif nuclear-localized protein4.8e-14181.79Show/hide
Query:  MEEKEGVDFGFALKVSQAPESFA-MDSRPENTSADGVPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGRSV
        MEEKEGVDFGFA+KV +APES   MDSRPEN+SADG PAVAAAA    AA TDGKKKRGRPRKYGPDG +APTLSPMPISSSIPLTGEF GW+RGRGRSV
Subjt:  MEEKEGVDFGFALKVSQAPESFA-MDSRPENTSADGVPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGRSV

Query:  ESIKKSRKYEYEI-PGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESG
        ESIKKSRKYEYEI PGNKV  FAGADFTPHVITVNIGEDVNLKVM+FSQQG RAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESG
Subjt:  ESIKKSRKYEYEI-PGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESG

Query:  GTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPTS---TANTLSGEVIKGVFGGVKPT
        GTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPV               QVVVGSFL PGHQQENKPKKSR EP +    AN  SGE  KGVFGGVK  
Subjt:  GTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPTS---TANTLSGEVIKGVFGGVKPT

Query:  VASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT
          STL GDKAASLDPTPA RTPP N+K P+PEESR G NQSNHEV+
Subjt:  VASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHEVT

A0A6J1GZL5 AT-hook motif nuclear-localized protein9.9e-13176.08Show/hide
Query:  MEEKEGVDFGFALKVSQAPESFA-MDSRPENTSADG-------VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWR
        MEEKEGVDFGFALKVSQA +SF  MDSRPENTS D         P  A   +A T A TDGKKKRGRPRKYGPDGTV PTLSPMPISSSIPLTGEFPGW+
Subjt:  MEEKEGVDFGFALKVSQAPESFA-MDSRPENTSADG-------VPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWR

Query:  RGRGRSVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSY
        RGRGRSVES+KKSRKY+YEIPGNKV  FAGADFTPHVITVNIGEDVNLKVM+FSQQGSRAI ILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSY
Subjt:  RGRGRSVESIKKSRKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSY

Query:  MPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPTSTANTLSGEVIKGVFGGV
        MPSE GGTKSRSGGMSVSLA PDGRVMGGGL+GMLIAAGPV               QVVVGSF PPGHQQENKPKKSR+EPTS A              +
Subjt:  MPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPTSTANTLSGEVIKGVFGGV

Query:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHE
         P  A    GDKA SLDP PAFRT PV++K P PEESRV LN SNHE
Subjt:  KPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESRVGLNQSNHE

SwissProt top hitse value%identityAlignment
O49658 AT-hook motif nuclear-localized protein 22.9e-5549.33Show/hide
Query:  GVDFGFALKVSQAPESFAMDSR------PENTSADGVPAVAAAASAPTAAPTDG------KKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGW--
        G D G  +  S AP  F M  R      P N+ A   P     +  P+AA  DG      KK+RGRPRKYG DG  A TLSP PISS+ P T     +  
Subjt:  GVDFGFALKVSQAPESFAMDSR------PENTSADGVPAVAAAASAPTAAPTDG------KKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGW--

Query:  ---RRGRGRSVESIKKS---RKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEI
           +RG+ +       S    KY+ E  G    S A A+FTPH+ITVN GEDV  ++++FSQQGS AI +L ANG+VS+VTLRQ  SSGGTLTYEGRFEI
Subjt:  ---RRGRGRSVESIKKS---RKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEI

Query:  LSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPK-------KSRIEPTST
        LSLSG++MPS+S GT+SR+GGMSVSLA PDGRV+GGG+AG+L+AA P+               QVVVG+FL   +QQE  PK        S + PTS+
Subjt:  LSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPK-------KSRIEPTST

Q8VYJ2 AT-hook motif nuclear-localized protein 14.5e-5647.02Show/hide
Query:  GVDFGFALKVSQAPESFAMDSRPEN-----TSADGVP-------------AVAAAASAPTAAPTDG------KKKRGRPRKYGPDGTVAPTLSPMPISSS
        G D G  +  S AP  F +  R E+     TS    P              ++   +  T A  +G      KKKRGRPRKYGPDGTV   LSP PISS+
Subjt:  GVDFGFALKVSQAPESFAMDSRPEN-----TSADGVP-------------AVAAAASAPTAAPTDG------KKKRGRPRKYGPDGTVAPTLSPMPISSS

Query:  IPLTGEFP-----------GWRRGRGRSVESIKKSRKYEYEIP--GNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLR
         P     P             +R + +   S  ++ KY +++   G       G +FTPH+ITVN GEDV +K+++FSQQG R+I +LSANG++S+VTLR
Subjt:  IPLTGEFP-----------GWRRGRGRSVESIKKSRKYEYEIP--GNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLR

Query:  QSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKK
        Q  SSGGTLTYEGRFEILSLSGS+MP++SGGT+SR+GGMSVSLA PDGRV+GGGLAG+L+AA PV               QVVVGSFL     Q+ KPKK
Subjt:  QSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKK

Query:  SR
        ++
Subjt:  SR

Q9FHM5 AT-hook motif nuclear-localized protein 41.0e-6044.18Show/hide
Query:  MEEKEGVDF-----GFALKVSQAP-ESFAMDSRPENTSADGVPAVAAAASAPTAAPTDG------------------KKKRGRPRKYGPDGTVAPTLSPM
        MEE+EG +       F LK  + P        R EN +   V   + +++A    P++                   KKKRGRPRKY PDG++A TLSPM
Subjt:  MEEKEGVDF-----GFALKVSQAP-ESFAMDSRPENTSADGVPAVAAAASAPTAAPTDG------------------KKKRGRPRKYGPDGTVAPTLSPM

Query:  PISSSIPLTGEFPGWRRGRGR------------------------SVESIKKSRKYEYEIPGNKVTSFAGA-------DFTPHVITVNIGEDVNLKVMTF
        PISSS+PLT EF   +RGRGR                            +K  + +E+    N  TS  G         FTPHV+TVN GEDV +K+MTF
Subjt:  PISSSIPLTGEFPGWRRGRGR------------------------SVESIKKSRKYEYEIPGNKVTSFAGA-------DFTPHVITVNIGEDVNLKVMTF

Query:  SQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTF
        SQQGSRAI ILSANG +SNVTLRQS +SGGTLTYEG FEILSL+GS++PSESGGT+SR+GGMSVSLAG DGRV GGGLAG+ IAAGPV            
Subjt:  SQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTF

Query:  NGCQVVVGSFL---PPGHQQENKPKKSRIEPTSTANTLSGEVIKGVFGGVKPTVASTLTGDKAASLDPTPAFRTPPVN
           QV+VGSF+       QQ+ + KK R E      T     I   FGG      +    +K   + P P    PPV+
Subjt:  NGCQVVVGSFL---PPGHQQENKPKKSRIEPTSTANTLSGEVIKGVFGGVKPTVASTLTGDKAASLDPTPAFRTPPVN

Q9LVB0 AT-hook motif nuclear-localized protein 61.5e-6253.14Show/hide
Query:  VSQAPESFAMDSRPENTSADGVPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGT-----VAPTLSPMPISSSIPLTGEFPGWRRGRGRS----VESIKKS
        V+  P   A  S P  T+     A A+  S PT      KKKRGRPRKY PDG+     + PTLSP PISSSIPL+G++  W+RG+ +     +E +KKS
Subjt:  VSQAPESFAMDSRPENTSADGVPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGT-----VAPTLSPMPISSSIPLTGEFPGWRRGRGRS----VESIKKS

Query:  RKYEYEIPGNK-----VTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGT
         K+EY  P        ++ + GA+FT H  TVN GEDV +KVM +SQQGSRAI ILSA G +SNVTL Q T++GGTLTYEGRFEILSLSGS+MP+E+GGT
Subjt:  RKYEYEIPGNK-----VTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGT

Query:  KSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRI
        K R+GGMS+SLAGP+G + GGGLAGMLIAAGPV               QVV+GSF+     ++N+ KK R+
Subjt:  KSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRI

Q9SB31 AT-hook motif nuclear-localized protein 32.6e-6746.4Show/hide
Query:  MEEKEGVDF------GFALK----VSQAPESFAMD--SRPENTSADGVPAVAAAASAPTAA--------------PTDG------KKKRGRPRKYGPDGT
        MEE+EG +        F LK     + +   ++MD   RPEN +   VP     A+A  AA              PT+       KKKRGRPRKY PDGT
Subjt:  MEEKEGVDF------GFALK----VSQAPESFAMD--SRPENTSADGVPAVAAAASAPTAA--------------PTDG------KKKRGRPRKYGPDGT

Query:  VAPTLSPMPISSSIPLTGEFPGWRRGRGRSVES--IKKSRKYEYE-------IPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSAN
        +  TLSPMPISSS+PLT EFP  +RGRGR   +  +KKS+ ++++       + G     F GA+FTPHV+ VN GEDV +K+MTFSQQGSRAI ILSAN
Subjt:  VAPTLSPMPISSSIPLTGEFPGWRRGRGRSVES--IKKSRKYEYE-------IPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSAN

Query:  GMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPG
        G +SNVTLRQS +SGGTLTYEGRFEILSL+GS+M ++SGGT+SR+GGMSV LAGPDGRV GGGLAG+ +AAGPV               QV+VG+F+   
Subjt:  GMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPG

Query:  HQQENKPKKSR-----IEPTSTANTLSGEVIKGVFGGVKPTVASTLTGDKAASLDPTPA---FRTPPVND-KSPF
         Q + +  K R      +P+S +  +S E  K  F  +  +VA          ++ T A   + T  VN  K PF
Subjt:  HQQENKPKKSR-----IEPTSTANTLSGEVIKGVFGGVKPTVASTLTGDKAASLDPTPA---FRTPPVND-KSPF

Arabidopsis top hitse value%identityAlignment
AT4G12080.1 AT-hook motif nuclear-localized protein 13.2e-5747.02Show/hide
Query:  GVDFGFALKVSQAPESFAMDSRPEN-----TSADGVP-------------AVAAAASAPTAAPTDG------KKKRGRPRKYGPDGTVAPTLSPMPISSS
        G D G  +  S AP  F +  R E+     TS    P              ++   +  T A  +G      KKKRGRPRKYGPDGTV   LSP PISS+
Subjt:  GVDFGFALKVSQAPESFAMDSRPEN-----TSADGVP-------------AVAAAASAPTAAPTDG------KKKRGRPRKYGPDGTVAPTLSPMPISSS

Query:  IPLTGEFP-----------GWRRGRGRSVESIKKSRKYEYEIP--GNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLR
         P     P             +R + +   S  ++ KY +++   G       G +FTPH+ITVN GEDV +K+++FSQQG R+I +LSANG++S+VTLR
Subjt:  IPLTGEFP-----------GWRRGRGRSVESIKKSRKYEYEIP--GNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLR

Query:  QSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKK
        Q  SSGGTLTYEGRFEILSLSGS+MP++SGGT+SR+GGMSVSLA PDGRV+GGGLAG+L+AA PV               QVVVGSFL     Q+ KPKK
Subjt:  QSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKK

Query:  SR
        ++
Subjt:  SR

AT4G22770.1 AT hook motif DNA-binding family protein2.1e-5649.33Show/hide
Query:  GVDFGFALKVSQAPESFAMDSR------PENTSADGVPAVAAAASAPTAAPTDG------KKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGW--
        G D G  +  S AP  F M  R      P N+ A   P     +  P+AA  DG      KK+RGRPRKYG DG  A TLSP PISS+ P T     +  
Subjt:  GVDFGFALKVSQAPESFAMDSR------PENTSADGVPAVAAAASAPTAAPTDG------KKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGW--

Query:  ---RRGRGRSVESIKKS---RKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEI
           +RG+ +       S    KY+ E  G    S A A+FTPH+ITVN GEDV  ++++FSQQGS AI +L ANG+VS+VTLRQ  SSGGTLTYEGRFEI
Subjt:  ---RRGRGRSVESIKKS---RKYEYEIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEI

Query:  LSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPK-------KSRIEPTST
        LSLSG++MPS+S GT+SR+GGMSVSLA PDGRV+GGG+AG+L+AA P+               QVVVG+FL   +QQE  PK        S + PTS+
Subjt:  LSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPK-------KSRIEPTST

AT4G25320.1 AT hook motif DNA-binding family protein1.8e-6846.4Show/hide
Query:  MEEKEGVDF------GFALK----VSQAPESFAMD--SRPENTSADGVPAVAAAASAPTAA--------------PTDG------KKKRGRPRKYGPDGT
        MEE+EG +        F LK     + +   ++MD   RPEN +   VP     A+A  AA              PT+       KKKRGRPRKY PDGT
Subjt:  MEEKEGVDF------GFALK----VSQAPESFAMD--SRPENTSADGVPAVAAAASAPTAA--------------PTDG------KKKRGRPRKYGPDGT

Query:  VAPTLSPMPISSSIPLTGEFPGWRRGRGRSVES--IKKSRKYEYE-------IPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSAN
        +  TLSPMPISSS+PLT EFP  +RGRGR   +  +KKS+ ++++       + G     F GA+FTPHV+ VN GEDV +K+MTFSQQGSRAI ILSAN
Subjt:  VAPTLSPMPISSSIPLTGEFPGWRRGRGRSVES--IKKSRKYEYE-------IPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSAN

Query:  GMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPG
        G +SNVTLRQS +SGGTLTYEGRFEILSL+GS+M ++SGGT+SR+GGMSV LAGPDGRV GGGLAG+ +AAGPV               QV+VG+F+   
Subjt:  GMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPG

Query:  HQQENKPKKSR-----IEPTSTANTLSGEVIKGVFGGVKPTVASTLTGDKAASLDPTPA---FRTPPVND-KSPF
         Q + +  K R      +P+S +  +S E  K  F  +  +VA          ++ T A   + T  VN  K PF
Subjt:  HQQENKPKKSR-----IEPTSTANTLSGEVIKGVFGGVKPTVASTLTGDKAASLDPTPA---FRTPPVND-KSPF

AT5G51590.1 AT hook motif DNA-binding family protein7.4e-6244.18Show/hide
Query:  MEEKEGVDF-----GFALKVSQAP-ESFAMDSRPENTSADGVPAVAAAASAPTAAPTDG------------------KKKRGRPRKYGPDGTVAPTLSPM
        MEE+EG +       F LK  + P        R EN +   V   + +++A    P++                   KKKRGRPRKY PDG++A TLSPM
Subjt:  MEEKEGVDF-----GFALKVSQAP-ESFAMDSRPENTSADGVPAVAAAASAPTAAPTDG------------------KKKRGRPRKYGPDGTVAPTLSPM

Query:  PISSSIPLTGEFPGWRRGRGR------------------------SVESIKKSRKYEYEIPGNKVTSFAGA-------DFTPHVITVNIGEDVNLKVMTF
        PISSS+PLT EF   +RGRGR                            +K  + +E+    N  TS  G         FTPHV+TVN GEDV +K+MTF
Subjt:  PISSSIPLTGEFPGWRRGRGR------------------------SVESIKKSRKYEYEIPGNKVTSFAGA-------DFTPHVITVNIGEDVNLKVMTF

Query:  SQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTF
        SQQGSRAI ILSANG +SNVTLRQS +SGGTLTYEG FEILSL+GS++PSESGGT+SR+GGMSVSLAG DGRV GGGLAG+ IAAGPV            
Subjt:  SQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTF

Query:  NGCQVVVGSFL---PPGHQQENKPKKSRIEPTSTANTLSGEVIKGVFGGVKPTVASTLTGDKAASLDPTPAFRTPPVN
           QV+VGSF+       QQ+ + KK R E      T     I   FGG      +    +K   + P P    PPV+
Subjt:  NGCQVVVGSFL---PPGHQQENKPKKSRIEPTSTANTLSGEVIKGVFGGVKPTVASTLTGDKAASLDPTPAFRTPPVN

AT5G62260.1 AT hook motif DNA-binding family protein1.0e-6353.14Show/hide
Query:  VSQAPESFAMDSRPENTSADGVPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGT-----VAPTLSPMPISSSIPLTGEFPGWRRGRGRS----VESIKKS
        V+  P   A  S P  T+     A A+  S PT      KKKRGRPRKY PDG+     + PTLSP PISSSIPL+G++  W+RG+ +     +E +KKS
Subjt:  VSQAPESFAMDSRPENTSADGVPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGT-----VAPTLSPMPISSSIPLTGEFPGWRRGRGRS----VESIKKS

Query:  RKYEYEIPGNK-----VTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGT
         K+EY  P        ++ + GA+FT H  TVN GEDV +KVM +SQQGSRAI ILSA G +SNVTL Q T++GGTLTYEGRFEILSLSGS+MP+E+GGT
Subjt:  RKYEYEIPGNK-----VTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGT

Query:  KSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRI
        K R+GGMS+SLAGP+G + GGGLAGMLIAAGPV               QVV+GSF+     ++N+ KK R+
Subjt:  KSRSGGMSVSLAGPDGRVMGGGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAGAAAGAAGGCGTGGATTTTGGGTTTGCACTAAAGGTGAGCCAAGCTCCAGAGAGCTTCGCGATGGATTCGAGACCTGAAAATACAAGCGCCGATGGTGTGCC
GGCAGTAGCGGCGGCGGCGAGTGCTCCAACGGCAGCGCCTACGGATGGAAAGAAGAAAAGAGGGAGGCCGAGAAAGTACGGACCGGATGGGACTGTAGCACCAACATTGT
CGCCAATGCCGATTTCGTCGTCGATTCCGCTAACAGGAGAATTTCCAGGTTGGAGACGGGGAAGAGGGAGGTCAGTAGAGTCAATTAAGAAGTCGCGGAAGTACGAGTAT
GAGATTCCAGGTAACAAGGTTACCTCCTTTGCTGGTGCAGATTTTACACCTCACGTGATCACTGTCAATATTGGCGAGGATGTTAACTTGAAAGTAATGACATTTTCTCA
ACAAGGATCTAGAGCTATTTTTATACTTTCTGCAAATGGTATGGTTTCAAATGTTACACTTCGGCAGTCAACATCTTCTGGGGGTACTCTAACATACGAGGGTCGTTTTG
AAATACTTTCATTGTCTGGATCATATATGCCTTCTGAGAGTGGCGGAACAAAGAGCCGATCTGGAGGGATGAGTGTCTCTTTGGCTGGCCCAGATGGCCGAGTGATGGGT
GGAGGTCTTGCTGGCATGTTGATAGCAGCTGGTCCAGTGCAGATGTATCATGGTAAAATAAAATTGCATACCTTCAATGGTTGTCAGGTGGTGGTCGGCAGTTTCCTACC
ACCAGGTCACCAGCAGGAAAATAAACCGAAGAAGAGTCGGATTGAACCTACATCAACTGCAAATACTCTTTCTGGCGAAGTGATAAAGGGAGTCTTTGGAGGAGTGAAGC
CCACCGTCGCGTCTACTCTTACTGGAGATAAAGCAGCTTCTTTAGACCCAACTCCAGCTTTTAGAACTCCACCAGTCAACGATAAATCACCTTTTCCAGAAGAATCAAGG
GTTGGCCTCAACCAATCAAACCATGAGGTAACGTCTCTAGATAGGAGTTTGATCATTAGGCTTATTCGTTGTCTGTTCTCAGAAC
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAGAAAGAAGGCGTGGATTTTGGGTTTGCACTAAAGGTGAGCCAAGCTCCAGAGAGCTTCGCGATGGATTCGAGACCTGAAAATACAAGCGCCGATGGTGTGCC
GGCAGTAGCGGCGGCGGCGAGTGCTCCAACGGCAGCGCCTACGGATGGAAAGAAGAAAAGAGGGAGGCCGAGAAAGTACGGACCGGATGGGACTGTAGCACCAACATTGT
CGCCAATGCCGATTTCGTCGTCGATTCCGCTAACAGGAGAATTTCCAGGTTGGAGACGGGGAAGAGGGAGGTCAGTAGAGTCAATTAAGAAGTCGCGGAAGTACGAGTAT
GAGATTCCAGGTAACAAGGTTACCTCCTTTGCTGGTGCAGATTTTACACCTCACGTGATCACTGTCAATATTGGCGAGGATGTTAACTTGAAAGTAATGACATTTTCTCA
ACAAGGATCTAGAGCTATTTTTATACTTTCTGCAAATGGTATGGTTTCAAATGTTACACTTCGGCAGTCAACATCTTCTGGGGGTACTCTAACATACGAGGGTCGTTTTG
AAATACTTTCATTGTCTGGATCATATATGCCTTCTGAGAGTGGCGGAACAAAGAGCCGATCTGGAGGGATGAGTGTCTCTTTGGCTGGCCCAGATGGCCGAGTGATGGGT
GGAGGTCTTGCTGGCATGTTGATAGCAGCTGGTCCAGTGCAGATGTATCATGGTAAAATAAAATTGCATACCTTCAATGGTTGTCAGGTGGTGGTCGGCAGTTTCCTACC
ACCAGGTCACCAGCAGGAAAATAAACCGAAGAAGAGTCGGATTGAACCTACATCAACTGCAAATACTCTTTCTGGCGAAGTGATAAAGGGAGTCTTTGGAGGAGTGAAGC
CCACCGTCGCGTCTACTCTTACTGGAGATAAAGCAGCTTCTTTAGACCCAACTCCAGCTTTTAGAACTCCACCAGTCAACGATAAATCACCTTTTCCAGAAGAATCAAGG
GTTGGCCTCAACCAATCAAACCATGAGGTAACGTCTCTAGATAGGAGTTTGATCATTAGGCTTATTCGTTGTCTGTTCTCAGAAC
Protein sequenceShow/hide protein sequence
MEEKEGVDFGFALKVSQAPESFAMDSRPENTSADGVPAVAAAASAPTAAPTDGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLTGEFPGWRRGRGRSVESIKKSRKYEY
EIPGNKVTSFAGADFTPHVITVNIGEDVNLKVMTFSQQGSRAIFILSANGMVSNVTLRQSTSSGGTLTYEGRFEILSLSGSYMPSESGGTKSRSGGMSVSLAGPDGRVMG
GGLAGMLIAAGPVQMYHGKIKLHTFNGCQVVVGSFLPPGHQQENKPKKSRIEPTSTANTLSGEVIKGVFGGVKPTVASTLTGDKAASLDPTPAFRTPPVNDKSPFPEESR
VGLNQSNHEVTSLDRSLIIRLIRCLFSEX