; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023143 (gene) of Chayote v1 genome

Gene IDSed0023143
OrganismSechium edule (Chayote v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationLG05:6958647..6963511
RNA-Seq ExpressionSed0023143
SyntenySed0023143
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437162.1 PREDICTED: AT-hook motif nuclear-localized protein 1 [Cucumis melo]2.6e-14885.76Show/hide
Query:  MEGRD-GGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGS-TPPAAAATTQPTSA-------PPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISS
        MEGRD GG+SSGVTVVGS+APSEY+IAPRT+D PPQT GS TPPA  +T+ P+++       PPPTA  SVPGKKKRGRPRKYGPDGSV+ ALSPKPISS
Subjt:  MEGRD-GGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGS-TPPAAAATTQPTSA-------PPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISS

Query:  SVPPPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYE
        SVPPPVIDFS+EK+GKVRP SAVSKSKFEVDNLGDWVPCS+GANFTPHIITVN GED+TMKIISFSQQGP+AICILSANGVISSVTLRQPDSSGGTLTYE
Subjt:  SVPPPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYE

Query:  GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP
        GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAA PVQVVVGSFLSGNQ EQKPKKPK D IS A PTAAIPISCVDPK+NLSP
Subjt:  GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP

Query:  SGSFRGNNWSILPNDSRNKPTDINVSLPTA
        S SFRG+NWS+LP DSRNK TDINVSLP+A
Subjt:  SGSFRGNNWSILPNDSRNKPTDINVSLPTA

XP_022957745.1 AT-hook motif nuclear-localized protein 1-like [Cucurbita moschata]2.2e-15590.68Show/hide
Query:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAA-AATTQPTSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVID
        MEGRDGG+SSGVTVVGS+APSEYQIAPRT D PPQT GSTP AA AA +QP S P PTAA SVPGKKKRGRPRKYGPDGSV+ ALSPKPISSSVPPPVID
Subjt:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAA-AATTQPTSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVID

Query:  FSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL
        FS+EK+GKVRPASAVSK KFEVDNLGDWVPCSVGANFTPHIITVNTGED+TMKIISFSQQGP+AICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL
Subjt:  FSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL

Query:  SGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGSFRGNN
        SGSFMPSDNGGTR RSGGMSVSLASPDGRVVGGGVAGLLVAA PVQVVVGSFLSGNQQEQ PKKPKQD ISTASPTAAIPISCVDPK+NLSPS SFRG+N
Subjt:  SGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGSFRGNN

Query:  WSILPNDSRNKPTDINVSLPTA
        WS+LPNDSRNKPTDINV LP+A
Subjt:  WSILPNDSRNKPTDINVSLPTA

XP_022995666.1 AT-hook motif nuclear-localized protein 1-like [Cucurbita maxima]3.4e-15389.3Show/hide
Query:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAA-AATTQP-----TSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVP
        MEGRD G+SSGVTVVGS+APSEYQIAPRT D PPQT GSTP AA AA +QP      SAP PTAA SVPGKKKRGRPRKYGPDGSV+ ALSPKPISSSVP
Subjt:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAA-AATTQP-----TSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVP

Query:  PPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRF
        PPVIDFS+EK+GKVRPASAVSK KFEVDNLGDWVPCSVGANFTPHIITVNTGED+TMKIISFSQQGP+AICILSANGVISSVTLRQPDSSGGTLTYEGRF
Subjt:  PPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRF

Query:  EILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGS
        EILSLSGSFMPSDNGGTR RSGGMSVSLASPDGRVVGGGVAGLLVAA PVQVVVGSFLSGNQQEQKPKKPKQD IST SPTAAIPISCVDPK+NLSPS S
Subjt:  EILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGS

Query:  FRGNNWSILPNDSRNKPTDINVSLPTA
        FRG+NWS+LPNDSRNKPTDINV LP+A
Subjt:  FRGNNWSILPNDSRNKPTDINVSLPTA

XP_023532888.1 AT-hook motif nuclear-localized protein 1-like [Cucurbita pepo subsp. pepo]3.1e-15489.6Show/hide
Query:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAA-AATTQP-----TSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVP
        MEGRDGG+SSGVTVVGS+APSEYQIAPRT D PPQT GSTP AA AA +QP      SAP PTAA SVPGKKKRGRPRKYGPDGSV+ ALSPKPISSSVP
Subjt:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAA-AATTQP-----TSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVP

Query:  PPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRF
        PPVIDFS+EK+GKVRPASAVSK KFEVDNLGDWVPCSVGANFTPHIITVNTGED+TMKIISFSQQGP+AICILSANGVISSVTLRQPDSSGGTLTYEGRF
Subjt:  PPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRF

Query:  EILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGS
        EILSLSGSFMPSDNGGTR RSGGMSVSLASPDGRVVGGGVAGLLVAA PVQVVVGSFLSGNQQEQKPKKPKQD ISTASPTAAIPISCVDPK+NLSPS S
Subjt:  EILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGS

Query:  FRGNNWSILPNDSRNKPTDINVSLPTA
        FRG+NWS+LPNDS+NKPTDINV LP+A
Subjt:  FRGNNWSILPNDSRNKPTDINVSLPTA

XP_038906607.1 AT-hook motif nuclear-localized protein 1 [Benincasa hispida]6.1e-15086.67Show/hide
Query:  MEGRD-GGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAAAATTQPTSA--------PPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISS
        MEGRD GG+SSGVTVVGS+APSEY+IAPRT+D PPQT GSTPP A  +T   SA        PPPTAA SVPGKKKRGRPRKYGPDGSV+ ALSPKPISS
Subjt:  MEGRD-GGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAAAATTQPTSA--------PPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISS

Query:  SVPPPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYE
        SVPPPVIDFS+EK+GKVRPA+AVSKSKFEVDNLGDWVPCSVGANFTPHIITVN GED+TMKIISFSQQGP+AICILSANGVISSVTLRQPDSSGGTLTYE
Subjt:  SVPPPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYE

Query:  GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP
        GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAA PVQVVVGSFLSGNQ EQKPKKPK D IS A PTAAIPISCVDPK+NLSP
Subjt:  GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP

Query:  SGSFRGNNWSILPNDSRNKPTDINVSLPTA
        S SFRG+NWS+LP DSRNK TDINVSLP+A
Subjt:  SGSFRGNNWSILPNDSRNKPTDINVSLPTA

TrEMBL top hitse value%identityAlignment
A0A1S3ATC1 AT-hook motif nuclear-localized protein1.2e-14885.76Show/hide
Query:  MEGRD-GGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGS-TPPAAAATTQPTSA-------PPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISS
        MEGRD GG+SSGVTVVGS+APSEY+IAPRT+D PPQT GS TPPA  +T+ P+++       PPPTA  SVPGKKKRGRPRKYGPDGSV+ ALSPKPISS
Subjt:  MEGRD-GGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGS-TPPAAAATTQPTSA-------PPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISS

Query:  SVPPPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYE
        SVPPPVIDFS+EK+GKVRP SAVSKSKFEVDNLGDWVPCS+GANFTPHIITVN GED+TMKIISFSQQGP+AICILSANGVISSVTLRQPDSSGGTLTYE
Subjt:  SVPPPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYE

Query:  GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP
        GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAA PVQVVVGSFLSGNQ EQKPKKPK D IS A PTAAIPISCVDPK+NLSP
Subjt:  GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP

Query:  SGSFRGNNWSILPNDSRNKPTDINVSLPTA
        S SFRG+NWS+LP DSRNK TDINVSLP+A
Subjt:  SGSFRGNNWSILPNDSRNKPTDINVSLPTA

A0A5A7TMP3 AT-hook motif nuclear-localized protein1.2e-14885.76Show/hide
Query:  MEGRD-GGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGS-TPPAAAATTQPTSA-------PPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISS
        MEGRD GG+SSGVTVVGS+APSEY+IAPRT+D PPQT GS TPPA  +T+ P+++       PPPTA  SVPGKKKRGRPRKYGPDGSV+ ALSPKPISS
Subjt:  MEGRD-GGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGS-TPPAAAATTQPTSA-------PPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISS

Query:  SVPPPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYE
        SVPPPVIDFS+EK+GKVRP SAVSKSKFEVDNLGDWVPCS+GANFTPHIITVN GED+TMKIISFSQQGP+AICILSANGVISSVTLRQPDSSGGTLTYE
Subjt:  SVPPPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYE

Query:  GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP
        GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAA PVQVVVGSFLSGNQ EQKPKKPK D IS A PTAAIPISCVDPK+NLSP
Subjt:  GRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP

Query:  SGSFRGNNWSILPNDSRNKPTDINVSLPTA
        S SFRG+NWS+LP DSRNK TDINVSLP+A
Subjt:  SGSFRGNNWSILPNDSRNKPTDINVSLPTA

A0A6J1H1D6 AT-hook motif nuclear-localized protein1.0e-15590.68Show/hide
Query:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAA-AATTQPTSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVID
        MEGRDGG+SSGVTVVGS+APSEYQIAPRT D PPQT GSTP AA AA +QP S P PTAA SVPGKKKRGRPRKYGPDGSV+ ALSPKPISSSVPPPVID
Subjt:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAA-AATTQPTSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVID

Query:  FSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL
        FS+EK+GKVRPASAVSK KFEVDNLGDWVPCSVGANFTPHIITVNTGED+TMKIISFSQQGP+AICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL
Subjt:  FSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL

Query:  SGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGSFRGNN
        SGSFMPSDNGGTR RSGGMSVSLASPDGRVVGGGVAGLLVAA PVQVVVGSFLSGNQQEQ PKKPKQD ISTASPTAAIPISCVDPK+NLSPS SFRG+N
Subjt:  SGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGSFRGNN

Query:  WSILPNDSRNKPTDINVSLPTA
        WS+LPNDSRNKPTDINV LP+A
Subjt:  WSILPNDSRNKPTDINVSLPTA

A0A6J1I241 AT-hook motif nuclear-localized protein3.6e-14885.93Show/hide
Query:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAAAATTQPTSAP-----------PPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPI
        MEGRDGG+SSGVTVVGS+APS+YQIAPRT+D PPQT GST P A   TQPTS P           PPTAA SVPGKKKRGRPRKYGPDGSVN ALSPKPI
Subjt:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAAAATTQPTSAP-----------PPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPI

Query:  SSSVPPPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDW-VPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTL
        SSSVPPPVIDFS+EKRGKVRPAS VSK+KFEVDNLGDW VPCSVGANFTPHIITV+ GED+TMKIISFSQQGP+AIC+LSANGVISSVTLRQPDSSGGTL
Subjt:  SSSVPPPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDW-VPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTL

Query:  TYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKAN
        TYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAA PVQVVVGSFLSGNQ EQKPKKPKQD  STA PTAAIPISCVDPK+N
Subjt:  TYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKAN

Query:  LSPSGSFRGNNWSILPNDSR-NKPTDINVSLPTA
        LSPS  FRG+NWS LP DSR NKPTDINVSLP+A
Subjt:  LSPSGSFRGNNWSILPNDSR-NKPTDINVSLPTA

A0A6J1K2J0 AT-hook motif nuclear-localized protein1.7e-15389.3Show/hide
Query:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAA-AATTQP-----TSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVP
        MEGRD G+SSGVTVVGS+APSEYQIAPRT D PPQT GSTP AA AA +QP      SAP PTAA SVPGKKKRGRPRKYGPDGSV+ ALSPKPISSSVP
Subjt:  MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAA-AATTQP-----TSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVP

Query:  PPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRF
        PPVIDFS+EK+GKVRPASAVSK KFEVDNLGDWVPCSVGANFTPHIITVNTGED+TMKIISFSQQGP+AICILSANGVISSVTLRQPDSSGGTLTYEGRF
Subjt:  PPVIDFSSEKRGKVRPASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRF

Query:  EILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGS
        EILSLSGSFMPSDNGGTR RSGGMSVSLASPDGRVVGGGVAGLLVAA PVQVVVGSFLSGNQQEQKPKKPKQD IST SPTAAIPISCVDPK+NLSPS S
Subjt:  EILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGS

Query:  FRGNNWSILPNDSRNKPTDINVSLPTA
        FRG+NWS+LPNDSRNKPTDINV LP+A
Subjt:  FRGNNWSILPNDSRNKPTDINVSLPTA

SwissProt top hitse value%identityAlignment
O49658 AT-hook motif nuclear-localized protein 21.1e-8258.66Show/hide
Query:  GSSSGVTVVGSEAPSEYQIAPR--TADKPPQTDGSTPPAAAATTQPTSAPPPTAAF----SVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPP--VI
        GS  GVTVV S APS++ +APR  T++ PP +    PP       P ++  P+AA     S P KK+RGRPRKYG DG+  T LSP PISS+ P    VI
Subjt:  GSSSGVTVVGSEAPSEYQIAPR--TADKPPQTDGSTPPAAAATTQPTSAPPPTAAF----SVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPP--VI

Query:  DFS--SEKRGKVRPA----SAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEG
        DFS  SEKRGK++PA    S+  + K++V+NLG+W P S  ANFTPHIITVN GED+T +IISFSQQG  AIC+L ANGV+SSVTLRQPDSSGGTLTYEG
Subjt:  DFS--SEKRGKVRPA----SAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEG

Query:  RFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSG-NQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP
        RFEILSLSG+FMPSD+ GTRSR+GGMSVSLASPDGRVVGGGVAGLLVAA P+QVVVG+FL G NQQEQ PK    + +S  SP      +  D +     
Subjt:  RFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSG-NQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP

Query:  SGSFRGNNWS-ILPNDSRNKPT-DINVSL
        + S   + W+   P+DSR+K + D N++L
Subjt:  SGSFRGNNWS-ILPNDSRNKPT-DINVSL

Q4V3E0 AT-hook motif nuclear-localized protein 72.3e-6751.9Show/hide
Query:  VGSEAPSEYQIAPRTADKPP-QTDGSTPPAAAATTQPTSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVIDFSSEKRGKVRPASA
        +G+E PS Y +APR +D P  Q  G + P       P  AP P++     GKK+RGRPRKY  +G+   + S   +   V   +  F  +K  K    + 
Subjt:  VGSEAPSEYQIAPRTADKPP-QTDGSTPPAAAATTQPTSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVIDFSSEKRGKVRPASA

Query:  VSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRS
           S  E   +G  V   VG+NFTPH+ITVNTGEDITM+IISFSQQGP+AICILSANGVIS+VTLRQPDS GGTLTYEGRFEILSLSGSFM ++N G++ 
Subjt:  VSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRS

Query:  RSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQE-QKPKKPKQDAISTASPTAAIPISCVDPKANL---------SPSGSFRGNNWSIL
        RSGGMSVSLA PDGRVVGGGVAGLL+AA P+QVVVGSF++ +QQ+ QKP+K + +    A  +   P S   P A++          P  SF  ++W+  
Subjt:  RSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQE-QKPKKPKQDAISTASPTAAIPISCVDPKANL---------SPSGSFRGNNWSIL

Query:  PNDSRNKPTDINVSLP
         +  RN  TDIN+SLP
Subjt:  PNDSRNKPTDINVSLP

Q8VYJ2 AT-hook motif nuclear-localized protein 11.5e-9562.17Show/hide
Query:  GSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAAAATTQPTSAPP----------PTAAF-SVPG---KKKRGRPRKYGPDGSVNTALSPKPISSS
        G+  G+TVV S+APS++ +A R+          TPP    ++  T+ PP           TAA   + G   KKKRGRPRKYGPDG+V  ALSPKPISS+
Subjt:  GSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAAAATTQPTSAPP----------PTAAF-SVPG---KKKRGRPRKYGPDGSVNTALSPKPISSS

Query:  -----VPPP---VIDFS-SEKRGKVRPASAVSKSKF--EVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQP
             +PPP   VIDFS SEKR KV+P ++ +++K+  +V+NLG+W PCSVG NFTPHIITVNTGED+TMKIISFSQQGP++IC+LSANGVISSVTLRQP
Subjt:  -----VPPP---VIDFS-SEKRGKVRPASAVSKSKF--EVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQP

Query:  DSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSG-NQQEQKPKKPKQDAISTASPTAAIPI
        DSSGGTLTYEGRFEILSLSGSFMP+D+GGTRSR+GGMSVSLASPDGRVVGGG+AGLLVAA PVQVVVGSFL+G + Q+QKPKK K D    +SPTAAIPI
Subjt:  DSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSG-NQQEQKPKKPKQDAISTASPTAAIPI

Query:  SCVDPKANLSPSGSFRGNN--W-SILPNDSRNKPTDINVSL
        S       +    S   NN  W + L +D RNK TDINV++
Subjt:  SCVDPKANLSPSGSFRGNN--W-SILPNDSRNKPTDINVSL

Q9LVB0 AT-hook motif nuclear-localized protein 63.9e-5951.48Show/hide
Query:  MEGRDGGSSSG-VTVVGSEA---PSEYQIAPRTADKPPQTDGST----PPAAAATTQPTSAPPPTAAFSV---PGKKKRGRPRKYGPDGSVN-----TAL
        ME +   S SG VTV G EA    +E+Q  P        T   T    PPA ++   PT+  P +A  S    P KKKRGRPRKY PDGS+N       L
Subjt:  MEGRDGGSSSG-VTVVGSEA---PSEYQIAPRTADKPPQTDGST----PPAAAATTQPTSAPPPTAAFSV---PGKKKRGRPRKYGPDGSVN-----TAL

Query:  SPKPISSSVPPPVIDFSSEKRGKV----RPASAVSKS-KFEVDNLGDWVP-----CSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVIS
        SP PISSS+  P+      KRGK     +P   V KS KFE  +     P     C VGANFT H  TVN GED+TMK++ +SQQG +AICILSA G IS
Subjt:  SPKPISSSVPPPVIDFSSEKRGKV----RPASAVSKS-KFEVDNLGDWVP-----CSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVIS

Query:  SVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASP
        +VTL QP ++GGTLTYEGRFEILSLSGSFMP++NGGT+ R+GGMS+SLA P+G + GGG+AG+L+AAGPVQVV+GSF+  +Q EQ  KK  +   + A P
Subjt:  SVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASP

Query:  TAAIP
            P
Subjt:  TAAIP

Q9SB31 AT-hook motif nuclear-localized protein 32.3e-5957.44Show/hide
Query:  PPQTDGSTPPAAAATTQ----PTSAPPPTAAFSVPG-KKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVIDFSSEKRGKVRPAS----------AVSKS
        PP T  +    AAA T+    P S   PT   S    KKKRGRPRKY PDG++   LSP PISSSV P   +F   KRG+ R  S             +S
Subjt:  PPQTDGSTPPAAAATTQ----PTSAPPPTAAFSVPG-KKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVIDFSSEKRGKVRPAS----------AVSKS

Query:  KFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGG
          + +  G      VGANFTPH++ VN GED+TMKI++FSQQG +AICILSANG IS+VTLRQ  +SGGTLTYEGRFEILSL+GSFM +D+GGTRSR+GG
Subjt:  KFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGG

Query:  MSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQ
        MSV LA PDGRV GGG+AGL +AAGPVQV+VG+F++G +Q Q
Subjt:  MSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQ

Arabidopsis top hitse value%identityAlignment
AT4G00200.1 AT hook motif DNA-binding family protein1.6e-6851.9Show/hide
Query:  VGSEAPSEYQIAPRTADKPP-QTDGSTPPAAAATTQPTSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVIDFSSEKRGKVRPASA
        +G+E PS Y +APR +D P  Q  G + P       P  AP P++     GKK+RGRPRKY  +G+   + S   +   V   +  F  +K  K    + 
Subjt:  VGSEAPSEYQIAPRTADKPP-QTDGSTPPAAAATTQPTSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVIDFSSEKRGKVRPASA

Query:  VSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRS
           S  E   +G  V   VG+NFTPH+ITVNTGEDITM+IISFSQQGP+AICILSANGVIS+VTLRQPDS GGTLTYEGRFEILSLSGSFM ++N G++ 
Subjt:  VSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRS

Query:  RSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQE-QKPKKPKQDAISTASPTAAIPISCVDPKANL---------SPSGSFRGNNWSIL
        RSGGMSVSLA PDGRVVGGGVAGLL+AA P+QVVVGSF++ +QQ+ QKP+K + +    A  +   P S   P A++          P  SF  ++W+  
Subjt:  RSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQE-QKPKKPKQDAISTASPTAAIPISCVDPKANL---------SPSGSFRGNNWSIL

Query:  PNDSRNKPTDINVSLP
         +  RN  TDIN+SLP
Subjt:  PNDSRNKPTDINVSLP

AT4G12080.1 AT-hook motif nuclear-localized protein 11.1e-9662.17Show/hide
Query:  GSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAAAATTQPTSAPP----------PTAAF-SVPG---KKKRGRPRKYGPDGSVNTALSPKPISSS
        G+  G+TVV S+APS++ +A R+          TPP    ++  T+ PP           TAA   + G   KKKRGRPRKYGPDG+V  ALSPKPISS+
Subjt:  GSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAAAATTQPTSAPP----------PTAAF-SVPG---KKKRGRPRKYGPDGSVNTALSPKPISSS

Query:  -----VPPP---VIDFS-SEKRGKVRPASAVSKSKF--EVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQP
             +PPP   VIDFS SEKR KV+P ++ +++K+  +V+NLG+W PCSVG NFTPHIITVNTGED+TMKIISFSQQGP++IC+LSANGVISSVTLRQP
Subjt:  -----VPPP---VIDFS-SEKRGKVRPASAVSKSKF--EVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQP

Query:  DSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSG-NQQEQKPKKPKQDAISTASPTAAIPI
        DSSGGTLTYEGRFEILSLSGSFMP+D+GGTRSR+GGMSVSLASPDGRVVGGG+AGLLVAA PVQVVVGSFL+G + Q+QKPKK K D    +SPTAAIPI
Subjt:  DSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSG-NQQEQKPKKPKQDAISTASPTAAIPI

Query:  SCVDPKANLSPSGSFRGNN--W-SILPNDSRNKPTDINVSL
        S       +    S   NN  W + L +D RNK TDINV++
Subjt:  SCVDPKANLSPSGSFRGNN--W-SILPNDSRNKPTDINVSL

AT4G22770.1 AT hook motif DNA-binding family protein8.1e-8458.66Show/hide
Query:  GSSSGVTVVGSEAPSEYQIAPR--TADKPPQTDGSTPPAAAATTQPTSAPPPTAAF----SVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPP--VI
        GS  GVTVV S APS++ +APR  T++ PP +    PP       P ++  P+AA     S P KK+RGRPRKYG DG+  T LSP PISS+ P    VI
Subjt:  GSSSGVTVVGSEAPSEYQIAPR--TADKPPQTDGSTPPAAAATTQPTSAPPPTAAF----SVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPP--VI

Query:  DFS--SEKRGKVRPA----SAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEG
        DFS  SEKRGK++PA    S+  + K++V+NLG+W P S  ANFTPHIITVN GED+T +IISFSQQG  AIC+L ANGV+SSVTLRQPDSSGGTLTYEG
Subjt:  DFS--SEKRGKVRPA----SAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEG

Query:  RFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSG-NQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP
        RFEILSLSG+FMPSD+ GTRSR+GGMSVSLASPDGRVVGGGVAGLLVAA P+QVVVG+FL G NQQEQ PK    + +S  SP      +  D +     
Subjt:  RFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSG-NQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSP

Query:  SGSFRGNNWS-ILPNDSRNKPT-DINVSL
        + S   + W+   P+DSR+K + D N++L
Subjt:  SGSFRGNNWS-ILPNDSRNKPT-DINVSL

AT4G25320.1 AT hook motif DNA-binding family protein1.6e-6057.44Show/hide
Query:  PPQTDGSTPPAAAATTQ----PTSAPPPTAAFSVPG-KKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVIDFSSEKRGKVRPAS----------AVSKS
        PP T  +    AAA T+    P S   PT   S    KKKRGRPRKY PDG++   LSP PISSSV P   +F   KRG+ R  S             +S
Subjt:  PPQTDGSTPPAAAATTQ----PTSAPPPTAAFSVPG-KKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVIDFSSEKRGKVRPAS----------AVSKS

Query:  KFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGG
          + +  G      VGANFTPH++ VN GED+TMKI++FSQQG +AICILSANG IS+VTLRQ  +SGGTLTYEGRFEILSL+GSFM +D+GGTRSR+GG
Subjt:  KFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGG

Query:  MSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQ
        MSV LA PDGRV GGG+AGL +AAGPVQV+VG+F++G +Q Q
Subjt:  MSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQ

AT5G62260.1 AT hook motif DNA-binding family protein2.8e-6051.48Show/hide
Query:  MEGRDGGSSSG-VTVVGSEA---PSEYQIAPRTADKPPQTDGST----PPAAAATTQPTSAPPPTAAFSV---PGKKKRGRPRKYGPDGSVN-----TAL
        ME +   S SG VTV G EA    +E+Q  P        T   T    PPA ++   PT+  P +A  S    P KKKRGRPRKY PDGS+N       L
Subjt:  MEGRDGGSSSG-VTVVGSEA---PSEYQIAPRTADKPPQTDGST----PPAAAATTQPTSAPPPTAAFSV---PGKKKRGRPRKYGPDGSVN-----TAL

Query:  SPKPISSSVPPPVIDFSSEKRGKV----RPASAVSKS-KFEVDNLGDWVP-----CSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVIS
        SP PISSS+  P+      KRGK     +P   V KS KFE  +     P     C VGANFT H  TVN GED+TMK++ +SQQG +AICILSA G IS
Subjt:  SPKPISSSVPPPVIDFSSEKRGKV----RPASAVSKS-KFEVDNLGDWVP-----CSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVIS

Query:  SVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASP
        +VTL QP ++GGTLTYEGRFEILSLSGSFMP++NGGT+ R+GGMS+SLA P+G + GGG+AG+L+AAGPVQVV+GSF+  +Q EQ  KK  +   + A P
Subjt:  SVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASP

Query:  TAAIP
            P
Subjt:  TAAIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAAGAGATGGTGGTTCAAGTAGTGGGGTTACTGTTGTTGGATCAGAAGCTCCATCGGAGTACCAAATAGCTCCGAGAACCGCCGATAAACCGCCGCAGACCGA
CGGATCAACGCCGCCGGCAGCGGCGGCGACGACTCAACCAACTTCGGCTCCGCCGCCCACGGCGGCCTTCTCCGTTCCGGGAAAGAAGAAAAGAGGGCGACCTAGAAAAT
ATGGACCTGACGGCTCTGTCAATACGGCTCTGTCTCCTAAGCCGATATCGTCGTCGGTGCCGCCGCCGGTGATCGATTTCTCGTCGGAAAAGAGGGGGAAAGTGCGGCCG
GCGAGTGCTGTGAGCAAAAGCAAGTTCGAAGTTGATAATCTAGGTGACTGGGTTCCGTGTTCTGTTGGTGCAAATTTTACACCTCATATCATCACTGTCAATACGGGCGA
GGATATCACAATGAAGATAATATCCTTTTCTCAACAAGGGCCTCAAGCTATATGCATTTTATCTGCGAACGGTGTGATCTCAAGTGTCACGCTCCGTCAGCCGGACTCCT
CTGGAGGAACACTAACATATGAGGGTCGATTCGAAATATTGTCCCTGTCCGGCTCATTCATGCCTAGTGATAATGGAGGAACAAGAAGTAGATCTGGTGGAATGAGTGTT
TCCTTGGCAAGTCCAGACGGGCGTGTCGTAGGCGGTGGGGTTGCTGGTCTATTAGTAGCTGCAGGGCCTGTTCAGGTTGTAGTAGGGAGTTTTCTATCTGGAAATCAACA
GGAGCAGAAACCTAAAAAACCAAAACAAGATGCCATTTCAACAGCCTCACCAACCGCTGCCATTCCGATATCTTGTGTTGATCCTAAGGCCAACCTCTCGCCTTCCGGTT
CCTTTCGTGGCAATAACTGGTCGATTCTGCCGAATGATTCAAGGAATAAGCCAACTGATATCAATGTATCTCTACCCACTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATTATCTGCAGAAACAAGGTTTCATTATCTGCAGAAACAAGGTTTCATTATCTGAGCATTTTCCCTTCAAATCAAAAGCTGAGTCTTATTTATTTATTATTTTTTTTTTT
CTTCTTATCTTTCATAATATTTTTCAATTTCTGAAAAAAAAAAGTTTTTTTCTTCTTGAAAATAGATCAGCATGCTGGCCTTTTCCTTTTAGCAAAAATATTAAAGTTTT
GGAGGTAATATAAAGATTACAATCTCACTGTGATTCCTTTTCTCATAATTTTTCATGCTATTTTATTTTTCTTCCCACTCTTGAAAAATAAAATCCCCCCACAAAAAAAA
AAAAAAGTAAAAAAAGAACTTCCAATTTTTCAGTTGTAGTTCCATTTCATTTTTCTGGGTTTTGATTTTGAAGAAAATTTCTTCTTCTTTTCTCATGAGAAATTGAAAAT
TAGTGAATGAAAAATGAGTTTTAATAAATAGCTGAAGAAAATATTTGTTGGGTTTTTTTTTTTTTTGCCATTTTTGAGTAAATGGAGGGAAGAGATGGTGGTTCAAGTAG
TGGGGTTACTGTTGTTGGATCAGAAGCTCCATCGGAGTACCAAATAGCTCCGAGAACCGCCGATAAACCGCCGCAGACCGACGGATCAACGCCGCCGGCAGCGGCGGCGA
CGACTCAACCAACTTCGGCTCCGCCGCCCACGGCGGCCTTCTCCGTTCCGGGAAAGAAGAAAAGAGGGCGACCTAGAAAATATGGACCTGACGGCTCTGTCAATACGGCT
CTGTCTCCTAAGCCGATATCGTCGTCGGTGCCGCCGCCGGTGATCGATTTCTCGTCGGAAAAGAGGGGGAAAGTGCGGCCGGCGAGTGCTGTGAGCAAAAGCAAGTTCGA
AGTTGATAATCTAGGTGACTGGGTTCCGTGTTCTGTTGGTGCAAATTTTACACCTCATATCATCACTGTCAATACGGGCGAGGATATCACAATGAAGATAATATCCTTTT
CTCAACAAGGGCCTCAAGCTATATGCATTTTATCTGCGAACGGTGTGATCTCAAGTGTCACGCTCCGTCAGCCGGACTCCTCTGGAGGAACACTAACATATGAGGGTCGA
TTCGAAATATTGTCCCTGTCCGGCTCATTCATGCCTAGTGATAATGGAGGAACAAGAAGTAGATCTGGTGGAATGAGTGTTTCCTTGGCAAGTCCAGACGGGCGTGTCGT
AGGCGGTGGGGTTGCTGGTCTATTAGTAGCTGCAGGGCCTGTTCAGGTTGTAGTAGGGAGTTTTCTATCTGGAAATCAACAGGAGCAGAAACCTAAAAAACCAAAACAAG
ATGCCATTTCAACAGCCTCACCAACCGCTGCCATTCCGATATCTTGTGTTGATCCTAAGGCCAACCTCTCGCCTTCCGGTTCCTTTCGTGGCAATAACTGGTCGATTCTG
CCGAATGATTCAAGGAATAAGCCAACTGATATCAATGTATCTCTACCCACTGCATGATCTGACCTTCATAGCTTATTGTAAAAGTCGCACCCTCTCCAATTTGCCACCAA
ATCTTGTTTCATTGTCAATGTAACTTACCATTGTATTTCCATTGTAGAATTCCAAACTCCCATTTCTCCCCTGTTGATCCCTCTCTGTTAGCCTTTCACTTGGTAGAGTA
GCTTTAGTCTCCGGGACGCAACCGCGTCGAGTAGAATAGTTTAATGTAATGTGAAGACTTTCGCTTTGCTTTGTTTCAATGGTTTGATTTTATGATATACTTTCTTTTCG
TCCA
Protein sequenceShow/hide protein sequence
MEGRDGGSSSGVTVVGSEAPSEYQIAPRTADKPPQTDGSTPPAAAATTQPTSAPPPTAAFSVPGKKKRGRPRKYGPDGSVNTALSPKPISSSVPPPVIDFSSEKRGKVRP
ASAVSKSKFEVDNLGDWVPCSVGANFTPHIITVNTGEDITMKIISFSQQGPQAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDNGGTRSRSGGMSV
SLASPDGRVVGGGVAGLLVAAGPVQVVVGSFLSGNQQEQKPKKPKQDAISTASPTAAIPISCVDPKANLSPSGSFRGNNWSILPNDSRNKPTDINVSLPTA