; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023859 (gene) of Chayote v1 genome

Gene IDSed0023859
OrganismSechium edule (Chayote v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationLG13:17102459..17105183
RNA-Seq ExpressionSed0023859
SyntenySed0023859
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037975.1 AT-hook motif nuclear-localized protein 10 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-14078.81Show/hide
Query:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY
        M+G+ETGVMT GEPF+IG QKSPVQSQQ  L G+HLPFGADGVYKP    A + SP YQS  GVGV+GN        EAF++MNTQSE VKRKRGRPRKY
Subjt:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY

Query:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV
        GPDGSMAV  A    +AAATQS GGFSPP  G+    GS   T+K+ARGRPPGSGKKQQLDALGSAGVGFTPHVITV+ GEDV+SKIMS SQNGPRAVC+
Subjt:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV

Query:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL
        L+ANG+ISNVTLRQPAMSGGT+TY GRFEILSLSGLYLL ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGH ELKQAN +
Subjt:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL

Query:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW
        EQ  VTAPHKLAPIRAGM GASSP SRG  SESSGG GSPFNQS GACNN  SW
Subjt:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW

XP_022940600.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita moschata]1.6e-14078.81Show/hide
Query:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY
        M+G+ETGVMT GEPF+IG QKSPVQSQQ  L G+HLPFGADGVYKP    A++ SP YQS   VGV+GN        EAF++MNTQSE VKRKRGRPRKY
Subjt:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY

Query:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV
        GPDGSMAV  A    +AAATQS GGFSPP  G+    GS   T+K+ARGRPPGSGKKQQLDALGSAGVGFTPHVITV+ GEDV+SKIMS SQNGPRAVC+
Subjt:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV

Query:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL
        L+ANG+ISNVTLRQPAMSGGT+TY GRFEILSLSGLYLL ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGH ELKQAN +
Subjt:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL

Query:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW
        EQ  VTAPHKLAPIRAGM GASSPQSRG  SESSGG GSPFNQS GACNN  SW
Subjt:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW

XP_022981864.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita maxima]1.2e-14079.1Show/hide
Query:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY
        M+G+ETGVMT GEPF+IG QKSPVQSQQ  L G+HLPFGADGVYKP    A++ SP YQS  GVGV+GN        EAF+ MNTQSE VKRKRGRPRKY
Subjt:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY

Query:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV
        GPDGSMAVA A    +AAATQS GGFSPP  G+    GS   T+K+ARGRPPGSGKKQQLDALGSAGVGFTPHVITV+ GEDV+SKIMS SQNGPRAVC+
Subjt:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV

Query:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL
        L+ANG+ISNVTLRQPAMSGGT+TY GRFEILSLSGLYLL ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG  ELKQAN +
Subjt:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL

Query:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW
        EQ  VTAPHKLAPIRAGMTGASSPQSRG  SESSGG GSPFNQS GACNN  SW
Subjt:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW

XP_023524512.1 AT-hook motif nuclear-localized protein 10-like [Cucurbita pepo subsp. pepo]6.1e-14078.53Show/hide
Query:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY
        M+G+ETGVMT GEPF+IG QKSPVQSQQ  L G+HLPFGAD VYKP    A++ SP YQS  GVGV+GN        EAF++MNTQS  VKRKRGRPRKY
Subjt:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY

Query:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV
        GPDGSMAV  A    +AAATQS GGFSPP+ G+    GS   T+K+ARGRPPGSGKKQQLDALGSAGVGFTPHVITV+ GEDV+SKIMS SQNGPRAVC+
Subjt:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV

Query:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL
        L+ANG+ISNVTLRQPAMSGGT+TY GRFEILSLSGLYLL ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG  ELKQAN +
Subjt:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL

Query:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW
        EQ  VTAPHKLAPIRAGMTGASSPQSRG  SESSGG GSPFNQS GACNN  SW
Subjt:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW

XP_038898092.1 AT-hook motif nuclear-localized protein 10-like [Benincasa hispida]1.8e-13979.89Show/hide
Query:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY
        MSG+ETGVM+ GEPF+IGLQK+ VQSQQ  +QGMHLPFGADGVYKPV    AAASP YQSS GVGV GN        EAF+NMN+QSE VKRKRGRPRKY
Subjt:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY

Query:  GPDGSMAVAPAVSSSAAAATQSSGGFS------PPSGGLGSPPSTMKRARGRPPGSG-KKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRA
        GPDGSMA+APAV S  AAATQ SGGFS      PPSGG  S P+ +K+ARGRPPGS  KKQQLD  GSAGVGFTPHVITV+ GEDV+SKIMSFSQNGPRA
Subjt:  GPDGSMAVAPAVSSSAAAATQSSGGFS------PPSGGLGSPPSTMKRARGRPPGSG-KKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRA

Query:  VCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQA
        VC+LTANG+ISNVTLRQPAMSGGT+TY GRFEILSLSG YLL ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSF+TDGGH EL   
Subjt:  VCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQA

Query:  NHLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNN-ISW
        N +EQ  VTAPHKLAPIRAGMTGASSP SRGT SESSGGPGSPFNQS GACNNN I W
Subjt:  NHLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNN-ISW

TrEMBL top hitse value%identityAlignment
A0A0A0L0T7 AT-hook motif nuclear-localized protein3.6e-13877.87Show/hide
Query:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY
        MSG+ETGV++ GE F+IGLQK+ V SQQP +Q MHLPFGADGVYKPVAT    ASP YQSS  VGV GN        +AF+NMN+QSE VKRKRGRPRKY
Subjt:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY

Query:  GPDGSMAVAPAVSSSAAAATQSSGGFSP-----PSGGLGSPPSTMKRARGRPPGSG-KKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAV
        GPDGSMAVAPAV    AAATQSSGGFSP     P  G  + P+++K+ RGRPPGS  KK  LD   SAGVGFTPHVITV+ GEDV+SKIMSFSQNGPRAV
Subjt:  GPDGSMAVAPAVSSSAAAATQSSGGFSP-----PSGGLGSPPSTMKRARGRPPGSG-KKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAV

Query:  CVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQAN
        C+LTANG+ISNVTLRQPAMSGGT+TY GRFEILSLSG YLL ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGH EL+Q N
Subjt:  CVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQAN

Query:  HLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNN-ISW
         +EQP V+APHKLAPIRAGMTGASSP SRGT SESSGGPGSPFNQSAGACNNN I W
Subjt:  HLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNN-ISW

A0A1S3BWC6 AT-hook motif nuclear-localized protein3.0e-13777.59Show/hide
Query:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY
        MSG+ETGV++ GE F+IGLQK+ VQSQQP +Q MHLPFGADGVYKPV     AASP YQSS  VGV GN        EAF+NMN+QSE VKRKRGRPRKY
Subjt:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY

Query:  GPDGSMAVAPAVSSSAAAATQSSGGFSP-----PSGGLGSPPSTMKRARGRPPGSG-KKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAV
        GPDGSM+VAPAV    AAATQSSGGFSP     P  G  + P+++K+ RGRPPGS  KK QLD+  S GVGFTPHVITV+ GEDV+SKIMSFSQNGPRAV
Subjt:  GPDGSMAVAPAVSSSAAAATQSSGGFSP-----PSGGLGSPPSTMKRARGRPPGSG-KKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAV

Query:  CVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQAN
        C+LTANG+ISNVTLRQPAMSGGT+TY GRFEILSLSG YLL ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GH EL+Q N
Subjt:  CVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQAN

Query:  HLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNN-ISW
         +EQP V+APHKLAPIRAGMTGASSP SRGT SESSGGPGSPFNQSAGACNNN I W
Subjt:  HLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNN-ISW

A0A5A7VAQ2 AT-hook motif nuclear-localized protein3.0e-13777.59Show/hide
Query:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY
        MSG+ETGV++ GE F+IGLQK+ VQSQQP +Q MHLPFGADGVYKPV     AASP YQSS  VGV GN        EAF+NMN+QSE VKRKRGRPRKY
Subjt:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY

Query:  GPDGSMAVAPAVSSSAAAATQSSGGFSP-----PSGGLGSPPSTMKRARGRPPGSG-KKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAV
        GPDGSM+VAPAV    AAATQSSGGFSP     P  G  + P+++K+ RGRPPGS  KK QLD+  S GVGFTPHVITV+ GEDV+SKIMSFSQNGPRAV
Subjt:  GPDGSMAVAPAVSSSAAAATQSSGGFSP-----PSGGLGSPPSTMKRARGRPPGSG-KKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAV

Query:  CVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQAN
        C+LTANG+ISNVTLRQPAMSGGT+TY GRFEILSLSG YLL ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GH EL+Q N
Subjt:  CVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQAN

Query:  HLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNN-ISW
         +EQP V+APHKLAPIRAGMTGASSP SRGT SESSGGPGSPFNQSAGACNNN I W
Subjt:  HLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNN-ISW

A0A6J1FR30 AT-hook motif nuclear-localized protein7.8e-14178.81Show/hide
Query:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY
        M+G+ETGVMT GEPF+IG QKSPVQSQQ  L G+HLPFGADGVYKP    A++ SP YQS   VGV+GN        EAF++MNTQSE VKRKRGRPRKY
Subjt:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY

Query:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV
        GPDGSMAV  A    +AAATQS GGFSPP  G+    GS   T+K+ARGRPPGSGKKQQLDALGSAGVGFTPHVITV+ GEDV+SKIMS SQNGPRAVC+
Subjt:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV

Query:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL
        L+ANG+ISNVTLRQPAMSGGT+TY GRFEILSLSGLYLL ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGH ELKQAN +
Subjt:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL

Query:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW
        EQ  VTAPHKLAPIRAGM GASSPQSRG  SESSGG GSPFNQS GACNN  SW
Subjt:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW

A0A6J1IXR2 AT-hook motif nuclear-localized protein5.9e-14179.1Show/hide
Query:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY
        M+G+ETGVMT GEPF+IG QKSPVQSQQ  L G+HLPFGADGVYKP    A++ SP YQS  GVGV+GN        EAF+ MNTQSE VKRKRGRPRKY
Subjt:  MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGN--------EAFINMNTQSEAVKRKRGRPRKY

Query:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV
        GPDGSMAVA A    +AAATQS GGFSPP  G+    GS   T+K+ARGRPPGSGKKQQLDALGSAGVGFTPHVITV+ GEDV+SKIMS SQNGPRAVC+
Subjt:  GPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGL----GSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCV

Query:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL
        L+ANG+ISNVTLRQPAMSGGT+TY GRFEILSLSGLYLL ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG  ELKQAN +
Subjt:  LTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHL

Query:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW
        EQ  VTAPHKLAPIRAGMTGASSPQSRG  SESSGG GSPFNQS GACNN  SW
Subjt:  EQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFNQSAGACNNNISW

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 103.3e-7252.09Show/hide
Query:  MSGAETGVMTG---GEPFSIGL--QKSPVQSQQPGLQGMHLPFGAD---GVYKPVATTAAAASPIYQSSGG------VGVNGNEAFINMNTQSEAVKRKR
        MSG+ETG+M        F++ L  Q+   Q+Q    Q   L FG D    +YK    + +       +S G      + + G E+     T SE VK++R
Subjt:  MSGAETGVMTG---GEPFSIGL--QKSPVQSQQPGLQGMHLPFGAD---GVYKPVATTAAAASPIYQSSGG------VGVNGNEAFINMNTQSEAVKRKR

Query:  GRPRKYGPD-GSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQ-QLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPR
        GRPRKYGPD G M++   ++  A + T S     P SGG G      ++ RGRPPGS  K+ +L ALGS G+GFTPHV+TV  GEDV+SKIM+ + NGPR
Subjt:  GRPRKYGPD-GSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQ-QLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPR

Query:  AVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQ
        AVCVL+ANG+ISNVTLRQ A SGGT+TY GRFEILSLSG + L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG   E + 
Subjt:  AVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQ

Query:  ANHLEQPTVTAP--HKLAPIRAGMTGASSPQSRGTFSESS--GGPGSPFNQSAGACNNN
          H+ Q  +++P   ++AP +  MT  SSPQSRGT SESS  GG GSP +QS G   NN
Subjt:  ANHLEQPTVTAP--HKLAPIRAGMTGASSPQSRGTFSESS--GGPGSPFNQSAGACNNN

O80834 AT-hook motif nuclear-localized protein 96.7e-4949.78Show/hide
Query:  SSGGVGVNGNEAFINM-----NTQSEAVKRKRGRPRKYGPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQQLDALG--
        ++GG G   +   +NM           +KRKRGRPRKYG DGS+++  A+SSS+ +                +P ++ KR RGRPPGSGKKQ++ ++G  
Subjt:  SSGGVGVNGNEAFINM-----NTQSEAVKRKRGRPRKYGPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQQLDALG--

Query:  ---SAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDG
           S+G+ FTPHVI V  GED+ SK+++FSQ GPRA+CVL+A+G++S  TL QP+ S G I Y GRFEIL+LS  Y++  +G  R+RTG LSVSL+ PDG
Subjt:  ---SAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDG

Query:  RVLGGGVAGLLTAASPVQVVVGSFV
        RV+GG + G L AASPVQV+VGSF+
Subjt:  RVLGGGVAGLLTAASPVQVVVGSFV

Q8GXB3 AT-hook motif nuclear-localized protein 51.7e-4746.67Show/hide
Query:  TQSEAVKRKRGRPRKYGPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQQLDALG-----SAGVGFTPHVITVQTGEDV
        ++   VK+KRGRPRKY PDG +++  +     +  ++ S           S P+  KRARGRPPG+G+KQ+L  LG     SAG+ F PHVI+V +GED+
Subjt:  TQSEAVKRKRGRPRKYGPDGSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQQLDALG-----SAGVGFTPHVITVQTGEDV

Query:  NSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG
         SK++SFSQ  PRA+C+++  G++S+VTLR+PA +  ++T+ GRFEILSL G YL+ E GG +SRTGGLSVSLSGP+G V+GGG+ G+L AAS VQVV  
Subjt:  NSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG

Query:  SFV-------TDGGHMELKQ-ANHLEQPT----VTAPHKLAPIRAGMTGASSPQS
        SFV        +  +  +KQ     ++PT     T P   AP  A  TG  +PQ+
Subjt:  SFV-------TDGGHMELKQ-ANHLEQPT----VTAPHKLAPIRAGMTGASSPQS

Q8VYJ2 AT-hook motif nuclear-localized protein 14.5e-5353.25Show/hide
Query:  VKRKRGRPRKYGPDGS-MAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKK----QQLDALG-----SAGVGFTPHVITVQTGEDV
        +K+KRGRPRKYGPDG+ +A++P   SSA A +       PPS  +    ++ KR++ +P  S  +     Q++ LG     S G  FTPH+ITV TGEDV
Subjt:  VKRKRGRPRKYGPDGS-MAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKK----QQLDALG-----SAGVGFTPHVITVQTGEDV

Query:  NSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG
          KI+SFSQ GPR++CVL+ANG IS+VTLRQP  SGGT+TY GRFEILSLSG ++  ++GG RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPVQVVVG
Subjt:  NSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG

Query:  SFVTDGGHMELKQANHLEQPTVTAPHKLAPI
        SF+    H + K   +     +++P    PI
Subjt:  SFVTDGGHMELKQANHLEQPTVTAPHKLAPI

Q9FIR1 AT-hook motif nuclear-localized protein 83.3e-4848.26Show/hide
Query:  INMNTQSEAVKRKRGRPRKYGPDGSMAVAPAVSS---SAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQQLDAL-GSAGVGFTPHVITVQTG
        I+   Q   VK+KRGRPRKY PDGS+A+  A +S   SAA+ +   GG     G   S    +KR RGRPPGS KK QLDAL G++GVGFTPHVI V TG
Subjt:  INMNTQSEAVKRKRGRPRKYGPDGSMAVAPAVSS---SAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQQLDAL-GSAGVGFTPHVITVQTG

Query:  EDVNSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQV
        ED+ SK+M+FS  G R +C+L+A+G++S V LRQ + S G +TY GRFEI++LSG  L +E  G  +R+G LSV+L+GPDG ++GG V G L AA+ VQV
Subjt:  EDVNSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQV

Query:  VVGSFVTDGGHMELKQANHLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGS
        +VGSFV         +A   +Q +V       P       AS+P +   F   S GP S
Subjt:  VVGSFVTDGGHMELKQANHLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGS

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein2.4e-7352.09Show/hide
Query:  MSGAETGVMTG---GEPFSIGL--QKSPVQSQQPGLQGMHLPFGAD---GVYKPVATTAAAASPIYQSSGG------VGVNGNEAFINMNTQSEAVKRKR
        MSG+ETG+M        F++ L  Q+   Q+Q    Q   L FG D    +YK    + +       +S G      + + G E+     T SE VK++R
Subjt:  MSGAETGVMTG---GEPFSIGL--QKSPVQSQQPGLQGMHLPFGAD---GVYKPVATTAAAASPIYQSSGG------VGVNGNEAFINMNTQSEAVKRKR

Query:  GRPRKYGPD-GSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQ-QLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPR
        GRPRKYGPD G M++   ++  A + T S     P SGG G      ++ RGRPPGS  K+ +L ALGS G+GFTPHV+TV  GEDV+SKIM+ + NGPR
Subjt:  GRPRKYGPD-GSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQ-QLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPR

Query:  AVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQ
        AVCVL+ANG+ISNVTLRQ A SGGT+TY GRFEILSLSG + L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG   E + 
Subjt:  AVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQ

Query:  ANHLEQPTVTAP--HKLAPIRAGMTGASSPQSRGTFSESS--GGPGSPFNQSAGACNNN
          H+ Q  +++P   ++AP +  MT  SSPQSRGT SESS  GG GSP +QS G   NN
Subjt:  ANHLEQPTVTAP--HKLAPIRAGMTGASSPQSRGTFSESS--GGPGSPFNQSAGACNNN

AT2G33620.2 AT hook motif DNA-binding family protein2.4e-7352.09Show/hide
Query:  MSGAETGVMTG---GEPFSIGL--QKSPVQSQQPGLQGMHLPFGAD---GVYKPVATTAAAASPIYQSSGG------VGVNGNEAFINMNTQSEAVKRKR
        MSG+ETG+M        F++ L  Q+   Q+Q    Q   L FG D    +YK    + +       +S G      + + G E+     T SE VK++R
Subjt:  MSGAETGVMTG---GEPFSIGL--QKSPVQSQQPGLQGMHLPFGAD---GVYKPVATTAAAASPIYQSSGG------VGVNGNEAFINMNTQSEAVKRKR

Query:  GRPRKYGPD-GSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQ-QLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPR
        GRPRKYGPD G M++   ++  A + T S     P SGG G      ++ RGRPPGS  K+ +L ALGS G+GFTPHV+TV  GEDV+SKIM+ + NGPR
Subjt:  GRPRKYGPD-GSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQ-QLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPR

Query:  AVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQ
        AVCVL+ANG+ISNVTLRQ A SGGT+TY GRFEILSLSG + L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG   E + 
Subjt:  AVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQ

Query:  ANHLEQPTVTAP--HKLAPIRAGMTGASSPQSRGTFSESS--GGPGSPFNQSAGACNNN
          H+ Q  +++P   ++AP +  MT  SSPQSRGT SESS  GG GSP +QS G   NN
Subjt:  ANHLEQPTVTAP--HKLAPIRAGMTGASSPQSRGTFSESS--GGPGSPFNQSAGACNNN

AT2G33620.3 AT hook motif DNA-binding family protein2.4e-7352.09Show/hide
Query:  MSGAETGVMTG---GEPFSIGL--QKSPVQSQQPGLQGMHLPFGAD---GVYKPVATTAAAASPIYQSSGG------VGVNGNEAFINMNTQSEAVKRKR
        MSG+ETG+M        F++ L  Q+   Q+Q    Q   L FG D    +YK    + +       +S G      + + G E+     T SE VK++R
Subjt:  MSGAETGVMTG---GEPFSIGL--QKSPVQSQQPGLQGMHLPFGAD---GVYKPVATTAAAASPIYQSSGG------VGVNGNEAFINMNTQSEAVKRKR

Query:  GRPRKYGPD-GSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQ-QLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPR
        GRPRKYGPD G M++   ++  A + T S     P SGG G      ++ RGRPPGS  K+ +L ALGS G+GFTPHV+TV  GEDV+SKIM+ + NGPR
Subjt:  GRPRKYGPD-GSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQ-QLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPR

Query:  AVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQ
        AVCVL+ANG+ISNVTLRQ A SGGT+TY GRFEILSLSG + L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG   E + 
Subjt:  AVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQ

Query:  ANHLEQPTVTAP--HKLAPIRAGMTGASSPQSRGTFSESS--GGPGSPFNQSAGACNNN
          H+ Q  +++P   ++AP +  MT  SSPQSRGT SESS  GG GSP +QS G   NN
Subjt:  ANHLEQPTVTAP--HKLAPIRAGMTGASSPQSRGTFSESS--GGPGSPFNQSAGACNNN

AT2G33620.4 AT hook motif DNA-binding family protein2.4e-7352.09Show/hide
Query:  MSGAETGVMTG---GEPFSIGL--QKSPVQSQQPGLQGMHLPFGAD---GVYKPVATTAAAASPIYQSSGG------VGVNGNEAFINMNTQSEAVKRKR
        MSG+ETG+M        F++ L  Q+   Q+Q    Q   L FG D    +YK    + +       +S G      + + G E+     T SE VK++R
Subjt:  MSGAETGVMTG---GEPFSIGL--QKSPVQSQQPGLQGMHLPFGAD---GVYKPVATTAAAASPIYQSSGG------VGVNGNEAFINMNTQSEAVKRKR

Query:  GRPRKYGPD-GSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQ-QLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPR
        GRPRKYGPD G M++   ++  A + T S     P SGG G      ++ RGRPPGS  K+ +L ALGS G+GFTPHV+TV  GEDV+SKIM+ + NGPR
Subjt:  GRPRKYGPD-GSMAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQ-QLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPR

Query:  AVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQ
        AVCVL+ANG+ISNVTLRQ A SGGT+TY GRFEILSLSG + L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG   E + 
Subjt:  AVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQ

Query:  ANHLEQPTVTAP--HKLAPIRAGMTGASSPQSRGTFSESS--GGPGSPFNQSAGACNNN
          H+ Q  +++P   ++AP +  MT  SSPQSRGT SESS  GG GSP +QS G   NN
Subjt:  ANHLEQPTVTAP--HKLAPIRAGMTGASSPQSRGTFSESS--GGPGSPFNQSAGACNNN

AT4G12080.1 AT-hook motif nuclear-localized protein 13.2e-5453.25Show/hide
Query:  VKRKRGRPRKYGPDGS-MAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKK----QQLDALG-----SAGVGFTPHVITVQTGEDV
        +K+KRGRPRKYGPDG+ +A++P   SSA A +       PPS  +    ++ KR++ +P  S  +     Q++ LG     S G  FTPH+ITV TGEDV
Subjt:  VKRKRGRPRKYGPDGS-MAVAPAVSSSAAAATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKK----QQLDALG-----SAGVGFTPHVITVQTGEDV

Query:  NSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG
          KI+SFSQ GPR++CVL+ANG IS+VTLRQP  SGGT+TY GRFEILSLSG ++  ++GG RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPVQVVVG
Subjt:  NSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILSLSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG

Query:  SFVTDGGHMELKQANHLEQPTVTAPHKLAPI
        SF+    H + K   +     +++P    PI
Subjt:  SFVTDGGHMELKQANHLEQPTVTAPHKLAPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGAGCTGAGACCGGAGTGATGACCGGCGGCGAACCCTTCAGCATCGGTCTCCAGAAGAGTCCAGTGCAGTCGCAGCAGCCGGGCTTGCAGGGTATGCATTTACC
CTTTGGCGCGGACGGGGTCTACAAGCCCGTCGCCACCACCGCTGCTGCTGCTTCACCCATTTACCAATCCTCCGGCGGCGTTGGAGTGAACGGTAACGAAGCTTTCATTA
ACATGAATACGCAGAGTGAGGCAGTAAAGAGGAAGAGAGGGAGGCCTAGAAAGTATGGACCAGATGGCAGTATGGCTGTGGCTCCTGCAGTTTCCTCCTCGGCCGCCGCT
GCAACTCAGTCCAGTGGAGGCTTTTCTCCTCCATCCGGAGGATTAGGCTCTCCTCCATCTACTATGAAGAGAGCTAGAGGCAGACCCCCTGGCTCTGGCAAAAAGCAGCA
GCTTGATGCTTTGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTGCAAACTGGAGAGGATGTAAATTCAAAAATAATGTCATTTTCACAGAATGGTCCTA
GAGCTGTTTGCGTCCTTACTGCAAATGGATCGATATCCAATGTCACTCTACGTCAACCAGCGATGTCTGGTGGAACCATAACTTACAACGGTCGATTTGAGATTTTGTCA
CTATCTGGGTTATATCTCCTCTTTGAGAATGGCGGTCAGCGGAGCCGAACTGGTGGCTTAAGTGTTTCATTGTCCGGACCAGATGGTAGAGTATTAGGTGGTGGGGTGGC
TGGTCTTCTAACGGCAGCCTCTCCTGTTCAGGTGGTGGTGGGAAGCTTCGTCACTGATGGGGGACACATGGAATTGAAACAAGCAAACCATTTGGAACAGCCGACTGTTA
CTGCTCCACATAAACTTGCTCCTATCCGTGCTGGAATGACGGGGGCGAGTAGCCCGCAATCACGTGGGACATTCAGTGAATCTTCAGGAGGCCCAGGGAGTCCATTTAAT
CAGAGTGCTGGAGCCTGCAATAACAACATATCTTGGGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGAGCTGAGACCGGAGTGATGACCGGCGGCGAACCCTTCAGCATCGGTCTCCAGAAGAGTCCAGTGCAGTCGCAGCAGCCGGGCTTGCAGGGTATGCATTTACC
CTTTGGCGCGGACGGGGTCTACAAGCCCGTCGCCACCACCGCTGCTGCTGCTTCACCCATTTACCAATCCTCCGGCGGCGTTGGAGTGAACGGTAACGAAGCTTTCATTA
ACATGAATACGCAGAGTGAGGCAGTAAAGAGGAAGAGAGGGAGGCCTAGAAAGTATGGACCAGATGGCAGTATGGCTGTGGCTCCTGCAGTTTCCTCCTCGGCCGCCGCT
GCAACTCAGTCCAGTGGAGGCTTTTCTCCTCCATCCGGAGGATTAGGCTCTCCTCCATCTACTATGAAGAGAGCTAGAGGCAGACCCCCTGGCTCTGGCAAAAAGCAGCA
GCTTGATGCTTTGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTGCAAACTGGAGAGGATGTAAATTCAAAAATAATGTCATTTTCACAGAATGGTCCTA
GAGCTGTTTGCGTCCTTACTGCAAATGGATCGATATCCAATGTCACTCTACGTCAACCAGCGATGTCTGGTGGAACCATAACTTACAACGGTCGATTTGAGATTTTGTCA
CTATCTGGGTTATATCTCCTCTTTGAGAATGGCGGTCAGCGGAGCCGAACTGGTGGCTTAAGTGTTTCATTGTCCGGACCAGATGGTAGAGTATTAGGTGGTGGGGTGGC
TGGTCTTCTAACGGCAGCCTCTCCTGTTCAGGTGGTGGTGGGAAGCTTCGTCACTGATGGGGGACACATGGAATTGAAACAAGCAAACCATTTGGAACAGCCGACTGTTA
CTGCTCCACATAAACTTGCTCCTATCCGTGCTGGAATGACGGGGGCGAGTAGCCCGCAATCACGTGGGACATTCAGTGAATCTTCAGGAGGCCCAGGGAGTCCATTTAAT
CAGAGTGCTGGAGCCTGCAATAACAACATATCTTGGGACTAA
Protein sequenceShow/hide protein sequence
MSGAETGVMTGGEPFSIGLQKSPVQSQQPGLQGMHLPFGADGVYKPVATTAAAASPIYQSSGGVGVNGNEAFINMNTQSEAVKRKRGRPRKYGPDGSMAVAPAVSSSAAA
ATQSSGGFSPPSGGLGSPPSTMKRARGRPPGSGKKQQLDALGSAGVGFTPHVITVQTGEDVNSKIMSFSQNGPRAVCVLTANGSISNVTLRQPAMSGGTITYNGRFEILS
LSGLYLLFENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHMELKQANHLEQPTVTAPHKLAPIRAGMTGASSPQSRGTFSESSGGPGSPFN
QSAGACNNNISWD