; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033044 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033044
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationchr11:40332149..40335844
RNA-Seq ExpressionLag0033044
SyntenyLag0033044
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036189.1 AT-hook motif nuclear-localized protein 13 [Cucurbita argyrosperma subsp. argyrosperma]7.5e-15776.94Show/hide
Query:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA
        MDSL+TPP LSAPSNM VG PTAYS  MSN+NNNASS +GLN AT QMM PSA FPFNSVIAPASVPLDS+NVSPYDGSHSGSFN DSGKKKRGRPRKY 
Subjt:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA

Query:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT
        PD GNIAL LAPTT+ASSV HGDLSGTPD +QPAKKARGRPPGSGKKQMNA+GS G+GFTPHVV AKPGEDVAAKIL+FSQQGPRTVFILSANG+ISNAT
Subjt:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT

Query:  LRHPATSGGSVTYE--------------------------------------------------GQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDG
        LRH  TSGGSVTYE                                                  GQYEIISLSGSF+LSENNGTRSRTGGLSVLL+GSDG
Subjt:  LRHPATSGGSVTYE--------------------------------------------------GQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDG

Query:  QVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMINFGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQ
        QVLGGGVAGMLMA SQVQV+VGSFLE DKKS NT MLNSGSSA+PSQMINFGG AAAAAASPPSLGASSGESSA+NG SPLNNRHPGMF+N+SQ IHNMQ
Subjt:  QVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMINFGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQ

Query:  MYHQLWASQTQQ
        MYHQLWA QTQQ
Subjt:  MYHQLWASQTQQ

XP_022159894.1 AT-hook motif nuclear-localized protein 13-like [Momordica charantia]1.0e-16187.53Show/hide
Query:  MDSLET-PPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKY
        MDSLET PPPLSA SNMAVGG TAYS AMSN+NNNASS IGLN  TTQMM P+  FPFNSVIAPASVPLDSLNV+PYDGSHSG+FN+DSGKKKRGRPRKY
Subjt:  MDSLET-PPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKY

Query:  APDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNA
         PD GNIAL LAPTTVASSV HGDL+ TPDS+QPAKKARGRPPGSGKKQMNA GSGGIGFTPHVVL KPGEDVAAKI+SF+QQGPR VFILSANGT+S+A
Subjt:  APDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNA

Query:  TLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKSNTSMLNSGSSAAPSQMIN
        TLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLL+GSDGQVLGGGVAGMLMAGSQVQ+IVGSFLE+DKKSN+SMLNS SSA P QMIN
Subjt:  TLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKSNTSMLNSGSSAAPSQMIN

Query:  FGGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ
        FG  AA AASPPSLGASSGESSAENGDSPL NRHPGMFNNTSQ I N+QMYH LWA QTQQ
Subjt:  FGGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ

XP_022931258.1 AT-hook motif nuclear-localized protein 13-like isoform X1 [Cucurbita moschata]1.8e-16387.29Show/hide
Query:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA
        MDSL+TPP LSAPSNM VG PTAYS  MSN+NNNASS +GLN AT QM+ PSA FPFNSVIAPASVPLDS+NVSPYDGSHSGSFN DSGKKKRGRPRKY 
Subjt:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA

Query:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT
        PD GNIAL LAPTT+ASSV HGDLSGTPD +QPAKKARGRPPGSGKKQMNA+GS G+GFTPHVV AKPGEDVAAKIL+FSQQGPRTVFILSANG+ISNAT
Subjt:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT

Query:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMIN
        LRH  TSGGSVTYEGQYEIISLSGSF+LSENNGTRSRTGGLSVLL+GSDGQVLGGGVAGMLMA SQVQV+VGSFLE DKKS NT MLNSGSSA+PSQMIN
Subjt:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMIN

Query:  FGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ
        FGG AAAAAASPPSLGASSGESSA+NG SPLNNRHPGMF+N+SQ IHNMQMYHQLWA QTQQ
Subjt:  FGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ

XP_022995511.1 AT-hook motif nuclear-localized protein 13-like isoform X1 [Cucurbita maxima]2.2e-16487.57Show/hide
Query:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA
        MDSL+TPP LSAPSNM VG PTAYS  MSN+NNNASS +GLN AT QM+PPSA FPFNSVIAPASVPLDS+NVSPYDGSHSGSFN DSGKKKRGRPRKY 
Subjt:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA

Query:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT
        PD GNIAL LAPTT+ASSV HGDLSGTPD +QPAKKARGRPPGSGKKQMNA+GS G+GFTPHVV AKPGEDVAAKIL+FSQQGPRTVFILSANG+ISNAT
Subjt:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT

Query:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMIN
        LRH  TSGGSVTYEGQYEIISLSGSF+LSENNGTRSRTGGLSVLL+GSDGQVLGGGVAGMLMA SQVQV+VGSFLE DKKS NT MLNSGSSA+PSQMIN
Subjt:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMIN

Query:  FGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ
        FGG AAAAAASPPSLGASSGESSA+NG SPLNNRHPGMF+N+SQ IHNMQMYHQLWA QTQQ
Subjt:  FGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ

XP_023532984.1 AT-hook motif nuclear-localized protein 13-like isoform X1 [Cucurbita pepo subsp. pepo]2.8e-16487.29Show/hide
Query:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA
        MDSL+TPP LSAPSNM VG PTAYS  MSN+NNNASS +GLN AT QM+PPSA FPFNSVIAPASVPLDS+N+SPYDGSHSGSFN DSGKKKRGRPRKY 
Subjt:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA

Query:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT
        PD GNIAL LAPTT+ASSV HGDLSGTPD +QPAKKARGRPPGSGKKQMNA+GS G+GFTPHVV AKPGEDVAAKIL+FSQQGPRTVFILSANG+ISNAT
Subjt:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT

Query:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMIN
        LRH  TSGGSVTYEGQYEIISLSGSF+LSENNGTRSRTGGLSVLL+GSDGQVLGGGVAGMLMA SQVQV+VGSFLE DKKS NT MLNSGSSA+PSQMIN
Subjt:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMIN

Query:  FGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ
        FGG AAAAAASPPSLGASSGESSA+NG SPLNNRHPGMF+N+SQ IHNMQMYHQLWA QTQQ
Subjt:  FGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ

TrEMBL top hitse value%identityAlignment
A0A6J1E3M0 AT-hook motif nuclear-localized protein4.9e-16287.53Show/hide
Query:  MDSLET-PPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKY
        MDSLET PPPLSA SNMAVGG TAYS AMSN+NNNASS IGLN  TTQMM P+  FPFNSVIAPASVPLDSLNV+PYDGSHSG+FN+DSGKKKRGRPRKY
Subjt:  MDSLET-PPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKY

Query:  APDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNA
         PD GNIAL LAPTTVASSV HGDL+ TPDS+QPAKKARGRPPGSGKKQMNA GSGGIGFTPHVVL KPGEDVAAKI+SF+QQGPR VFILSANGT+S+A
Subjt:  APDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNA

Query:  TLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKSNTSMLNSGSSAAPSQMIN
        TLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLL+GSDGQVLGGGVAGMLMAGSQVQ+IVGSFLE+DKKSN+SMLNS SSA P QMIN
Subjt:  TLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKSNTSMLNSGSSAAPSQMIN

Query:  FGGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ
        FG  AA AASPPSLGASSGESSAENGDSPL NRHPGMFNNTSQ I N+QMYH LWA QTQQ
Subjt:  FGGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ

A0A6J1ETT7 AT-hook motif nuclear-localized protein1.9e-15382.78Show/hide
Query:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA
        MDSL+TPP LSAPSNM VG PTAYS  MSN+NNNASS +GLN AT QM+ PSA FPFNSVIAPASVPLDS+NVSPYDGSHSGSFN DSGKKKRGRPRKY 
Subjt:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA

Query:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT
        PD GNIAL LAPTT+ASSV HGDLSGTPD +QPAKKARGRPPGSGKKQMNA+GS G+GFTPHVV AKPGEDVAAKIL+FSQQGPRTVFILSANG+ISNAT
Subjt:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT

Query:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKSNTSMLNSGSSAAPSQMINF
        LRH  TSGGSVTYEGQYEIISLSGSF+LSENNGTRSRTGGLSVLL+GSDGQVLGGGVAGMLMA SQVQV+VGSFLE DKKSN +                
Subjt:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKSNTSMLNSGSSAAPSQMINF

Query:  GGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ
          AAAAAASPPSLGASSGESSA+NG SPLNNRHPGMF+N+SQ IHNMQMYHQLWA QTQQ
Subjt:  GGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ

A0A6J1EXY5 AT-hook motif nuclear-localized protein8.9e-16487.29Show/hide
Query:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA
        MDSL+TPP LSAPSNM VG PTAYS  MSN+NNNASS +GLN AT QM+ PSA FPFNSVIAPASVPLDS+NVSPYDGSHSGSFN DSGKKKRGRPRKY 
Subjt:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA

Query:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT
        PD GNIAL LAPTT+ASSV HGDLSGTPD +QPAKKARGRPPGSGKKQMNA+GS G+GFTPHVV AKPGEDVAAKIL+FSQQGPRTVFILSANG+ISNAT
Subjt:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT

Query:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMIN
        LRH  TSGGSVTYEGQYEIISLSGSF+LSENNGTRSRTGGLSVLL+GSDGQVLGGGVAGMLMA SQVQV+VGSFLE DKKS NT MLNSGSSA+PSQMIN
Subjt:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMIN

Query:  FGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ
        FGG AAAAAASPPSLGASSGESSA+NG SPLNNRHPGMF+N+SQ IHNMQMYHQLWA QTQQ
Subjt:  FGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ

A0A6J1JZ43 AT-hook motif nuclear-localized protein2.2e-15483.06Show/hide
Query:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA
        MDSL+TPP LSAPSNM VG PTAYS  MSN+NNNASS +GLN AT QM+PPSA FPFNSVIAPASVPLDS+NVSPYDGSHSGSFN DSGKKKRGRPRKY 
Subjt:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA

Query:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT
        PD GNIAL LAPTT+ASSV HGDLSGTPD +QPAKKARGRPPGSGKKQMNA+GS G+GFTPHVV AKPGEDVAAKIL+FSQQGPRTVFILSANG+ISNAT
Subjt:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT

Query:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKSNTSMLNSGSSAAPSQMINF
        LRH  TSGGSVTYEGQYEIISLSGSF+LSENNGTRSRTGGLSVLL+GSDGQVLGGGVAGMLMA SQVQV+VGSFLE DKKSN +                
Subjt:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKSNTSMLNSGSSAAPSQMINF

Query:  GGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ
          AAAAAASPPSLGASSGESSA+NG SPLNNRHPGMF+N+SQ IHNMQMYHQLWA QTQQ
Subjt:  GGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ

A0A6J1K244 AT-hook motif nuclear-localized protein1.1e-16487.57Show/hide
Query:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA
        MDSL+TPP LSAPSNM VG PTAYS  MSN+NNNASS +GLN AT QM+PPSA FPFNSVIAPASVPLDS+NVSPYDGSHSGSFN DSGKKKRGRPRKY 
Subjt:  MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYA

Query:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT
        PD GNIAL LAPTT+ASSV HGDLSGTPD +QPAKKARGRPPGSGKKQMNA+GS G+GFTPHVV AKPGEDVAAKIL+FSQQGPRTVFILSANG+ISNAT
Subjt:  PDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNAT

Query:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMIN
        LRH  TSGGSVTYEGQYEIISLSGSF+LSENNGTRSRTGGLSVLL+GSDGQVLGGGVAGMLMA SQVQV+VGSFLE DKKS NT MLNSGSSA+PSQMIN
Subjt:  LRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKS-NTSMLNSGSSAAPSQMIN

Query:  FGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ
        FGG AAAAAASPPSLGASSGESSA+NG SPLNNRHPGMF+N+SQ IHNMQMYHQLWA QTQQ
Subjt:  FGG-AAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQSIHNMQMYHQLWASQTQQ

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 101.5e-5143.32Show/hide
Query:  MAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDS-LNVSPYDGSHSGSFNVDSG--KKKRGRPRKYAPDGGNIALALAP
        MA+     +S A    + N   + G +  T     P           P S   +S LN++   G   G     S   KK+RGRPRKY PD G ++L L P
Subjt:  MAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDS-LNVSPYDGSHSGSFNVDSG--KKKRGRPRKYAPDGGNIALALAP

Query:  TTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKK--QMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHPATSGGS
           + +V+     G        +K RGRPPGS  K  ++ ALGS GIGFTPHV+    GEDV++KI++ +  GPR V +LSANG ISN TLR  ATSGG+
Subjt:  TTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKK--QMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHPATSGGS

Query:  VTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL---EEDKKSNTSMLNSGS----SAAPSQMINFGGA
        VTYEG++EI+SLSGSF L ENNG RSRTGGLSV LS  DG VLGG VAG+L+A S VQ++VGSFL   E++ K +   +   S      AP+Q++     
Subjt:  VTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL---EEDKKSNTSMLNSGS----SAAPSQMINFGGA

Query:  AAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNT
            +SP S G  S  S      SP++    G +NNT
Subjt:  AAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNT

Q8GXB3 AT-hook motif nuclear-localized protein 53.7e-4245.68Show/hide
Query:  KKKRGRPRKYAPDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNA-LG-----SGGIGFTPHVVLAKPGEDVAAKILSFSQQG
        KKKRGRPRKY PD G ++L L+P    S  +  D S   D + P K+ARGRPPG+G+KQ  A LG     S G+ F PHV+    GED+ +K+LSFSQ+ 
Subjt:  KKKRGRPRKYAPDGGNIALALAPTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNA-LG-----SGGIGFTPHVVLAKPGEDVAAKILSFSQQG

Query:  PRTVFILSANGTISNATLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL--EEDKKS
        PR + I+S  GT+S+ TLR PA++  S+T+EG++EI+SL GS+L++E  G++SRTGGLSV LSG +G V+GGG+ GML+A S VQV+  SF+     K +
Subjt:  PRTVFILSANGTISNATLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL--EEDKKS

Query:  NTSMLNSGSSAAPSQMINFGGAAAAAASPPSLGASSGESSAEN
        N +         P Q            S P   AS+G+ + +N
Subjt:  NTSMLNSGSSAAPSQMINFGGAAAAAASPPSLGASSGESSAEN

Q8VYJ2 AT-hook motif nuclear-localized protein 18.9e-4440.94Show/hide
Query:  GGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSG--KKKRGRPRKYAPDGGNIALALAPTTVA
        GG T   +   +  + A  +   NQ+ T + PP    P +   AP  + + ++  +    +  G   +  G  KKKRGRPRKY PDG  +AL+  P + A
Subjt:  GGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSG--KKKRGRPRKYAPDGGNIALALAPTTVA

Query:  SSVAH--GDLSGTPDSDQPAKKARGRPPGSGKK-----QMNALG-----SGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHP
         + +H     S   D     K+++ +P  S  +     Q+  LG     S G  FTPH++    GEDV  KI+SFSQQGPR++ +LSANG IS+ TLR P
Subjt:  SSVAH--GDLSGTPDSDQPAKKARGRPPGSGKK-----QMNALG-----SGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHP

Query:  ATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL--------EEDKKSNTSMLNSGSSAAP
         +SGG++TYEG++EI+SLSGSF+ +++ GTRSRTGG+SV L+  DG+V+GGG+AG+L+A S VQV+VGSFL        +  K  +  ML+S ++A P
Subjt:  ATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL--------EEDKKSNTSMLNSGSSAAP

Q940I0 AT-hook motif nuclear-localized protein 132.0e-5944.58Show/hide
Query:  NSNNNASSAI--GLNQATTQMMPPSALFPFNSVIAP--------------ASVPLDSLNVSPYDGS---------HSGSFNVD--SGKKKRGRPRKYAPD
        N N NA++A+  G N +T+Q M      PF   ++P                +   +L    +DGS         HS  F +D    KKKRGRPRKYA D
Subjt:  NSNNNASSAI--GLNQATTQMMPPSALFPFNSVIAP--------------ASVPLDSLNVSPYDGS---------HSGSFNVD--SGKKKRGRPRKYAPD

Query:  GG-------NIALALAPTTVASSVAH-----------GDLSG--TPDSDQPAKKARGRPPGSGKKQMNAL-GSGGIGFTPHVVLAKPGEDVAAKILSFSQ
        GG       NIAL LAPT+   S ++           GD +G     SD PAK+ RGRPPGSGKKQ++AL G+GG+GFTPHV+  K GED+A KIL+F+ 
Subjt:  GG-------NIALALAPTTVASSVAH-----------GDLSG--TPDSDQPAKKARGRPPGSGKKQMNAL-GSGGIGFTPHVVLAKPGEDVAAKILSFSQ

Query:  QGPRTVFILSANGTISNATLRHPATSG--GSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDK
        QGPR + ILSA G ++N  LR    S   G+V YEG++EIISLSGSFL SE+NGT ++TG LSV L+G +G+++GG V GML+AGSQVQVIVGSF+ + +
Subjt:  QGPRTVFILSANGTISNATLRHPATSG--GSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDK

Query:  KSNTSMLNSGS----SAAPSQMINFGGAAAAAASPPSLGAS-SGESSAEN-GDSPL-------NNRHPGMF-NNTSQSIHN--MQMYHQLWASQTQQ
        K   S   + +    ++AP+ M++FGG      SP S G   S ESS EN  +SPL       N+ + G+F N+T Q +H   MQMY  LW   + Q
Subjt:  KSNTSMLNSGS----SAAPSQMINFGGAAAAAASPPSLGAS-SGESSAEN-GDSPL-------NNRHPGMF-NNTSQSIHN--MQMYHQLWASQTQQ

Q9FIR1 AT-hook motif nuclear-localized protein 82.3e-5248.44Show/hide
Query:  KKKRGRPRKYAPDGGNIALALAPTTVASSVAH--------GDLSGTPDS-DQPAKKARGRPPGSGKKQMNAL-GSGGIGFTPHVVLAKPGEDVAAKILSF
        KKKRGRPRKY PD G+IAL LAPT+   S A         GD  G  +S D P K+ RGRPPGS KKQ++AL G+ G+GFTPHV+    GED+A+K+++F
Subjt:  KKKRGRPRKYAPDGGNIALALAPTTVASSVAH--------GDLSGTPDS-DQPAKKARGRPPGSGKKQMNAL-GSGGIGFTPHVVLAKPGEDVAAKILSF

Query:  SQQGPRTVFILSANGTISNATLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDK
        S QG RT+ ILSA+G +S   LR  + S G VTYEG++EII+LSGS L  E NG+ +R+G LSV L+G DG ++GG V G L+A +QVQVIVGSF+ E K
Subjt:  SQQGPRTVFILSANGTISNATLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDK

Query:  KSNTSMLNSG------SSAAPSQMINFGGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQS---IHNMQMYHQLWASQTQ
        K   S +N         ++AP+ M+NFG  +   +S  S    SG S A + D+  NN   G      Q     H MQMY  LW++  Q
Subjt:  KSNTSMLNSG------SSAAPSQMINFGGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQS---IHNMQMYHQLWASQTQ

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein1.1e-5243.32Show/hide
Query:  MAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDS-LNVSPYDGSHSGSFNVDSG--KKKRGRPRKYAPDGGNIALALAP
        MA+     +S A    + N   + G +  T     P           P S   +S LN++   G   G     S   KK+RGRPRKY PD G ++L L P
Subjt:  MAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDS-LNVSPYDGSHSGSFNVDSG--KKKRGRPRKYAPDGGNIALALAP

Query:  TTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKK--QMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHPATSGGS
           + +V+     G        +K RGRPPGS  K  ++ ALGS GIGFTPHV+    GEDV++KI++ +  GPR V +LSANG ISN TLR  ATSGG+
Subjt:  TTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKK--QMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHPATSGGS

Query:  VTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL---EEDKKSNTSMLNSGS----SAAPSQMINFGGA
        VTYEG++EI+SLSGSF L ENNG RSRTGGLSV LS  DG VLGG VAG+L+A S VQ++VGSFL   E++ K +   +   S      AP+Q++     
Subjt:  VTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL---EEDKKSNTSMLNSGS----SAAPSQMINFGGA

Query:  AAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNT
            +SP S G  S  S      SP++    G +NNT
Subjt:  AAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNT

AT2G33620.2 AT hook motif DNA-binding family protein1.1e-5243.32Show/hide
Query:  MAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDS-LNVSPYDGSHSGSFNVDSG--KKKRGRPRKYAPDGGNIALALAP
        MA+     +S A    + N   + G +  T     P           P S   +S LN++   G   G     S   KK+RGRPRKY PD G ++L L P
Subjt:  MAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDS-LNVSPYDGSHSGSFNVDSG--KKKRGRPRKYAPDGGNIALALAP

Query:  TTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKK--QMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHPATSGGS
           + +V+     G        +K RGRPPGS  K  ++ ALGS GIGFTPHV+    GEDV++KI++ +  GPR V +LSANG ISN TLR  ATSGG+
Subjt:  TTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKK--QMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHPATSGGS

Query:  VTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL---EEDKKSNTSMLNSGS----SAAPSQMINFGGA
        VTYEG++EI+SLSGSF L ENNG RSRTGGLSV LS  DG VLGG VAG+L+A S VQ++VGSFL   E++ K +   +   S      AP+Q++     
Subjt:  VTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL---EEDKKSNTSMLNSGS----SAAPSQMINFGGA

Query:  AAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNT
            +SP S G  S  S      SP++    G +NNT
Subjt:  AAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNT

AT2G33620.3 AT hook motif DNA-binding family protein1.1e-5243.32Show/hide
Query:  MAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDS-LNVSPYDGSHSGSFNVDSG--KKKRGRPRKYAPDGGNIALALAP
        MA+     +S A    + N   + G +  T     P           P S   +S LN++   G   G     S   KK+RGRPRKY PD G ++L L P
Subjt:  MAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDS-LNVSPYDGSHSGSFNVDSG--KKKRGRPRKYAPDGGNIALALAP

Query:  TTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKK--QMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHPATSGGS
           + +V+     G        +K RGRPPGS  K  ++ ALGS GIGFTPHV+    GEDV++KI++ +  GPR V +LSANG ISN TLR  ATSGG+
Subjt:  TTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKK--QMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHPATSGGS

Query:  VTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL---EEDKKSNTSMLNSGS----SAAPSQMINFGGA
        VTYEG++EI+SLSGSF L ENNG RSRTGGLSV LS  DG VLGG VAG+L+A S VQ++VGSFL   E++ K +   +   S      AP+Q++     
Subjt:  VTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFL---EEDKKSNTSMLNSGS----SAAPSQMINFGGA

Query:  AAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNT
            +SP S G  S  S      SP++    G +NNT
Subjt:  AAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNT

AT4G17950.1 AT hook motif DNA-binding family protein1.4e-6044.58Show/hide
Query:  NSNNNASSAI--GLNQATTQMMPPSALFPFNSVIAP--------------ASVPLDSLNVSPYDGS---------HSGSFNVD--SGKKKRGRPRKYAPD
        N N NA++A+  G N +T+Q M      PF   ++P                +   +L    +DGS         HS  F +D    KKKRGRPRKYA D
Subjt:  NSNNNASSAI--GLNQATTQMMPPSALFPFNSVIAP--------------ASVPLDSLNVSPYDGS---------HSGSFNVD--SGKKKRGRPRKYAPD

Query:  GG-------NIALALAPTTVASSVAH-----------GDLSG--TPDSDQPAKKARGRPPGSGKKQMNAL-GSGGIGFTPHVVLAKPGEDVAAKILSFSQ
        GG       NIAL LAPT+   S ++           GD +G     SD PAK+ RGRPPGSGKKQ++AL G+GG+GFTPHV+  K GED+A KIL+F+ 
Subjt:  GG-------NIALALAPTTVASSVAH-----------GDLSG--TPDSDQPAKKARGRPPGSGKKQMNAL-GSGGIGFTPHVVLAKPGEDVAAKILSFSQ

Query:  QGPRTVFILSANGTISNATLRHPATSG--GSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDK
        QGPR + ILSA G ++N  LR    S   G+V YEG++EIISLSGSFL SE+NGT ++TG LSV L+G +G+++GG V GML+AGSQVQVIVGSF+ + +
Subjt:  QGPRTVFILSANGTISNATLRHPATSG--GSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDK

Query:  KSNTSMLNSGS----SAAPSQMINFGGAAAAAASPPSLGAS-SGESSAEN-GDSPL-------NNRHPGMF-NNTSQSIHN--MQMYHQLWASQTQQ
        K   S   + +    ++AP+ M++FGG      SP S G   S ESS EN  +SPL       N+ + G+F N+T Q +H   MQMY  LW   + Q
Subjt:  KSNTSMLNSGS----SAAPSQMINFGGAAAAAASPPSLGAS-SGESSAEN-GDSPL-------NNRHPGMF-NNTSQSIHN--MQMYHQLWASQTQQ

AT5G46640.1 AT hook motif DNA-binding family protein1.7e-5348.44Show/hide
Query:  KKKRGRPRKYAPDGGNIALALAPTTVASSVAH--------GDLSGTPDS-DQPAKKARGRPPGSGKKQMNAL-GSGGIGFTPHVVLAKPGEDVAAKILSF
        KKKRGRPRKY PD G+IAL LAPT+   S A         GD  G  +S D P K+ RGRPPGS KKQ++AL G+ G+GFTPHV+    GED+A+K+++F
Subjt:  KKKRGRPRKYAPDGGNIALALAPTTVASSVAH--------GDLSGTPDS-DQPAKKARGRPPGSGKKQMNAL-GSGGIGFTPHVVLAKPGEDVAAKILSF

Query:  SQQGPRTVFILSANGTISNATLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDK
        S QG RT+ ILSA+G +S   LR  + S G VTYEG++EII+LSGS L  E NG+ +R+G LSV L+G DG ++GG V G L+A +QVQVIVGSF+ E K
Subjt:  SQQGPRTVFILSANGTISNATLRHPATSGGSVTYEGQYEIISLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDK

Query:  KSNTSMLNSG------SSAAPSQMINFGGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQS---IHNMQMYHQLWASQTQ
        K   S +N         ++AP+ M+NFG  +   +S  S    SG S A + D+  NN   G      Q     H MQMY  LW++  Q
Subjt:  KSNTSMLNSG------SSAAPSQMINFGGAAAAAASPPSLGASSGESSAENGDSPLNNRHPGMFNNTSQS---IHNMQMYHQLWASQTQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCACTTGAAACTCCTCCGCCACTTTCAGCACCGTCCAACATGGCGGTCGGAGGACCGACGGCGTATTCTACGGCCATGTCCAACTCCAACAACAATGCCTCTTC
CGCGATTGGTCTCAATCAGGCCACGACTCAGATGATGCCGCCTTCTGCGCTCTTTCCGTTTAACTCCGTGATCGCCCCTGCATCCGTGCCTTTGGATTCTTTGAATGTTT
CTCCCTATGACGGATCGCATTCCGGGAGTTTCAACGTTGATTCGGGGAAGAAGAAGAGAGGCCGGCCGAGGAAGTACGCGCCTGATGGTGGTAACATTGCCTTGGCTTTG
GCGCCTACAACTGTTGCGTCTTCTGTTGCTCACGGGGATTTGAGCGGCACTCCTGATTCGGACCAGCCGGCGAAGAAAGCGAGGGGAAGGCCACCGGGCTCGGGGAAGAA
ACAGATGAATGCTCTTGGTTCAGGCGGCATTGGTTTTACTCCTCACGTTGTATTGGCGAAGCCTGGAGAGGATGTAGCAGCCAAAATTCTATCCTTCTCACAGCAAGGAC
CACGAACTGTCTTTATTCTCTCTGCAAATGGTACCATCAGTAATGCTACCCTTCGACACCCGGCTACCTCTGGTGGTTCTGTGACATATGAGGGCCAGTATGAGATCATC
TCTCTGTCAGGGTCCTTTTTGCTCTCGGAGAATAATGGAACTCGAAGTAGAACAGGTGGTTTGAGTGTGTTGCTGTCCGGGTCAGACGGACAAGTTCTTGGTGGTGGAGT
TGCAGGAATGCTAATGGCAGGTTCTCAAGTGCAGGTGATTGTGGGAAGTTTTCTCGAGGAGGATAAAAAATCCAACACAAGTATGCTGAATTCTGGATCTTCTGCTGCGC
CATCCCAAATGATAAACTTTGGTGGTGCAGCAGCGGCAGCAGCCAGCCCTCCGTCGCTAGGGGCATCGAGCGGCGAGTCATCTGCGGAAAATGGAGACAGCCCTCTTAAT
AATAGGCATCCTGGAATGTTCAACAACACCAGCCAATCGATCCACAATATGCAGATGTACCACCAACTATGGGCAAGCCAAACGCAGCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCACTTGAAACTCCTCCGCCACTTTCAGCACCGTCCAACATGGCGGTCGGAGGACCGACGGCGTATTCTACGGCCATGTCCAACTCCAACAACAATGCCTCTTC
CGCGATTGGTCTCAATCAGGCCACGACTCAGATGATGCCGCCTTCTGCGCTCTTTCCGTTTAACTCCGTGATCGCCCCTGCATCCGTGCCTTTGGATTCTTTGAATGTTT
CTCCCTATGACGGATCGCATTCCGGGAGTTTCAACGTTGATTCGGGGAAGAAGAAGAGAGGCCGGCCGAGGAAGTACGCGCCTGATGGTGGTAACATTGCCTTGGCTTTG
GCGCCTACAACTGTTGCGTCTTCTGTTGCTCACGGGGATTTGAGCGGCACTCCTGATTCGGACCAGCCGGCGAAGAAAGCGAGGGGAAGGCCACCGGGCTCGGGGAAGAA
ACAGATGAATGCTCTTGGTTCAGGCGGCATTGGTTTTACTCCTCACGTTGTATTGGCGAAGCCTGGAGAGGATGTAGCAGCCAAAATTCTATCCTTCTCACAGCAAGGAC
CACGAACTGTCTTTATTCTCTCTGCAAATGGTACCATCAGTAATGCTACCCTTCGACACCCGGCTACCTCTGGTGGTTCTGTGACATATGAGGGCCAGTATGAGATCATC
TCTCTGTCAGGGTCCTTTTTGCTCTCGGAGAATAATGGAACTCGAAGTAGAACAGGTGGTTTGAGTGTGTTGCTGTCCGGGTCAGACGGACAAGTTCTTGGTGGTGGAGT
TGCAGGAATGCTAATGGCAGGTTCTCAAGTGCAGGTGATTGTGGGAAGTTTTCTCGAGGAGGATAAAAAATCCAACACAAGTATGCTGAATTCTGGATCTTCTGCTGCGC
CATCCCAAATGATAAACTTTGGTGGTGCAGCAGCGGCAGCAGCCAGCCCTCCGTCGCTAGGGGCATCGAGCGGCGAGTCATCTGCGGAAAATGGAGACAGCCCTCTTAAT
AATAGGCATCCTGGAATGTTCAACAACACCAGCCAATCGATCCACAATATGCAGATGTACCACCAACTATGGGCAAGCCAAACGCAGCAGTGA
Protein sequenceShow/hide protein sequence
MDSLETPPPLSAPSNMAVGGPTAYSTAMSNSNNNASSAIGLNQATTQMMPPSALFPFNSVIAPASVPLDSLNVSPYDGSHSGSFNVDSGKKKRGRPRKYAPDGGNIALAL
APTTVASSVAHGDLSGTPDSDQPAKKARGRPPGSGKKQMNALGSGGIGFTPHVVLAKPGEDVAAKILSFSQQGPRTVFILSANGTISNATLRHPATSGGSVTYEGQYEII
SLSGSFLLSENNGTRSRTGGLSVLLSGSDGQVLGGGVAGMLMAGSQVQVIVGSFLEEDKKSNTSMLNSGSSAAPSQMINFGGAAAAAASPPSLGASSGESSAENGDSPLN
NRHPGMFNNTSQSIHNMQMYHQLWASQTQQ