; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0018659 (gene) of Chayote v1 genome

Gene IDSed0018659
OrganismSechium edule (Chayote v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationLG01:69906585..69910834
RNA-Seq ExpressionSed0018659
SyntenySed0018659
Gene Ontology termsGO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606248.1 AT-hook motif nuclear-localized protein 13, partial [Cucurbita argyrosperma subsp. sororia]1.4e-10262.36Show/hide
Query:  MAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTA
        M VG P AYSP +SN  NN++S + LNP+  QM+  S+ FP+NS+IAPA++P ++MN   YDGSHS S N DSGK+KRGRPRKY  DG  AL LAPTT A
Subjt:  MAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTA

Query:  ---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYE
           GHGDLSG PD E PAKK RGRP GSGKKQ    G    GFT H V AKPGEDV AK+++FSQQGPRTVFI+SANG+I +ATLRH+  +GGS T  YE
Subjt:  ---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYE

Query:  GPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNLSRGAVAAAPNSP-
        G YE+IS+SG F+LSENNG R++TGGLSVLLA  DG++ GGGV+GMLMA SQVQV++GSFLE+DK+  +  M NSGSS+ PSQ +N    A AAA + P 
Subjt:  GPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNLSRGAVAAAPNSP-

Query:  ---SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
           SSGESSADNG   +NNR PG+ SNS SQP+      MQMYH+LWA
Subjt:  ---SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

XP_022159894.1 AT-hook motif nuclear-localized protein 13-like [Momordica charantia]1.2e-10160.17Show/hide
Query:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD
        +++PP P SA  NMAVG   AYS ++SN  NN++S + LNP+ TQM+  +  FP+NS+IAPA++P +++N   YDGSHS + N+DSGK+KRGRPRKY  D
Subjt:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD

Query:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH
        G  AL LAPTT A   GHGDL+  PDSE PAKK RGRP GSGKKQ    G G  GFT H V+ KPGEDV AK+VSF+QQGPR VFI+SANGT+ SATLRH
Subjt:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH

Query:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVPSQRVNL-
           +GGS T  YEG YE+IS+SG F+LSENNG R++TGGLSVLLA  DG++ GGGV+GMLMAGSQVQ+++GSFLEDDK+ + SM NS SS+ P Q +N  
Subjt:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVPSQRVNL-

Query:  SRGAVAAAPNS--PSSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
        +  A AA+P S   SSGESSA+NGD  + NR PG+ +N+ SQP+      +QMYH LWA
Subjt:  SRGAVAAAPNS--PSSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

XP_022931258.1 AT-hook motif nuclear-localized protein 13-like isoform X1 [Cucurbita moschata]3.8e-10562.33Show/hide
Query:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD
        +D+PP   SAP NM VG P AYSP +SN  NN++S + LNP+  QMI  S+ FP+NS+IAPA++P ++MN   YDGSHS S N DSGK+KRGRPRKY  D
Subjt:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD

Query:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH
        G  AL LAPTT A   GHGDLSG PD E PAKK RGRP GSGKKQ    G    GFT H V AKPGEDV AK+++FSQQGPRTVFI+SANG+I +ATLRH
Subjt:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH

Query:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNL
        +  +GGS T  YEG YE+IS+SG F+LSENNG R++TGGLSVLLA  DG++ GGGV+GMLMA SQVQV++GSFLE+DK+  +  M NSGSS+ PSQ +N 
Subjt:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNL

Query:  SRGAVAAAPNSP----SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
           A AAA + P    SSGESSADNG   +NNR PG+ SNS SQP+      MQMYH+LWA
Subjt:  SRGAVAAAPNSP----SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

XP_022995511.1 AT-hook motif nuclear-localized protein 13-like isoform X1 [Cucurbita maxima]4.5e-10662.6Show/hide
Query:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD
        +D+PP   SAP NM VG P AYSP +SN  NN++S + LNP+  QMIP S+ FP+NS+IAPA++P ++MN   YDGSHS S N DSGK+KRGRPRKY  D
Subjt:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD

Query:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH
        G  AL LAPTT A   GHGDLSG PD E PAKK RGRP GSGKKQ    G    GFT H V AKPGEDV AK+++FSQQGPRTVFI+SANG+I +ATLRH
Subjt:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH

Query:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNL
        +  +GGS T  YEG YE+IS+SG F+LSENNG R++TGGLSVLLA  DG++ GGGV+GMLMA SQVQV++GSFLE+DK+  +  M NSGSS+ PSQ +N 
Subjt:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNL

Query:  SRGAVAAAPNSP----SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
           A AAA + P    SSGESSADNG   +NNR PG+ SNS SQP+      MQMYH+LWA
Subjt:  SRGAVAAAPNSP----SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

XP_023532984.1 AT-hook motif nuclear-localized protein 13-like isoform X1 [Cucurbita pepo subsp. pepo]5.9e-10662.6Show/hide
Query:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD
        +D+PP   SAP NM VG P AYSP +SN  NN++S + LNP+  QMIP S+ FP+NS+IAPA++P ++MN   YDGSHS S N DSGK+KRGRPRKY  D
Subjt:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD

Query:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH
        G  AL LAPTT A   GHGDLSG PD E PAKK RGRP GSGKKQ    G    GFT H V AKPGEDV AK+++FSQQGPRTVFI+SANG+I +ATLRH
Subjt:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH

Query:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNL
        +  +GGS T  YEG YE+IS+SG F+LSENNG R++TGGLSVLLA  DG++ GGGV+GMLMA SQVQV++GSFLE+DK+  +  M NSGSS+ PSQ +N 
Subjt:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNL

Query:  SRGAVAAAPNSP----SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
           A AAA + P    SSGESSADNG   +NNR PG+ SNS SQP+      MQMYH+LWA
Subjt:  SRGAVAAAPNSP----SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

TrEMBL top hitse value%identityAlignment
A0A6J1E3M0 AT-hook motif nuclear-localized protein5.6e-10260.17Show/hide
Query:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD
        +++PP P SA  NMAVG   AYS ++SN  NN++S + LNP+ TQM+  +  FP+NS+IAPA++P +++N   YDGSHS + N+DSGK+KRGRPRKY  D
Subjt:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD

Query:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH
        G  AL LAPTT A   GHGDL+  PDSE PAKK RGRP GSGKKQ    G G  GFT H V+ KPGEDV AK+VSF+QQGPR VFI+SANGT+ SATLRH
Subjt:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH

Query:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVPSQRVNL-
           +GGS T  YEG YE+IS+SG F+LSENNG R++TGGLSVLLA  DG++ GGGV+GMLMAGSQVQ+++GSFLEDDK+ + SM NS SS+ P Q +N  
Subjt:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVPSQRVNL-

Query:  SRGAVAAAPNS--PSSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
        +  A AA+P S   SSGESSA+NGD  + NR PG+ +N+ SQP+      +QMYH LWA
Subjt:  SRGAVAAAPNS--PSSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

A0A6J1ETT7 AT-hook motif nuclear-localized protein5.8e-9960.34Show/hide
Query:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD
        +D+PP   SAP NM VG P AYSP +SN  NN++S + LNP+  QMI  S+ FP+NS+IAPA++P ++MN   YDGSHS S N DSGK+KRGRPRKY  D
Subjt:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD

Query:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH
        G  AL LAPTT A   GHGDLSG PD E PAKK RGRP GSGKKQ    G    GFT H V AKPGEDV AK+++FSQQGPRTVFI+SANG+I +ATLRH
Subjt:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH

Query:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVPSQRVNLS
        +  +GGS T  YEG YE+IS+SG F+LSENNG R++TGGLSVLLA  DG++ GGGV+GMLMA SQVQV++GSFLE+DK                 + N +
Subjt:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVPSQRVNLS

Query:  RGAVAAAPNS--PSSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
          A AA+P S   SSGESSADNG   +NNR PG+ SNS SQP+      MQMYH+LWA
Subjt:  RGAVAAAPNS--PSSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

A0A6J1EXY5 AT-hook motif nuclear-localized protein1.9e-10562.33Show/hide
Query:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD
        +D+PP   SAP NM VG P AYSP +SN  NN++S + LNP+  QMI  S+ FP+NS+IAPA++P ++MN   YDGSHS S N DSGK+KRGRPRKY  D
Subjt:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD

Query:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH
        G  AL LAPTT A   GHGDLSG PD E PAKK RGRP GSGKKQ    G    GFT H V AKPGEDV AK+++FSQQGPRTVFI+SANG+I +ATLRH
Subjt:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH

Query:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNL
        +  +GGS T  YEG YE+IS+SG F+LSENNG R++TGGLSVLLA  DG++ GGGV+GMLMA SQVQV++GSFLE+DK+  +  M NSGSS+ PSQ +N 
Subjt:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNL

Query:  SRGAVAAAPNSP----SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
           A AAA + P    SSGESSADNG   +NNR PG+ SNS SQP+      MQMYH+LWA
Subjt:  SRGAVAAAPNSP----SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

A0A6J1JZ43 AT-hook motif nuclear-localized protein6.8e-10060.61Show/hide
Query:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD
        +D+PP   SAP NM VG P AYSP +SN  NN++S + LNP+  QMIP S+ FP+NS+IAPA++P ++MN   YDGSHS S N DSGK+KRGRPRKY  D
Subjt:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD

Query:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH
        G  AL LAPTT A   GHGDLSG PD E PAKK RGRP GSGKKQ    G    GFT H V AKPGEDV AK+++FSQQGPRTVFI+SANG+I +ATLRH
Subjt:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH

Query:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVPSQRVNLS
        +  +GGS T  YEG YE+IS+SG F+LSENNG R++TGGLSVLLA  DG++ GGGV+GMLMA SQVQV++GSFLE+DK                 + N +
Subjt:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVPSQRVNLS

Query:  RGAVAAAPNS--PSSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
          A AA+P S   SSGESSADNG   +NNR PG+ SNS SQP+      MQMYH+LWA
Subjt:  RGAVAAAPNS--PSSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

A0A6J1K244 AT-hook motif nuclear-localized protein2.2e-10662.6Show/hide
Query:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD
        +D+PP   SAP NM VG P AYSP +SN  NN++S + LNP+  QMIP S+ FP+NS+IAPA++P ++MN   YDGSHS S N DSGK+KRGRPRKY  D
Subjt:  MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTD

Query:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH
        G  AL LAPTT A   GHGDLSG PD E PAKK RGRP GSGKKQ    G    GFT H V AKPGEDV AK+++FSQQGPRTVFI+SANG+I +ATLRH
Subjt:  GGNALRLAPTTTA---GHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG--GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRH

Query:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNL
        +  +GGS T  YEG YE+IS+SG F+LSENNG R++TGGLSVLLA  DG++ GGGV+GMLMA SQVQV++GSFLE+DK+  +  M NSGSS+ PSQ +N 
Subjt:  AGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRL-DPSMANSGSSSVPSQRVNL

Query:  SRGAVAAAPNSP----SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
           A AAA + P    SSGESSADNG   +NNR PG+ SNS SQP+      MQMYH+LWA
Subjt:  SRGAVAAAPNSP----SSGESSADNGDGAVNNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 101.5e-3538.63Show/hide
Query:  MNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG----GFTLHAVMAKPGEDV
        MN P  +         +  K++RGRPRKY  D G  + L     A    +S         +K RGRP GS  K+      G    GFT H +    GEDV
Subjt:  MNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG----GFTLHAVMAKPGEDV

Query:  GAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVL
         +K+++ +  GPR V ++SANG I + TLR +  +GG  TV YEG +E++S+SG F L ENNG R++TGGLSV L++PDG + GG V+G+L+A S VQ++
Subjt:  GAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVL

Query:  IGSFLED-DKRLDPSMANSGSSSVPSQRVNLSRGAVAAAPNSPSS----GESSADNGDGA-VNNRQPGVLSNSNSQP
        +GSFL D +K     +   G SS    RV  ++  V   P+SP S     ESS   G G+ ++    G  +N+ + P
Subjt:  IGSFLED-DKRLDPSMANSGSSSVPSQRVNLSRGAVAAAPNSPSS----GESSADNGDGA-VNNRQPGVLSNSNSQP

Q8VYJ2 AT-hook motif nuclear-localized protein 13.1e-3337.97Show/hide
Query:  SPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAP
        +PS  +V   S S+   N SPT + P     P +   AP  L  + +   T   +  E ++    K+KRGRPRKY  D G  + L+P   +     S  P
Subjt:  SPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAP

Query:  -------DSETPAKKPRGRPAGSGKK-----QTTTFGP-------GGFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSA
               D     K+ + +P  S  +     Q    G        G FT H +    GEDV  K++SFSQQGPR++ ++SANG I S TLR    +GG  
Subjt:  -------DSETPAKKPRGRPAGSGKK-----QTTTFGP-------GGFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSA

Query:  TVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFL
        T+ YEG +E++S+SG F+ +++ G R++TGG+SV LA+PDGR+ GGG++G+L+A S VQV++GSFL
Subjt:  TVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFL

Q940I0 AT-hook motif nuclear-localized protein 131.6e-4538.24Show/hide
Query:  NPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGN--------ALRLAPTTT--------------AGHGD
        +P P Q I   +L       +P+++ +   ++  +   H +       K+KRGRPRKY  DGG         AL LAPT+                G GD
Subjt:  NPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGN--------ALRLAPTTT--------------AGHGD

Query:  LSG--APDSETPAKKPRGRPAGSGKKQTTTFGPG---GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYE
         +G  A  S+ PAK+ RGRP GSGKKQ    G     GFT H +  K GED+  K+++F+ QGPR + I+SA G + +  LR A  +  + TV YEG +E
Subjt:  LSG--APDSETPAKKPRGRPAGSGKKQTTTFGPG---GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYE

Query:  MISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVP-SQRVNLSRGAVAAAPNSP-----
        +IS+SG F+ SE+NG  TKTG LSV LA  +GRI GG V GML+AGSQVQV++GSF+ D ++   S   + ++  P S   N+        P SP     
Subjt:  MISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVP-SQRVNLSRGAVAAAPNSP-----

Query:  -SSGESSADNGDGAV--------NNRQPGVLSNSNSQPMQYMQQQMQMYHKLW
          S ESS +N   +         N+   G+  NS  QP+   Q  MQMY  LW
Subjt:  -SSGESSADNGDGAV--------NNRQPGVLSNSNSQPMQYMQQQMQMYHKLW

Q9FIR1 AT-hook motif nuclear-localized protein 82.2e-3940.83Show/hide
Query:  KRKRGRPRKYVTDGGNALRLAPTT-----------TAGHGDLSGAPDS-ETPAKKPRGRPAGSGKKQTTTFGPG---GFTLHAVMAKPGEDVGAKVVSFS
        K+KRGRPRKY  DG  AL LAPT+             G GD  G  +S + P K+ RGRP GS KKQ    G     GFT H +    GED+ +KV++FS
Subjt:  KRKRGRPRKYVTDGGNALRLAPTT-----------TAGHGDLSGAPDS-ETPAKKPRGRPAGSGKKQTTTFGPG---GFTLHAVMAKPGEDVGAKVVSFS

Query:  QQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDD
         QG RT+ I+SA+G +    LR A  + G   V YEG +E+I++SG  +  E NG   ++G LSV LA PDG I GG V G L+A +QVQV++GSF+ + 
Subjt:  QQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDD

Query:  KRLDPSMANSG------SSSVPSQRVNLSRGAVAAAPNSPSSGESSADNGDGAV----NNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
        K+   S  N         +S P+  +N   G+V+  P+S SS E+  ++G  A+    NN   G       QP+     QMQMY  LW+
Subjt:  KRLDPSMANSG------SSSVPSQRVNLSRGAVAAAPNSPSSGESSADNGDGAV----NNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA

Q9SB31 AT-hook motif nuclear-localized protein 37.7e-3234.08Show/hide
Query:  LNPSPTQMIPSSSLFPYNSMIAPATLPSNAMN---APTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAPDSETPAKKPRGRP
        ++P P    P+  L P  ++ A AT+ +        P      +E+ + +  K+KRGRPRKY  DG   + L+P   +    L+    SE P +K RGR 
Subjt:  LNPSPTQMIPSSSLFPYNSMIAPATLPSNAMN---APTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAPDSETPAKKPRGRP

Query:  AGSGKK----------------------QTTTFGPGGFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEM
         G   +                       T  F    FT H ++   GEDV  K+++FSQQG R + I+SANG I + TLR +  +GG  T+ YEG +E+
Subjt:  AGSGKK----------------------QTTTFGPGGFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEM

Query:  ISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMA
        +S++G F+ +++ G R++ GG+SV LA PDGR+FGGG++G+ +A   VQV++G+F+   ++    +A
Subjt:  ISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMA

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein1.1e-3638.63Show/hide
Query:  MNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG----GFTLHAVMAKPGEDV
        MN P  +         +  K++RGRPRKY  D G  + L     A    +S         +K RGRP GS  K+      G    GFT H +    GEDV
Subjt:  MNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG----GFTLHAVMAKPGEDV

Query:  GAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVL
         +K+++ +  GPR V ++SANG I + TLR +  +GG  TV YEG +E++S+SG F L ENNG R++TGGLSV L++PDG + GG V+G+L+A S VQ++
Subjt:  GAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVL

Query:  IGSFLED-DKRLDPSMANSGSSSVPSQRVNLSRGAVAAAPNSPSS----GESSADNGDGA-VNNRQPGVLSNSNSQP
        +GSFL D +K     +   G SS    RV  ++  V   P+SP S     ESS   G G+ ++    G  +N+ + P
Subjt:  IGSFLED-DKRLDPSMANSGSSSVPSQRVNLSRGAVAAAPNSPSS----GESSADNGDGA-VNNRQPGVLSNSNSQP

AT2G33620.2 AT hook motif DNA-binding family protein1.1e-3638.63Show/hide
Query:  MNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG----GFTLHAVMAKPGEDV
        MN P  +         +  K++RGRPRKY  D G  + L     A    +S         +K RGRP GS  K+      G    GFT H +    GEDV
Subjt:  MNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG----GFTLHAVMAKPGEDV

Query:  GAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVL
         +K+++ +  GPR V ++SANG I + TLR +  +GG  TV YEG +E++S+SG F L ENNG R++TGGLSV L++PDG + GG V+G+L+A S VQ++
Subjt:  GAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVL

Query:  IGSFLED-DKRLDPSMANSGSSSVPSQRVNLSRGAVAAAPNSPSS----GESSADNGDGA-VNNRQPGVLSNSNSQP
        +GSFL D +K     +   G SS    RV  ++  V   P+SP S     ESS   G G+ ++    G  +N+ + P
Subjt:  IGSFLED-DKRLDPSMANSGSSSVPSQRVNLSRGAVAAAPNSPSS----GESSADNGDGA-VNNRQPGVLSNSNSQP

AT2G33620.3 AT hook motif DNA-binding family protein1.1e-3638.63Show/hide
Query:  MNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG----GFTLHAVMAKPGEDV
        MN P  +         +  K++RGRPRKY  D G  + L     A    +S         +K RGRP GS  K+      G    GFT H +    GEDV
Subjt:  MNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPTTTAGHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPG----GFTLHAVMAKPGEDV

Query:  GAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVL
         +K+++ +  GPR V ++SANG I + TLR +  +GG  TV YEG +E++S+SG F L ENNG R++TGGLSV L++PDG + GG V+G+L+A S VQ++
Subjt:  GAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVL

Query:  IGSFLED-DKRLDPSMANSGSSSVPSQRVNLSRGAVAAAPNSPSS----GESSADNGDGA-VNNRQPGVLSNSNSQP
        +GSFL D +K     +   G SS    RV  ++  V   P+SP S     ESS   G G+ ++    G  +N+ + P
Subjt:  IGSFLED-DKRLDPSMANSGSSSVPSQRVNLSRGAVAAAPNSPSS----GESSADNGDGA-VNNRQPGVLSNSNSQP

AT4G17950.1 AT hook motif DNA-binding family protein1.1e-4638.24Show/hide
Query:  NPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGN--------ALRLAPTTT--------------AGHGD
        +P P Q I   +L       +P+++ +   ++  +   H +       K+KRGRPRKY  DGG         AL LAPT+                G GD
Subjt:  NPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGN--------ALRLAPTTT--------------AGHGD

Query:  LSG--APDSETPAKKPRGRPAGSGKKQTTTFGPG---GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYE
         +G  A  S+ PAK+ RGRP GSGKKQ    G     GFT H +  K GED+  K+++F+ QGPR + I+SA G + +  LR A  +  + TV YEG +E
Subjt:  LSG--APDSETPAKKPRGRPAGSGKKQTTTFGPG---GFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYE

Query:  MISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVP-SQRVNLSRGAVAAAPNSP-----
        +IS+SG F+ SE+NG  TKTG LSV LA  +GRI GG V GML+AGSQVQV++GSF+ D ++   S   + ++  P S   N+        P SP     
Subjt:  MISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVP-SQRVNLSRGAVAAAPNSP-----

Query:  -SSGESSADNGDGAV--------NNRQPGVLSNSNSQPMQYMQQQMQMYHKLW
          S ESS +N   +         N+   G+  NS  QP+   Q  MQMY  LW
Subjt:  -SSGESSADNGDGAV--------NNRQPGVLSNSNSQPMQYMQQQMQMYHKLW

AT5G46640.1 AT hook motif DNA-binding family protein1.6e-4040.83Show/hide
Query:  KRKRGRPRKYVTDGGNALRLAPTT-----------TAGHGDLSGAPDS-ETPAKKPRGRPAGSGKKQTTTFGPG---GFTLHAVMAKPGEDVGAKVVSFS
        K+KRGRPRKY  DG  AL LAPT+             G GD  G  +S + P K+ RGRP GS KKQ    G     GFT H +    GED+ +KV++FS
Subjt:  KRKRGRPRKYVTDGGNALRLAPTT-----------TAGHGDLSGAPDS-ETPAKKPRGRPAGSGKKQTTTFGPG---GFTLHAVMAKPGEDVGAKVVSFS

Query:  QQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDD
         QG RT+ I+SA+G +    LR A  + G   V YEG +E+I++SG  +  E NG   ++G LSV LA PDG I GG V G L+A +QVQV++GSF+ + 
Subjt:  QQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCFVLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDD

Query:  KRLDPSMANSG------SSSVPSQRVNLSRGAVAAAPNSPSSGESSADNGDGAV----NNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA
        K+   S  N         +S P+  +N   G+V+  P+S SS E+  ++G  A+    NN   G       QP+     QMQMY  LW+
Subjt:  KRLDPSMANSG------SSSVPSQRVNLSRGAVAAAPNSPSSGESSADNGDGAV----NNRQPGVLSNSNSQPMQYMQQQMQMYHKLWA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCACCACCTCAGCCACATTCAGCACCGCCAAACATGGCCGTCGGAGCACCGCCGGCGTATTCGCCGTCGATATCCAACGTCGGCAACAATTCCGCTTCAGCGAT
GGTTCTAAATCCCTCTCCGACTCAGATGATTCCGTCTTCTTCGCTATTTCCGTATAACTCTATGATCGCCCCTGCAACCCTACCTTCGAATGCTATGAATGCTCCAACGT
ACGACGGATCGCATTCCGAGAGTTTAAACGTCGATTCCGGCAAGAGGAAGAGAGGCCGGCCGAGGAAGTACGTGACGGACGGTGGCAATGCGTTGCGTTTGGCACCTACG
ACGACTGCCGGTCATGGAGATTTGAGCGGCGCTCCTGATTCGGAGACGCCGGCGAAGAAACCGAGGGGAAGGCCTGCCGGCTCGGGGAAGAAACAGACTACTACATTCGG
TCCTGGCGGTTTTACTCTTCATGCTGTAATGGCGAAGCCTGGAGAGGACGTAGGAGCCAAAGTTGTGTCCTTCTCGCAGCAAGGACCACGAACTGTCTTTATTATCTCTG
CAAATGGTACCATCCAAAGTGCTACCCTCCGACATGCAGGGAGAACTGGTGGTTCCGCGACAGTCGCATATGAGGGCCCTTATGAAATGATCTCTATGTCAGGCTGCTTT
GTGCTCTCAGAGAATAATGGAATTCGAACTAAAACAGGTGGTTTGAGCGTGTTGCTTGCTGCGCCAGACGGAAGGATTTTCGGTGGAGGAGTTTCAGGAATGCTAATGGC
AGGTTCCCAAGTACAGGTGCTTATAGGAAGTTTTCTTGAGGATGATAAAAGATTGGATCCAAGTATGGCGAATTCTGGATCTTCCTCCGTTCCATCTCAAAGGGTAAACT
TAAGTAGAGGCGCAGTAGCAGCAGCGCCCAATTCTCCGTCGAGTGGCGAGTCGTCTGCCGACAACGGAGACGGCGCTGTCAATAATAGGCAGCCTGGAGTGTTGAGTAAT
AGCAACAGCCAACCAATGCAGTATATGCAGCAGCAGATGCAGATGTACCACAAATTATGGGCACCAAGATAA
mRNA sequenceShow/hide mRNA sequence
CTAAACCTCCTACACTCTTTCTTCTTCTTGTTCTTCATCTTCATTTGTTTTCACGTTTCTGAGAAATTTGAAGAACAATTACAAGAATCTTGATAATCAAGTTTCTCTGT
TTCTTGAATCTATTTTTCTTTGTATGGATTCACCACCTCAGCCACATTCAGCACCGCCAAACATGGCCGTCGGAGCACCGCCGGCGTATTCGCCGTCGATATCCAACGTC
GGCAACAATTCCGCTTCAGCGATGGTTCTAAATCCCTCTCCGACTCAGATGATTCCGTCTTCTTCGCTATTTCCGTATAACTCTATGATCGCCCCTGCAACCCTACCTTC
GAATGCTATGAATGCTCCAACGTACGACGGATCGCATTCCGAGAGTTTAAACGTCGATTCCGGCAAGAGGAAGAGAGGCCGGCCGAGGAAGTACGTGACGGACGGTGGCA
ATGCGTTGCGTTTGGCACCTACGACGACTGCCGGTCATGGAGATTTGAGCGGCGCTCCTGATTCGGAGACGCCGGCGAAGAAACCGAGGGGAAGGCCTGCCGGCTCGGGG
AAGAAACAGACTACTACATTCGGTCCTGGCGGTTTTACTCTTCATGCTGTAATGGCGAAGCCTGGAGAGGACGTAGGAGCCAAAGTTGTGTCCTTCTCGCAGCAAGGACC
ACGAACTGTCTTTATTATCTCTGCAAATGGTACCATCCAAAGTGCTACCCTCCGACATGCAGGGAGAACTGGTGGTTCCGCGACAGTCGCATATGAGGGCCCTTATGAAA
TGATCTCTATGTCAGGCTGCTTTGTGCTCTCAGAGAATAATGGAATTCGAACTAAAACAGGTGGTTTGAGCGTGTTGCTTGCTGCGCCAGACGGAAGGATTTTCGGTGGA
GGAGTTTCAGGAATGCTAATGGCAGGTTCCCAAGTACAGGTGCTTATAGGAAGTTTTCTTGAGGATGATAAAAGATTGGATCCAAGTATGGCGAATTCTGGATCTTCCTC
CGTTCCATCTCAAAGGGTAAACTTAAGTAGAGGCGCAGTAGCAGCAGCGCCCAATTCTCCGTCGAGTGGCGAGTCGTCTGCCGACAACGGAGACGGCGCTGTCAATAATA
GGCAGCCTGGAGTGTTGAGTAATAGCAACAGCCAACCAATGCAGTATATGCAGCAGCAGATGCAGATGTACCACAAATTATGGGCACCAAGATAAACACACAACACAGCA
GTGAGGATCTTATTTTGCAGGGAAAACTGCTTTAGCCATTAGGGAGATGATATGATATGTGATGC
Protein sequenceShow/hide protein sequence
MDSPPQPHSAPPNMAVGAPPAYSPSISNVGNNSASAMVLNPSPTQMIPSSSLFPYNSMIAPATLPSNAMNAPTYDGSHSESLNVDSGKRKRGRPRKYVTDGGNALRLAPT
TTAGHGDLSGAPDSETPAKKPRGRPAGSGKKQTTTFGPGGFTLHAVMAKPGEDVGAKVVSFSQQGPRTVFIISANGTIQSATLRHAGRTGGSATVAYEGPYEMISMSGCF
VLSENNGIRTKTGGLSVLLAAPDGRIFGGGVSGMLMAGSQVQVLIGSFLEDDKRLDPSMANSGSSSVPSQRVNLSRGAVAAAPNSPSSGESSADNGDGAVNNRQPGVLSN
SNSQPMQYMQQQMQMYHKLWAPR