; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G02660 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G02660
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationChr2:1835748..1837462
RNA-Seq ExpressionCSPI02G02660
SyntenyCSPI02G02660
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0045824 - negative regulation of innate immune response (biological process)
GO:0050832 - defense response to fungus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR014476 - AT-hook motif nuclear-localized protein 15-29


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030326.1 AT-hook motif nuclear-localized protein 20, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-14394.24Show/hide
Query:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHS-GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
        MANRWWTS QMGLPGVDHTSTSSSAMR PDLGISMNDN G V+S GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRP GSKNKPKPPIFVTRDSPNALKSH
Subjt:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHS-GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH

Query:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
        VMEISNGADIAESVA FARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVL LQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
Subjt:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG

Query:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN--GGGGQLNQEAYSWAHGGRPSF
        PVMVIAATFSNATYERLPLEEEEEGGG GAQGHTSA GGG GDGSPQGIGGGVGD SAM PLYNLPPNLLPN  GGGGQ+NQEAYSWAHGGRPSF
Subjt:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN--GGGGQLNQEAYSWAHGGRPSF

XP_004139388.1 AT-hook motif nuclear-localized protein 19 [Cucumis sativus]1.7e-159100Show/hide
Query:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
        MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
Subjt:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH

Query:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
        VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
Subjt:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG

Query:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
        PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
Subjt:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF

XP_008456342.1 PREDICTED: AT-hook motif nuclear-localized protein 19-like [Cucumis melo]2.6e-15598.64Show/hide
Query:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
        MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
Subjt:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH

Query:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
        VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
Subjt:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG

Query:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
        PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSA GGGAGD SPQGIGGGV GDPSAMTPLYNLPPNLLPNGGGGQ+NQEAYSWAHGGRPSF
Subjt:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF

XP_023547116.1 AT-hook motif nuclear-localized protein 19-like [Cucurbita pepo subsp. pepo]1.3e-14393.92Show/hide
Query:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHS-GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
        MANRWWTS QMGLPGVDHTSTSSSAMR PDLGISMNDN G V+S GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRP GSKNKPKPPIFVTRDSPNALKSH
Subjt:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHS-GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH

Query:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
        VMEISNGADIAESVA FARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVL LQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
Subjt:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG

Query:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGGQLNQEAYSWAHGGRPSF
        PVMVIAATFSNATYERLPLEEEEEGGG GAQGHTSA GGG GDGSPQGIGGGVGD SAM PLYNLPPNLLPN   GGGGQ+NQEAYSWAHGGRPSF
Subjt:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGGQLNQEAYSWAHGGRPSF

XP_038890801.1 AT-hook motif nuclear-localized protein 19-like [Benincasa hispida]6.3e-15497.61Show/hide
Query:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
        MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
Subjt:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH

Query:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
        VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVL L GRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
Subjt:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG

Query:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
        PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSA GGGA DGSPQGIG GVGDPSAM PLYNLPPNLLPNGGGGQ+NQEAYSWAHGGRPSF
Subjt:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF

TrEMBL top hitse value%identityAlignment
A0A0A0LL85 AT-hook motif nuclear-localized protein8.4e-160100Show/hide
Query:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
        MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
Subjt:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH

Query:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
        VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
Subjt:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG

Query:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
        PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
Subjt:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF

A0A1S3C3P5 AT-hook motif nuclear-localized protein1.2e-15598.64Show/hide
Query:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
        MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
Subjt:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH

Query:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
        VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
Subjt:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG

Query:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
        PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSA GGGAGD SPQGIGGGV GDPSAMTPLYNLPPNLLPNGGGGQ+NQEAYSWAHGGRPSF
Subjt:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF

A0A5A7VCE1 AT-hook motif nuclear-localized protein1.2e-15598.64Show/hide
Query:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
        MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
Subjt:  MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH

Query:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
        VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
Subjt:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG

Query:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
        PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSA GGGAGD SPQGIGGGV GDPSAMTPLYNLPPNLLPNGGGGQ+NQEAYSWAHGGRPSF
Subjt:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF

A0A6J1G447 AT-hook motif nuclear-localized protein8.4e-14493.92Show/hide
Query:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHS-GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
        MANRWWTS QMGLPGVDHTSTSSSAMR PDLGISMNDN G V+S GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRP GSKNKPKPPIFVTRDSPNALKSH
Subjt:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHS-GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH

Query:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
        VMEISNGADIAESVA FARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVL LQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
Subjt:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG

Query:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGGQLNQEAYSWAHGGRPSF
        PVMVIAATFSNATYERLPLEEEEEGGG GAQGHTSA GGG GDGSPQGIGGGVGD SAM PLYNLPPNLLPN   GGGGQ+NQEAYSWAHGGRPSF
Subjt:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGGQLNQEAYSWAHGGRPSF

A0A6J1KEB8 AT-hook motif nuclear-localized protein9.3e-14393.58Show/hide
Query:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHS-GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH
        MANRWWTS QMGLPGVDHTSTSSSAMR PDLGISMNDN G V+S GGDDDDDRDN GDEPKEGAVEVPTRRPRGRP GSKNKPKPPIFVTRDSPNALKSH
Subjt:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHS-GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSH

Query:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
        VMEISNGADIAESVA FARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVL LQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG
Subjt:  VMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAG

Query:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGGQLNQEAYSWAHGGRPSF
        PVMVIAATFSNATYERLPLEEEEEGGG GAQGHTSA GGG GDGSPQGIGGGVGD SAM PLYNLPPNLLPN   GGGGQ+NQEAYSWAHGGRPSF
Subjt:  PVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGGQLNQEAYSWAHGGRPSF

SwissProt top hitse value%identityAlignment
O23620 AT-hook motif nuclear-localized protein 237.5e-6551.9Show/hide
Query:  MRKPDLGISMNDNGGPVHSGG-------DDDDDRDN---------------GGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEI
        + +PDL +  N +   V  G        DD+D+ +N               GG     G  +V  RRPRGRPPGSKNKPKPP+ +TR+S N L++H++E+
Subjt:  MRKPDLGISMNDNGGPVHSGG-------DDDDDRDN---------------GGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEI

Query:  SNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMV
        +NG D+ + VA +ARRRQRG+ VLSGSGTVTNV++RQPSA GAV+ LQG FEILSL+G+FLP PAPPG+T LTI+LAGGQGQVVGGSVVG LTAAGPV+V
Subjt:  SNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMV

Query:  IAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
        IAA+F+N  YERLPLEE+E+      Q     G  G G+  P+   GG G      P +NLP N+ PN    QL  E +    GGR  F
Subjt:  IAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF

O49662 AT-hook motif nuclear-localized protein 241.3e-6158.1Show/hide
Query:  GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPS--
        GG  +    +GGD          TRRPRGRP GSKNKPKPPI +TRDS NAL++HVMEI +G D+ ESVA FARRRQRGV V+SG+G VTNVT+RQP   
Subjt:  GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPS--

Query:  -APGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAG
         +PG+V++L GRFEILSL+G+FLP PAPP +TGL++YLAGGQGQVVGGSVVGPL  AGPV+V+AA+FSNA YERLPLEE+E       Q     GGGG  
Subjt:  -APGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAG

Query:  DGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQE-AYSWAHGGRPSF
          SP  +G  +     AM+    LPPNLL   G  QL Q+   S+   GRP +
Subjt:  DGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQE-AYSWAHGGRPSF

Q8GWQ2 AT-hook motif nuclear-localized protein 201.1e-8763.67Show/hide
Query:  MANRWWTSGQMGLPG-VDHTSTS--------SSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRD
        MAN WWT+ Q GL G VDH+ +S         S + K DLGI+MN +        D+D D +   D+P+EGAVEV  RRPRGRPPGSKNKPK PIFVTRD
Subjt:  MANRWWTSGQMGLPG-VDHTSTS--------SSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRD

Query:  SPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSV
        SPNAL+SHV+EIS+G+D+A+++A F+RRRQRGV VLSG+G+V NVTLRQ +APG V++LQGRFEILSLTG FLPGP+PPGSTGLT+YLAG QGQVVGGSV
Subjt:  SPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSV

Query:  VGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMT-PLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP
        VGPL A G VMVIAATFSNATYERLP+EEEE+GGG   Q H      G GD SP  IG  + D S M  P YN+PP+L+PN G GQL  E Y+W H   P
Subjt:  VGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMT-PLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP

Q9SR17 AT-hook motif nuclear-localized protein 192.0e-9467.85Show/hide
Query:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMN---DNGGPVH------SGGDDDDDRDN-GGD--EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFV
        MAN WWT GQ+ L G++ T   SS ++KPDL ISMN   D+G   H         ++DDDRDN  GD  EP+EGAVE PTRRPRGRP GSKNKPKPPIFV
Subjt:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMN---DNGGPVH------SGGDDDDDRDN-GGD--EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFV

Query:  TRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPS------APG--AVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLA
        TRDSPNALKSHVMEI++G D+ E++A FARRRQRG+ +LSG+GTV NVTLRQPS      APG  AVLALQGRFEILSLTG+FLPGPAPPGSTGLTIYLA
Subjt:  TRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPS------APG--AVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLA

Query:  GGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEE--EGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGG
        GGQGQVVGGSVVGPL AAGPVM+IAATFSNATYERLPLEEEE  E GG G  G    G  G G GSP   G G GD +   P+YN+P NL+ N   GGGG
Subjt:  GGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEE--EGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGG

Query:  QLN-QEAYSWA
        Q++ QEAY WA
Subjt:  QLN-QEAYSWA

Q9SZ70 AT-hook motif nuclear-localized protein 261.7e-6157.63Show/hide
Query:  DNGGPVHSGGDDDDDRDNGGD--EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVT
        DN    +SG +  +   +GG+      G+ E  TRRPRGRP GSKNKPK PI +TRDS NAL++HVMEI +G DI + +A FARRRQRGV V+SG+G+VT
Subjt:  DNGGPVHSGGDDDDDRDNGGD--EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVT

Query:  NVTLRQP-SAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHT
        NVT+RQP S PG+V++L GRFEILSL+G+FLP PAPP +TGL++YLAGGQGQVVGGSVVGPL  +GPV+V+AA+FSNA YERLPLEE+E    V  QG  
Subjt:  NVTLRQP-SAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHT

Query:  SAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQL-----NQEAYSWAHGGRP
          GGGG G GSP  +G      +AM     LPPNLL   G  QL     N + Y W+ G  P
Subjt:  SAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQL-----NQEAYSWAHGGRP

Arabidopsis top hitse value%identityAlignment
AT3G04570.1 AT-hook motif nuclear-localized protein 191.4e-9567.85Show/hide
Query:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMN---DNGGPVH------SGGDDDDDRDN-GGD--EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFV
        MAN WWT GQ+ L G++ T   SS ++KPDL ISMN   D+G   H         ++DDDRDN  GD  EP+EGAVE PTRRPRGRP GSKNKPKPPIFV
Subjt:  MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMN---DNGGPVH------SGGDDDDDRDN-GGD--EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFV

Query:  TRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPS------APG--AVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLA
        TRDSPNALKSHVMEI++G D+ E++A FARRRQRG+ +LSG+GTV NVTLRQPS      APG  AVLALQGRFEILSLTG+FLPGPAPPGSTGLTIYLA
Subjt:  TRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPS------APG--AVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLA

Query:  GGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEE--EGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGG
        GGQGQVVGGSVVGPL AAGPVM+IAATFSNATYERLPLEEEE  E GG G  G    G  G G GSP   G G GD +   P+YN+P NL+ N   GGGG
Subjt:  GGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEE--EGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGG

Query:  QLN-QEAYSWA
        Q++ QEAY WA
Subjt:  QLN-QEAYSWA

AT4G12050.1 Predicted AT-hook DNA-binding family protein1.2e-6257.63Show/hide
Query:  DNGGPVHSGGDDDDDRDNGGD--EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVT
        DN    +SG +  +   +GG+      G+ E  TRRPRGRP GSKNKPK PI +TRDS NAL++HVMEI +G DI + +A FARRRQRGV V+SG+G+VT
Subjt:  DNGGPVHSGGDDDDDRDNGGD--EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVT

Query:  NVTLRQP-SAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHT
        NVT+RQP S PG+V++L GRFEILSL+G+FLP PAPP +TGL++YLAGGQGQVVGGSVVGPL  +GPV+V+AA+FSNA YERLPLEE+E    V  QG  
Subjt:  NVTLRQP-SAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHT

Query:  SAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQL-----NQEAYSWAHGGRP
          GGGG G GSP  +G      +AM     LPPNLL   G  QL     N + Y W+ G  P
Subjt:  SAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQL-----NQEAYSWAHGGRP

AT4G14465.1 AT-hook motif nuclear-localized protein 207.6e-8963.67Show/hide
Query:  MANRWWTSGQMGLPG-VDHTSTS--------SSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRD
        MAN WWT+ Q GL G VDH+ +S         S + K DLGI+MN +        D+D D +   D+P+EGAVEV  RRPRGRPPGSKNKPK PIFVTRD
Subjt:  MANRWWTSGQMGLPG-VDHTSTS--------SSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRD

Query:  SPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSV
        SPNAL+SHV+EIS+G+D+A+++A F+RRRQRGV VLSG+G+V NVTLRQ +APG V++LQGRFEILSLTG FLPGP+PPGSTGLT+YLAG QGQVVGGSV
Subjt:  SPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSV

Query:  VGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMT-PLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP
        VGPL A G VMVIAATFSNATYERLP+EEEE+GGG   Q H      G GD SP  IG  + D S M  P YN+PP+L+PN G GQL  E Y+W H   P
Subjt:  VGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMT-PLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP

AT4G17800.1 Predicted AT-hook DNA-binding family protein5.3e-6651.9Show/hide
Query:  MRKPDLGISMNDNGGPVHSGG-------DDDDDRDN---------------GGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEI
        + +PDL +  N +   V  G        DD+D+ +N               GG     G  +V  RRPRGRPPGSKNKPKPP+ +TR+S N L++H++E+
Subjt:  MRKPDLGISMNDNGGPVHSGG-------DDDDDRDN---------------GGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEI

Query:  SNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMV
        +NG D+ + VA +ARRRQRG+ VLSGSGTVTNV++RQPSA GAV+ LQG FEILSL+G+FLP PAPPG+T LTI+LAGGQGQVVGGSVVG LTAAGPV+V
Subjt:  SNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMV

Query:  IAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
        IAA+F+N  YERLPLEE+E+      Q     G  G G+  P+   GG G      P +NLP N+ PN    QL  E +    GGR  F
Subjt:  IAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF

AT4G22810.1 Predicted AT-hook DNA-binding family protein9.4e-6358.1Show/hide
Query:  GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPS--
        GG  +    +GGD          TRRPRGRP GSKNKPKPPI +TRDS NAL++HVMEI +G D+ ESVA FARRRQRGV V+SG+G VTNVT+RQP   
Subjt:  GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPS--

Query:  -APGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAG
         +PG+V++L GRFEILSL+G+FLP PAPP +TGL++YLAGGQGQVVGGSVVGPL  AGPV+V+AA+FSNA YERLPLEE+E       Q     GGGG  
Subjt:  -APGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAG

Query:  DGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQE-AYSWAHGGRPSF
          SP  +G  +     AM+    LPPNLL   G  QL Q+   S+   GRP +
Subjt:  DGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQE-AYSWAHGGRPSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCAAACCGGTGGTGGACCTCCGGCCAGATGGGTCTTCCCGGAGTTGATCATACTTCAACAAGCTCCTCTGCTATGAGAAAACCAGATCTGGGAATCTCCATGAA
CGACAACGGTGGTCCTGTTCACAGTGGTGGTGATGACGATGACGATAGGGATAACGGTGGGGATGAACCTAAAGAAGGAGCAGTTGAGGTTCCAACGCGTCGTCCCCGTG
GCCGACCGCCGGGATCCAAGAATAAGCCTAAGCCACCTATCTTTGTTACTAGAGATAGCCCTAATGCCCTAAAAAGCCATGTCATGGAGATTTCTAATGGAGCTGATATT
GCCGAGAGCGTTGCTCAATTCGCTCGACGGCGACAGAGAGGTGTTTCTGTGCTTAGTGGTAGCGGTACGGTTACAAATGTCACACTCCGACAACCGTCTGCGCCTGGTGC
AGTCTTAGCCCTCCAGGGACGATTCGAGATACTTTCTTTAACTGGAACTTTCCTCCCTGGACCAGCCCCACCTGGCTCAACCGGACTAACGATCTACTTGGCTGGTGGAC
AAGGGCAAGTGGTGGGTGGCAGCGTCGTCGGGCCACTCACCGCTGCTGGCCCAGTGATGGTGATTGCTGCAACATTTTCCAACGCAACATACGAAAGATTACCCTTAGAA
GAGGAAGAAGAAGGCGGCGGAGTAGGAGCACAAGGGCACACATCGGCAGGCGGTGGCGGCGCAGGCGACGGTTCACCACAAGGCATCGGAGGCGGAGTCGGGGACCCATC
AGCTATGACTCCACTGTACAATTTACCACCAAATTTACTACCGAATGGTGGCGGAGGGCAGTTGAACCAAGAGGCCTATTCTTGGGCTCACGGCGGCCGGCCGTCATTTT
AA
mRNA sequenceShow/hide mRNA sequence
AGAACACTCACATTACCCTCAACATTTCTAAAAAAACCCCATTTCTCTCTTTCTCTCTCCTCACATACTTTCTTGTCTATCTCAACTCTCTAGGCATCTCTTTGCCTTTT
CTTTCCCTCTCCTTCCCCTTCATCATCACTTCAACCTCCCCTTTCCTTAATCCTTCCATTTAATTTACACCTCCCAATACTCTCTACTTATTAATTTAATCCTTTCTTCT
CTCTATTTTTTTCCTTTTTTTTAATTATTATTTCTTCTTCTTCTTCCCCCTTTCAACCCCACAATACCATCTTAATTCTTCTTCAGCAGATAAATTATTTTTTTTTATAA
AAGAATTTTAACTTTTAAAACTCTGAAAATTTTCAAGATCTGAAAAAAAAAAAAAAGATCTGAATTTTTTGATGATGGCAAACCGGTGGTGGACCTCCGGCCAGATGGGT
CTTCCCGGAGTTGATCATACTTCAACAAGCTCCTCTGCTATGAGAAAACCAGATCTGGGAATCTCCATGAACGACAACGGTGGTCCTGTTCACAGTGGTGGTGATGACGA
TGACGATAGGGATAACGGTGGGGATGAACCTAAAGAAGGAGCAGTTGAGGTTCCAACGCGTCGTCCCCGTGGCCGACCGCCGGGATCCAAGAATAAGCCTAAGCCACCTA
TCTTTGTTACTAGAGATAGCCCTAATGCCCTAAAAAGCCATGTCATGGAGATTTCTAATGGAGCTGATATTGCCGAGAGCGTTGCTCAATTCGCTCGACGGCGACAGAGA
GGTGTTTCTGTGCTTAGTGGTAGCGGTACGGTTACAAATGTCACACTCCGACAACCGTCTGCGCCTGGTGCAGTCTTAGCCCTCCAGGGACGATTCGAGATACTTTCTTT
AACTGGAACTTTCCTCCCTGGACCAGCCCCACCTGGCTCAACCGGACTAACGATCTACTTGGCTGGTGGACAAGGGCAAGTGGTGGGTGGCAGCGTCGTCGGGCCACTCA
CCGCTGCTGGCCCAGTGATGGTGATTGCTGCAACATTTTCCAACGCAACATACGAAAGATTACCCTTAGAAGAGGAAGAAGAAGGCGGCGGAGTAGGAGCACAAGGGCAC
ACATCGGCAGGCGGTGGCGGCGCAGGCGACGGTTCACCACAAGGCATCGGAGGCGGAGTCGGGGACCCATCAGCTATGACTCCACTGTACAATTTACCACCAAATTTACT
ACCGAATGGTGGCGGAGGGCAGTTGAACCAAGAGGCCTATTCTTGGGCTCACGGCGGCCGGCCGTCATTTTAAAGCTTCTGATGGGAAAAAAAAACAAAGGTTAGAAAAT
GTTTTAAGATTGGAGGCTGTTTTGTTATTGCAGCCATGGTTTAGGAAGTTGAAGGTATAAAAGATGAAAGAGATGAGTCTCTATATATTTCATGTCTTAGTGTTGATTAT
AAATATTCAGTGTGAAAAAAAAAACCCTTAAATTAATTGTTCTATGCAAATACAAGAAGGAGAGTTAATTAACTTCCTCTCTTTTTCTTCTAATTTAATTTTTATT
Protein sequenceShow/hide protein sequence
MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADI
AESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLE
EEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF