; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017687 (gene) of Chayote v1 genome

Gene IDSed0017687
OrganismSechium edule (Chayote v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationLG01:18014971..18018226
RNA-Seq ExpressionSed0017687
SyntenySed0017687
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608657.1 AT-hook motif nuclear-localized protein 10, partial [Cucurbita argyrosperma subsp. sororia]4.8e-12378.59Show/hide
Query:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA
        S+  V+  +HLPF        A + SPTYQS GVGVSGNAG DVS  +AF +MN+QSEPVKRKRGRPRK+G DGSMAV    PSAAATQS GGFSPPP  
Subjt:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA

Query:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL
           +GG ASPT LKK RGRPPGS  KQQ+ ALGSAG+GFTPHVITVK GEDVSSKIMS SQNGPR VC+LSANGAISNVTLRQPAMSGGTVTYEGRFEIL
Subjt:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL

Query:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH
        SLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGG KELKQANQIEQ PV   TAPHKLAPIRAGMAG +SSPHS 
Subjt:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH

Query:  GTLSESS---GSPFNHSAGACNSTIPW
        G LSESS   GSPFN S GACN+T  W
Subjt:  GTLSESS---GSPFNHSAGACNSTIPW

KAG7037975.1 AT-hook motif nuclear-localized protein 10 [Cucurbita argyrosperma subsp. argyrosperma]5.6e-12478.9Show/hide
Query:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA
        S+  V+  +HLPF        A + SPTYQS GVGVSGNAG DVS  +AF +MN+QSEPVKRKRGRPRK+G DGSMAV    PSAAATQS GGFSPPP  
Subjt:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA

Query:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL
           +GG ASPT LKK RGRPPGS  KQQ+ ALGSAG+GFTPHVITVK GEDVSSKIMS SQNGPR VC+LSANGAISNVTLRQPAMSGGTVTYEGRFEIL
Subjt:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL

Query:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH
        SLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQ PV   TAPHKLAPIRAGMAG +SSPHS 
Subjt:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH

Query:  GTLSESS---GSPFNHSAGACNSTIPW
        G LSESS   GSPFN S GACN+T  W
Subjt:  GTLSESS---GSPFNHSAGACNSTIPW

XP_022940600.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita moschata]2.4e-12278.29Show/hide
Query:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA
        S+  V+  +HLPF        A++ SPTYQS  VGVSGNAG DVS  +AF +MN+QSEPVKRKRGRPRK+G DGSMAV    PSAAATQS GGFSPPP  
Subjt:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA

Query:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL
           +GG ASPT LKK RGRPPGS  KQQ+ ALGSAG+GFTPHVITVK GEDVSSKIMS SQNGPR VC+LSANGAISNVTLRQPAMSGGTVTYEGRFEIL
Subjt:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL

Query:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH
        SLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQ PV   TAPHKLAPIRAGMAG +SSP S 
Subjt:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH

Query:  GTLSESS---GSPFNHSAGACNSTIPW
        G LSESS   GSPFN S GACN+T  W
Subjt:  GTLSESS---GSPFNHSAGACNSTIPW

XP_022981864.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita maxima]5.8e-12177.68Show/hide
Query:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA
        S+  V+  +HLPF        A++ SPTYQS GVGVSGNAG DVS  +AF  MN+QSEPVKRKRGRPRK+G DGSMAV    PSAAATQS GGFSPPP  
Subjt:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA

Query:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL
           +GG ASPT LKK RGRPPGS  KQQ+ ALGSAG+GFTPHVITVK GEDVSSKIMS SQNGPR VC+LSANGAISNVTLRQPAMSGGTVTYEGRFEIL
Subjt:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL

Query:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH
        SLSG YLL+ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGG KELKQANQIEQ PV   TAPHKLAPIRAGM G +SSP S 
Subjt:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH

Query:  GTLSESS---GSPFNHSAGACNSTIPW
        G LSESS   GSPFN S GACN+T  W
Subjt:  GTLSESS---GSPFNHSAGACNSTIPW

XP_038898092.1 AT-hook motif nuclear-localized protein 10-like [Benincasa hispida]4.0e-12275.99Show/hide
Query:  MSGSEIGVMSS----------------------MHLPFAA-------AAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGS
        MSGSE GVMSS                      MHLPF A       AAASPTYQSSGVGV+GNAG D SA +AF NMNSQSEPVKRKRGRPRK+G DGS
Subjt:  MSGSEIGVMSS----------------------MHLPFAA-------AAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGS

Query:  M----AVPSAAATQSSGGFSPPPAAA-AAGGLASPTSLKKGRGRPPGSA-NKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANG
        M    AV SAAATQ SGGFSPPP AA  +GG ASPT LKK RGRPPGS+  KQQ+   GSAG+GFTPHVITVK GEDVSSKIMSFSQNGPR VC+L+ANG
Subjt:  M----AVPSAAATQSSGGFSPPPAAA-AAGGLASPTSLKKGRGRPPGSA-NKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSF+ DGGHKEL   NQIEQLPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPV

Query:  VVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS---GSPFNHSAGACNST-IPW
           TAPHKLAPIRAGM G +SSP S GTLSESS   GSPFN S GACN+  IPW
Subjt:  VVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS---GSPFNHSAGACNST-IPW

TrEMBL top hitse value%identityAlignment
A0A0A0L0T7 AT-hook motif nuclear-localized protein6.3e-12179.03Show/hide
Query:  SEIGVMSSMHLPFAA-------AAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAVP----SAAATQSSGGFSPPPAA
        S+  VM SMHLPF A       A ASPTYQSS VGV+GNAG D SA DAF NMNSQSEPVKRKRGRPRK+G DGSMAV      AAATQSSGGFSP P A
Subjt:  SEIGVMSSMHLPFAA-------AAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAVP----SAAATQSSGGFSPPPAA

Query:  A-AAGGLASPTSLKKGRGRPPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEI
        A  +G  ASPTSLKK RGRPPGS+ K+  +    SAG+GFTPHVITVK GEDVSSKIMSFSQNGPR VC+L+ANGAISNVTLRQPAMSGGTVTYEGRFEI
Subjt:  A-AAGGLASPTSLKKGRGRPPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEI

Query:  LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHS
        LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGHKEL+Q NQIEQ PV   +APHKLAPIRAGM G +SSP S
Subjt:  LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHS

Query:  HGTLSESS---GSPFNHSAGAC-NSTIPW
         GTLSESS   GSPFN SAGAC N+TIPW
Subjt:  HGTLSESS---GSPFNHSAGAC-NSTIPW

A0A1S3BWC6 AT-hook motif nuclear-localized protein5.3e-12078.42Show/hide
Query:  SEIGVMSSMHLPFAA-------AAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSM----AVPSAAATQSSGGFSPPPAA
        S+  VM SMHLPF A        AASPTYQSS VGV+GNAG D SA +AF NMNSQSEPVKRKRGRPRK+G DGSM    AV  AAATQSSGGFSP P A
Subjt:  SEIGVMSSMHLPFAA-------AAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSM----AVPSAAATQSSGGFSPPPAA

Query:  A-AAGGLASPTSLKKGRGRPPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEI
        A  +GG  SPTSLKK RGRPPGS+ K+ Q+ +  S G+GFTPHVITVK GEDVSSKIMSFSQNGPR VC+L+ANGAISNVTLRQPAMSGGTVTYEGRFEI
Subjt:  A-AAGGLASPTSLKKGRGRPPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEI

Query:  LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHS
        LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D GHKEL+Q NQIEQ PV   +APHKLAPIRAGM G +SSP S
Subjt:  LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHS

Query:  HGTLSESS---GSPFNHSAGAC-NSTIPW
         GTLSESS   GSPFN SAGAC N+TIPW
Subjt:  HGTLSESS---GSPFNHSAGAC-NSTIPW

A0A5A7VAQ2 AT-hook motif nuclear-localized protein5.3e-12078.42Show/hide
Query:  SEIGVMSSMHLPFAA-------AAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSM----AVPSAAATQSSGGFSPPPAA
        S+  VM SMHLPF A        AASPTYQSS VGV+GNAG D SA +AF NMNSQSEPVKRKRGRPRK+G DGSM    AV  AAATQSSGGFSP P A
Subjt:  SEIGVMSSMHLPFAA-------AAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSM----AVPSAAATQSSGGFSPPPAA

Query:  A-AAGGLASPTSLKKGRGRPPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEI
        A  +GG  SPTSLKK RGRPPGS+ K+ Q+ +  S G+GFTPHVITVK GEDVSSKIMSFSQNGPR VC+L+ANGAISNVTLRQPAMSGGTVTYEGRFEI
Subjt:  A-AAGGLASPTSLKKGRGRPPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEI

Query:  LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHS
        LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D GHKEL+Q NQIEQ PV   +APHKLAPIRAGM G +SSP S
Subjt:  LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHS

Query:  HGTLSESS---GSPFNHSAGAC-NSTIPW
         GTLSESS   GSPFN SAGAC N+TIPW
Subjt:  HGTLSESS---GSPFNHSAGAC-NSTIPW

A0A6J1FR30 AT-hook motif nuclear-localized protein1.1e-12278.29Show/hide
Query:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA
        S+  V+  +HLPF        A++ SPTYQS  VGVSGNAG DVS  +AF +MN+QSEPVKRKRGRPRK+G DGSMAV    PSAAATQS GGFSPPP  
Subjt:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA

Query:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL
           +GG ASPT LKK RGRPPGS  KQQ+ ALGSAG+GFTPHVITVK GEDVSSKIMS SQNGPR VC+LSANGAISNVTLRQPAMSGGTVTYEGRFEIL
Subjt:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL

Query:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH
        SLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQ PV   TAPHKLAPIRAGMAG +SSP S 
Subjt:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH

Query:  GTLSESS---GSPFNHSAGACNSTIPW
        G LSESS   GSPFN S GACN+T  W
Subjt:  GTLSESS---GSPFNHSAGACNSTIPW

A0A6J1IXR2 AT-hook motif nuclear-localized protein2.8e-12177.68Show/hide
Query:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA
        S+  V+  +HLPF        A++ SPTYQS GVGVSGNAG DVS  +AF  MN+QSEPVKRKRGRPRK+G DGSMAV    PSAAATQS GGFSPPP  
Subjt:  SEIGVMSSMHLPFA-------AAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAV----PSAAATQSSGGFSPPPAA

Query:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL
           +GG ASPT LKK RGRPPGS  KQQ+ ALGSAG+GFTPHVITVK GEDVSSKIMS SQNGPR VC+LSANGAISNVTLRQPAMSGGTVTYEGRFEIL
Subjt:  AA-AGGLASPTSLKKGRGRPPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEIL

Query:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH
        SLSG YLL+ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGG KELKQANQIEQ PV   TAPHKLAPIRAGM G +SSP S 
Subjt:  SLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSH

Query:  GTLSESS---GSPFNHSAGACNSTIPW
        G LSESS   GSPFN S GACN+T  W
Subjt:  GTLSESS---GSPFNHSAGACNSTIPW

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 101.1e-6453.82Show/hide
Query:  PFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSD-GSMAV---PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR
        P  + +    YQ +  G +     ++   ++     + SEPVK++RGRPRK+G D G M++   P A +   S      P++   GG       +K RGR
Subjt:  PFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSD-GSMAV---PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR

Query:  PPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT
        PPGS++K+ ++ ALGS G+GFTPHV+TV  GEDVSSKIM+ + NGPR VCVLSANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRT
Subjt:  PPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT

Query:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS-----GSPFNH
        GGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ D G KE KQ   + Q+ +     P ++AP +  M  T SSP S GT+SESS     GSP + 
Subjt:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS-----GSPFNH

Query:  SAGA-CNSTI--PW
        S G   N+TI  PW
Subjt:  SAGA-CNSTI--PW

O49658 AT-hook motif nuclear-localized protein 21.2e-4747.87Show/hide
Query:  NAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAVPSAAATQSSGGFSPPPAAAAAGGLA-----SPTSLKKGRGRP----PGS--ANKQQIHAL
        N+ T  +A D F      S P+K++RGRPRK+G DG      AA T      SP P ++AA   +     S TS K+G+ +P    P S    K Q+  L
Subjt:  NAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAVPSAAATQSSGGFSPPPAAAAAGGLA-----SPTSLKKGRGRP----PGS--ANKQQIHAL

Query:  G-----SAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSG
        G     SA   FTPH+ITV  GEDV+ +I+SFSQ G   +CVL ANG +S+VTLRQP  SGGT+TYEGRFEILSLSG+++ S++ G RSRTGG+SVSL+ 
Subjt:  G-----SAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSG

Query:  PDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESSGS
        PDGRV+GGGVAGLL AA+P+QVVVG+F+  GG       NQ EQ P      PH    + + +  TSS+   H T+   + S
Subjt:  PDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESSGS

O80834 AT-hook motif nuclear-localized protein 97.3e-5050.7Show/hide
Query:  GTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAVPSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGRPPGSANKQQIHALG-----SAGMGFT
        G ++ AP   P+      P+KRKRGRPRK+G DGS+++  ++++ S+              +    S K+GRGRPPGS  KQ++ ++G     S+GM FT
Subjt:  GTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAVPSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGRPPGSANKQQIHALG-----SAGMGFT

Query:  PHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGL
        PHVI V  GED++SK+++FSQ GPR +CVLSA+GA+S  TL QP+ S G + YEGRFEIL+LS SY+++ +G  R+RTG LSVSL+ PDGRV+GG + G 
Subjt:  PHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGL

Query:  LTAASPVQVVVGSFV
        L AASPVQV+VGSF+
Subjt:  LTAASPVQVVVGSFV

Q8GXB3 AT-hook motif nuclear-localized protein 52.0e-4746.15Show/hide
Query:  PNMNSQSEPVKRKRGRPRKFGSDGSMAVPSAAATQSSGGFSPPPAAAA----AGGLASPTSLKKGRGRPPGSANKQQIHALG-----SAGMGFTPHVITV
        P   S+   VK+KRGRPRK+  DG          Q S G SP P  +     +  ++ P + K+ RGRPPG+  KQ++  LG     SAG+ F PHVI+V
Subjt:  PNMNSQSEPVKRKRGRPRKFGSDGSMAVPSAAATQSSGGFSPPPAAAA----AGGLASPTSLKKGRGRPPGSANKQQIHALG-----SAGMGFTPHVITV

Query:  KTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASP
         +GED+ SK++SFSQ  PR +C++S  G +S+VTLR+PA +  ++T+EGRFEILSL GSYL++E GG +SRTGGLSVSLSGP+G V+GGG+ G+L AAS 
Subjt:  KTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASP

Query:  VQVVVGSFV-------NDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHS
        VQVV  SFV       N+  +K +KQ  + +Q P   T +  +  P  A  A  S+  H+
Subjt:  VQVVVGSFV-------NDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHS

Q8VYJ2 AT-hook motif nuclear-localized protein 12.3e-5148.66Show/hide
Query:  VKRKRGRPRKFGSDGSMAV--PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGRPPGSANKQQIH---------ALGSAGMGFTPHVITVKTGEDVS
        +K+KRGRPRK+G DG++    P   ++  +    PPP++      AS    K+ + +P  S N+ + H         A  S G  FTPH+ITV TGEDV+
Subjt:  VKRKRGRPRKFGSDGSMAV--PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGRPPGSANKQQIH---------ALGSAGMGFTPHVITVKTGEDVS

Query:  SKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS
         KI+SFSQ GPR++CVLSANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++GG RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPVQVVVGS
Subjt:  SKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS

Query:  FVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESSGSPFNHS
        F+    H++ K           + ++P    PI        SS   H T+   S  P N++
Subjt:  FVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESSGSPFNHS

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein7.5e-6653.82Show/hide
Query:  PFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSD-GSMAV---PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR
        P  + +    YQ +  G +     ++   ++     + SEPVK++RGRPRK+G D G M++   P A +   S      P++   GG       +K RGR
Subjt:  PFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSD-GSMAV---PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR

Query:  PPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT
        PPGS++K+ ++ ALGS G+GFTPHV+TV  GEDVSSKIM+ + NGPR VCVLSANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRT
Subjt:  PPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT

Query:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS-----GSPFNH
        GGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ D G KE KQ   + Q+ +     P ++AP +  M  T SSP S GT+SESS     GSP + 
Subjt:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS-----GSPFNH

Query:  SAGA-CNSTI--PW
        S G   N+TI  PW
Subjt:  SAGA-CNSTI--PW

AT2G33620.2 AT hook motif DNA-binding family protein7.5e-6653.82Show/hide
Query:  PFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSD-GSMAV---PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR
        P  + +    YQ +  G +     ++   ++     + SEPVK++RGRPRK+G D G M++   P A +   S      P++   GG       +K RGR
Subjt:  PFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSD-GSMAV---PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR

Query:  PPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT
        PPGS++K+ ++ ALGS G+GFTPHV+TV  GEDVSSKIM+ + NGPR VCVLSANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRT
Subjt:  PPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT

Query:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS-----GSPFNH
        GGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ D G KE KQ   + Q+ +     P ++AP +  M  T SSP S GT+SESS     GSP + 
Subjt:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS-----GSPFNH

Query:  SAGA-CNSTI--PW
        S G   N+TI  PW
Subjt:  SAGA-CNSTI--PW

AT2G33620.3 AT hook motif DNA-binding family protein7.5e-6653.82Show/hide
Query:  PFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSD-GSMAV---PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR
        P  + +    YQ +  G +     ++   ++     + SEPVK++RGRPRK+G D G M++   P A +   S      P++   GG       +K RGR
Subjt:  PFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSD-GSMAV---PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR

Query:  PPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT
        PPGS++K+ ++ ALGS G+GFTPHV+TV  GEDVSSKIM+ + NGPR VCVLSANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRT
Subjt:  PPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT

Query:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS-----GSPFNH
        GGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ D G KE KQ   + Q+ +     P ++AP +  M  T SSP S GT+SESS     GSP + 
Subjt:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS-----GSPFNH

Query:  SAGA-CNSTI--PW
        S G   N+TI  PW
Subjt:  SAGA-CNSTI--PW

AT2G33620.4 AT hook motif DNA-binding family protein7.5e-6653.82Show/hide
Query:  PFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSD-GSMAV---PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR
        P  + +    YQ +  G +     ++   ++     + SEPVK++RGRPRK+G D G M++   P A +   S      P++   GG       +K RGR
Subjt:  PFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSD-GSMAV---PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR

Query:  PPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT
        PPGS++K+ ++ ALGS G+GFTPHV+TV  GEDVSSKIM+ + NGPR VCVLSANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRT
Subjt:  PPGSANKQ-QIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT

Query:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS-----GSPFNH
        GGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ D G KE KQ   + Q+ +     P ++AP +  M  T SSP S GT+SESS     GSP + 
Subjt:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESS-----GSPFNH

Query:  SAGA-CNSTI--PW
        S G   N+TI  PW
Subjt:  SAGA-CNSTI--PW

AT4G12080.1 AT-hook motif nuclear-localized protein 11.6e-5248.66Show/hide
Query:  VKRKRGRPRKFGSDGSMAV--PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGRPPGSANKQQIH---------ALGSAGMGFTPHVITVKTGEDVS
        +K+KRGRPRK+G DG++    P   ++  +    PPP++      AS    K+ + +P  S N+ + H         A  S G  FTPH+ITV TGEDV+
Subjt:  VKRKRGRPRKFGSDGSMAV--PSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGRPPGSANKQQIH---------ALGSAGMGFTPHVITVKTGEDVS

Query:  SKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS
         KI+SFSQ GPR++CVLSANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++GG RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPVQVVVGS
Subjt:  SKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS

Query:  FVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESSGSPFNHS
        F+    H++ K           + ++P    PI        SS   H T+   S  P N++
Subjt:  FVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESSGSPFNHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGATCTGAGATCGGAGTGATGAGCAGCATGCATTTACCCTTTGCCGCCGCTGCCGCCTCGCCCACCTACCAGTCCTCGGGTGTCGGGGTTTCCGGTAATGCCGG
CACCGATGTGTCTGCTCCTGATGCTTTCCCTAACATGAATTCCCAAAGCGAGCCAGTAAAGAGGAAGAGGGGAAGACCTAGGAAGTTTGGATCAGATGGCAGTATGGCAG
TCCCGTCCGCCGCCGCAACTCAGTCGAGTGGTGGTTTTTCTCCTCCACCCGCTGCTGCTGCGGCCGGAGGGTTAGCCTCTCCAACTTCTTTGAAGAAAGGCAGAGGCAGA
CCCCCTGGCTCTGCCAACAAGCAGCAGATACATGCTTTGGGGTCAGCAGGAATGGGATTTACCCCACATGTCATCACCGTGAAAACTGGAGAGGATGTATCCTCGAAGAT
AATGTCATTTTCACAGAATGGTCCTAGAACGGTATGTGTCCTTAGCGCAAATGGAGCTATATCTAATGTGACTCTACGTCAACCAGCCATGTCAGGTGGAACAGTTACTT
ACGAGGGGCGATTTGAGATTTTATCACTCTCTGGGTCATATCTCCTCTCTGAGAATGGCGGTCAGCGGAGCCGAACAGGGGGTCTAAGTGTGTCATTGTCTGGACCAGAT
GGTAGAGTACTAGGTGGTGGGGTTGCTGGTCTTCTAACGGCAGCCTCTCCTGTCCAGGTGGTGGTGGGGAGCTTCGTCAACGATGGGGGACACAAGGAATTGAAACAAGC
GAACCAAATAGAACAGCTGCCTGTGGTTGTGACTACTGCACCACATAAGCTTGCTCCAATCCGTGCTGGAATGGCGGGGACGAGCAGCAGCCCACATTCGCATGGGACTC
TCAGTGAATCCTCAGGCAGTCCGTTTAATCACAGTGCTGGAGCCTGCAATAGCACCATACCATGGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGATCTGAGATCGGAGTGATGAGCAGCATGCATTTACCCTTTGCCGCCGCTGCCGCCTCGCCCACCTACCAGTCCTCGGGTGTCGGGGTTTCCGGTAATGCCGG
CACCGATGTGTCTGCTCCTGATGCTTTCCCTAACATGAATTCCCAAAGCGAGCCAGTAAAGAGGAAGAGGGGAAGACCTAGGAAGTTTGGATCAGATGGCAGTATGGCAG
TCCCGTCCGCCGCCGCAACTCAGTCGAGTGGTGGTTTTTCTCCTCCACCCGCTGCTGCTGCGGCCGGAGGGTTAGCCTCTCCAACTTCTTTGAAGAAAGGCAGAGGCAGA
CCCCCTGGCTCTGCCAACAAGCAGCAGATACATGCTTTGGGGTCAGCAGGAATGGGATTTACCCCACATGTCATCACCGTGAAAACTGGAGAGGATGTATCCTCGAAGAT
AATGTCATTTTCACAGAATGGTCCTAGAACGGTATGTGTCCTTAGCGCAAATGGAGCTATATCTAATGTGACTCTACGTCAACCAGCCATGTCAGGTGGAACAGTTACTT
ACGAGGGGCGATTTGAGATTTTATCACTCTCTGGGTCATATCTCCTCTCTGAGAATGGCGGTCAGCGGAGCCGAACAGGGGGTCTAAGTGTGTCATTGTCTGGACCAGAT
GGTAGAGTACTAGGTGGTGGGGTTGCTGGTCTTCTAACGGCAGCCTCTCCTGTCCAGGTGGTGGTGGGGAGCTTCGTCAACGATGGGGGACACAAGGAATTGAAACAAGC
GAACCAAATAGAACAGCTGCCTGTGGTTGTGACTACTGCACCACATAAGCTTGCTCCAATCCGTGCTGGAATGGCGGGGACGAGCAGCAGCCCACATTCGCATGGGACTC
TCAGTGAATCCTCAGGCAGTCCGTTTAATCACAGTGCTGGAGCCTGCAATAGCACCATACCATGGAACTGA
Protein sequenceShow/hide protein sequence
MSGSEIGVMSSMHLPFAAAAASPTYQSSGVGVSGNAGTDVSAPDAFPNMNSQSEPVKRKRGRPRKFGSDGSMAVPSAAATQSSGGFSPPPAAAAAGGLASPTSLKKGRGR
PPGSANKQQIHALGSAGMGFTPHVITVKTGEDVSSKIMSFSQNGPRTVCVLSANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPD
GRVLGGGVAGLLTAASPVQVVVGSFVNDGGHKELKQANQIEQLPVVVTTAPHKLAPIRAGMAGTSSSPHSHGTLSESSGSPFNHSAGACNSTIPWN