; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013297 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013297
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationChr02:182612..185178
RNA-Seq ExpressionHG10013297
SyntenyHG10013297
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037975.1 AT-hook motif nuclear-localized protein 10 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-12273.5Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGVM+SGEPFTIG QK+ VQSQQ V+ G+HLPFGADGVYKP  + SPTYQS GVGV+GNAGAD S REAFV+MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------
        MA+  A  S AATQS GGFSPPPT   PSGGSASPT LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAG                         
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------

Query:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
                           EGRFEILSLSG YLL+ENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGHKEL+Q NQIEQ PV
Subjt:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        TAPHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS  AC NNT SWK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

XP_004145559.1 AT-hook motif nuclear-localized protein 10 [Cucumis sativus]7.1e-14080.91Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGE FTIGLQKNSV SQQPVMQ MHLPFGADGVYKPVA ASPTYQSS VGVAGNAGADGSAR+AFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------
        MA+APAVR  AATQSSGGFSP PTAAP SG SASPTSLKK RGRPPGSS+KK  LD S SAGVGFTPHVITVKAG                         
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------

Query:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
                           EGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
Subjt:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

XP_008453016.1 PREDICTED: AT-hook motif nuclear-localized protein 10 [Cucumis melo]2.4e-14080.91Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGE FTIGLQKNSVQSQQPVMQ MHLPFGADGVYKPV AASPTYQSS VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------
        M++APAVR  AATQSSGGFSP PTAAP SGGS SPTSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAG                         
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------

Query:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
                           EGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GHKELRQVNQIEQPPV
Subjt:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

XP_022981864.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita maxima]8.7e-12273.5Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGVM+SGEPFTIG QK+ VQSQQ V+ G+HLPFGADGVYKP ++ SPTYQS GVGV+GNAGAD S REAFV MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------
        MA+A A  S AATQS GGFSPPPT   PSGGSASPT LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAG                         
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------

Query:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
                           EGRFEILSLSG YLL+ENG QRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG KEL+Q NQIEQ PV
Subjt:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        TAPHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS  AC NNT SWK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

XP_038898092.1 AT-hook motif nuclear-localized protein 10-like [Benincasa hispida]1.3e-14483.19Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVMSSGEPFTIGLQKNSVQSQQ VMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------
        MA+APAVRS AATQ SGGFSPPPTAAPPSGGSASPT LKKARGRPPGSS+KKQQLDGSGSAGVGFTPHVITVKAG                         
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------

Query:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
                           EGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSF+TDGGHKEL  VNQIEQ PV
Subjt:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQS  ACNNN I WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

TrEMBL top hitse value%identityAlignment
A0A0A0L0T7 AT-hook motif nuclear-localized protein3.4e-14080.91Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGE FTIGLQKNSV SQQPVMQ MHLPFGADGVYKPVA ASPTYQSS VGVAGNAGADGSAR+AFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------
        MA+APAVR  AATQSSGGFSP PTAAP SG SASPTSLKK RGRPPGSS+KK  LD S SAGVGFTPHVITVKAG                         
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------

Query:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
                           EGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
Subjt:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

A0A1S3BWC6 AT-hook motif nuclear-localized protein1.2e-14080.91Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGE FTIGLQKNSVQSQQPVMQ MHLPFGADGVYKPV AASPTYQSS VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------
        M++APAVR  AATQSSGGFSP PTAAP SGGS SPTSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAG                         
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------

Query:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
                           EGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GHKELRQVNQIEQPPV
Subjt:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

A0A5A7VAQ2 AT-hook motif nuclear-localized protein1.2e-14080.91Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGE FTIGLQKNSVQSQQPVMQ MHLPFGADGVYKPV AASPTYQSS VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------
        M++APAVR  AATQSSGGFSP PTAAP SGGS SPTSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAG                         
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------

Query:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
                           EGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GHKELRQVNQIEQPPV
Subjt:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

A0A6J1FR30 AT-hook motif nuclear-localized protein5.5e-12273.22Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGVM+SGEPFTIG QK+ VQSQQ V+ G+HLPFGADGVYKP ++ SPTYQS  VGV+GNAGAD S REAFV+MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------
        MA+  A  S AATQS GGFSPPPT   PSGGSASPT LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAG                         
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------

Query:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
                           EGRFEILSLSG YLL+ENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGHKEL+Q NQIEQ PV
Subjt:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        TAPHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS  AC NNT SWK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

A0A6J1IXR2 AT-hook motif nuclear-localized protein4.2e-12273.5Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGVM+SGEPFTIG QK+ VQSQQ V+ G+HLPFGADGVYKP ++ SPTYQS GVGV+GNAGAD S REAFV MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------
        MA+A A  S AATQS GGFSPPPT   PSGGSASPT LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAG                         
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------------------

Query:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
                           EGRFEILSLSG YLL+ENG QRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG KEL+Q NQIEQ PV
Subjt:  -------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        TAPHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS  AC NNT SWK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 107.8e-4943.6Show/hide
Query:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG+M++      FT+ L +    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------
        GRPRKYGPD G M++     +P+ T S           P SGG       +K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AG             
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------

Query:  -------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE
                                       EGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  -------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE

Query:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK
         + V Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI+  WK
Subjt:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK

O80834 AT-hook motif nuclear-localized protein 93.0e-2437.72Show/hide
Query:  GVMSSGEPFTIGLQKNSVQSQQPV--MQGMHLPF--GADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNS-----QSEPVKRKRGRPRKYGP
        G+  SG P   G    S Q QQ +  +   + PF  G+ G   P     P+  ++    AG AGA        VNM +        P+KRKRGRPRKYG 
Subjt:  GVMSSGEPFTIGLQKNSVQSQQPV--MQGMHLPF--GADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNS-----QSEPVKRKRGRPRKYGP

Query:  DGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVITVKAG-----------------
        DGS+++A          SS   S   T  P +       S K+ RGRPPG S KKQ++   G     S+G+ FTPHVI V  G                 
Subjt:  DGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVITVKAG-----------------

Query:  ---------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV
                                   EGRFEIL+LS SY+++ +GS R+RTG LSVSL+ PDGRV+GG + G L AASPVQV+VGSF+
Subjt:  ---------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV

Q8GXB3 AT-hook motif nuclear-localized protein 57.9e-2537.55Show/hide
Query:  SQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSAS---PTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVIT
        S+   VK+KRGRPRKY PDG +++              G SP P  +  S  S+S   P + K+ARGRPPG + +KQ+L   G     SAG+ F PHVI+
Subjt:  SQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSAS---PTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVIT

Query:  VKAG--------------------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAAS
        V +G                                            EGRFEILSL GSYL++E G  +SRTGGLSVSLSGP+G V+GGG+ G+L AAS
Subjt:  VKAG--------------------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAAS

Query:  PVQVVVGSFV-------TDGGHKELRQVNQIEQPPVTAPHKL----APIRAGMTGASSP---PSRGTLSESSGGPGS
         VQVV  SFV        +  +K ++Q  + +Q P  +  +     AP  A  TG  +P   P++G       G GS
Subjt:  PVQVVVGSFV-------TDGGHKELRQVNQIEQPPVTAPHKL----APIRAGMTGASSP---PSRGTLSESSGGPGS

Q940I0 AT-hook motif nuclear-localized protein 131.3e-2434.53Show/hide
Query:  VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDG----------SMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPP
        +G  G+  +  + ++  +      + VK+KRGRPRKY  DG          ++ +AP    P+A+ S GG +        +G +A  S    K+ RGRPP
Subjt:  VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDG----------SMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPP

Query:  GSSSKKQQLDG-SGSAGVGFTPHVITVKAG----------------------------------------------EGRFEILSLSGSYLLSENGSQRSR
        GS   K+QLD   G+ GVGFTPHVI VK G                                              EGRFEI+SLSGS+L SE+    ++
Subjt:  GSSSKKQQLDG-SGSAGVGFTPHVITVKAG----------------------------------------------EGRFEILSLSGSYLLSENGSQRSR

Query:  TGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG-GHKELRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIA
        TG LSVSL+G +GR++GG V G+L A S VQV+VGSFV DG   K+     Q    P +AP  +     G+ G  SP S+G    S     +  N     
Subjt:  TGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG-GHKELRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIA

Query:  CNNNTIS
         +NN  S
Subjt:  CNNNTIS

Q9FIR1 AT-hook motif nuclear-localized protein 81.3e-2436.78Show/hide
Query:  VNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDG-SGSAGVGFTPHVITVKA
        ++  +Q   VK+KRGRPRKY PDGS+A+  A  SP  + +S  +           G++    +K+ RGRPPGSS  K+QLD   G++GVGFTPHVI V  
Subjt:  VNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDG-SGSAGVGFTPHVITVKA

Query:  G--------------------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQ
        G                                            EGRFEI++LSGS L  E     +R+G LSV+L+GPDG ++GG V G L AA+ VQ
Subjt:  G--------------------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQ

Query:  VVVGSFVTDGGHKELRQVN--QIEQP-PVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGP
        V+VGSFV +    +   VN  + + P P +AP  +    +   G SS  S       SG P
Subjt:  VVVGSFVTDGGHKELRQVN--QIEQP-PVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGP

Arabidopsis top hitse value%identityAlignment
AT1G63470.1 AT hook motif DNA-binding family protein5.6e-2637.55Show/hide
Query:  SQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSAS---PTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVIT
        S+   VK+KRGRPRKY PDG +++              G SP P  +  S  S+S   P + K+ARGRPPG + +KQ+L   G     SAG+ F PHVI+
Subjt:  SQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSAS---PTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVIT

Query:  VKAG--------------------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAAS
        V +G                                            EGRFEILSL GSYL++E G  +SRTGGLSVSLSGP+G V+GGG+ G+L AAS
Subjt:  VKAG--------------------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAAS

Query:  PVQVVVGSFV-------TDGGHKELRQVNQIEQPPVTAPHKL----APIRAGMTGASSP---PSRGTLSESSGGPGS
         VQVV  SFV        +  +K ++Q  + +Q P  +  +     AP  A  TG  +P   P++G       G GS
Subjt:  PVQVVVGSFV-------TDGGHKELRQVNQIEQPPVTAPHKL----APIRAGMTGASSP---PSRGTLSESSGGPGS

AT2G33620.1 AT hook motif DNA-binding family protein5.6e-5043.6Show/hide
Query:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG+M++      FT+ L +    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------
        GRPRKYGPD G M++     +P+ T S           P SGG       +K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AG             
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------

Query:  -------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE
                                       EGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  -------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE

Query:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK
         + V Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI+  WK
Subjt:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK

AT2G33620.2 AT hook motif DNA-binding family protein5.6e-5043.6Show/hide
Query:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG+M++      FT+ L +    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------
        GRPRKYGPD G M++     +P+ T S           P SGG       +K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AG             
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------

Query:  -------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE
                                       EGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  -------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE

Query:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK
         + V Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI+  WK
Subjt:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK

AT2G33620.3 AT hook motif DNA-binding family protein5.6e-5043.6Show/hide
Query:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG+M++      FT+ L +    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------
        GRPRKYGPD G M++     +P+ T S           P SGG       +K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AG             
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------

Query:  -------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE
                                       EGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  -------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE

Query:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK
         + V Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI+  WK
Subjt:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK

AT2G33620.4 AT hook motif DNA-binding family protein5.6e-5043.6Show/hide
Query:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG+M++      FT+ L +    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------
        GRPRKYGPD G M++     +P+ T S           P SGG       +K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AG             
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAG-------------

Query:  -------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE
                                       EGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  -------------------------------EGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE

Query:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK
         + V Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI+  WK
Subjt:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGATCCGAGACCGGAGTGATGTCCAGCGGCGAACCTTTCACCATCGGTCTCCAGAAGAATTCAGTACAGTCACAACAGCCGGTCATGCAGGGCATGCATTTACC
CTTCGGCGCCGATGGCGTCTACAAGCCCGTTGCCGCCGCCTCTCCCACCTACCAGTCCTCCGGCGTCGGAGTTGCCGGTAATGCTGGTGCCGATGGATCTGCTCGTGAAG
CTTTCGTGAACATGAATTCGCAAAGCGAGCCTGTAAAGAGGAAGAGAGGGAGGCCTCGGAAGTATGGGCCAGATGGCAGTATGGCGATGGCTCCTGCGGTCCGCTCTCCC
GCTGCAACTCAGTCGAGTGGAGGTTTTTCTCCTCCACCCACCGCCGCGCCTCCGTCGGGAGGATCAGCCTCTCCAACTTCTTTGAAGAAAGCCAGAGGCAGACCCCCTGG
CTCGTCTAGCAAAAAGCAGCAGTTGGATGGTTCGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTAAAAGCTGGAGAGGGGCGATTCGAAATTTTGTCAC
TATCTGGGTCCTATCTCCTCTCCGAGAATGGCAGCCAGCGGAGTCGAACTGGAGGTCTAAGTGTTTCATTATCTGGACCAGATGGTAGAGTATTAGGTGGTGGGGTGGCT
GGTCTTCTAACGGCAGCATCTCCTGTACAGGTGGTCGTGGGGAGCTTCGTCACTGATGGGGGGCACAAGGAATTGAGACAAGTAAACCAAATAGAACAGCCGCCTGTTAC
TGCACCCCATAAACTTGCTCCGATCCGTGCTGGAATGACGGGGGCCAGCAGCCCGCCATCACGTGGGACTCTCAGTGAATCCTCAGGAGGGCCTGGGAGTCCGTTTAATC
AGAGTGCTATAGCCTGCAATAACAACACCATATCTTGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGGATCCGAGACCGGAGTGATGTCCAGCGGCGAACCTTTCACCATCGGTCTCCAGAAGAATTCAGTACAGTCACAACAGCCGGTCATGCAGGGCATGCATTTACC
CTTCGGCGCCGATGGCGTCTACAAGCCCGTTGCCGCCGCCTCTCCCACCTACCAGTCCTCCGGCGTCGGAGTTGCCGGTAATGCTGGTGCCGATGGATCTGCTCGTGAAG
CTTTCGTGAACATGAATTCGCAAAGCGAGCCTGTAAAGAGGAAGAGAGGGAGGCCTCGGAAGTATGGGCCAGATGGCAGTATGGCGATGGCTCCTGCGGTCCGCTCTCCC
GCTGCAACTCAGTCGAGTGGAGGTTTTTCTCCTCCACCCACCGCCGCGCCTCCGTCGGGAGGATCAGCCTCTCCAACTTCTTTGAAGAAAGCCAGAGGCAGACCCCCTGG
CTCGTCTAGCAAAAAGCAGCAGTTGGATGGTTCGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTAAAAGCTGGAGAGGGGCGATTCGAAATTTTGTCAC
TATCTGGGTCCTATCTCCTCTCCGAGAATGGCAGCCAGCGGAGTCGAACTGGAGGTCTAAGTGTTTCATTATCTGGACCAGATGGTAGAGTATTAGGTGGTGGGGTGGCT
GGTCTTCTAACGGCAGCATCTCCTGTACAGGTGGTCGTGGGGAGCTTCGTCACTGATGGGGGGCACAAGGAATTGAGACAAGTAAACCAAATAGAACAGCCGCCTGTTAC
TGCACCCCATAAACTTGCTCCGATCCGTGCTGGAATGACGGGGGCCAGCAGCCCGCCATCACGTGGGACTCTCAGTGAATCCTCAGGAGGGCCTGGGAGTCCGTTTAATC
AGAGTGCTATAGCCTGCAATAACAACACCATATCTTGGAAGTGA
Protein sequenceShow/hide protein sequence
MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSP
AATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVA
GLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK