; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G019040 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G019040
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionAT-hook motif nuclear-localized protein
Genome locationCG_Chr11:32052269..32055673
RNA-Seq ExpressionClCG11G019040
SyntenyClCG11G019040
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037975.1 AT-hook motif nuclear-localized protein 10 [Cucurbita argyrosperma subsp. argyrosperma]4.1e-14883.77Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGV++SGEPFTIG  K+ VQSQQ V+ G+HLPFG DGVYKP  + SPTYQS GVGV+GNAGAD S REAFV+MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+  A  S AATQS GGF PPPT   PSGGSA+ T LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG HKEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN
        TAPHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS  ACNN
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN

XP_004145559.1 AT-hook motif nuclear-localized protein 10 [Cucumis sativus]2.5e-16691.95Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVISSGE FTIGL KNSV SQQPVMQ MHLPFG DGVYKPVA ASPTYQSS VGVAGNAGADGSAR+AFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APAVR  AATQSSGGF P PTAAP SG SA+ TSLKK RGRPPGSS+KK  LD S SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG HKELRQVNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

XP_008453016.1 PREDICTED: AT-hook motif nuclear-localized protein 10 [Cucumis melo]8.7e-16791.95Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVISSGE FTIGL KNSVQSQQPVMQ MHLPFG DGVYKPV AASPTYQSS VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APAVR  AATQSSGGF P PTAAP SGGS + TSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD  HKELRQVNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

XP_022981864.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita maxima]1.2e-14783.77Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGV++SGEPFTIG  K+ VQSQQ V+ G+HLPFG DGVYKP ++ SPTYQS GVGV+GNAGAD S REAFV MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+A A  S AATQS GGF PPPT   PSGGSA+ T LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG  KEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN
        TAPHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS  ACNN
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN

XP_038898092.1 AT-hook motif nuclear-localized protein 10-like [Benincasa hispida]2.6e-17193.97Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGEPFTIGL KNSVQSQQ VMQGMHLPFG DGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APAVRS AATQ SGGF PPPTAAPPSGGSA+ T LKKARGRPPGSS+KKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSF+TDG HKEL  VNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQS  ACNNN I
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

TrEMBL top hitse value%identityAlignment
A0A0A0L0T7 AT-hook motif nuclear-localized protein1.2e-16691.95Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVISSGE FTIGL KNSV SQQPVMQ MHLPFG DGVYKPVA ASPTYQSS VGVAGNAGADGSAR+AFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APAVR  AATQSSGGF P PTAAP SG SA+ TSLKK RGRPPGSS+KK  LD S SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG HKELRQVNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

A0A1S3BWC6 AT-hook motif nuclear-localized protein4.2e-16791.95Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVISSGE FTIGL KNSVQSQQPVMQ MHLPFG DGVYKPV AASPTYQSS VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APAVR  AATQSSGGF P PTAAP SGGS + TSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD  HKELRQVNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

A0A5A7VAQ2 AT-hook motif nuclear-localized protein4.2e-16791.95Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVISSGE FTIGL KNSVQSQQPVMQ MHLPFG DGVYKPV AASPTYQSS VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APAVR  AATQSSGGF P PTAAP SGGS + TSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD  HKELRQVNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

A0A6J1FR30 AT-hook motif nuclear-localized protein7.5e-14883.48Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGV++SGEPFTIG  K+ VQSQQ V+ G+HLPFG DGVYKP ++ SPTYQS  VGV+GNAGAD S REAFV+MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+  A  S AATQS GGF PPPT   PSGGSA+ T LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG HKEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN
        TAPHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS  ACNN
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN

A0A6J1IXR2 AT-hook motif nuclear-localized protein5.7e-14883.77Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGV++SGEPFTIG  K+ VQSQQ V+ G+HLPFG DGVYKP ++ SPTYQS GVGV+GNAGAD S REAFV MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+A A  S AATQS GGF PPPT   PSGGSA+ T LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG  KEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN
        TAPHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS  ACNN
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 101.0e-7753.7Show/hide
Query:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMHLP--FGVD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG++++      FT+ LH+    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMHLP--FGVD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG   +    K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE

Query:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP
         + V Q+  S    P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI  P
Subjt:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP

O80834 AT-hook motif nuclear-localized protein 93.2e-4745.33Show/hide
Query:  GVISSGEPFTIGLHKNSVQSQQPV--MQGMHLPF--GVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNS-----QSEPVKRKRGRPRKYGP
        G+  SG P   G    S Q QQ +  +   + PF  G  G   P     P+  ++    AG AGA        VNM +        P+KRKRGRPRKYG 
Subjt:  GVISSGEPFTIGLHKNSVQSQQPV--MQGMHLPF--GVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNS-----QSEPVKRKRGRPRKYGP

Query:  DGSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA
        DGS+++A +  S +            T  P         S K+ RGRPPG S KKQ++   G     S+G+ FTPHVI V  GED++SK+++FSQ GPRA
Subjt:  DGSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA

Query:  VCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV
        +C+L+A+GA+S  TL QP+ S G + YEGRFEIL+LS SY+++ +GS R+RTG LSVSL+ PDGRV+GG + G L AASPVQV+VGSF+
Subjt:  VCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV

Q8VYJ2 AT-hook motif nuclear-localized protein 14.1e-5050.42Show/hide
Query:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSAT--QTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK
        +K+KRGRPRKYGPDG+ +A++P   S A         P P+  PP          S K+++ +P  S ++ +   Q++  G     S G  FTPH+ITV 
Subjt:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSAT--QTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK

Query:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV
         GEDV+ KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++G  RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPV
Subjt:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV

Query:  QVVVGSFVTDGVHKELRQVNQIEQSPVTAPHKLAPI
        QVVVGSF+    H++ +         +++P    PI
Subjt:  QVVVGSFVTDGVHKELRQVNQIEQSPVTAPHKLAPI

Q940I0 AT-hook motif nuclear-localized protein 134.7e-4641.97Show/hide
Query:  VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDG----------SMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTS--LKKARGRPP
        +G  G+  +  + ++  +      + VK+KRGRPRKY  DG          ++ +AP    P+A+ S GG          +G +A  +    K+ RGRPP
Subjt:  VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDG----------SMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTS--LKKARGRPP

Query:  GSSSKKQQLDG-SGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFEILSLSGSYLLSENGSQRSR
        GS   K+QLD   G+ GVGFTPHVI VK GED+++KI++F+  GPRA+CIL+A GA++NV LRQ   S   GTV YEGRFEI+SLSGS+L SE+    ++
Subjt:  GSSSKKQQLDG-SGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFEILSLSGSYLLSENGSQRSR

Query:  TGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQS--PVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAI
        TG LSVSL+G +GR++GG V G+L A S VQV+VGSFV DG  K+ +   + + +  P +AP  +     G+ G  SP S+G    S     +  N    
Subjt:  TGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQS--PVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAI

Query:  ACNNN
          +NN
Subjt:  ACNNN

Q9FIR1 AT-hook motif nuclear-localized protein 81.4e-4744.96Show/hide
Query:  VNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDG-SGSAGVGFTPHVITVKA
        ++  +Q   VK+KRGRPRKY PDGS+A+  A  SP  + +S  +           G++    +K+ RGRPPGSS  K+QLD   G++GVGFTPHVI V  
Subjt:  VNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDG-SGSAGVGFTPHVITVKA

Query:  GEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQ
        GED++SK+M+FS  G R +CIL+A+GA+S V LRQ + S G VTYEGRFEI++LSGS L  E     +R+G LSV+L+GPDG ++GG V G L AA+ VQ
Subjt:  GEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQ

Query:  VVVGSFVTDGVHKELRQVNQI---EQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTIY
        V+VGSFV +    +   VN        P +AP  +    +   G SS  S       SG P    +      NNN IY
Subjt:  VVVGSFVTDGVHKELRQVNQI---EQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTIY

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein7.3e-7953.7Show/hide
Query:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMHLP--FGVD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG++++      FT+ LH+    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMHLP--FGVD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG   +    K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE

Query:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP
         + V Q+  S    P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI  P
Subjt:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP

AT2G33620.2 AT hook motif DNA-binding family protein7.3e-7953.7Show/hide
Query:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMHLP--FGVD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG++++      FT+ LH+    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMHLP--FGVD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG   +    K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE

Query:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP
         + V Q+  S    P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI  P
Subjt:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP

AT2G33620.3 AT hook motif DNA-binding family protein7.3e-7953.7Show/hide
Query:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMHLP--FGVD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG++++      FT+ LH+    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMHLP--FGVD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG   +    K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE

Query:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP
         + V Q+  S    P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI  P
Subjt:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP

AT2G33620.4 AT hook motif DNA-binding family protein7.3e-7953.7Show/hide
Query:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMHLP--FGVD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG++++      FT+ LH+    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMHLP--FGVD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG   +    K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE

Query:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP
         + V Q+  S    P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI  P
Subjt:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP

AT4G12080.1 AT-hook motif nuclear-localized protein 12.9e-5150.42Show/hide
Query:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSAT--QTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK
        +K+KRGRPRKYGPDG+ +A++P   S A         P P+  PP          S K+++ +P  S ++ +   Q++  G     S G  FTPH+ITV 
Subjt:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFCPPPTAAPPSGGSAT--QTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK

Query:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV
         GEDV+ KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++G  RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPV
Subjt:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV

Query:  QVVVGSFVTDGVHKELRQVNQIEQSPVTAPHKLAPI
        QVVVGSF+    H++ +         +++P    PI
Subjt:  QVVVGSFVTDGVHKELRQVNQIEQSPVTAPHKLAPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGATCTGAGACCGGAGTGATTTCCAGCGGCGAACCCTTCACCATCGGTCTCCACAAGAATTCAGTACAGTCACAACAGCCGGTCATGCAGGGCATGCATTTACC
CTTCGGCGTCGATGGCGTCTACAAGCCCGTTGCCGCCGCCTCTCCCACCTACCAGTCCTCCGGCGTCGGAGTTGCCGGTAATGCTGGTGCCGATGGATCTGCTCGTGAAG
CTTTCGTTAACATGAATTCGCAAAGCGAGCCTGTAAAGAGGAAGAGAGGGAGGCCTCGGAAGTATGGGCCAGATGGCAGTATGGCAATGGCTCCTGCAGTCCGCTCTCCC
GCCGCAACTCAGTCCAGTGGAGGTTTTTGTCCTCCACCCACCGCCGCGCCTCCGTCGGGAGGATCAGCCACTCAAACTTCTTTGAAGAAAGCCAGAGGCAGACCCCCTGG
CTCTTCTAGCAAAAAGCAGCAGTTGGATGGCTCGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTAAAAGCTGGAGAGGATGTATCTTCGAAAATAATGT
CATTTTCACAGAATGGTCCTAGAGCTGTTTGTATCCTTACAGCAAATGGAGCAATATCCAATGTGACTCTACGTCAACCAGCCATGTCTGGTGGAACCGTGACTTACGAG
GGGCGATTCGAGATTTTGTCGCTATCTGGGTCCTATCTCCTCTCTGAGAATGGCAGCCAGCGGAGTCGAACTGGAGGTCTAAGTGTTTCATTATCTGGTCCAGATGGTAG
AGTATTAGGTGGTGGGGTGGCTGGTCTTCTAACGGCAGCGTCTCCTGTACAGGTGGTCGTGGGGAGCTTCGTCACTGATGGGGTGCACAAGGAATTGAGACAAGTAAACC
AAATAGAACAGTCCCCTGTTACTGCACCCCATAAACTTGCTCCAATCCGTGCTGGAATGACGGGAGCCAGCAGCCCGCCATCACGTGGGACTCTCAGTGAATCCTCAGGA
GGGCCCGGGAGTCCGTTTAATCAGAGTGCTATAGCCTGCAATAATAACACCATATATGCACCTGTAATTTATGCTAAGCTAAAGGATATTGTAACTCTTTCCAGTTTGGG
ATGGCTTGGAACCTTGGCCTTGTGCCCGCTGGAAACGAAAGACAGACAGACGACACAGATGGTTTCAGTTAGGAATTAG
mRNA sequenceShow/hide mRNA sequence
TTTTTTTTTAAAAGAAAAATTTGGAAAATAATTAGTTACTGAATGATGTGGATCTTCTTCTTCCCGGAGTGGCGAGTGCTAAGATCTGTGCTTGAAAACGGAAAATTGCT
GCAGTTTTATGAAAGTTTATTTGGCTGTTAGGGTTTTTGTTCGAATTTTACTGATTTTGTTCCGAGATTTAAGGTCGGGGAGTGGGACTTCCTTGTATGGAATTTTAGTT
CCAGCGAAGGAGAACAATGTCGGGATCTGAGACCGGAGTGATTTCCAGCGGCGAACCCTTCACCATCGGTCTCCACAAGAATTCAGTACAGTCACAACAGCCGGTCATGC
AGGGCATGCATTTACCCTTCGGCGTCGATGGCGTCTACAAGCCCGTTGCCGCCGCCTCTCCCACCTACCAGTCCTCCGGCGTCGGAGTTGCCGGTAATGCTGGTGCCGAT
GGATCTGCTCGTGAAGCTTTCGTTAACATGAATTCGCAAAGCGAGCCTGTAAAGAGGAAGAGAGGGAGGCCTCGGAAGTATGGGCCAGATGGCAGTATGGCAATGGCTCC
TGCAGTCCGCTCTCCCGCCGCAACTCAGTCCAGTGGAGGTTTTTGTCCTCCACCCACCGCCGCGCCTCCGTCGGGAGGATCAGCCACTCAAACTTCTTTGAAGAAAGCCA
GAGGCAGACCCCCTGGCTCTTCTAGCAAAAAGCAGCAGTTGGATGGCTCGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTAAAAGCTGGAGAGGATGTA
TCTTCGAAAATAATGTCATTTTCACAGAATGGTCCTAGAGCTGTTTGTATCCTTACAGCAAATGGAGCAATATCCAATGTGACTCTACGTCAACCAGCCATGTCTGGTGG
AACCGTGACTTACGAGGGGCGATTCGAGATTTTGTCGCTATCTGGGTCCTATCTCCTCTCTGAGAATGGCAGCCAGCGGAGTCGAACTGGAGGTCTAAGTGTTTCATTAT
CTGGTCCAGATGGTAGAGTATTAGGTGGTGGGGTGGCTGGTCTTCTAACGGCAGCGTCTCCTGTACAGGTGGTCGTGGGGAGCTTCGTCACTGATGGGGTGCACAAGGAA
TTGAGACAAGTAAACCAAATAGAACAGTCCCCTGTTACTGCACCCCATAAACTTGCTCCAATCCGTGCTGGAATGACGGGAGCCAGCAGCCCGCCATCACGTGGGACTCT
CAGTGAATCCTCAGGAGGGCCCGGGAGTCCGTTTAATCAGAGTGCTATAGCCTGCAATAATAACACCATATATGCACCTGTAATTTATGCTAAGCTAAAGGATATTGTAA
CTCTTTCCAGTTTGGGATGGCTTGGAACCTTGGCCTTGTGCCCGCTGGAAACGAAAGACAGACAGACGACACAGATGGTTTCAGTTAGGAATTAG
Protein sequenceShow/hide protein sequence
MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLPFGVDGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSP
AATQSSGGFCPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYE
GRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESSG
GPGSPFNQSAIACNNNTIYAPVIYAKLKDIVTLSSLGWLGTLALCPLETKDRQTTQMVSVRN